Question about image scaling for tf.classify()

ShawnHymel · May 1, 2021, 10:41pm

I’m working on a demo with the OpenMV H7 suing the tf (TensorFlow Lite) package. When I call tf.classify(net, img), how is scaling or cropping performed on the image?

For example, I have a CNN trained on Fashion-MNIST with an input tensor of 28x28. I have a 240x240 window as my input image and ROI in OpenMV. When this image data is sent to tf.classify(), does tf.classify() automatically scale this image to match the required input tensor (28x28)? If so, what scaling technique is used (e.g. area-averaging)?

Next, tf.classify() still seems to work even when the input image is not the same aspect ratio. For example, I change the sensor to sensor.set_windowing((120, 240)), and tf.classify() happily accepts this image data. From what I can tell, the image is scaled to match the smallest dimension, and the rest is cropped. So, the 120x240 image is scaled to 28x56, and the top 14 rows and bottom 14 rows are cropped out so that you’re left with a 28x28 px square image. Do I have this right?

Here is my code. Please note that “trained.tflite” is a CNN trained in Edge Impulse with Fashion-MNIST.

# Edge Impulse - OpenMV Image Classification Example

import sensor, image, time, os, tf

sensor.reset()                         # Reset and initialize the sensor.
sensor.set_pixformat(sensor.GRAYSCALE)    # Set pixel format to RGB565 (or GRAYSCALE)
sensor.set_framesize(sensor.QVGA)      # Set frame size to QVGA (320x240)
sensor.set_windowing((120, 240))       # Set 240x240 window.
sensor.skip_frames(time=2000)          # Let the camera adjust.

net = "trained.tflite"
labels = [line.rstrip('\n') for line in open("labels.txt")]

clock = time.clock()
while(True):
    clock.tick()

    img = sensor.snapshot()

    # Default classify: perform one inference on whole image
    obj = tf.classify(net, img)[0]
    predictions_list = list(zip(labels, obj.output()))
    print(max(predictions_list, key=lambda x: x[1]))

    print(clock.fps(), "fps")

kwagyeman · May 2, 2021, 2:09am

github.com

openmv/openmv/blob/master/src/omv/modules/py_tf.c#L220


      
          
          
    return tf_model;
          }
          
          
STATIC mp_obj_t py_tf_load(uint n_args, const mp_obj_t *args, mp_map_t *kw_args)
          {
              bool alloc_mode = py_helper_keyword_int(n_args, args, 1, kw_args, MP_OBJ_NEW_QSTR(MP_QSTR_load_to_fb), false);
              return int_py_tf_load(args[0], alloc_mode, false);
          }
          STATIC MP_DEFINE_CONST_FUN_OBJ_KW(py_tf_load_obj, 1, py_tf_load);
          
          
STATIC mp_obj_t py_tf_load_builtin_model(mp_obj_t path_obj)
          {
              mp_obj_t net = int_py_tf_load(path_obj, false, false);
              const char *path = mp_obj_str_get_str(path_obj);
              mp_obj_t labels = mp_obj_new_list(0, NULL);
          
          
    for (int i=0; i<MP_ARRAY_SIZE(libtf_builtin_models); i++) {
                  const libtf_builtin_model_t *model = &libtf_builtin_models[i];
                  if (!strcmp(path, model->name)) {
                      for (int l=0; l < model->n_labels; l++) {

It will be updated top area scaling soon. But, it does just you think right now with nearest neighbor.

ShawnHymel · May 2, 2021, 2:02pm

Awesome, thank you! That helps a lot.

Topic		Replies	Views
Resize the image to give it as an input to neural network OpenMV Boards	7	1576	July 16, 2022
Image Classification with EdgeImpulse - poor performance OpenMV Boards	4	736	September 13, 2021
Specify max_scale in tf.classify OpenMV Boards	1	508	June 22, 2021
Tensorflow Lite model output type error OpenMV Boards ml	14	201	June 6, 2025
Problem with imput type machine learning edge impulse OpenMV Boards tensorflow	13	284	January 25, 2025

Question about image scaling for tf.classify()

Related topics