Recorded stream annotation

johanteleman · March 10, 2021, 12:14pm

Hello!

I’m recording raw OpenMV H7 streams to use for developing a machine vision algorithm. When initially prototyping the algorithm it was fine to run in on the recorded streams, draw some relevant lines, and manually validate how the code was doing. Now that we’re getting more and more recorded streams however, this becomes time consuming and hard. I would like to instead hand-annotate the recordings with the correct output somehow, to create a dataset with known output. This way I could evaluate my algorithm using regular supervised methods, and numerically describe how well it was performing across all datasets.

Do you have any suggestions on how to achieve this? I’m thinking some tool which would display recorded frames and let me draw the correct output every now and then (and interpolate between these keyframes) would make this process sort of doable. Should I mod the IDE? Create my own tool reading raw streams?

Best,
Johan

kwagyeman · March 10, 2021, 6:09pm

Hi, what type of algorithms are you doing? Have you checked out Edge Impulse CNNs for object classification? They will have bounding box support soon.

johanteleman · March 10, 2021, 8:39pm

I work for https://elonroad.com/, and am implementing a system for real-time tracking of a rail while driving. I didn’t think CNN was the right choice because I’m under the impression it’s not fast enough (we need about 30-40fps). Essentially we’re detecting a conceptual line, and would like to compare the output (slope/offset) to a human-defined truth.

It’s also interesting to note that any frame is highly dependent on the previous one, which can be leveraged to improve robustness.

kwagyeman · March 10, 2021, 8:54pm

Ah, okay, so you are using like find line segments and find lines.

So, what is the exact pain point? The ImageReader and writer allow you to replay your code on what happened.

iabdalkader · March 10, 2021, 9:03pm

It’s now called ImageIO… Anyway I think I understand what you want to do, you want to draw the lines between key frames, and have the tool interpolate between key frames, so it’s less manual work. This is just too application specific, the IDE can’t do that for you. You can try to implement it yourself in the IDE is opensource, but it’s probably a lot easier to write something with Open-CV + Python, the raw video format is really easy to parse and you can output the truth table to txt file and read it using the frame number as index.

johanteleman · March 10, 2021, 9:38pm

@kwagyeman I could be using line_segments or find_blobs or whatever. The point is that if I come up with a new way to do it, which I want to try, I don’t want to watch minutes or hours of video to try to see if it’s better or worse than the previous one.

Yes exactly @iabdalkader :). I dunno if it’s application specific really, half of machine learning is based of having supervised data to train on - I just want to supervise. I’m completely fine with you not wanting it in the IDE though. Would you consider adding it if I made a PR?

Otherwise, a pointer to how to parse the raw-format would be helpful, I can code some custom annotator from there.

iabdalkader · March 10, 2021, 10:09pm

Yes mostly manually labeled or someone writes specific code to label it, if it’s not too generic like adding a label to an image, which the IDE can do already.

It’s not like we don’t want it in the IDE, was just explaining why it’s not there already. Yes we welcome any contributions.

The format is not documented, it starts with a 16 bytes header which you can skip, followed by frames until the EOF, each frame is:
4 bytes timestamp
4 bytes width
4 bytes height
4 bytes BPP
Image data (whbpp bytes)…

See the code here:

github.com

openmv/openmv/blob/master/src/omv/modules/py_imageio.c#L109


      
                        stream->closed ? "\"true\"" : "\"false\"",
                        stream->count,
                        stream->offset,
                        #if defined(IMLIB_ENABLE_IMAGE_FILE_IO)
                        (stream->type == IMAGE_IO_FILE_STREAM)  ? stream->version : 0,
                        #else
                        0,
                        #endif
                        (stream->type == IMAGE_IO_FILE_STREAM) ? 0 : stream->size,
                        #if defined(IMLIB_ENABLE_IMAGE_FILE_IO)
                        (stream->type == IMAGE_IO_FILE_STREAM) ? f_size(&stream->fp) : (stream->count * stream->size));
                        #else
                        stream->count * stream->size);
                        #endif
          }
          
          STATIC mp_obj_t py_imageio_get_type(mp_obj_t self) {
              py_imageio_obj_t *stream = MP_OBJ_TO_PTR(self);
              return mp_obj_new_int(stream->type);
          }
          STATIC MP_DEFINE_CONST_FUN_OBJ_1(py_imageio_get_type_obj, py_imageio_get_type);

johanteleman · March 11, 2021, 7:23pm

Great, I’ll be back!

Topic		Replies	Views
Any way to add images captured during the testing of my model to the dataset itself OpenMV Boards	22	496	October 25, 2023
Record photo from device for ML training OpenMV Boards	2	122	January 6, 2024
Use recorded video OpenMV Boards	1	1064	January 11, 2020
Image Stream OpenMV Boards	39	18967	August 24, 2017
Multiple object tracking/recognition OpenMV Boards	7	852	March 31, 2022

Recorded stream annotation

Related topics