TinyML-CAM

Code for MobiCom'22 paper 'TinyML-CAM: 80 FPS Image Recognition in 1 Kb RAM'

arduino-library c-plus-plus computer-vision edge-analytics edge-computing edgeimpulse embedded-c embedded-systems esp32-cam image-recognition iot-device tinyml

Go to file

Bharath Sudharsan 4582e0580d Update README.md		2022-07-23 09:21:01 +01:00
ESP32-image-object-classification-live-demo.mp4	Added demo video	2022-07-18 05:57:39 +01:00
LICENSE	Initial commit	2022-07-18 05:31:50 +01:00
README.md	Update README.md	2022-07-23 09:21:01 +01:00
[h]-HogClassifier.h	Updated code	2022-07-23 04:16:33 +01:00
[h]-HogPipeline.h	Updated code	2022-07-23 04:16:33 +01:00
[ino]-CameraWebServer.ino	Update [ino]-CameraWebServer.ino	2022-07-23 04:26:24 +01:00
[ino]-arduino-ESP32-code.ino	added code	2022-07-23 04:08:11 +01:00
[ipynb]-TinyML-CAM-full-code-with-markdown.ipynb	added code	2022-07-23 04:08:11 +01:00

README.md

TinyML-CAM - Image Recognition System that Runs at 80 FPS in 1 Kb RAM

Demo - HOG and Random Forest based Image Recognition on ESP32

ESP32 classifying Raspberry Pi Pico, Portenta H7, Wio Terminal from image frames

https://user-images.githubusercontent.com/16524846/179447640-d7f5efa9-3a44-431c-922d-348ee526c782.mp4

Following can be observed from the video:

Time. For image frames, the digital signal processing (DSP) based features extraction time is ≈ 12 ms, while classification time is ≈ < 20 𝜇𝑠 (1/1000^th of DSP).
FPS. It is 1000/12 ms = 83.3 FPS, which is the time taken by the TinyML-CAM image recognition system to process (DSP) plus classify using a single image frame. Since the ESP32 has a 30 FPS frame rate, just to capture frames, it takes 1000/30 = 33 ms. So the entire frame rate is 1000/(33+12) = 22 FPS.
Accuracy. As expected during Pairplot analysis, Portenta and Pi (features overlapped) are mislabelled quite often, which can be rectified by improving dataset quality.
Memory. Consumes only 1 kB of RAM - difference between the RAM calculated by Arduino IDE before and after adding the TinyML-CAM image recognition system.

Requirements

To capture images from the ESP32 with ease, install Eloquent library via Arduino IDE library manager.
To collect images on a PC and train an ML classifier, install EverywhereML Python package.
To test the TinyML-CAM pipeline, users only require an ESP32 of any variant:
- AI Thinker (the most widely used)
- Espressif
- M5Stack (recommend as it comes with 4 Mb external PSRAM)

Code

[ino]-CameraWebServer.ino - For image dataset collection. After upload to ESP32, it will connect to WiFi network and start an HTTP video streaming server that can be accessed from any web broswer.
[h]-HogClassifier.h - Contains the RandomForestClassifier trained using the collected image data.
[h]-HogPipeline.h - Contains the HOG features extrator for image frames.
[ino]-arduino-ESP32-code.ino - Upload to ESP32 along with the above two .h files. After upload, put your objects in front of the camera to see predicted labels.
[ipynb]-TinyML-CAM-full-code-with-markdown.ipynb - Contains all the required code required for this project, along with sample outputs in each step.

Future Work

To lower the DSP time (currently 12 ms) by implementing mathematical approximation methods, which will boost the frame rate - i.e., if reduced to 6 ms, then 1000/6 ms = 166.6 FPS.

Similar to the TinyML benchmark, we plan to test the pipeline on a range of datasets, ML algorithms, and IoT boards.