Equation-Reading-by-Robot-Tracking 🤖

Goal

Read a simple handwritten equation (digits and operator) from the visual tracking of robot hovering the characters.

Input

The input video is an .avi file displaying a filmed red arrow moving over different handwritten digits and operators.

Approach

Since the equation elements are steady in the video, they only need to be classified once from the first frame. The First frame is analyzed in three steps.

Get a binary mask of the object
Label objects and get their bounding box
Classify object type (digit or operators) and their value (the character represented)

The classification of type is made using K-means since operators are blue and digits are black (and the arrow is red). Due to the uniformity of the operators over videos, the classification of operator's value is made using a 5-NN models on the 5 first Fourier descriptors (translation, rotation and scale invariant). We have access to one example of of each operator, but rotating each of them by 1° gives slight variation on the mask and thus the Fourier descriptors. We can therefore train the 5-NN on 1800 samples. We obtain a test accuracy of 100% with a perfect separation of the classes. The digit are classified using a 4-layers MLP (784 -> 200 -> 100 -> 50 -> 9). Since the digit can be rotated on the original image, the MLP has to be rotation invariant. To do so we trained the MLP on randomly rotated MNIST images.

The second step is tracking the arrow and read the equation. At each frame we detect the arrow as the largest object on the segmentation mask. We get the bounding box of the arrow and add the value to the equation if the arrow overlap one of the detected element.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
data		data
models		models
outputs		outputs
resources		resources
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
Presentation_Team35.pdf		Presentation_Team35.pdf
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Equation-Reading-by-Robot-Tracking 🤖

Goal

Input

Approach

Result on training sequence

Result on test sequences

Without element rotation

With element rotation

About

Releases

Packages

Languages

License

antoine-spahr/Equation-Reading-by-Robot-Tracking

Folders and files

Latest commit

History

Repository files navigation

Equation-Reading-by-Robot-Tracking 🤖

Goal

Input

Approach

Result on training sequence

Result on test sequences

Without element rotation

With element rotation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages