Monocular Visual Odometry Pipeline

This repository contains the implementation of a monocular visual odometry pipeline. First, an initial set of 2D-3D correspondences is extracted from the first frames of each dataset. Then, the camera pose is estimated from the subsequent frames. Whenever possible, available OpenCV implementations of the algorithms are used for maximum efficiency.

We achieved acceptable local accuracy on the datasets. However, our implementation suffers from global scale ambiguity which is expected due to the monocular setup. We could mitigate some drift by tuning the parameters but to achieve better results on a global scale, more parameter tuning would be necessary. In addition, further optimization techniques such as Bundle Adjustment or Pose-Graph Optimization, as well as Place Recognition for loop detection and closure would be required to receive more reliable global results.

The program was developed from scratch as part of the course Vision Algorithms for Mobile Robots at UZH, taught by Prof. Scaramuzza.

Screencasts:

These are the links to the screencasts recorded for each of the datasets:

KITTI: Kitti
Malaga: Malaga
Parking: Parking

Specifications of the machine used to record the screencasts

The machine on which we recorded the screencasts was an ASUS Zenbook 14 OLED with the following specifications:

Intel Evo i9 CPU
32 GB RAM
Nvidia GeForce RTX onboard graphics
Runnning Ubuntu 22.04

The metrics while running the KITTI pipeline were the following, as seen in this screencast of the System Monitor taken while the pipeline was running: Kitti System Monitor

Putting the datasets in the right place

You can download the datasets from the Robot Perception Group website.

From the base directory of this repository, the images of the datasets should be inside the following folder structure:

KITTI:

data/kitti/05/image_0/

Malaga:

data/malaga/malaga-urban-dataset-extract-07_rectified_800x600_Images/

Parking:

data/parking/images/

You can select the desired dataset inside the code as described further down.

Running the pipeline

To run the pipeline, create a virtual environment with the requirements.txt file and make sure that you correctly place the datasets. Then run the main.py file. This executes the entire pipeline and creates the intermediate plots as well as the final metric plot.

To run the pipeline with conda, perform the following steps:

Navigate to the base directory of the repo (where this readme is also located) and install the anaconda environment from the provided environment.yml file:

conda env create -f environment.yml

Activate the conda environment:

conda activate VAMR_Project

Then run the following command:

python3 main.py

The dataset can be selected by changing the dataset integer in line 26 of the main.py file:

...
# Setup
dataset = 0 # 0: KITTI, 1: Malaga, 2: parking, 3: test <---- select desired dataset here
...

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
.github/workflows		.github/workflows
.vscode		.vscode
img		img
test_data		test_data
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
VAMR_Project_Report.pdf		VAMR_Project_Report.pdf
bootstrap.py		bootstrap.py
environment_projectvision.yml		environment_projectvision.yml
main.py		main.py
process_frame.py		process_frame.py
requirements.txt		requirements.txt
vo_project_statement.pdf		vo_project_statement.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Monocular Visual Odometry Pipeline

Screencasts:

Specifications of the machine used to record the screencasts

Putting the datasets in the right place

KITTI:

Malaga:

Parking:

Running the pipeline

About

Releases

Packages

Contributors 4

Languages

jvw01/monocular-vo

Folders and files

Latest commit

History

Repository files navigation

Monocular Visual Odometry Pipeline

Screencasts:

Specifications of the machine used to record the screencasts

Putting the datasets in the right place

KITTI:

Malaga:

Parking:

Running the pipeline

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages