Skip to content

Latest commit

 

History

History
31 lines (21 loc) · 3.04 KB

README.md

File metadata and controls

31 lines (21 loc) · 3.04 KB

PnPRansac-mutiprocessing

Reproduction and revision of solvePnPRansac algorithm

Purpose

  1. This program is used in a project in my lab for a paper in ICCV. I will provide the paper's link in the future.

  2. In this project, we change the PnPRansac algorithm and make it finishing iteration only when the current Rotation Matrix or Translation Vector has little change with last Rotation Matrix or Translation Vector no more than a threshold which can be set by user freely rather than finishing iteration when the algorithm reach the iteration times.

Notice

  1. In our project the "ProReRansac.py" is put in a neural network later so the type of many variables is torch.Tensor(). In the program the input can be numpy.array() or t.Tensor(). But if you only use numpy.array(), you'd better to delete the related t.Tensor() variable for coding's efficiency.

  2. For fast running speed, We provide another version of this algorithm: muti-ProReRansac_for_numpy.py. It is running with the help of mutiprocessing(Recommend to use this). Of course you can change my code and make it available for cuda.

  3. In the code, notes are written by Chinese and output the state of system continuously. If you have any trouble for them please feel free to contact me. I will answer you soon.

  4. Pytorch 4.0 or higher version is ok, and it's usable for any numpy version.

File

Test Sample

  1. I provide two test samples which can interpret how my program work.

  2. 1.txt and 2.txt are two groundtruth poses for scenes_coordinate1.npy and scenes_coordinate2.npy.

  3. "scenes_coordinate1.npy" and "scenes_coordinate2.npy" is a 1×3×640×480 array. "640×480" record an image's image coordinates, and "3" record the world coordinates. In other words, if an array of a point (1,:,4,8) is (7,6,5), which mean there is a point in the world and the world coordinate of the point is (7,6,5), and this point is record in current image with an image coordinate (4,8). This file is generated by saving a numpy.array(1×3×640×480). (Notice: After reloading it to program, it will be 1×3×480×640; But the real image is 3×640×480, (640 and 480 is reverse). Because this image can't be clockwise and counterclockwise rotate, so I readjust it in the code. You may adjust the code according to your circumstances.)

  4. The transform matrices in 1.txt and 2.txt are Twc, which means:Pw=Twc·Pc, Pw is the world coordinate and Pc is the camera coordinate.

ReDefineError.py

In this file there are three functions.They compute the reprojection error,angle error,and the difference between two transform matrix. Angle error hasn't been used in any files. Of course you can try to use it to replace the reprojection error.

angle error=||(||f·K^(-1)·Pu||/||T·Pw||)·T·Pw -f·K^(-1)·Pu||, in this equation, f is focus length, K is camera matrix, Pw is world coordinate, Pu is image coordinate, "||" is F normal form.

License

For commercial use,please leave a message to let me know.