netVLAD-pytorch & RL method Usage Policy gradient: python pg_train.py Q learning python q_train.py # train python q_test.py # test Main Code Structure policy.py: Q network q_train.py: Q learning algorithm for training pg_train.py: policy net and REINFORCE algorithm