-
Notifications
You must be signed in to change notification settings - Fork 45
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to extract Cast and Activity features? #5
Comments
Please check out here https://github.com/movienet/movienet-tools And we will keep on improving it 🎉 Thanks for your interest, 😄 |
Hi, Thanks for this excellent project! Just wondering where "cast_feat", "act_feat", and "aud_feat" are located in the Google drive? Running "run.sh" had a lot of errors due to the missing files under these directories. For example: FileNotFoundError: [Errno 2] No such file or directory: '../data/scene318/act_feat/tt1375666.pkl' Thanks so much and have a good day! |
Hi miaoqiz Thanks for your interests. The features are already uploaded and you may follow the guidance https://github.com/AnyiRao/SceneSeg/blob/master/docs/INSTALL.md#prepare-datasets-for-scene318 Best, |
Hi @AnyiRao |
Hello xpngzhng, You need to use the You may refer to the following place feature extraction example.
|
Hi AnyiRao I want to use SceneSeg and movienet-tools to run a video clip scene segmentation using aud_feat, place_feat, cast_feat and act_feat, so I need to split shots first and then extract all the four kinds of features. I can extract place_feat and aud_feat using SceneSeg, since the code is available, but I still need to extract cast_feat and act_feat.
|
Hello xpngzhng, Thanks for your interest in the project. You may refer to http://docs.movienet.site/movie-toolbox/tools/extract_feature Or you could implement something like the following. Cast feature consistutie of the face feature and person body feature. The example of extract face feature is as follows. Person body feature extractor loads the
Best,
|
Thank you for your quick response |
Hi @AnyiRao It is not difficult to extract cast_feat by movienet-tools, at least I can organize a pkl file with the keys and feature dim the same as scene318 dataset's movie cast_feat But I failed to produce similar act_feat. The act_feat of one movie in scene318 dataset is like: I hope I use the movienet-tools in a correct way. Is it possible to obtain 512-length action feature for each shot? |
Hi @xpngzhng You have done a good job. What you did is correct. After I discussed it with my collaborators, the feature size cannot match with the previous one, since we updated the backbone and the model to a better version. As we said, the project is an ongoing effort, we are iteratively making it better, and this causes the version mismatch. And we also need to be clarified that the cast feature doesn't match with the previous one either. In the previous version, the cast feature (dim=512) is concatenated by face feature (dim=256) and body feature (dim=256). Now the dimension of the face feature is 512 and the body feature is 256. You may need to notice this. The good news is that the release of videos goes to the final round (this Wednesday). You may extract the features from the videos then. You are cordily to contact us via email if you have any further question as CVPR deadline is approaching. We will try our best to adapt to the purpose of your usage. Best,
|
Hi @AnyiRao |
Hi, Can you kindly advise how to understand the prediction output from "python run.py ../run/xxx/xxx.py" For example: demo 0020 1 1 What does each column present? Thanks so much and have a good day! |
Hi @miaoqiz The function to write the output is as follows, https://github.com/AnyiRao/SceneSeg/blob/master/lgss/utilis/dataset_utilis.py#L160 And you could also find out that the template is Best,
|
Thanks so much! @AnyiRao How to find the timecode / frame range of a predicted shot? for example, how to find the specifics of "0020"? Thanks! |
Hi @miaoqiz If you follow my file naming rule, Best,
|
Hi @AnyiRao , Thanks you for sharing your code-base. What model did you used for face feature extraction ? I am trying to replicate your results, however as you mentioned the face feature extractor from movienet-tools has a 512 length output whereas yours is 256. |
Hi @AnyiRao , Thanks you for your released code, and when will you release raw videos? If raw videos can not be released recently, would you like to release the model which you used to extract cast_feat? Thank you! |
Hi @AnyiRao Appreciated for sharing the codebase of LGSS framework. Thank you very much !
|
Hi @AnyiRao Many congratulations to you on the awesome work. Going through this thread, I realize that that the models for face and action feature extraction have changed. Due to this I am getting size mismatch errors for the same. I need to extract these features for a custom dataset that my team has created for scene boundary detection and also compare it with your approach. I would be grateful if you can provide the models and scrips for face and action feature extraction that you have used. I have successfully modified the movie-net tools library to be compatible with our dataset but I am facing issues when I run the scene segmentation model (all.py config) due to the size mismatch. |
Now that there are new models uploaded to Google Drive, I am trying to process a video with all 4 modes, but I do not see how can I extract the features for the Cast and Activity modes.
Could you give me any pointers on this?
Thank you
The text was updated successfully, but these errors were encountered: