You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for sharing your code. I have a question about the input dataset which I would need to jsonify. I download the dataset and used the respective data partitions, for example, WikiQA-test.tsv for test set which has a sample file entry below.
QuestionID Question DocumentID DocumentTitle SentenceID Sentence Label
Q0 HOW AFRICAN AMERICANS WERE IMMIGRATED TO THE US D0 African immigration to the United States D0-0 African immigration to the United States refers to immigrants to the United States who are or were nationals of Africa . 0
Now, I'm confused because in the jsonify code, the question would point to D0-0 which is the sentenceID. It seems that the question_id and the question were interchanged, am I right or did I miss out anything?
The indexing for some variables like question and queston_id seem interchanged, because jsonify.py requires some extra preprocessing beforehand (and I am sorry that it is not provided on this repo).
It is basically for removing the questions that do not contain correct answer in it, as described on the original paper.
So please fix the code if you think it is necessary.
Dear Butsugiri,
Thank you for sharing your code. I have a question about the input dataset which I would need to jsonify. I download the dataset and used the respective data partitions, for example, WikiQA-test.tsv for test set which has a sample file entry below.
Now, I'm confused because in the jsonify code, the
question
would point toD0-0
which is the sentenceID. It seems that thequestion_id
and thequestion
were interchanged, am I right or did I miss out anything?should have been the following?
Cheers,
Kurt
The text was updated successfully, but these errors were encountered: