Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Links that redirect to a different title. #1

Open
arifr1234 opened this issue Mar 11, 2021 · 1 comment
Open

Links that redirect to a different title. #1

arifr1234 opened this issue Mar 11, 2021 · 1 comment

Comments

@arifr1234
Copy link
Owner

For example:
If you open the Wikipedia Graph page of "Sunflowers (Van Gogh series)"
and click on the link: "Oil on canvas" you will be redirected to the page "Oil painting" but the title remains "Oil on canvas".
If you will now open "Oil painting" you will be able to see that they both have the same contents but different titles.
The reason for that is that the URL of the initial link: "Oil on canvas" was: https://en.wikipedia.org/wiki/Oil_on_canvas which normally immediately redirects to https://en.wikipedia.org/wiki/Oil_painting. The rest API took care of redirecting and getting the HTML content of "Oil painting", but when fetching the titles out to the links (to build the graph for instance), the pre-redirect title is considered. So as far as the graph concerns the "Oil on canvas" and "Oil painting" are different articles, even though "Oil on canvas" redirects to "Oil painting".

@arifr1234
Copy link
Owner Author

This can be solved by using pageids as keys to the dictionaries instead of the page titles.

@arifr1234 arifr1234 reopened this Jul 30, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant