Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

2 fix incomplete abstract or title #5

Merged
merged 2 commits into from
Feb 25, 2022

Conversation

iacopy
Copy link
Owner

@iacopy iacopy commented Feb 25, 2022

In some cases the title and/or abstract obtained was incomplete
(issue gijswobben#23).

This happens when the text contains html markup tags (<b>, <i>, <sub>, <sup>, ...).

Example: PMID 31689885

<ArticleTitle>Gamma Irradiated <i>Rhodiola sachalinensis</i> Extract Ameliorates [...]</ArticleTitle>
Before the fix the returned title was just: 'Gamma Irradiated '
<AbstractText>The effect of <i>Rhodiola sachalinensis</i> Boriss extract irradiated [...]</ArticleTitle>
Before the fix the returned abstract was just: 'The effect of '

Fastest solution found: cleanup of frequently used html markup tags like <b>, <i>, <sub>, <sup>.
It seems to fix most of the papers correctly, at least for the above-mentioned tags.

@iacopy iacopy added the bug Something isn't working label Feb 25, 2022
In some cases the title and/or abstract obtained was incomplete
(issue #2) -- 23 in the original pymed repo.

This happens when the text contains html markup tags
(<b>, <i>, <sub>, <sup>, ...).

Example: PMID 31689885
<ArticleTitle>Gamma Irradiated <i>Rhodiola sachalinensis</i> Extract Ameliorates [...]</ArticleTitle>
Before the fix the returned title was just: 'Gamma Irradiated '
<AbstractText>The effect of <i>Rhodiola sachalinensis</i> Boriss extract irradiated [...]</ArticleTitle>
Before the fix the returned abstract was just: 'The effect of '

Fastest solution found: cleanup of frequently used html markup tags like <b>, <i>, <sub>, <sup>.
It seems to fix most of papers correctly, at least for the above mentioned tags.
@iacopy iacopy force-pushed the 2-fix-incomplete-abstract-or-title branch from e13872b to 8f373ff Compare February 25, 2022 20:19
@iacopy iacopy merged commit ebf8880 into master Feb 25, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant