-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Consistent analysis of after all
#19
Comments
I would favor the EWT approach. Note also that "all" attaches as |
Thanks for pointing out the inconsistencies. I will fix them. As regards the choice of DET vs. PRON for all I have followed the same annotation guidelines as for the Swedish treebanks and, apparently, generally for UD v.1. The UPOS would depend on the presence or absence of a head word. One reason is that I want the annotation for English-LinES and Swedish-LinES to be as similar as possible. And I'm not sure that the guidelines for English DET vs. PRON is in total agreement with the general guidelines for DET. What is the "hypothetical modified noun" in cases such as first of all, above all, at all? I am also puzzled why demonstratives are treated differently from the quantifiers all, some, each, ... in following the v.1 guidelines? The general guidelines say that DETs in comparison to PRONs "are more likely to be used attributively (modifying a noun phrase) than substantively (replacing a noun phrase). Is there a difference here for English between quantifiers and demonstratives? I note that German has a stricter division, only words that cannot be used attributively are assigned the UPOS PRON. This means that English, German and Swedish apply different principles for the choice of DET vs PRON. It would be easy to make English_LinES follow the current principles, it would just mean changing the UPOS and FEATS in accordance with the proposed values when there is a deviation. (And it would not be so difficult to change back). However, I'd prefer waiting some time in the hope that there may be more agreement among the Germanic language family as a whole. |
You are right that there is an exception for demonstratives in the English guidelines, which otherwise generally follow Penn conventions that never treat "all", "some", etc. as pronouns. That decision predated me. Perhaps @jnivre can weigh in on whether a uniform definition of DET vs. PRON for Germanic languages is desirable (and worth the disruption to longstanding within-language practices). |
I agree with @LarsAhrenberg that it would be good to discuss this more generally, at least for Germanic languages. I think the special treatment of demonstratives in English is a heritage from the PTB. However, it is not clear that this should carry over to UPOS tags (as opposed to XPOS tags). |
There are a few different analyses of
after all
in this treebank:Actually, none of these quite agree with what is done in EWT, which treats
all
as aDET
in this expression:Open to having a PR which unifies these treatments?
The text was updated successfully, but these errors were encountered: