Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enhanced CGEL trees? #29

Open
nschneid opened this issue Jun 19, 2022 · 3 comments
Open

Enhanced CGEL trees? #29

nschneid opened this issue Jun 19, 2022 · 3 comments
Labels
enhancement New feature or request
Milestone

Comments

@nschneid
Copy link
Contributor

nschneid commented Jun 19, 2022

This is a long-term idea: I'm starting to wonder if we should have an enhanced version of trees that might incorporate additional info such as

  • punctuation (added in regular trees)
  • lemmas where different from wordform (added in regular trees)
  • PTB POS tag for morphology
  • phrase-level morphological features, WH status, clause types
  • lexeme-internal structure to match UD tokenization (added subtokens in regular trees)
  • something for control/raising? cf. UD xcomp
  • finer-grained functions as used in CGEL/SIEG itself, for the benefit of readers (e.g. "Predicate", "Nucleus", "CatComp")
  • notes indicating the name of a special construction/phenomenon, CGEL page references, ambiguity, etc.
  • token and constituent offsets for computational use

In my view one of the main reasons to have a CGEL treebank (and eventually parser) is for explanation to humans, so the more pointers we have the better. But it's good to have a simple version of the tree that avoids clutter.

@nschneid nschneid added the enhancement New feature or request label Aug 20, 2022
@nschneid
Copy link
Contributor Author

#88 is a first step re: morphology

For clausal constructions I think we should have a script that automatically adds features for at least:

  • clause type (Add clause types? #10)
  • passive—regular, prepositional
  • extraposition
  • cleft
  • existential
  • relative clause subtype—integrated vs. supplementary, bare/THAT/WH, fronted PP, function of gap?, etc.
  • and maybe x for xcomp-taking predicates

In addition, on function words, we could mark:

  • expletive it, there
  • V_aux varieties: pass, prog, perf, cop, supp (do-support)...
  • subjunctive mood? conditional would?

@nschneid nschneid added this to the v1.1 milestone Jun 12, 2023
@nschneid
Copy link
Contributor Author

nschneid commented Jul 9, 2024

  • for EUD: antecedents of WH relativizers (will it always be the preceding sister, even for supplementary relatives)?

@nschneid
Copy link
Contributor Author

Hypothesis: xcomp = PredComp + subjectless (nonfinite) Clause as Comp in VP (catenative complement) [also consider coordination of clauses]

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant