Parsing des copies normalisées et annotées

Le toolkit Stanza (https://stanfordnlp.github.io/stanza/) a été utilisé pour parser et POStaguer les copies normalisées au format AC.
Puis on aligne ces sorties stanza avec les fichiers d'annotations au format AA (Glozz) afin de récupérer les informations morpho-syntaxiques sur chaque maillons annotés.

On obtient en sortie un fichier CSV avec pour chaque token d'une copie :

Provide feedback