Skip to content

surfacesyntacticud/SUD_French-Rhapsodie

Repository files navigation

Summary

A Universal Dependencies corpus for spoken French.

Introduction

The corpus was converted automatically from the Rhapsodie treebank with manual corrections.

Structure

  • fr_rhapsodie.sud.train.conllu: 1288 sentences 19144 tokens
  • fr_rhapsodie-ud-dev.conllu 1082 sentences 12908 tokens
  • fr_rhapsodie-ud-test.conllu 840 sentences 12191 tokens
  • total 3210 sentences 44243 tokens

Development

The corpus is maintained here in the SUD framework and automatically converted into UD_French-Rhapsodie using the Grew software with the conversions rules described here.

About

SUD version of the French spoken corpus Rhapsodie

Resources

License

Stars

Watchers

Forks

Packages

No packages published