Skip to content

Latest commit

 

History

History
19 lines (11 loc) · 876 Bytes

README.md

File metadata and controls

19 lines (11 loc) · 876 Bytes

Summary

A Universal Dependencies corpus for spoken French.

Introduction

The corpus was converted automatically from the Rhapsodie treebank with manual corrections.

Structure

  • fr_rhapsodie.sud.train.conllu: 1288 sentences 19144 tokens
  • fr_rhapsodie-ud-dev.conllu 1082 sentences 12908 tokens
  • fr_rhapsodie-ud-test.conllu 840 sentences 12191 tokens
  • total 3210 sentences 44243 tokens

Development

The corpus is maintained here in the SUD framework and automatically converted into UD_French-Rhapsodie using the Grew software with the conversions rules described here.