Summary of the paper

Title Syntactic Dependencies for Multilingual and Multilevel Corpus Annotation
Authors Simon Mille and Leo Wanner
Abstract The relevance of syntactic dependency annotated corpora is nowadaysunquestioned. However, a broad debate on the optimal set of dependency relationtags did not take place yet. As a result, largely varying tag sets of a largelyvarying size are used in different annotation initiatives. We propose ahierarchical dependency structure annotation schema that is more detailed andmore flexible than the known annotation schemata. The schema allows us tochoose the level of the desired detail of annotation, which facilitates the useof the schema for corpus annotation for different languages and for differentNLP applications. Thanks to the inclusion of semantico-syntactic tags into theschema, we can annotate a corpus not only with syntactic dependency structures,but also with valency patterns as they are usually found in separate treebankssuch as PropBank and NomBank. Semantico-syntactic tags and the level of detailof the schema furthermore facilitate the derivation of deep-syntactic andsemantic annotations, leading to truly multilevel annotated dependency corpora.Such multilevel annotations can be readily used for the task of ML-basedacquisition of grammar resources that map between the different levels oflinguistic representation ― something which forms part of, for instance, anynatural language text generator.
Language Multilinguality
Topics Corpus (creation, annotation, etc.), Grammar and Syntax, Multilinguality
Full paper Syntactic Dependencies for Multilingual and Multilevel Corpus Annotation
Bibtex @InProceedings{MILLE10.697,
  author = {Simon Mille and Leo Wanner},
  title = {Syntactic Dependencies for Multilingual and Multilevel Corpus Annotation},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA