Title |
Interacting Semantic Layers of Annotation in SoNaR, a Reference Corpus of Contemporary Written Dutch |
Authors |
Ineke Schuurman, Véronique Hoste and Paola Monachesi |
Abstract |
This paper reports on the annotation of a corpus of 1 million words with foursemantic annotation layers, including named entities, co-reference relations, semantic roles and spatial and temporal expressions. Thesesemantic annotation layers can benefit from the manuallyverified part of speech tagging, lemmatization and syntactic analysis(dependency tree) information layers which resulted from an earlierproject (Van Noord et al., 2006) and will thus result in a deeply syntacticallyand semantically annotated corpus. This annotation effortis carried out in the framework of a larger project which aims at thecollection of a 500-million word corpus of contemporary Dutch,covering the variants used in the Netherlands and Flanders, the Dutch speakingpart of Belgium. All the annotation schemes used were(co-)developed by the authors within the Flemish-Dutch STEVIN-programme as noprevious schemes for Dutch were available. Theywere created taking into account standards (either de facto or official (likeISO)) used elsewhere. |
Language |
Semantics |
Topics |
Corpus (creation, annotation, etc.), Discourse annotation, representation and processing, Semantics |
Full paper  |
Interacting Semantic Layers of Annotation in SoNaR, a Reference Corpus of Contemporary Written Dutch |
Bibtex |
@InProceedings{SCHUURMAN10.162,
author = {Ineke Schuurman, Véronique Hoste and Paola Monachesi}, title = {Interacting Semantic Layers of Annotation in SoNaR, a Reference Corpus of Contemporary Written Dutch}, booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |