Summary of the paper

Title The DAD Parallel Corpora and their Uses
Authors Costanza Navarretta
Abstract This paper deals with the uses of the annotations of third person singularneuter pronouns in the DAD parallel and comparable corpora of Danish andItalian texts and spoken data. The annotations contain information about thefunctions of these pronouns and their uses as abstract anaphora. Abstractanaphora have constructions such as verbal phrases, clauses and discoursesegments as antecedents and refer to abstract objects comprising events,situations and propositions. The analysis of the annotated data shows thelanguage specific characteristics of abstract anaphora in the two languagescompared with the uses of abstract anaphora in English. Finally, the paperpresents machine learning experiments run on the annotated data in order toidentify the functions of third person singular neuter personal pronouns andneuter demonstrative pronouns. The results of these experiments vary fromcorpus to corpus. However, they are all comparable with the results obtained insimilar tasks in other languages. This is very promising because theexperiments have been run on both written and spoken data using aclassification of the pronominal functions which is much more fine-grained thanthe classifications used in other studies.
Language Tools, systems, applications
Topics Anaphora, Coreference, Corpus (creation, annotation, etc.), Tools, systems, applications
Full paper The DAD Parallel Corpora and their Uses
Bibtex @InProceedings{NAVARRETTA10.325,
  author = {Costanza Navarretta},
  title = {The DAD Parallel Corpora and their Uses},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA