Summary of the paper

Title C-3: Coherence and Coreference Corpus
Authors Cristina Nicolae, Gabriel Nicolae and Kirk Roberts
Abstract The phenomenon of coreference, covering entities, their mentions and theirproperties, is intricately linked to the phenomenon of coherence, covering thestructure of rhetorical relations in a discourse. A text corpus that has bothphenomena annotated can be used to test hypotheses about their interrelation orto detect other phenomena. We present the process by which C-3, a new corpus,was obtained by annotating the Discourse GraphBank coherence corpus with entityand mention information. The annotation followed a set of ACE guidelinesadapted to favor coreference and to include entities of unknown types in theannotation. Together with the corpus we offer a new annotation toolspecifically designed to annotate entity and mention information within asimple and functional graphical interface that combines the “best of allworlds” from available annotation tools. The potential usefulness of C-3 isdiscussed,as well as an application in which the corpus proved to be a valuable resource.
Language Discourse annotation, representation and processing
Topics Corpus (creation, annotation, etc.), Anaphora, Coreference, Discourse annotation, representation and processing
Full paper C-3: Coherence and Coreference Corpus
Bibtex @InProceedings{NICOLAE10.622,
  author = {Cristina Nicolae, Gabriel Nicolae and Kirk Roberts},
  title = {C-3: Coherence and Coreference Corpus},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA