Title |
C-3: Coherence and Coreference Corpus |
Authors |
Cristina Nicolae, Gabriel Nicolae and Kirk Roberts |
Abstract |
The phenomenon of coreference, covering entities, their mentions and theirproperties, is intricately linked to the phenomenon of coherence, covering thestructure of rhetorical relations in a discourse. A text corpus that has bothphenomena annotated can be used to test hypotheses about their interrelation orto detect other phenomena. We present the process by which C-3, a new corpus,was obtained by annotating the Discourse GraphBank coherence corpus with entityand mention information. The annotation followed a set of ACE guidelinesadapted to favor coreference and to include entities of unknown types in theannotation. Together with the corpus we offer a new annotation toolspecifically designed to annotate entity and mention information within asimple and functional graphical interface that combines the best of allworlds from available annotation tools. The potential usefulness of C-3 isdiscussed,as well as an application in which the corpus proved to be a valuable resource. |
Language |
Discourse annotation, representation and processing |
Topics |
Corpus (creation, annotation, etc.), Anaphora, Coreference, Discourse annotation, representation and processing |
Full paper  |
C-3: Coherence and Coreference Corpus |
Bibtex |
@InProceedings{NICOLAE10.622,
author = {Cristina Nicolae, Gabriel Nicolae and Kirk Roberts}, title = {C-3: Coherence and Coreference Corpus}, booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |