Title |
Creating a Coreference Resolution System for Italian |
Authors |
Massimo Poesio, Olga Uryupina and Yannick Versley |
Abstract |
This paper summarizes our work on creating a full-scale coreference resolution(CR) system for Italian, using BART ― an open-source modular CR toolkitinitially developed for English corpora. We discuss our experiments onlanguage-specific issues of the task. As our evaluation experiments show, alanguage-agnostic system (designed primarily for English) can achieve aperformance level in high forties (MUC F-score) when re-trained and tested on anew language, at least on gold mention boundaries. Compared to this level, wecan improve our F-score by around 10% introducing a small number oflanguage-specific changes. This shows that, with a modular coreferenceresolution platform, such as BART, one can straightforwardly develop a familyof robust and reliable systems for various languages. We hope that ourexperiments will encourage researchers working on coreference in otherlanguages to create their own full-scale coreference resolution systems ― aswe have mentioned above, at the moment such modules exist only for very fewlanguages other than English. |
Language |
|
Topics |
Anaphora, Coreference |
Full paper  |
Creating a Coreference Resolution System for Italian |
Bibtex |
@InProceedings{POESIO10.755,
author = {Massimo Poesio, Olga Uryupina and Yannick Versley}, title = {Creating a Coreference Resolution System for Italian}, booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |