Summary of the paper

Title Named and Specific Entity Detection in Varied Data: The Quæro Named Entity Baseline Evaluation
Authors Olivier Galibert, Ludovic Quintard, Sophie Rosset, Pierre Zweigenbaum, Claire Nédellec, Sophie Aubin, Laurent Gillard, Jean-Pierre Raysz, Delphine Pois, Xavier Tannier, Louise Deléger and Dominique Laurent
Abstract The Quæro program that promotes research and industrial innovation ontechnologies for automatic analysis and classification of multimediaand multilingual documents. Within its context a set of evaluationsof Named Entity recognition systems was held in 2009. Four tasks weredefined. The first two concerned traditional named entities in Frenchbroadcast news for one (a rerun of ESTER 2) and of OCR-ed oldnewspapers for the other. The third was a gene and protein nameextraction in medical abstracts. The last one was the detection ofreferences in patents. Four different partners participated, giving atotal of 16 systems. We provide a synthetic descriptions of all ofthem classifying them by the main approaches chosen (resource-based,rules-based or statistical), without forgetting the fact that anymodern system is at some point hybrid. The metric (the relativelystandard Slot Error Rate) and the results are also presented anddiscussed. Finally, a process is ongoing with preliminary acceptanceof the partners to ensure the availability for the community of allthe corpora used with the exception of the non-Quæro produced ESTER 2one.
Language
Topics Named Entity recognition
Full paper Named and Specific Entity Detection in Varied Data: The Quæro Named Entity Baseline Evaluation
Bibtex @InProceedings{GALIBERT10.191,
  author = {Olivier Galibert, Ludovic Quintard, Sophie Rosset, Pierre Zweigenbaum, Claire Nédellec, Sophie Aubin, Laurent Gillard, Jean-Pierre Raysz, Delphine Pois, Xavier Tannier, Louise Deléger and Dominique Laurent},
  title = {Named and Specific Entity Detection in Varied Data: The Quæro Named Entity Baseline Evaluation},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA