Summary of the paper

Title Second HAREM: Advancing the State of the Art of Named Entity Recognition in Portuguese
Authors Cláudia Freitas, Cristina Mota, Diana Santos, Hugo Gonçalo Oliveira and Paula Carvalho
Abstract In this paper, we present Second HAREM, the second edition of an evaluationcampaign for Portuguese, addressing named entity recognition (NER). This secondedition also included two new tracks: the recognition and normalization oftemporal entities (proposed by a group of participants, and hence not coveredon this paper) and ReRelEM, the detection of semantic relations between namedentities. We summarize the setup of Second HAREM by showing the preserveddistinctive features and discussing the changes compared to the first edition.Furthermore, we present the main results achieved and describe the availableresources and tools developed under this evaluation, namely,(i) the goldencollections, i.e. a set of documents whose named entities and semanticrelations between those entities were manually annotated, (ii) the Second HAREMcollection (which contains the unannotated version of the golden collection),as well as the participating systems results on it, (iii) the scoring tools,and (iv) SAHARA, a Web application that allows interactive evaluation. We endthe paper by offering some remarks about what was learned.
Language Evaluation methodologies
Topics Named Entity recognition, Information Extraction, Information Retrieval, Evaluation methodologies
Full paper Second HAREM: Advancing the State of the Art of Named Entity Recognition in Portuguese
Bibtex @InProceedings{FREITAS10.412,
  author = {Cláudia Freitas, Cristina Mota, Diana Santos, Hugo Gonçalo Oliveira and Paula Carvalho},
  title = {Second HAREM: Advancing the State of the Art of Named Entity Recognition in Portuguese},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA