Summary of the paper

Title Named Entity Recognition in Questions: Towards a Golden Collection
Authors Ana Cristina Mendes, Luísa Coheur and Paula Vaz Lobo
Abstract Named Entity Recognition (NER) plays a relevant role in several NaturalLanguage Processing tasks. Question-Answering (QA) is an example of such, sinceanswers are frequently named entities in agreement with the semantic categoryexpected by a given question. In this context, the recognition of namedentities is usually applied in free text data. NER in natural languagequestions can also aid QA and, thus, should not be disregarded. Nevertheless,it has not yet been given the necessary importance.In this paper, we approach the identification and classification of namedentities in natural language questions. We hypothesize that NER results canbenefit with the inclusion of previously labeled questions in the trainingcorpus. We present a broad study addressing that hypothesis, focusing on thebalance to be achieved between the amount of free text and questions in orderto build a suitable training corpus. This work also contributes by providing aset of nearly 5,500 annotated questions with their named entities, freelyavailable for research purposes.
Language Question Answering
Topics Corpus (creation, annotation, etc.), Named Entity recognition, Question Answering
Full paper Named Entity Recognition in Questions: Towards a Golden Collection
Bibtex @InProceedings{MENDES10.97,
  author = {Ana Cristina Mendes, Luísa Coheur and Paula Vaz Lobo},
  title = {Named Entity Recognition in Questions: Towards a Golden Collection},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA