Summary of the paper

Title Active Learning for Building a Corpus of Questions for Parsing
Authors Jordi Atserias, Giuseppe Attardi, Maria Simi and Hugo Zaragoza
Abstract This paper describes how we built a dependency Treebank for questions. Thequestions for the Treebank were drawn from questions from the TREC 10 QA taskand from Yahoo! Answers. Among the uses for the corpus is to train a dependencyparser achieving good accuracy on parsing questions without hurting its overallaccuracy. We also explore active learning techniques to determine the suitablesize for a corpus of questions in order to achieve adequate accuracy whileminimizing the annotation efforts.
Language
Topics Corpus (creation, annotation, etc.), Parsing
Full paper Active Learning for Building a Corpus of Questions for Parsing
Bibtex @InProceedings{ATSERIAS10.656,
  author = {Jordi Atserias, Giuseppe Attardi, Maria Simi and Hugo Zaragoza},
  title = {Active Learning for Building a Corpus of Questions for Parsing},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA