Title |
Active Learning for Building a Corpus of Questions for Parsing |
Authors |
Jordi Atserias, Giuseppe Attardi, Maria Simi and Hugo Zaragoza |
Abstract |
This paper describes how we built a dependency Treebank for questions. Thequestions for the Treebank were drawn from questions from the TREC 10 QA taskand from Yahoo! Answers. Among the uses for the corpus is to train a dependencyparser achieving good accuracy on parsing questions without hurting its overallaccuracy. We also explore active learning techniques to determine the suitablesize for a corpus of questions in order to achieve adequate accuracy whileminimizing the annotation efforts. |
Language |
|
Topics |
Corpus (creation, annotation, etc.), Parsing |
Full paper  |
Active Learning for Building a Corpus of Questions for Parsing |
Bibtex |
@InProceedings{ATSERIAS10.656,
author = {Jordi Atserias, Giuseppe Attardi, Maria Simi and Hugo Zaragoza}, title = {Active Learning for Building a Corpus of Questions for Parsing}, booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |