Title |
The DARPA Machine Reading Program - Encouraging Linguistic and Reasoning Research with a Series of Reading Tasks |
Authors |
Stephanie Strassel, Dan Adams, Henry Goldberg, Jonathan Herr, Ron Keesing, Daniel Oblinger, Heather Simpson, Robert Schrag and Jonathan Wright |
Abstract |
The goal of DARPAs Machine Reading (MR) program is nothing less than makingthe worlds natural language corpora available for formal processing. Mosttext processing research has focused on locating mission-relevant text(information retrieval) and on techniques for enriching text by transforming itto other forms of text (translation, summarization) ― always for use byhumans. In contrast, MR will make knowledge contained in text available informs that machines can use for automated processing. This will be done withlittle human intervention. Machines will learn to read from a few examples andthey will read to learn what they need in order to answer questions or performsome reasoning task. Three independent Reading Teams are building universaltext engines which will capture knowledge from naturally occurring text andtransform it into the formal representations used by Artificial Intelligence.An Evaluation Team is selecting and annotating text corpora with task domainconcepts, creating model reasoning systems with which the reading systems willinteract, and establishing question-answer sets and evaluation protocols tomeasure progress toward this goal. We describe development of the MR evaluationframework, including test protocols, linguistic resources and technicalinfrastructure. |
Language |
Evaluation methodologies |
Topics |
LR national/international projects, organizational/policy issues, Corpus (creation, annotation, etc.), Evaluation methodologies |
Full paper  |
The DARPA Machine Reading Program - Encouraging Linguistic and Reasoning Research with a Series of Reading Tasks |
Bibtex |
@InProceedings{STRASSEL10.862,
author = {Stephanie Strassel, Dan Adams, Henry Goldberg, Jonathan Herr, Ron Keesing, Daniel Oblinger, Heather Simpson, Robert Schrag and Jonathan Wright}, title = {The DARPA Machine Reading Program - Encouraging Linguistic and Reasoning Research with a Series of Reading Tasks}, booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |