Title |
Acquiring Reliable Predicate-argument Structures from Raw Corpora for Case Frame Compilation |
Authors |
Daisuke Kawahara and Sadao Kurohashi |
Abstract |
We present a method for acquiring reliable predicate-argumentstructures from raw corpora for automatic compilation of caseframes. Such lexicon compilation requires highly reliablepredicate-argument structures to practically contribute to NaturalLanguage Processing (NLP) applications, such as paraphrasing, textentailment, and machine translation. However, to precisely identifypredicate-argument structures, case frames are required. This issue issimilar to the question "what came first: the chicken or the egg?" Inthis paper, we propose the first step in the extraction of reliablepredicate-argument structures without using case frames. We firstapply chunking to raw corpora and then extract reliable chunks toensure that high-quality predicate-argument structures are obtainedfrom the chunks. We conducted experiments to confirm the effectivenessof our approach. We successfully extracted reliable chunks of anaccuracy of 98% and high-quality predicate-argument structures of anaccuracy of 97%. Our experiments confirmed that we succeeded inacquiring highly reliable predicate-argument structures that can beused to compile case frames. |
Language |
Knowledge Discovery/Representation |
Topics |
Acquisition, Lexicon, lexical database, Knowledge Discovery/Representation |
Full paper  |
Acquiring Reliable Predicate-argument Structures from Raw Corpora for Case Frame Compilation |
Bibtex |
author = {Daisuke Kawahara and Sadao Kurohashi}, title = {Acquiring Reliable Predicate-argument Structures from Raw Corpora for Case Frame Compilation}, booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |