Summary of the paper

Title A Modality Lexicon and its use in Automatic Tagging
Authors Kathrin Baker, Michael Bloodgood, Bonnie Dorr, Nathaniel W. Filardo, Lori Levin and Christine Piatko
Abstract This paper describes our resource-building results for an eight-week JHU HumanLanguage Technology Center of Excellence Summer Camp for Applied LanguageExploration (SCALE-2009) on Semantically-Informed Machine Translation.Specifically, we describe the construction of a modality annotation scheme, amodality lexicon, and two automated modality taggers that were built using thelexicon and annotation scheme. Our annotation scheme is based on identifyingthree components of modality: a trigger, a target and a holder. We describe howour modality lexicon was produced semi-automatically, expanding from an initialhand-selected list of modality trigger words and phrases. The resultingexpanded modality lexicon is being made publicly available. We demonstrate thatone tagger―a structure-based tagger―results in precision around 86%(depending on genre) for tagging of a standard LDC data set. In a machinetranslation application, using the structure-based tagger to annotate Englishmodalities on an English-Urdu training corpus improved the translation qualityscore for Urdu by 0.3 Bleu points in the face of sparse training data.
Language Knowledge Discovery/Representation
Topics Lexicon, lexical database, Semantics, Knowledge Discovery/Representation
Full paper A Modality Lexicon and its use in Automatic Tagging
Bibtex @InProceedings{BAKER10.446,
  author = {Kathrin Baker, Michael Bloodgood, Bonnie Dorr, Nathaniel W. Filardo, Lori Levin and Christine Piatko},
  title = {A Modality Lexicon and its use in Automatic Tagging},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA