Summary of the paper

Title FrAG, a Hybrid Constraint Grammar Parser for French
Authors Eckhard Bick
Abstract This paper describes a hybrid system (FrAG) for tagging / parsing French text,and presents results from ongoing development work, corpus annotation andevaluation. The core of the system is a sentence scope Constraint Grammar (CG),with linguist-written rules. However, unlike traditional CG, the system useshybrid techniques on both its morphological input side and its syntactic outputside. Thus, FrAG draws on a pre-existing probabilistic Decision Tree Tagger(DTT) before and in parallel with its own lexical stage, and feeds its outputinto a Phrase Structure Grammar (PSG) that uses CG syntactic function tagsrather than ordinary terminals in its rewriting rules. As an alternativearchitecture, dependency tree structures are also supported. In the newestversion, dependencies are assigned within the CG-framework itself, and caninteract with other rules. To provide semantic context, a semantic prototypeontology for nouns is used, covering a large part of the lexicon. In a recenttest run on Parliamentary debate transcripts, FrAG achieved F-scores of 98.7 %for part of speech (PoS) and between 93.1 % and 96.2 % for syntactic functiontags. Dependency links were correct in 95.9 %.
Language Tools, systems, applications
Topics Parsing, Corpus (creation, annotation, etc.), Tools, systems, applications
Full paper FrAG, a Hybrid Constraint Grammar Parser for French
Bibtex @InProceedings{BICK10.688,
  author = {Eckhard Bick},
  title = {FrAG, a Hybrid Constraint Grammar Parser for French},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA