Title |
WikiWoods: Syntacto-Semantic Annotation for English Wikipedia |
Authors |
Dan Flickinger, Stephan Oepen and Gisle Ytrestøl |
Abstract |
WikiWoods is an ongoing initiative to provide rich syntacto-semanticannotations for English Wikipedia. We sketch an automated processing pipelineto extract relevant textual content from Wikipedia sources, segment documentsinto sentence-like units, parse and disambiguate using a broad-coverageprecision grammar, and support the export of syntactic and semantic informationin various formats. The full parsed corpus is accompanied by a subset ofWikipedia articles for which gold-standard annotations in the same format wereproduced manually. This subset was selected to represent a coherent domain,Wikipedia entries on the broad topic of Natural Language Processing. |
Language |
Semantics |
Topics |
Corpus (creation, annotation, etc.), Grammar and Syntax, Semantics |
Full paper  |
WikiWoods: Syntacto-Semantic Annotation for English Wikipedia |
Bibtex |
@InProceedings{FLICKINGER10.432,
author = {Dan Flickinger, Stephan Oepen and Gisle Ytrestøl}, title = {WikiWoods: Syntacto-Semantic Annotation for English Wikipedia}, booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |