Summary of the paper

Title Cross-lingual Ontology Alignment using EuroWordNet and Wikipedia
Authors Gosse Bouma
Abstract This paper describes a system for linking the thesaurus of the NetherlandsInstitute for Sound and Vision to English WordNet and dbpedia. The thesauruscontains subject (concept) terms, and names of persons, locations, andmiscalleneous names. We used EuroWordNet, a multilingual wordnet, and DutchWikipedia as intermediaries for the two alignments. EuroWordNet covers most ofthe subject terms in the thesaurus, but the organization of the cross-linguallinks makes selection of the most appropriate English target term almostimpossible. Precision and recall of the automatic alignment with WordNet forsubject terms is 0.59. Using page titles, redirects, disambiguation pages, andanchor text harvested from Dutch Wikipedia gives reasonable performance onsubject terms and geographical terms. Many person and miscalleneous names inthe thesaurus could not be located in (Dutch or English) Wikipedia. Precisionfor miscellaneous names, subjects, persons and locations for the alignment withWikipedia ranges from 0.63 to 0.94, while recall for subject terms is 0.62.
Language Multilinguality
Topics Lexicon, lexical database, Ontologies, Multilinguality
Full paper Cross-lingual Ontology Alignment using EuroWordNet and Wikipedia
Bibtex @InProceedings{BOUMA10.175,
  author = {Gosse Bouma},
  title = {Cross-lingual Ontology Alignment using EuroWordNet and Wikipedia},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA