Title |
Corpus Based Analysis for Multilingual Terminology Entry Compounding |
Authors |
Andrejs Vasiljevs and Kaspars Balodis |
Abstract |
This paper proposes statistical analysis methods for improvement of terminologyentry compounding. Terminology entry compounding is a mechanism that identifiesmatching entries across multiple multilingual terminology collections.Bilingual or trilingual term entries are unified in compounded multilingualentry. We suggest that corpus analysis can improve entry compounding results byanalysing contextual terms of given term in the corpus data. Proposed algorithmis described. It is implemented in an experimental setup. Results of experimenton compounding of Latvian and Lithuanian terminology resources are provided.These results encourage further research for different language pairs and indifferent domains. |
Language |
Multilinguality |
Topics |
Lexicon, lexical database, LR Infrastructures and Architectures, Multilinguality |
Full paper  |
Corpus Based Analysis for Multilingual Terminology Entry Compounding |
Bibtex |
@InProceedings{VASILJEVS10.855,
author = {Andrejs Vasiljevs and Kaspars Balodis}, title = {Corpus Based Analysis for Multilingual Terminology Entry Compounding}, booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |