Title |
Semi-Automated Extension of a Specialized Medical Lexicon for French |
Authors |
Bruno Cartoni and Pierre Zweigenbaum |
Abstract |
This paper describes the development of a specialized lexical resource for aspecialized domain, namely medicine. First, in order to assess the linguisticphenomena that need to be adressed, we based our observation on a largecollection of more than 300'000 terms, organised around conceptual identifiers.Based on these observations, we highlight the specificities that such a lexiconshould take into account, namely in terms of inflectional and derivationalknowledge. In a first experiment, we show that general resources lack a largepart of the words needed to process specialized language. Secondly, we describean experiment to feed semi-automatically a medical lexicon and populate it withinflectional information. This experiment is based on a semi-automatic methodsthat tries to acquire inflectional knowledge from frequent endings of wordsrecorded in existing lexicon. Thanks to this, we increased the coverage of thetarget vocabulary from 14.1% to 25.7%. |
Language |
Controlled languages |
Topics |
Lexicon, lexical database, Morphology, Controlled languages |
Full paper  |
Semi-Automated Extension of a Specialized Medical Lexicon for French |
Bibtex |
@InProceedings{CARTONI10.420,
author = {Bruno Cartoni and Pierre Zweigenbaum}, title = {Semi-Automated Extension of a Specialized Medical Lexicon for French}, booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |