Summary of the paper

Title A Swedish Scientific Medical Corpus for Terminology Management and Linguistic Exploration
Authors Dimitrios Kokkinakis and Ulla Gerdin
Abstract This paper describes the development of a new Swedish scientific medicalcorpus. We provide a detailed description of the characteristics of this newcollection as well results of an application of the corpus on term managementtasks, including terminology validation and terminology extraction. Althoughthe corpus is representative for the scientific medical domain it still coversin detail a lot of specialised sub-disciplines such as diabetes andosteoporosis which makes it suitable for facilitating the production of smallerbut more focused sub-corpora. We address this issue by making explicit somefeatures of the corpus in order to demonstrate the usability of the corpusparticularly for the quality assessment of subsets of official terminologiessuch as the Systematized NOmenclature of MEDicine - Clinical Terms (SNOMED CT).Domain-dependent language resources, labelled or not, are a crucial keycomponents for progressing R&D in the human language technology field sincesuch resources are an indispensable, integrated part for terminologymanagement, evaluation, software prototyping and design validation and aprerequisite for the development and evaluation of a number of sublanguagedependent applications including information extraction, text mining andinformation retrieval.
Language
Topics Corpus (creation, annotation, etc.), Evaluation methodologies
Full paper A Swedish Scientific Medical Corpus for Terminology Management and Linguistic Exploration
Bibtex @InProceedings{KOKKINAKIS10.60,
  author = {Dimitrios Kokkinakis and Ulla Gerdin},
  title = {A Swedish Scientific Medical Corpus for Terminology Management and Linguistic Exploration},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA