Summary of the paper

Title Processing and Extracting Data from Dicionário Aberto
Authors Alberto Simões, José João Almeida and Rita Farinha
Abstract Synonyms dictionaries are useful resources for natural language processing.Unfortunately their availability in digital format is limited, as publishingcompanies do not release their dictionaries in open digital formats.Dicionário-Aberto (Simões and Farinha, 2010) is an open and free digitalsynonyms dictionary for the Portuguese language. It is under public domain andin textual digital format, which makes it usable for any task. Synonyms dictionaries are commonly used for the extraction of relations betweenwords, the construction of complex structures like ontologies or thesaurus(comparable to WordNet (Miller et al., 1990)), or just the extraction of listsof words of specific type. This article will present Dicionário-Aberto, discussing how it was created,its main characteristics, the type of information present on it and the formatsin which it is available. Follows the description of an API designedspecifically to help Dicionário-Aberto processing without the need to tacklewith the dictionary format. Finally, we will analyze the results on some dataextraction experiments, extracting lists of words from a specific class, andextracting relationships between words.
Language Lexicon, lexical database
Topics Information Extraction, Information Retrieval, Knowledge Discovery/Representation, Lexicon, lexical database
Full paper Processing and Extracting Data from Dicionário Aberto
Bibtex @InProceedings{SIMES10.90,
  author = {Alberto Simões, José João Almeida and Rita Farinha},
  title = {Processing and Extracting Data from Dicionário Aberto},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA