Summary of the paper

Title New Features in Spoken Language Search Hawk (SpLaSH): Query Language and Query Sequence
Authors Sara Romano and Francesco Cutugno
Abstract In this work we present further development of the SpLaSH (Spoken LanguageSearch Hawk) project. SpLaSH implements a data model for annotated speechcorpora integrated with textual markup (i.e. POS tagging, syntax, pragmatics) including a toolkit used to perform complex queries across speech and textlabels. The integration of time aligned annotations (TMA), represented makinguse of Annotation Graphs, with text aligned ones (TXA), stored ingeneric XMLfiles, are provided by a data structure, the Connector Frame, acting astable-look-up linking temporal data to words in the text. SpLaSH imposes a verylimited number of constraints to the data model design, allowing theintegration of annotations developed separately within the same dataset andwithout any relative dependency. It also provides a GUI allowing three typesof queries: simple query on TXA or TMA structures, sequence query on TMAstructure and cross query on both TXA and TMA integrated structures. In thiswork new SpLaSH features will be presented: SpLaSH Query Language (SpLaSHQL)and Query Sequence.
Language Discourse annotation, representation and processing
Topics Information Extraction, Information Retrieval, Knowledge Discovery/Representation, Discourse annotation, representation and processing
Full paper New Features in Spoken Language Search Hawk (SpLaSH): Query Language and Query Sequence
Bibtex @InProceedings{ROMANO10.484,
  author = {Sara Romano and Francesco Cutugno},
  title = {New Features in Spoken Language Search Hawk (SpLaSH): Query Language and Query Sequence},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA