Summary of the paper

Title Experimental Deployment of a Grid Virtual Organization for Human Language Technologies
Authors Jan Jona Javoršek and Tomaž Erjavec
Abstract We propose to create a grid virtual organization for human languagetechnologies, at first chiefly with the task of enabling linguistic researchesto use existing distributed computing facilities of the European gridinfrastructure for more efficient processing of large data sets. After a briefoverview of modern grid computing, a number of common use-cases of naturallanguage processing tasks running on the grid are presented, notably corpusannotation with morpho-syntactic tagging (600+ million-word corpus annotated inless than a day), $n$-gram statistics processing of a corpus and creation ofgrid-backed web-accessible services with annotation and term-extraction asexamples. Implementation considerations and common problems of using grid forthis type of tasks are laid out. We conclude with an outline of a simple actionplan for evolving the infrastructure created for these experiments into a fullyfunctional Human Language Technology grid Virtual Organization with the goal ofmaking the power of European grid infrastructure available to the linguisticcommunity.
Language LR Infrastructures and Architectures
Topics Tools, systems, applications, Corpus (creation, annotation, etc.), LR Infrastructures and Architectures
Full paper Experimental Deployment of a Grid Virtual Organization for Human Language Technologies
Bibtex @InProceedings{JAVOREK10.899,
  author = {Jan Jona Javoršek and Tomaž Erjavec},
  title = {Experimental Deployment of a Grid Virtual Organization for Human Language Technologies},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA