Title |
Technical Infrastructure at Linguistic Data Consortium: Software and Hardware Resources for Linguistic Data Creation |
Authors |
Kazuaki Maeda, Haejoong Lee, Stephen Grimes, Jonathan Wright, Robert Parker, David Lee and Andrea Mazzucchi |
Abstract |
Linguistic Data Consortium (LDC) at the University of Pennsylvania hasparticipated as a data provider in a variety of governmentsponsored programsthat support development of Human Language Technologies. As the number ofprojects increases, the quantity and variety of the data LDC produces haveincreased dramatically in recent years. In this paper, we describe thetechnical infrastructure, both hardware and software, that LDC has built tosupport these complex, large-scale linguistic data creation efforts at LDC. Asit would not be possible to cover all aspects of LDCs technicalinfrastructure in one paper, this paper focuses on recent development. We alsoreport on our plans for making our custom-built software resources available tothe community as open source software, and introduce an initiative tocollaborate with software developers outside LDC. We hope that our approachesand software resources will be useful to the community members who take onsimilar challenges. |
Language |
|
Topics |
Tools, systems, applications, LR Infrastructures and Architectures |
Full paper  |
Technical Infrastructure at Linguistic Data Consortium: Software and Hardware Resources for Linguistic Data Creation |
Bibtex |
@InProceedings{MAEDA10.857,
author = {Kazuaki Maeda, Haejoong Lee, Stephen Grimes, Jonathan Wright, Robert Parker, David Lee and Andrea Mazzucchi}, title = {Technical Infrastructure at Linguistic Data Consortium: Software and Hardware Resources for Linguistic Data Creation}, booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Mike Rosner, Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |