Computerlinguistische Implementierung (Urdu/Hindi)
- AG Butt (Allgemeine Sprachwissenschaft mit Schwerpunkt Computerlinguistik)
(2013): ParGramBank : The ParGram Parallel Treebank Proceedings of the 51 st Annual Meeting of the Association for Computational Linguistics, 4 - 9 August 2013, Sofia, Bulgaria. Stroudsburg, PA: Association for Computational Linguistics, 2013, pp. 550-560 |
This paper discusses the construction of a parallel treebank currently involving ten languages from six language families. The treebank is based on deep LFG (Lexical- Functional Grammar) grammars that were developed within the framework of the ParGram (Parallel Grammar) effort. The grammars produce output that is maximally parallelized across languages and language families. This output forms the basis of a parallel treebank covering a diverse set of phenomena. The treebank is publicly available via the INESS treebanking environment, which also allows for the alignment of language pairs. We thus present a unique, multilayered parallel treebank that represents more and different types of languages than are available in other treebanks, that represents deep linguistic knowledge and that allows for the alignment of sentences at several levels: dependency structures, constituency structures and POS information. Forschungszusammenhang (Projekte) |
Laufzeit: | 01.03.2009 – 30.04.2013 |