Computerlinguistische Implementierung (Urdu/Hindi)

  • WG (Butt General and Computational Linguistics)
  Sulger, Sebastian; Butt, Miriam; King, Tracy Holloway; Meurer, Paul; Laczkó, Tibor; Rákosi, György; Bamba Dione, Cheikh; Dyvik, Helge; Rosén, Victoria; De Smedt, Koenraad; Patejuk, Agnieszka; Özlem, Cetinoglu,; Arka, I Wayan; Mistica, Meladel (2013): ParGramBank : The ParGram Parallel Treebank Proceedings of the 51 st Annual Meeting of the Association for Computational Linguistics, 4 - 9 August 2013, Sofia, Bulgaria. Stroudsburg, PA: Association for Computational Linguistics, 2013, pp. 550-560

ParGramBank : The ParGram Parallel Treebank


This paper discusses the construction of a parallel treebank currently involving ten languages from six language families. The treebank is based on deep LFG (Lexical- Functional Grammar) grammars that were developed within the framework of the ParGram (Parallel Grammar) effort. The grammars produce output that is maximally parallelized across languages and language families. This output forms the basis of a parallel treebank covering a diverse set of phenomena. The treebank is publicly available via the INESS treebanking environment, which also allows for the alignment of language pairs. We thus present a unique, multilayered parallel treebank that represents more and different types of languages than are available in other treebanks, that represents deep linguistic knowledge and that allows for the alignment of sentences at several levels: dependency structures, constituency structures and POS information.

Origin (projects)

Funding sources
Name Finanzierungstyp Kategorie Project no.
Deutsche Forschungsgemeinschaft third-party funds research funding program 410/09
Further information
Period: 01.03.2009 – 30.03.2012