Published December 13, 2011
| Version v1
Dataset
Restricted
TüBa-DDC - Tübinger Baumbank des Deutschen - Diachrones Corpus
Description
Creation tools: OpenNLP Tokenizer, TCF 0.3 Sentence Border Detecter (in-house tool for detecting sentence borders), Part-of-speech tagger and lemmatiser, Berkeley Parser (constituent parser), and German Named Entity Recognizer.
Source: Gutenberg-DE Edition 11 DVD-ROM
Files
Additional details
Additional titles
- Alternative title (English)
- TüBa-DDC - Tübingen Treebank of Written German - Diachronic Corpus