Published December 13, 2011 | Version v1
Dataset Restricted

TüBa-DDC - Tübinger Baumbank des Deutschen - Diachrones Corpus

Description

Creation tools: OpenNLP Tokenizer, TCF 0.3 Sentence Border Detecter (in-house tool for detecting sentence borders), Part-of-speech tagger and lemmatiser, Berkeley Parser (constituent parser), and German Named Entity Recognizer.

Source: Gutenberg-DE Edition 11 DVD-ROM

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.

Additional details

Additional titles

Alternative title (English)
TüBa-DDC - Tübingen Treebank of Written German - Diachronic Corpus

Data quality

Accuracy

Not specified.

Completeness

Not specified.

Conformity

Not specified.

Consistency

Not specified.

Credibility

Not specified.

Processability

Not specified.

Relevance

Not specified.

Timeliness

Not specified.

Understandability

Not specified.