Published September 20, 2004 | Version v1
Dataset Restricted

Tübinger Partiell Geparstes Korpus des Deutschen/Schriftsprache

Description

TüPP-D/Z is a collection of articles from the taz newspaper ("die tageszeitung") which have been automatically annotated with clause structure, topological fields, and chunks, in addition to more low level annotation including parts of speech and morphological ambiguity classes.  All texts have been processed automatically, starting from paragraph, sentence and token segmentation. Word forms include information about some regular types of named entities,  including dates, telephone numbers, and number/unit combinations.

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.

Additional details

Additional titles

Alternative title (English)
Tübingen Partially Parsed Corpus of Written German

Data quality

Accuracy

Not specified.

Completeness

Not specified.

Conformity

Not specified.

Consistency

Not specified.

Credibility

Not specified.

Processability

Not specified.

Relevance

Not specified.

Timeliness

Not specified.

Understandability

Not specified.