Published September 16, 2020 | Version v1
Dataset Open

TüBa-D/DP: Wikipedia.

  • 1. ROR icon University of Tübingen

Description

A treebank  (dependency) for a Wikipedia corpus in CONLL format. All data is compressed via zstd (see https://facebook.github.io/zstd/).

Other

Research carried out in work package A03 of the SFB 833.

Files

CMDI.xml

Files (13.1 GB)

Name Size Download all
md5:e24ed35f0e9532bd182e50bbf7de3632
12.8 kB Preview Download
md5:46b9a111b852aaea47c5028a3281802e
13.1 GB Download
md5:df1ad935971abdb2b5225770047cb584
57.0 kB Preview Download

Additional details

Additional titles

Alternative title
Wikipedia subcorpus of the Tübingen treebank of dependency-parsed German

Funding

Deutsche Forschungsgemeinschaft
SFB 833:  Bedeutungskonstitution - Dynamik und Adaptivität sprachlicher Strukturen 75650358

Data quality

Accuracy

Not specified.

Completeness

Not specified.

Conformity

Not specified.

Consistency

Not specified.

Credibility

Not specified.

Processability

Not specified.

Relevance

Not specified.

Timeliness

Not specified.

Understandability

Not specified.