Published September 6, 2012 | Version v1
Dataset Restricted

Diskursstruktur/Diskursrelationen in der TüBa-D/Z: Subkorpus der TüBa-D/Z mit Diskursstruktur und Diskursrelationen (release 8)

Description

This subcorpus of the treebank TüBa-D/Z contains discourse structures and discourse relations. For the study on automatic classification, we use instances of two German temporal connectives that can also carry a non-temporal discourse relation, namely `während´ and `nachdem. Our data set – the connective occurrences from the current extent of the TüBa-D/Z, totaling about 60 000 sentences – contains 294 instances of nachdem and 527 instances of während. Where available, we used the syntactic annotation from the treebank; in the remaining cases, we used a syntactic parser (Versley and Rehbein, 2009) to provide syntax trees for the feature extraction. This corpus is based on release 8.

Other (English)

Research carried out in work package A03 of the SFB 833.

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.

Additional details

Funding

Deutsche Forschungsgemeinschaft
SFB 833:  Bedeutungskonstitution - Dynamik und Adaptivität sprachlicher Strukturen 75650358

Data quality

Accuracy

Not specified.

Completeness

Not specified.

Conformity

Not specified.

Consistency

Not specified.

Credibility

Not specified.

Processability

Not specified.

Relevance

Not specified.

Timeliness

Not specified.

Understandability

Not specified.