Published August 19, 2020 | Version v1
Dataset Open

Subject-object information in word embeddings (subj-obj-embeds)

  • 1. ROR icon University of Tübingen

Description

This set of datasets was initially built for the experiment to find out wether embeddings encoded the information suggesting a word is more likely to be the subject or object. The datasets were extracted from German TüBa-D/Z treebank with UD and Hamburg dependency annotations (for German) and the Lassy small treebank (for Dutch).

Other (English)

Research carried out in work package A03 of the SFB 833.

Files

CMDI.xml

Files (95.3 MB)

Name Size Download all
md5:d3dab98e1fd2d2abfed7ad95e5416072
14.2 kB Preview Download
md5:4346a06e88884fb598bea82be420615d
95.3 MB Preview Download

Additional details

Funding

Deutsche Forschungsgemeinschaft
SFB 833:  Bedeutungskonstitution - Dynamik und Adaptivität sprachlicher Strukturen 75650358

Data quality

Accuracy

Not specified.

Completeness

Not specified.

Conformity

Not specified.

Consistency

Not specified.

Credibility

Not specified.

Processability

Not specified.

Relevance

Not specified.

Timeliness

Not specified.

Understandability

Not specified.