Published August 19, 2020
| Version v1
Dataset
Open
Subject-object information in word embeddings (subj-obj-embeds)
Description
This set of datasets was initially built for the experiment to find out wether embeddings encoded the information suggesting a word is more likely to be the subject or object. The datasets were extracted from German TüBa-D/Z treebank with UD and Hamburg dependency annotations (for German) and the Lassy small treebank (for Dutch).
Other (English)
Research carried out in work package A03 of the SFB 833.
Files
CMDI.xml
Files
(95.3 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:d3dab98e1fd2d2abfed7ad95e5416072
|
14.2 kB | Preview Download |
|
md5:4346a06e88884fb598bea82be420615d
|
95.3 MB | Preview Download |