Published March 24, 2024 | Version v1
Dataset Open

German Adjective-Noun Co-occurrences with Attributes

  • 1. ROR icon University of Tübingen

Contributors

Data collector:

  • 1. ROR icon University of Tübingen

Description

The dataset contains 3305 adjective-noun pairs extracted from the GerCo data set [1] and enriched with context sentences as described in [2]. All the phrases have been annotated by two experts with attributes from an attribute inventory of 49 relations. Each noun is also manually annotated with its semantic class.

The corresponding citation (doctoral thesis) will be added in the next version of the dataset.

References:

[1]: Hinrichs, E., Klein, W., Strakatova, Y., & Fuhrmann, I. (2017). GerCo: German Adjective-Noun Collocations Datase [Data set]. University of Tübingen. https://doi.org/10.57754/FDAT.rr563-my238

[2]: Strakatova, Y., Falk, N., Fuhrmann, I., Hinrichs, E., and Rossmann, D. (2020). All That Glitters is Not Gold: A Gold Standard of Adjective-Noun Collocations for German. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 4368–4378, Marseille, France. European Language Resources Association. https://aclanthology.org/2020.lrec-1.538

Files

GerCoAt.txt

Files (632.1 kB)

Name Size Download all
md5:566b9c5d5f7a19a23251f4200986c71a
630.3 kB Preview Download
md5:cb85e2e283415880282cb5e91eb6b15f
1.9 kB Preview Download

Additional details

Related works

Is derived from
Dataset: 10.57754/FDAT.rr563-my238 (DOI)

Funding

Deutsche Forschungsgemeinschaft
Modellierung lexikalisch-semantischer Beziehungen von Kollokationen (MoKo) 322096725

Data quality

Accuracy

Not specified.

Completeness

Not specified.

Conformity

Not specified.

Consistency

Not specified.

Credibility

Not specified.

Processability

Not specified.

Relevance

Not specified.

Timeliness

Not specified.

Understandability

Not specified.