Published March 24, 2024 | Version v1
Dataset Open

German Adjective-Noun Co-occurrences with Attributes

  • 1. ROR icon University of Tübingen
Data collector:
Strakatova, Yana1
  • 1. ROR icon University of Tübingen

Description

The dataset contains 3305 adjective-noun pairs extracted from the GerCo data set [1] and enriched with context sentences as described in [2]. All the phrases have been annotated by two experts with attributes from an attribute inventory of 49 relations. Each noun is also manually annotated with its semantic class.

The corresponding citation (doctoral thesis) will be added in the next version of the dataset.

References:

[1]: Hinrichs, E., Klein, W., Strakatova, Y., & Fuhrmann, I. (2017). GerCo: German Adjective-Noun Collocations Datase [Data set]. University of Tübingen. https://doi.org/10.57754/FDAT.rr563-my238

[2]: Strakatova, Y., Falk, N., Fuhrmann, I., Hinrichs, E., and Rossmann, D. (2020). All That Glitters is Not Gold: A Gold Standard of Adjective-Noun Collocations for German. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 4368–4378, Marseille, France. European Language Resources Association. https://aclanthology.org/2020.lrec-1.538

Files

README.txt
Files (617.3 KiB)
Name Size
md5:cb85e2e283415880282cb5e91eb6b15f
1.8 KiB Preview Download
md5:566b9c5d5f7a19a23251f4200986c71a
615.5 KiB Preview Download

Additional details

Created:
March 25, 2024
Modified:
March 25, 2024