GerCoAt: German Adjective-Noun Co-occurrences with Attributes Dataset ------------------------------------------------------------------- The dataset contains 3305 adjective-noun pairs extracted from the GerCo data set [1] and enriched with context sentences as described in [2]. All the phrases have been annotated by two experts with attributes from an attribute inventory of 49 relations. The database is a tab-separated text file "GerCoAt.txt" with 6 columns: - "ADJ" lemma of the adjective - "NN" lemma of the noun - "final_attribute" relation between a noun and its adjectival modifier - "final_semclass" semantic class of the noun according to the GermaNet [3,4] - "status" collocation or free phrase (as described for dataset [1]) - "context" a context sentence containing the given adjective-noun phrase References: [1]: Hinrichs, E., Klein, W., Strakatova, Y., & Fuhrmann, I. (2017). GerCo: German Adjective-Noun Collocations Datase [Data set]. University of Tübingen. https://doi.org/10.57754/FDAT.rr563-my238 [2]: Strakatova, Y., Falk, N., Fuhrmann, I., Hinrichs, E., and Rossmann, D. (2020). All That Glitters is Not Gold: A Gold Standard of Adjective-Noun Collocations for German. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 4368–4378, Marseille, France. European Language Resources Association. https://aclanthology.org/2020.lrec-1.538 [3]: Hamp, B. and Feldweg, H. (1997). "GermaNet - a Lexical-Semantic Net for German." Proceedings of the ACL workshop Automatic Information Extraction and Building of Lexical Semantic Resources for NLP Applications. Madrid. [4]: Henrich, V. und Hinrichs, E. (2010). "GernEdiT - The GermaNet Editing Tool". Proceedings of the Seventh Conference on International Language Resources and Evaluation (LREC 2010). Valletta, Malta, pp. 2228-2235.