Published February 22, 2011 | Version v1
Dataset Open

Approx. 17.000 answers to reading comprehension questions (CREG-17K r6478)

Description

Learner corpus consisting of answers to German reading comprehension questions written by American college students learning German. Along with the learner answers, the collection includes the reading comprehension questions, and the target answers that teachers prepare as reference for the grading process. The meaning of each learner answer is assessed by two independent annotators. Meaning assessment is done using a binary classification (correct vs. incorrect) as well as using a richer set of diagnosis categories encoding the nature of the divergence from the target answers specified by the teachers. Following Bailey and Meurers (2008), we distinguish "missing concept", "extra concept", "blend" (missing concept and extra material), and "non-answer" for answers which are unrelated to the topic under discussion.

Answers may not be rated by two annotators, but by none ore one instead, this is a full snapshot. The annotation furthermore includes an automatic question form detection which is not really reliable

Other (English)

Research carried out in work package A04 of the SFB 833.

Files

CMDI.xml

Files (10.2 MB)

Name Size Download all
md5:8c5dd6ff7c7356f531bd108d9c44b44e
589 Bytes Download
md5:93209bfc1a0c978cda0b9a82c0aaf66a
30.2 kB Preview Download
md5:0652f26c2d0c60bf0d03469c492140b1
5.5 MB Preview Download
md5:5e198e1ffe20843ab684b515201cd61c
23.4 kB Preview Download
md5:2b8e33a7cdafc0bb9b300d5c1073ff58
4.0 MB Preview Download
md5:3f960f9255fcd4ef3d2788fe0cdecfaa
676.9 kB Preview Download
md5:7bf1dc55704c42ebeec877abcc57f8d0
1.9 kB Preview Download

Additional details

Funding

Deutsche Forschungsgemeinschaft
SFB 833:  Bedeutungskonstitution - Dynamik und Adaptivität sprachlicher Strukturen 75650358

Data quality

Accuracy

Not specified.

Completeness

Not specified.

Conformity

Not specified.

Consistency

Not specified.

Credibility

Not specified.

Processability

Not specified.

Relevance

Not specified.

Timeliness

Not specified.

Understandability

Not specified.