Published November 25, 2013 | Version v1
Dataset Restricted

British National Corpus

Creators

Description

BNC contains about 100 million words: 90% written, 10% orthographically transcribed spoken text. The written part of the BNC (90%) includes, for example, extracts from regional and national newspapers, specialist periodicals and journals for all ages and interests, academic books and popular fiction, published and unpublished letters and memoranda, school and university essays, among many other kinds of text. The spoken part (10%) consists of orthographic transcriptions of unscripted informal conversations (recorded by volunteers selected from different age, region and social classes in a demographically balanced way) and spoken language collected in different contexts, ranging from formal business or government meetings to radio shows and phone-ins.

Other (English)

The BNC project was carried out and is managed by the BNC Consortium, an industrial/academic consortium lead by Oxford University Press, of which the other members are major dictionary publishers Longman (now Pearson Education) and Larousse Kingfisher Chambers; academic research centres at Oxford University Computing Services (OUCS, now IT Services), the University Centre for Computer Corpus Research on Language (UCREL) at Lancaster University, and the British Library's Research and Innovation Centre.

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.

Additional details

Related works

Data quality

Accuracy

Not specified.

Completeness

Not specified.

Conformity

Not specified.

Consistency

Not specified.

Credibility

Not specified.

Processability

Not specified.

Relevance

Not specified.

Timeliness

Not specified.

Understandability

Not specified.