The Corpus of Contemporary American English (COCA) is the only large and ”representative” corpus of American English. COCA is probably the most widely-used corpus of English. It is related to many other corpora of English that were formerly known as the ”BYU Corpora”, and they offer unparalleled insight into variation in English.
For general terms and conditions for this and other corpora from BYU please see https://www.corpusdata.org/restrictions.asp
For more information about access rights see Mark Davies’ downloadable corpora at Kielipankki
Latest versions/subcorpora: | |
Corpus of Contemporary American English – Kielipankki VRT version 2020 Metadata and license Attribution instructions |
Download the resource |
Corpus of Contemporary American English – Kielipankki Korp version 2020 Metadata and license Attribution instructions |
Select the corpus in Korp |
Corpus of Contemporary American English – Kielipankki download version 2020 Metadata and license Attribution instructions |
Download the resource |
Corpus of Contemporary American English – Kielipankki Korp version 2017H1 Metadata and license Attribution instructions |
Select the corpus in Korp |
Corpus of Contemporary American English – Kielipankki download version 2017H1 Metadata and license Attribution instructions |
Download the resource |
Of this language corpus different versions/subcorpora are (or might be in the future) published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.
This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2017061921