Corpora Available with an MSU login
- The MSU Libraries’ licensed and purchased Linguistic Corpora and Tools page, which requires an MSU login to access, currently features 76 corpora and associated tools.
Learner Corpora
The following corpora may or may not require registration, university permission, or payment, to access.
- Chinese Learner Corpus
- English Learner Corpora
- CCLE: Corpus of Chinese Learner English
- CROW: Corpus & Repository of Writing
- Gachon Korean Learner Corpus
- ICNALE: The International Corpus Network of Asian Learners of English
- MICUSP: Michigan Corpus of Upper Level Student Papers
- The LTTC English Learner Corpus (LTTC-ELC)
- UCLouvain Corpora:
- WRICLE: Written Corpus of Learner English
- French Learner Corpora
- German Learner Corpus
- Spanish Learner Corpus
- Russian Learner Corpus
Non-Learner Corpora
The following corpora may or may not require registration, university permission, or payment, to access.
- Arabic
- Bosnian
- Chinese
- English
- French
- German
- Greek
- Japanese
- Korean
- Italian
- Spanish
- Swedish
- Thai
- Turkish
Field Specific Corpora
- Legal Corpora
Free and Fee-Based Corpus Tools
Free Tools
Fee-Based Tools
Corpus Tools/Sites For Teachers
Statistical Resources
Corpus Linguistics Journals
Professional Organizations and Other Corpus Labs
Professional Organizations
Other Corpus Labs
Instructional Videos
Corpus-Related Talks
- Introduction to Machine Learning in R
- Introduction to Topic Models With R
- Introduction to Python for Corpus Research
- Automatically Assessing Lexical Features in Learner Corpora
- How can learner corpora help us better understand second language development?
- Corpus-based curriculum development for ESP: needs analysis, materials development, assessment and evaluation
- Using large corpora to look at genre-based, dialectal, and (especially) historical change in language
- Corpus linguistics informing the language classroom
Video Tutorials
- Tutorial on The Corpus of Contemporary American English’s Newest 2020 Features
- DukeWrites Enrichment Suite COCA (2020) Tutorials
- Introduction to The Corpus of Contemporary American English (COCA)
- The AntConc Youtube Channel
- Tutorial on the Python Natural Language Toolkit (NLTK)
- Tutorial For Using Sketch Engine
- R For Corpus Linguistics