Online course in Literature

Corpus Linguistics: Method, Analysis, Interpretation

Get a practical introduction to the methodology of corpus linguistics for researchers in the social sciences and humanities.

Corpus Linguistics: Method, Analysis, Interpretation

  • Duration 8 weeks
  • Weekly study 3 hours
  • Learn Free
  • Extra benefits From $59 Find out more

Learn how to build and query corpora in this practical course

On this course, you’ll get a practical introduction to corpus linguistics, an extremely versatile methodology of language analysis using computers.

Over eight weeks, you’ll build the skills necessary to collect and analyse large digital collections of text (corpora).

You’ll be introduced to a number of topics demonstrating the use of corpora in areas as diverse as discourse analysis, sociolinguistics, and language learning and teaching.

Download video: standard

Skip to 0 minutes and 12 secondsLanguage defines what we are. Something linguists argue it's the very essence of being human. It's the key skill that sets us apart from animals. Yet while we use language routinely, on a daily basis, it's something which we don't fully consciously comprehend. It's difficult to understand consciously the mechanics of language. Until recently, the sheer scale of language defied a comprehensive analysis, but that wasn't for want of evidence. As you can see, we're surrounded by the evidence for the use of language. There is a sea of words around me here, but of course, without suitable help, an analyst can drown in this sea of words, so we need to step out of the age of paper and ink.

Skip to 1 minute and 6 secondsThe computer has changed everything. For the first time, we're able to rapidly and reliability search through millions or even billions of words of data. At the same time, electronic publishing has made available to us, on a scale that's quite unprecedented, electronic language data, texts. We can gather those texts together into a body of data called a corpus, the plural of which is corpora, that we use to study language on a computer. Now, the development of such corpora is leading to a golden age in the study of language. For the first time, as the vast collections of data become available, we can easily study language across a range of languages and even back through time.

Skip to 2 minutes and 2 secondsBy entering the digital age, analysts are able to search for patterns that would probably defy analysis by hand and eye alone. Take, for example, the word tendencies. It's usually associated with negative things. Now, some of you may not have known that. Some of you may have suspected it. The great thing about using corpus data is you can look into the data. If you didn't know it, you're shown it. If you suspected it, you can confirm your suspicions. Now. This revolution in the study of language has probably touched on your everyday life already. Dictionaries, grammar, spell checkers, grammar checkers, speech synthesis systems, even web search engines, to some extent, rely on these insights into language provided by corpus data.

Skip to 2 minutes and 53 secondsOn this course, you'll learn about the range of applications of corpus data in the study of language both in linguistics and beyond it, in the social sciences for example. Importantly, you'll also get a sense of what it's like to study at Lancaster University. You'll have lectures, practical tasks, readings, additional lectures, and discussions available to you each week. So I welcome you to join me in this journey into language. I think you'll find it interesting. You'll certainly find it empowering because, by the end of the course, you too will be able to carry out some of these analyses on your own.

Skip to 3 minutes and 52 secondsLanguage defines what we are.

What topics will you cover?

  • Introduction to corpus linguistics and basic techniques: concordancing
  • Further corpus techniques: collocation and keywords
  • Corpus-based discourse analysis
  • Building a corpus: tagging and processing data
  • Sociolinguistics: analysing BNC1994 and BNC2014
  • Textbook and dictionary construction
  • Language learning and corpus linguistics
  • Swearing extravaganza: looking at language and society

When would you like to start?

Most FutureLearn courses run multiple times. Every run of a course has a set start date but you can join it and work through it after it starts. Find out more

  • Available now
    This course started 16 Sep 2019

What will you achieve?

By the end of the course, you'll be able to...

  • Interpret corpus data using techniques such as concordancing, collocation and keywords.
  • Describe the main methodological underpinnings behind corpus linguistics.
  • Apply corpus linguistic techniques to the analysis of different types of data.
  • Collect own corpora.
  • Design research studies using corpus methods.
  • Explain corpus methods as well as a range of applications of this versatile methodology.
  • Perform corpus analysis using a range of corpus tools such as #LancsBox, CQPweb, USAS and BNClab.

Who is the course for?

This course is designed for anyone with an interest in the study of language.

Who will you learn with?

Tony McEnery

Tony McEnery

Has been working for over 20 years to help pioneer new ways to use computers to analyse very large collections of language data.

Vaclav Brezina

Vaclav Brezina

Corpus linguist, lead developer of #LancsBox.

Who developed the course?

Lancaster University is a collegiate university, with a global reputation as a centre for research, scholarship and teaching with an emphasis on employability.

Join this course

Start this course for free, upgrade for extra benefits, or buy Unlimited to access this course and hundreds of other short courses for a year.

Free
$0

Join free and you will get:

  • Access to this course for 10 weeks

Upgrade
$59

Upgrade this course and you will get:

  • Access to this course for as long as it’s on FutureLearn
  • Access to this course’s tests as well as a print and digital Certificate of Achievement once you’re eligible
New

Unlimited (New!)
$239 for one year

Buy Unlimited and you will get:

  • Access to this course, and hundreds of other FutureLearn short courses and tests for a year
  • A printable digital Certificate of Achievement on all short courses once you’re eligible
  • The freedom to keep access to any course you've achieved a digital Certificate of Achievement on, for as long as the course exists on FutureLearn
  • The flexibility to complete your choice of short courses in your own time within the year
Find out more about upgrades or Unlimited.