ENGL4140 Research Methods: Corpus Linguistics Spring 2025
Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit
Research Methods: Corpus Linguistics
ENGL4140
Module Handbook
Spring 2025
Module Overview:
Welcome to Research Methods: Corpus Linguistics. Corpus linguistics involves the use of quantitative and qualitative methods in the analysis of large volumes of electronically-stored texts, including transcripts of spoken interaction, newspaper articles, literary texts and online communication.
Throughout the module you will be learning to use corpus analysis tools to investigate both freely available online corpora and a corpus you have made yourself. You may, for instance, examine newspaper articles that can be compiled through the Nexis database, use nineteenth century novels that are available on Project Gutenberg, or examine contemporary spoken interactions from around the UK. A key strength of corpus linguistics is its potential in different domains of applied linguistics research. As such, you will also learn to use different corpus software packages to analyse your own texts and pursue your own ideas, as well as considering issues of corpus design and compilation.
The module will enable you to study texts by combining your own intuitions and interpretations of language with evidence and analysis drawn from multimillion-word language datasets. While we will tackle some of the typical statistical processes involved in corpus linguistics, this will not be a statistics or coding-centred course. Rather, the application of corpus methods will mainly be motivated by research questions from discourse analysis.
Teaching format and session details
Teaching will consist of one 1-hour lecture plus one 1-hour workshop per week. The lecture will introduce key concepts and analytical approaches while the workshops will focus on hands-on practical tasks and discussion designed to give you experience of using corpus interfaces and methods to investigate different corpora.
Teaching times:
Lecture
· Monday 12-1pm – Physics C05
Workshops
· Tuesday 11am-12pm – Hallward Library LG101 (Daniel)
· Tuesday 12-1pm – Hallward Library LG101 (Daniel)
· Friday 12-1pm – Hallward Library LG101 (James)
· Friday 1-2pm – Hallward Library LG106 (James)
N.B. You will need your student card to access the Hallward Library.
Reading:
Much of the module’s preparatory reading comes from:
Baker, P. (2023). Using Corpora in Discourse Analysis (Second Edition). London: Bloomsbury.
Baker’s book is widely used in corpus linguistics modules across the world because of its considered and accessible account of specific corpus analysis techniques. Chapters from this book will be supplemented by additional articles/chapters provided on Moodle (indicated on p.4-5).
Weeks 4 and 7 each have more complex papers on corpus statistics that we recommend you read after the workshop, though you can read them before if you wish to.
Content:
Session 5 on ‘Corpus and Discourse Studies’ looks at mediated representations of the Syrian refugee crisis in the Daily Mail, a right-wing British newspaper. If you have any questions or concerns about this material then please contact me – [email protected].
Original session timetable and set reading:
|
Week beginning |
Theme |
Reading |
|
Week 1 27th January |
Corpus linguistics: Basics and background |
· Baker, chapter 1 |
|
Week 2 3rd February |
Corpus design |
· Baker, chapter 3 · Reppen (2010) ‘Building a corpus: what are the key considerations?. In A. O’Keeffe & M. McCarthy (Eds.) The Routledge Handbook of Corpus Linguistics, on Moodle. |
|
Week 3 10th February |
Collocation pt.1: Concordancing and the lexical item |
· Baker, chapter 5 |
|
Week 4 17th February |
Frequency and keywords |
· Baker, chapters 4 and 7 · Post workshop: Gabrielatos (2018) ‘Keyness analysis: nature, metrics and analysis’ on Moodle. |
|
Week 5 24th February |
Corpus and Discourse Studies |
· Baker et al. (2008) ‘A useful methodological synergy?...’ on Moodle. |
|
Week 6 3rd March |
Reading Week – complete take-home task this week |
|
|
Week 7 10th March |
Re-cap and collocation pt.2 |
· Baker, chapter 6 · Post workshop: Gablasova, Brezina & McEnery (2017) ‘Collocations in corpus-based language learning research’ on Moodle. |
|
Week 8 17th March |
Corpora and ELT (Dr Pawel Szudarski) |
· Hirata and Thompson (2022) 'Communicative data-driven learning: a two-year pilot study' on Moodle. · Post workshop: Szudarski (2023) 'Corpora and teaching vocabulary and phraseology' on Moodle. |
|
Week 9 24th March |
Learner corpora, complex searches and multiword units |
· Myles (2015) ‘Second language acquisition theory and learner corpus research’ on Moodle. |
|
Easter break |
||
|
Week 10 28th April |
Annotation and semantic domains |
Prentice (2010) ‘Using automated semantic tagging in CDA…’, on Moodle |
|
Week 11 5th May |
Essay advice: one-to-one meetings this week |
|
Assessment:
Deadline: TBC, but mid-May
As a second semester MA module, the assessment is kept deliberately wide open for you to write on a topic of your choosing:
Write a paper of 3000-3500 words on a topic of your own choice.
For your paper you should conduct a small-scale corpus analysis and demonstrate an awareness of the concepts and issues addressed in the module.
If you collect your own corpus (i.e. do not use one that is freely accessible on the web) then you should share your data on Sketch Engine with the module tutors ([email protected] and [email protected]) or by including a URL link from which the data can be downloaded in your essay.
Suggested areas for your essay are below. However, you do not have to stick with one of these and can propose another corpus-related topic if you wish. Identifying a focus for your essay will be appreciably easier if you endeavour to keep up with the reading and references you find at the end of the lecture slides for each week. Also check Moodle for further materials / advice.
We strongly recommend that you confirm your essay topic with your workshop tutor
Suggested essay formats:
§ Compare the patterns and uses of X in relevant corpora
- e.g. ‘Comparing the use of passive constructions in corpora of humanities and social science journal articles’; ‘Comparing the use of phrasal verbs in broadsheet and tabloid newspapers’; ‘Comparing the use of modal verbs in writing by L2 learners at different proficiency levels’.
- Make the title more concrete
- ‘X’ is a word or phrase that you choose
- This has to be an informed choice reflecting that you understand what kind of
variation in usage can be usefully described
- Normally two corpora for the comparison
§ An analysis of the discourse of X in a corpus of Y
- e.g. ‘An analysis of discourses of climate change in FTSE100 companies’ CSR reports’; ‘An analysis of ‘woke’ in British tabloid media’.
- The assignment needs to provide a motivation for why X is a good concept to define a discourse, and why Y is the right choice of data to investigate this discourse
- To find appropriate criteria for the analysis of this discourse you need to draw on literature that you discuss in your research context section.
§ Discuss the relevance and applicability of corpus methods to the EFL classroom
- This essay should not just be a literature review!
- You will need to find your own corpus examples to discuss in support of your arguments
- In the discussion of concrete examples you need to refer to
- the teaching context into which these examples would be embedded
- the rationale for choosing them
- the aims that you think could be achieved with them
- If you are a teacher you can write about your own situation, but avoid doing so in a narrative style.
§ Conduct a corpus stylistic analysis of a text of your own choice
- To find an appropriate focus for the analysis you may wish to either:
- focus on the applicability of a method / approach from this module
or
- start from a claim based on your research of literary criticisms for the text you chose, then choose the appropriate method to find linguistic evidence to defend or refute the claim.
§ Any other topic related to the content of this module, subject to approval by your module tutor.
2025-05-14