Module QXL-3377:
Fundamentals of Corpus Linguis
Fundamentals of Corpus Linguistics 2025-26
QXL-3377
2025-26
School of Arts, Culture And Language
Module - Semester 2
20 credits
Module Organiser:
Christopher Shank
Overview
This module provides students with a comprehensive understanding of corpora in linguistic studies, covering both theoretical and practical aspects. It aims to equip students with the necessary skills to employ corpus-based approaches in their research projects. The course introduces technical aspects, delves into diverse linguistic studies utilizing corpora, and emphasizes discussions on methodologies, results, and implications in tutorials.
Topics covered include corpus linguistics, analysis techniques like concordance and keyword analysis, statistical integration, grammatical, synchronic, and diachronic studies, sociolinguistics, critical discourse analysis, language variation, translation studies, and language education, particularly in TEFL (Teaching English as a Foreign Language).
Students in this module, depending on assessment topics, research question(s) and methodologies will have to opportunity to utilize the department labs, specialized software and resources, when carry out their assessments. This can include access to and the use of; high-level statistical modelling and analysis software and a wide range of concordance software and specialised corpora for many languages.
This module introduces students to the theoretical and practical issues of using corpora in linguistic studies and helps them to develop the background, knowledge and skills needed in order to develop and utilize a corpus based approach in their own research projects. The goals of this module are two-fold. First the students will be introduced and become familiar with the technical aspects of course based approaches and research. Then, attention will be directed to looking at how corpora and corpuses based approaches are used in a range of linguistic and language oriented studies. The lectures will provide students with the “big picture”, i.e. different research domains will be explored, central topics are summarized, important studies discussed and open questions outlined. In the tutorials, students discuss key studies in detail and reflect on methodologies, results and implications.
The following topics will be covered:
- Introducing corpus linguistics, corpus design, types of corpora and corpus annotation
- Corpus analysis: concordance, wordlist, keyword analysis
- Integrating stats and making statistic claims
- Corpora in grammatical studies
- Corpora in diachronic studies
- Metaphor and Corpus Linguistics
- Corpora in critical discourse analysis
- Corpora in language variation research
- Corpora and translation studies / research
- Corpora in sociolinguistic studies / research
- Corpora in language education - focus on TEFL.
Assessment Strategy
Threshold (D- to D+) Submitted work is adequate and shows an acceptable level of competence as follows: 1.Generally accurate but with omissions and errors.2.Assertions are made without clear supporting evidence or reasoning.3.Has structure but is lacking in clarity and therefore relies on the reader to make links and assumptions.4.Draws on a relatively narrow range of material.
Good (C- to B+) Submitted work is competent throughout and may be distinguished by superior style, approach and choice of supporting materials. It: 1.Demonstrates good or very good structure and logically developed arguments.2.Draws at least in parts on material that has been sourced and assessed as a result of independent study, or in a way unique to the student.3.Assertions are backed by evidence and sound reasoning.4.Accuracy and presentation in an appropriate academic style.
Excellent (A- to A*) Submitted work is of an outstanding quality and excellent in one or more of the following ways: 1.Has originality of exposition with the student’s own thinking being readily apparent.2.Provides clear evidence of extensive and relevant independent study.3.Arguments are laid down with clarity and provide the reader with successive stages of consideration to reach conclusions.
Learning Outcomes
- Differentiate between types of research and research questions commonly used in corpus-based approaches to language variation, use, form, register and style.
- Distinguish the basic principles underlying the scientific method in general and empirical data driven approach in particular.
- Evaluate and analyse corpus-based studies in linguistics and related disciplines that examine language use, reference, categorization, comprehension, cognition and synchronic and diachronic change and or patterns.
- Evaluate and apply the core concepts, ideas, terminology, and approaches relating to the use of corpora and corpus-based approaches in the study of language
Assessment method
Coursework
Assessment type
Summative
Description
Assignment #1 Take home CORPUS based data analysis using the BYU corpora This is a take home analysis / assignment, and it will be handed out electronically via email and Blackboard. The objective is to give you some practice working with a well know online corpora, using search syntax, and answering some basic questions about search term frequencies and the effects of different variables. The report generated will be informal and you will largely be reporting your findings. This assessment is worth 10% of the final mark.
Weighting
10%
Due date
12/02/2024
Assessment method
Coursework
Assessment type
Summative
Description
Assignment #2 Replication study The objective of this assignment is to ‘replicate’ a previously published corpus-based paper on a topic and in a subject of your interest. The goal of this assessment is to give you some structured hands-on practice to address a research question or questions using a corpus-based methodology. This assessment is worth 30% of the final mark.
Weighting
30%
Due date
15/04/2024
Assessment method
Coursework
Assessment type
Summative
Description
Assignment #3. Final Assessment - Corpus based research paper. The goal of this assessment is to develop a research question or questions that can be explored, and answered, via a data driven approach, using a corpus-based methodology and corpus resources. This research paper will follow a standard; an introduction that sets up the background for your research question and the rationale for it, a review of literature, a statement of the research questions, a methodology section, a results / discussion section, and a short conclusion. It will be worth 60% of the final mark.
Weighting
60%
Due date
17/05/2023