The York-Toronto-Helsinki Parsed Corpus of Old English Prose

The York-Toronto-Helsinki Parsed Corpus of Old English Prose is a 1.5 million word electronic corpus of Old English prose texts which is annotated with the grammatical information necessary for extensive linguistic analysis. The corpus can be searched automatically for abstract grammatical structures (such as relative clauses, subject-verb inversion, expletive subjects, etc.), as well as (strings of) words, allowing quick and easy access to the data necessary to investigate virtually any aspect of the language of the period. As part of a series of grammatically-annotated corpora which together cover the entire history of English, the corpus can also be used in the study of long-term changes in the history of English.

Principal investigator
Professor Anthony Warner
Principal project staff
Professor Anthony Warner; Professor Susan Pintzuk
Start date
Saturday, January 1, 2000
Completion date
Saturday, March 1, 2003
Source material
The base text of the corpus is the Toronto Dictionary of Old English Corpus ( The text is enhanced with grammatical annotation.
Data formats