Move to main content
Search

Written language corpora

The Institute has several written language corpora containing texts written in Finnish, ranging from the birth of written Finnish in the 16th century to the 21st century. Most of the collections contain texts published before 1900 due to copyright issues.

The Corpus of Old Literary Finnish contains texts created during the period of Swedish rule (from the 16th century to ca. 1810). It contains, for example, the complete works of Mikael Agricola, religious texts including the first translation of the Bible to Finnish, law texts, and secular material.

The texts in the Corpus of Early Modern Finnish date from 1809–1899. The text corpus is available on the web and contains literary works of various topics as well as magazines and newspapers, dictionaries, and single articles. There are also several collections of entry slips in the physical archives.

The corpus of writings from famous Finnish authors ranges from the late 19th century to the early 20th century. Modern Finnish is presented through a corpus of New Year’s Speeches of the President of the Republic, from 1935 to 2006.

Agricola ABCkiria kansi Kielikello
First page of Agricola's ABCkiria. Image: Kotus.