First Open Model for Latvian Handwritten Text Recognition Created at the University of Latvia

Author
Latvijas Universitāte

June 2, 2026

technologies social sciences

The first open text recognition model for Latvian handwriting has been created on the Transkribus platform. Developed for the research and digitisation of Latvian handwritten manuscripts, the model will be useful to anyone working with 19th-century Latvian handwritten materials.

csm_19th-century-latvian-model_55fb055276.png
Image on the Transkribus platform

The model was trained using materials from the collection of the Scientific Commission of the Riga Latvian Society, preserved at the Archives of Latvian Folklore (ALF, Institute of Literature, Folklore and Art, University of Latvia). This is the oldest and most extensive collection at ALF, containing unique manuscripts that cover various folklore genres, ethnographic records, linguistic materials, place-name documentation, songs, riddles, word explanations, and other testimonies of traditional culture, dialects, and Latvian cultural history.

The model was trained using artificial intelligence technologies and previously prepared manuscript transcriptions produced by the ALF volunteer community. The character error rate (CER) of the model published in Transkribus is 4.83%; it was trained on 2,671 pages, covering more than 367,000 words and 132,000 lines of text.

The model was developed as part of the University of Latvia-funded project ȬPEN: Open Knowledge Ecosystems for the Advancement of Citizen Science (ZDA-LIP 2025/2). Its development brings together the University of Latvia Digital Humanities Centre, the University of Latvia Library, Transkribus, and the Institute of Literature, Folklore and Art of the University of Latvia Archives of Latvian Folklore.

“This model is an important step towards expanding access to Latvia’s handwritten heritage. It not only accelerates manuscript transcription, but also opens up new opportunities for research, the creation of digital collections, and public participation in exploring cultural heritage. It is especially important that the model is relatively open. All registered Transkribus users can use it in their own projects and continue improving it,” says project leader and head of the University of Latvia Digital Humanities Centre Sanita Reinsone.

Work on the project is continuing with the development of a text recognition model for 20th-century Latvian handwriting, which will further expand the possibilities for automated recognition of Latvian handwritten materials. In addition, a new tool is currently being developed to make it easy for anyone to try handwritten text transcription and image description using the best-known large language models.

The model is available on the Transkribus platform under the name “Latvian 19th century”:

https://www.transkribus.org/models/latvian-19th-century 
 

Recommended articles

technologies science

Agreement reached to develop unmanned systems and dual‑use technologies in Latvia

To create an environment where unmanned systems, autonomous technologies, and other innovative solutions can be tested under real conditions, while at the same time promoting cooperation between companies, research organisations, academia, and international partners, the “Baltic Forces Hub” will be…

Rīgas Tehniskā universitāte

June 3, 2026

technologies public health

Four “BioPhoT” projects have earned the opportunity to present at the “Deep Tech Startup Pitch Competition 2026”

On May 14 and 15, four “BioPhoT” projects will represent Latvia’s science and innovation ecosystem at the international level – in the international startup presentation competition “Deep Tech Startup Pitch Competition 2026” at Northern Europe’s leading deep technology conference “Deep Tech Atelier…

BioPhoT

May 13, 2026

innovation technologies

Deep tech promises a million jobs: how the war in Ukraine accelerated innovation?

Expert estimates indicate that the deep tech, or knowledge‑intensive technologies, sector in Europe could reach a value of nearly one trillion euros by 2030 and create up to one million new jobs. Meanwhile, the Organisation for Economic Co‑operation and Development (OECD) currently highlights the d…

Labs of Latvia

May 12, 2026

technologies

Europe’s microchip industry leaders to gather in Riga for “Riga Chip Summit 2026

On 13 May from 9:00 to 17:00, the international microchip industry forum “Riga Chip Summit 2026” will take place at the VEF Chamber Hall “Spīdola”, Ieriķu Street 5a, bringing together industry leaders, investors, researchers and startups from across Europe. The event is organised by the Latvian Mic…

Riga Technical University

May 11, 2026