Digitisation of genealogical documents based on automatic text recognition technology
DOI:
https://doi.org/10.63009/lsrsi/4.2024.56Keywords:
archive group, scanning, genealogical research, Optical Character Recognition, information technologies, automationAbstract
This study aimed to examine existing approaches and technologies for the digitisation of genealogical documents, drawing on international experience. This enabled more efficient organisation of digitisation processes and mechanisms for archive groups, their centralised storage, accelerated genealogical research, and improved user accessibility. The digitisation of archives had became a critically important aspect of preserving cultural heritage, particularly in the context of Russia's military aggression against Ukraine. The introduction of automatic text recognition technology had contributed to the optimisation of this process, facilitating access to information and enhancing the efficiency of research, particularly in the field of genealogy. The study analysed the operating principles of optical character recognition, its advantages, the features of ready-made solutions, and the functionality of software based on this technology. The strategy for digitisation in Ukraine was assessed, along with the challenges facing the archival sector in terms of digitisation and access to archive groups. The research also examined the outcomes of implementing automatic text recognition in leading archives worldwide, as well as the capabilities of online archives that offered contextual search functions. Particular attention was given to the opportunities afforded to researchers through the integration of such systems into archival operations, notably the ease of locating required information, the increased speed of data processing, and the provision of round-the-clock access to archival resources regardless of users’ geographical location. The study also reviewed the research of scholars involved in the development and implementation of optical character recognition in archival institutions. Drawing on international experience, the potential of modern Optical Character Recognition technologies to modernise the archival sector in Ukraine was identified, with positive implications for genealogical research and the preservation of cultural heritage. The practical value of the study lies in demonstrating the effectiveness of information technologies in improving the digitisation process of archival documents and enhancing access to them. The proposed recommendations aim to optimise the organisation of digital archives, improve document storage and retrieval processes, and accelerate genealogical research. These developments will contribute to the preservation of cultural heritage and improve access to archival information for users
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Артур Спектор

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).