Recognising the contents in digitised financial documents

Simas Rimašauskas; Igoris Belovas

doi:10.15388/LMITT.2025.22

Articles

Simas Rimašauskas

Vilnius University

Igoris Belovas

Vilnius University

Published 2025-05-12

https://doi.org/10.15388/LMITT.2025.22

PDF

Keywords

machine learning
natural language processing
optical character recognition
text recognition
table recognition

How to Cite

Rimašauskas, S. and Belovas, I. (2025) “Recognising the contents in digitised financial documents”, Vilnius University Open Series, pp. 187–196. doi:10.15388/LMITT.2025.22.

Download Citation

Abstract

The necessity of content recognition in digital documents is everincreasing in the financial sector. Extracted data is used for fundamental analysis, modelling and portfolio selection. In the most prominent markets, there is a wide array of available sources to obtain the data, such as SEC filings easily. However, it is not so in markets with less investor interest, such as the CEE region or Latin America. Often, the only sources containing the data are primary reports by the company itself. Scarce secondary sources may provide data of dubious reliability. This leads to an excessive workload for analysts, implying the necessity to adapt existing intelligent methods for processing financial data.

PDF

This work is licensed under a Creative Commons Attribution 4.0 International License.

Downloads

Download data is not yet available.

Most read articles by the same author(s)

Lukas Kuzma, Igoris Belovas, Martynas Sabaliauskas, Precalculated arrays-based algorithms for the calculation of the Riemann zeta-function , Vilnius University Open Series: 2022: Proceedings of the Conference "Lithuanian MSc Research in Informatics and ICT"
Martynas Jokubaitis, Igoris Belovas, Vilniaus universiteto priėmimo rodiklių analizė ir prognozavimas (XXI a., I ketvirtis) , Vilnius University Open Series: 2025: Proceedings of the Conference "Lithuanian MSc Research in Informatics and ICT". 2025