Optimizing the performance of a server-based classification for a large business document flow

The document categorization problem in the case of a large business document flow is considered. Textual and visual embeddings were employed for classification. Textual embeddings were extracted via OCR Tesseract. The Viola and Jones method was applied to generate visual embeddings. This paper descr...

Full description

Bibliographic Details
Published in:Системный анализ и прикладная информатика
Main Author: O. A. Slavin
Format: Article
Language:English
Published: Belarusian National Technical University 2023-02-01
Subjects:
Online Access:https://sapi.bntu.by/jour/article/view/595