Optimizing the performance of a server-based classification for a large business document flow
The document categorization problem in the case of a large business document flow is considered. Textual and visual embeddings were employed for classification. Textual embeddings were extracted via OCR Tesseract. The Viola and Jones method was applied to generate visual embeddings. This paper descr...
| Published in: | Системный анализ и прикладная информатика |
|---|---|
| Main Author: | |
| Format: | Article |
| Language: | English |
| Published: |
Belarusian National Technical University
2023-02-01
|
| Subjects: | |
| Online Access: | https://sapi.bntu.by/jour/article/view/595 |
