Calculating the average distance between horizontal projection peaks
During the analysis and construction of the noise filtering algorithm at the stage of segmentation of character strings, the need to describe a special database or dictionary for storing model and skeletal projections of alphabet characters became clear. However, when formatting entries for alphabet...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Belarusian National Technical University
2019-12-01
|
Series: | Sistemnyj Analiz i Prikladnaâ Informatika |
Subjects: | |
Online Access: | https://sapi.bntu.by/jour/article/view/298 |
id |
doaj-10fac99fd6f74a38b22273f0d66d81fe |
---|---|
record_format |
Article |
spelling |
doaj-10fac99fd6f74a38b22273f0d66d81fe2021-07-29T08:38:33ZengBelarusian National Technical UniversitySistemnyj Analiz i Prikladnaâ Informatika2309-49232414-04812019-12-01034810.21122/2309-4923-2019-3-4-8208Calculating the average distance between horizontal projection peaksD. V. Zaerko0V. A. Lipnitski1Belarusian State University of Informatics and RadioelectronicsBelarusian State University of Informatics and RadioelectronicsDuring the analysis and construction of the noise filtering algorithm at the stage of segmentation of character strings, the need to describe a special database or dictionary for storing model and skeletal projections of alphabet characters became clear. However, when formatting entries for alphabetical projection in the dictionary, the following questions arose: how many projection values are needed to describe a single character, and also how this value may change depending on the number of strings analyzed and their characters. The objects of research in the article are the vertical projection of the characters, as well as their width of the projection segments. The subject of research is the change of the average width of the projection segment for a certain type of license plates (character string). The main goal is to calculate and justify the average size of the segment. Data about the average width of the projection segment that describes a character allows to determine the number of projection values (coordinates along the ox axis) to store the projection values of this character from an alphabet. Several language alphabets are considered, where each of them is associated with a unique type. In particular, license plates of the Republic of Belarus and the Slovak Republic are considered. Using an elementary statistical apparatus, calculations and analysis of samples were carried out, taking into account the case of their high variation. As a result, the average values of the segment width and the variance of the segment width were obtained using two types of alphabets as an example. In research an algorithm for using the obtained values in the formation of model projection records is presented. The algorithm takes into account «special» cases of going beyond the segment boundaries. The described steps and calculations can be applied to a larger number of alphabets, which indicates the possibility of describing model projection dictionaries for them, with subsequent widespread use of character strings in noise filtering.https://sapi.bntu.by/jour/article/view/298vertical projectionsymbolic stringnoise filteringaverage segment widthprojection extreme pointsmodel and skeleton projectionalphabetic projection dictionary |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
D. V. Zaerko V. A. Lipnitski |
spellingShingle |
D. V. Zaerko V. A. Lipnitski Calculating the average distance between horizontal projection peaks Sistemnyj Analiz i Prikladnaâ Informatika vertical projection symbolic string noise filtering average segment width projection extreme points model and skeleton projection alphabetic projection dictionary |
author_facet |
D. V. Zaerko V. A. Lipnitski |
author_sort |
D. V. Zaerko |
title |
Calculating the average distance between horizontal projection peaks |
title_short |
Calculating the average distance between horizontal projection peaks |
title_full |
Calculating the average distance between horizontal projection peaks |
title_fullStr |
Calculating the average distance between horizontal projection peaks |
title_full_unstemmed |
Calculating the average distance between horizontal projection peaks |
title_sort |
calculating the average distance between horizontal projection peaks |
publisher |
Belarusian National Technical University |
series |
Sistemnyj Analiz i Prikladnaâ Informatika |
issn |
2309-4923 2414-0481 |
publishDate |
2019-12-01 |
description |
During the analysis and construction of the noise filtering algorithm at the stage of segmentation of character strings, the need to describe a special database or dictionary for storing model and skeletal projections of alphabet characters became clear. However, when formatting entries for alphabetical projection in the dictionary, the following questions arose: how many projection values are needed to describe a single character, and also how this value may change depending on the number of strings analyzed and their characters. The objects of research in the article are the vertical projection of the characters, as well as their width of the projection segments. The subject of research is the change of the average width of the projection segment for a certain type of license plates (character string). The main goal is to calculate and justify the average size of the segment. Data about the average width of the projection segment that describes a character allows to determine the number of projection values (coordinates along the ox axis) to store the projection values of this character from an alphabet. Several language alphabets are considered, where each of them is associated with a unique type. In particular, license plates of the Republic of Belarus and the Slovak Republic are considered. Using an elementary statistical apparatus, calculations and analysis of samples were carried out, taking into account the case of their high variation. As a result, the average values of the segment width and the variance of the segment width were obtained using two types of alphabets as an example. In research an algorithm for using the obtained values in the formation of model projection records is presented. The algorithm takes into account «special» cases of going beyond the segment boundaries. The described steps and calculations can be applied to a larger number of alphabets, which indicates the possibility of describing model projection dictionaries for them, with subsequent widespread use of character strings in noise filtering. |
topic |
vertical projection symbolic string noise filtering average segment width projection extreme points model and skeleton projection alphabetic projection dictionary |
url |
https://sapi.bntu.by/jour/article/view/298 |
work_keys_str_mv |
AT dvzaerko calculatingtheaveragedistancebetweenhorizontalprojectionpeaks AT valipnitski calculatingtheaveragedistancebetweenhorizontalprojectionpeaks |
_version_ |
1721253089370963968 |