Extrapolating training of neural networks

An approach for training neural networks is presented. The point is the knowledge contained in one network are used to generalize the input signals that are corresponded to classes what are unknown to it, in order to train them by another neural network with a simpler architecture. The paper observe...

Full description

Bibliographic Details
Main Authors: Ya. A. Bury, D. I. Samal
Format: Article
Language:Russian
Published: The United Institute of Informatics Problems of the National Academy of Sciences of Belarus 2019-03-01
Series:Informatika
Subjects:
Online Access:https://inf.grid.by/jour/article/view/869
Description
Summary:An approach for training neural networks is presented. The point is the knowledge contained in one network are used to generalize the input signals that are corresponded to classes what are unknown to it, in order to train them by another neural network with a simpler architecture. The paper observes the possibility of using the output signal of a trained handwriting recognition system on the images what are presented to it and which are absent in the original training set of symbols. This training process is performing in order to generalize and then extrapolate the reaction to the uniquely interpreted output of another system during its training to those unknown classes. Like a person in the process of studying what is able to perceive more and more complex concepts and learn new knowledge faster depending on already acquired information, as well as when learning new data – to keep in memory those that were obtained earlier, the approach allows us to use the result of input signal generalization from already trained system in the aim to perceive of new knowledge in a shorter time. Also it allows increasing the accuracy of the recognition process without a necessity to repeat the entire training cycle, and therefore – without changing the previously acquired knowledge in the net. The presented approach can be used to optimize the training process of recognition systems, increase the accuracy of already trained systems, and also to retrain or additional training them to new classes without the need to retrain the original training set.
ISSN:1816-0301