The Regression Learning of the Imbalanced and Big Data by the Online Mixture Model for the Mach Number Forecasting

Extracting valuable information to enhance the performance of forecasting models from the imbalanced and big data requires the scalable implementation of advanced statistical learning methods. This paper proposes the online mixture model (OMM) and applies it to the Mach number forecasting. Treating...

Full description

Bibliographic Details
Main Authors: Xiao-Jun Wang, Yan Liu, Ping Yuan, Chang-Jun Zhou, Lin Zhang
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8576504/
Description
Summary:Extracting valuable information to enhance the performance of forecasting models from the imbalanced and big data requires the scalable implementation of advanced statistical learning methods. This paper proposes the online mixture model (OMM) and applies it to the Mach number forecasting. Treating the key variable (e.g., Mach number) forecasting under all working conditions as an entire task, and viewing that of each individual working condition as a subtask, the OMM separates the dense samples from the sparse ones on the basis of subtasks. The subtask models are independently learnt on the samples with reduced volume, and updated for the new working conditions without retaining samples from the old working conditions. Moreover, the tree-structure ensemble (TSE)-feature subsets ensembles (FSEs) algorithm is presented to fit the nonlinear function of a subtask model, where the FSE local models with low-dimensional input features are established on the non-overlapping sample subsets constructed by the TSE method. The TSE-FSEs not only reduce the volume of data but also perform distributed computing with parallel structure, and thus has the advantage of the learning of big data. Experiments carried out on the measurement data of wind tunnel indicate that the OMM with the TSE-FSEs outperforms other learning algorithms for the Mach number forecasting, and meets the precision and forecasting speed requirements in engineering.
ISSN:2169-3536