Accurate Quantile Estimation for Skewed Data Streams Using Nonlinear Interpolation

Quantile estimation is a fundamental method to generate the descriptions of the distribution of data for data management and analysis. Although the investigation and design of efficient quantile estimation algorithm has attracted much study, the problem of accurately finding quantiles in the case of...

Full description

Bibliographic Details
Main Authors: Jun Liu, Wenyao Zheng, Zheng Lin, Nan Lin
Format: Article
Language:English
Published: IEEE 2018-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8360417/
id doaj-6461a116d3344a27929738461208715b
record_format Article
spelling doaj-6461a116d3344a27929738461208715b2021-03-29T20:50:13ZengIEEEIEEE Access2169-35362018-01-016284382844610.1109/ACCESS.2018.28379068360417Accurate Quantile Estimation for Skewed Data Streams Using Nonlinear InterpolationJun Liu0https://orcid.org/0000-0003-4007-6109Wenyao Zheng1Zheng Lin2Nan Lin3School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, ChinaSchool of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, ChinaSchool of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, ChinaDepartment of Mathematics, Washington University in St. Louis, St. Louis, MO, USAQuantile estimation is a fundamental method to generate the descriptions of the distribution of data for data management and analysis. Although the investigation and design of efficient quantile estimation algorithm has attracted much study, the problem of accurately finding quantiles in the case of skewed data streams, which are prevalent in many data sources like text data and IP traffic streams, is still not well addressed. In this paper, we specifically address the problem of estimating the quantiles of skewed data streams by designing and implementing an incremental quantile estimation with nonlinear-interpolation algorithm. The comprehensive experimental evaluation results demonstrate that the estimated quantiles of the proposed algorithm are more accurate than the existing methods in the literature on both synthetic and real-world datasets, especially on important extreme quantiles.https://ieeexplore.ieee.org/document/8360417/Data streamsquantile estimation
collection DOAJ
language English
format Article
sources DOAJ
author Jun Liu
Wenyao Zheng
Zheng Lin
Nan Lin
spellingShingle Jun Liu
Wenyao Zheng
Zheng Lin
Nan Lin
Accurate Quantile Estimation for Skewed Data Streams Using Nonlinear Interpolation
IEEE Access
Data streams
quantile estimation
author_facet Jun Liu
Wenyao Zheng
Zheng Lin
Nan Lin
author_sort Jun Liu
title Accurate Quantile Estimation for Skewed Data Streams Using Nonlinear Interpolation
title_short Accurate Quantile Estimation for Skewed Data Streams Using Nonlinear Interpolation
title_full Accurate Quantile Estimation for Skewed Data Streams Using Nonlinear Interpolation
title_fullStr Accurate Quantile Estimation for Skewed Data Streams Using Nonlinear Interpolation
title_full_unstemmed Accurate Quantile Estimation for Skewed Data Streams Using Nonlinear Interpolation
title_sort accurate quantile estimation for skewed data streams using nonlinear interpolation
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2018-01-01
description Quantile estimation is a fundamental method to generate the descriptions of the distribution of data for data management and analysis. Although the investigation and design of efficient quantile estimation algorithm has attracted much study, the problem of accurately finding quantiles in the case of skewed data streams, which are prevalent in many data sources like text data and IP traffic streams, is still not well addressed. In this paper, we specifically address the problem of estimating the quantiles of skewed data streams by designing and implementing an incremental quantile estimation with nonlinear-interpolation algorithm. The comprehensive experimental evaluation results demonstrate that the estimated quantiles of the proposed algorithm are more accurate than the existing methods in the literature on both synthetic and real-world datasets, especially on important extreme quantiles.
topic Data streams
quantile estimation
url https://ieeexplore.ieee.org/document/8360417/
work_keys_str_mv AT junliu accuratequantileestimationforskeweddatastreamsusingnonlinearinterpolation
AT wenyaozheng accuratequantileestimationforskeweddatastreamsusingnonlinearinterpolation
AT zhenglin accuratequantileestimationforskeweddatastreamsusingnonlinearinterpolation
AT nanlin accuratequantileestimationforskeweddatastreamsusingnonlinearinterpolation
_version_ 1724194047785959424