Embedded Dimension and Time Series Length. Practical Influence on Permutation Entropy and Its Applications

Permutation Entropy (PE) is a time series complexity measure commonly used in a variety of contexts, with medicine being the prime example. In its general form, it requires three input parameters for its calculation: time series length <i>N</i>, embedded dimension <i>m</i>, a...

Full description

Bibliographic Details
Main Authors: David Cuesta-Frau, Juan Pablo Murillo-Escobar, Diana Alexandra Orrego, Edilson Delgado-Trejos
Format: Article
Language:English
Published: MDPI AG 2019-04-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/21/4/385
id doaj-6f935cb9c1684910970f7384c0c63de5
record_format Article
spelling doaj-6f935cb9c1684910970f7384c0c63de52020-11-25T00:58:53ZengMDPI AGEntropy1099-43002019-04-0121438510.3390/e21040385e21040385Embedded Dimension and Time Series Length. Practical Influence on Permutation Entropy and Its ApplicationsDavid Cuesta-Frau0Juan Pablo Murillo-Escobar1Diana Alexandra Orrego2Edilson Delgado-Trejos3Technological Institute of Informatics, Universitat Politècnica de València, Alcoi Campus, 03801 Alcoi, SpainGrupo de Investigación e Innovación Biomédica (GI2B), Instituto Tecnológico Metropolitano (ITM), Medellín, ColombiaGrupo de Investigación e Innovación Biomédica (GI2B), Instituto Tecnológico Metropolitano (ITM), Medellín, ColombiaCM&amp;P, Instituto Tecnológico Metropolitano (ITM), Medellín, ColombiaPermutation Entropy (PE) is a time series complexity measure commonly used in a variety of contexts, with medicine being the prime example. In its general form, it requires three input parameters for its calculation: time series length <i>N</i>, embedded dimension <i>m</i>, and embedded delay <inline-formula> <math display="inline"> <semantics> <mi>&#964;</mi> </semantics> </math> </inline-formula>. Inappropriate choices of these parameters may potentially lead to incorrect interpretations. However, there are no specific guidelines for an optimal selection of <i>N</i>, <i>m</i>, or <inline-formula> <math display="inline"> <semantics> <mi>&#964;</mi> </semantics> </math> </inline-formula>, only general recommendations such as <inline-formula> <math display="inline"> <semantics> <mrow> <mi>N</mi> <mo>&gt;</mo> <mo>&gt;</mo> <mi>m</mi> <mo>!</mo> </mrow> </semantics> </math> </inline-formula>, <inline-formula> <math display="inline"> <semantics> <mrow> <mi>&#964;</mi> <mo>=</mo> <mn>1</mn> </mrow> </semantics> </math> </inline-formula>, or <inline-formula> <math display="inline"> <semantics> <mrow> <mi>m</mi> <mo>=</mo> <mn>3</mn> <mo>,</mo> <mo>&#8230;</mo> <mo>,</mo> <mn>7</mn> </mrow> </semantics> </math> </inline-formula>. This paper deals specifically with the study of the practical implications of <inline-formula> <math display="inline"> <semantics> <mrow> <mi>N</mi> <mo>&gt;</mo> <mo>&gt;</mo> <mi>m</mi> <mo>!</mo> </mrow> </semantics> </math> </inline-formula>, since long time series are often not available, or non-stationary, and other preliminary results suggest that low <i>N</i> values do not necessarily invalidate PE usefulness. Our study analyses the PE variation as a function of the series length <i>N</i> and embedded dimension <i>m</i> in the context of a diverse experimental set, both synthetic (random, spikes, or logistic model time series) and real&#8211;world (climatology, seismic, financial, or biomedical time series), and the classification performance achieved with varying <i>N</i> and <i>m</i>. The results seem to indicate that shorter lengths than those suggested by <inline-formula> <math display="inline"> <semantics> <mrow> <mi>N</mi> <mo>&gt;</mo> <mo>&gt;</mo> <mi>m</mi> <mo>!</mo> </mrow> </semantics> </math> </inline-formula> are sufficient for a stable PE calculation, and even very short time series can be robustly classified based on PE measurements before the stability point is reached. This may be due to the fact that there are forbidden patterns in chaotic time series, not all the patterns are equally informative, and differences among classes are already apparent at very short lengths.https://www.mdpi.com/1099-4300/21/4/385permutation entropyembedded dimensionshort time recordssignal classificationrelevance analysis
collection DOAJ
language English
format Article
sources DOAJ
author David Cuesta-Frau
Juan Pablo Murillo-Escobar
Diana Alexandra Orrego
Edilson Delgado-Trejos
spellingShingle David Cuesta-Frau
Juan Pablo Murillo-Escobar
Diana Alexandra Orrego
Edilson Delgado-Trejos
Embedded Dimension and Time Series Length. Practical Influence on Permutation Entropy and Its Applications
Entropy
permutation entropy
embedded dimension
short time records
signal classification
relevance analysis
author_facet David Cuesta-Frau
Juan Pablo Murillo-Escobar
Diana Alexandra Orrego
Edilson Delgado-Trejos
author_sort David Cuesta-Frau
title Embedded Dimension and Time Series Length. Practical Influence on Permutation Entropy and Its Applications
title_short Embedded Dimension and Time Series Length. Practical Influence on Permutation Entropy and Its Applications
title_full Embedded Dimension and Time Series Length. Practical Influence on Permutation Entropy and Its Applications
title_fullStr Embedded Dimension and Time Series Length. Practical Influence on Permutation Entropy and Its Applications
title_full_unstemmed Embedded Dimension and Time Series Length. Practical Influence on Permutation Entropy and Its Applications
title_sort embedded dimension and time series length. practical influence on permutation entropy and its applications
publisher MDPI AG
series Entropy
issn 1099-4300
publishDate 2019-04-01
description Permutation Entropy (PE) is a time series complexity measure commonly used in a variety of contexts, with medicine being the prime example. In its general form, it requires three input parameters for its calculation: time series length <i>N</i>, embedded dimension <i>m</i>, and embedded delay <inline-formula> <math display="inline"> <semantics> <mi>&#964;</mi> </semantics> </math> </inline-formula>. Inappropriate choices of these parameters may potentially lead to incorrect interpretations. However, there are no specific guidelines for an optimal selection of <i>N</i>, <i>m</i>, or <inline-formula> <math display="inline"> <semantics> <mi>&#964;</mi> </semantics> </math> </inline-formula>, only general recommendations such as <inline-formula> <math display="inline"> <semantics> <mrow> <mi>N</mi> <mo>&gt;</mo> <mo>&gt;</mo> <mi>m</mi> <mo>!</mo> </mrow> </semantics> </math> </inline-formula>, <inline-formula> <math display="inline"> <semantics> <mrow> <mi>&#964;</mi> <mo>=</mo> <mn>1</mn> </mrow> </semantics> </math> </inline-formula>, or <inline-formula> <math display="inline"> <semantics> <mrow> <mi>m</mi> <mo>=</mo> <mn>3</mn> <mo>,</mo> <mo>&#8230;</mo> <mo>,</mo> <mn>7</mn> </mrow> </semantics> </math> </inline-formula>. This paper deals specifically with the study of the practical implications of <inline-formula> <math display="inline"> <semantics> <mrow> <mi>N</mi> <mo>&gt;</mo> <mo>&gt;</mo> <mi>m</mi> <mo>!</mo> </mrow> </semantics> </math> </inline-formula>, since long time series are often not available, or non-stationary, and other preliminary results suggest that low <i>N</i> values do not necessarily invalidate PE usefulness. Our study analyses the PE variation as a function of the series length <i>N</i> and embedded dimension <i>m</i> in the context of a diverse experimental set, both synthetic (random, spikes, or logistic model time series) and real&#8211;world (climatology, seismic, financial, or biomedical time series), and the classification performance achieved with varying <i>N</i> and <i>m</i>. The results seem to indicate that shorter lengths than those suggested by <inline-formula> <math display="inline"> <semantics> <mrow> <mi>N</mi> <mo>&gt;</mo> <mo>&gt;</mo> <mi>m</mi> <mo>!</mo> </mrow> </semantics> </math> </inline-formula> are sufficient for a stable PE calculation, and even very short time series can be robustly classified based on PE measurements before the stability point is reached. This may be due to the fact that there are forbidden patterns in chaotic time series, not all the patterns are equally informative, and differences among classes are already apparent at very short lengths.
topic permutation entropy
embedded dimension
short time records
signal classification
relevance analysis
url https://www.mdpi.com/1099-4300/21/4/385
work_keys_str_mv AT davidcuestafrau embeddeddimensionandtimeserieslengthpracticalinfluenceonpermutationentropyanditsapplications
AT juanpablomurilloescobar embeddeddimensionandtimeserieslengthpracticalinfluenceonpermutationentropyanditsapplications
AT dianaalexandraorrego embeddeddimensionandtimeserieslengthpracticalinfluenceonpermutationentropyanditsapplications
AT edilsondelgadotrejos embeddeddimensionandtimeserieslengthpracticalinfluenceonpermutationentropyanditsapplications
_version_ 1725220092746137600