Techniques to Explore Time-Related Correlation in Large Datasets

The next generation of database management and computing systems will be significantly complex with data distributed both in functionality and operation. The complexity arises, at least in part, due to data types involved and types of information request rendered by the database user. Time sequence...

Full description

Bibliographic Details
Main Author:	Dua, Sumeet
Other Authors:	Bush Jones
Format:	Others
Language:	en
Published:	LSU 2002
Subjects:	Computer Science
Online Access:	http://etd.lsu.edu/docs/available/etd-0302102-175245/

id	ndltd-LSU-oai-etd.lsu.edu-etd-0302102-175245
record_format	oai_dc
spelling	ndltd-LSU-oai-etd.lsu.edu-etd-0302102-1752452013-01-07T22:48:39Z Techniques to Explore Time-Related Correlation in Large Datasets Dua, Sumeet Computer Science The next generation of database management and computing systems will be significantly complex with data distributed both in functionality and operation. The complexity arises, at least in part, due to data types involved and types of information request rendered by the database user. Time sequence databases are generated in many practical applications. Detecting similar sequences and subsequences within these databases is an important research area and has generated lot of interest recently. Previous studies in this area have concentrated on calculating similitude between (sub)sequences of equal sizes. The question of unequal sized (sub)sequence comparison to report similitude has been an open problem for some time. The problem is an important and non-trivial one. In this dissertation, we propose a solution to the problem of finding sequences, in a database of unequal sized sequences, that are similar to a given query sequence. A paradigm to search pairs of similar, equal and unequal sized, subsequences within a pair of sequences is also presented. We put forward new approaches for sequence time-scale reduction, feature aggregation and object recognition. To make the search of similar sequences efficient, we propose an indexing technique to index the unequal-sized sequence database. We also introduce a unique indexing technique to index identified subsequences within a reference sequence. This index is subsequently employed to report similar pairs of subsequences, when presented with a query sequence. We present several experimental results and also compare the proposed framework with previous work in this area. Bush Jones Jerry Trahan Aiichiro Nakano Lynn Lamotte S. S. Iyengar LSU 2002-03-15 text application/pdf http://etd.lsu.edu/docs/available/etd-0302102-175245/ http://etd.lsu.edu/docs/available/etd-0302102-175245/ en unrestricted I hereby grant to LSU or its agents the right to archive and to make available my thesis or dissertation in whole or in part in the University Libraries in all forms of media, now or hereafter known. I retain all proprietary rights, such as patent rights. I also retain the right to use in future works (such as articles or books) all or part of this thesis or dissertation.
collection	NDLTD
language	en
format	Others
sources	NDLTD
topic	Computer Science
spellingShingle	Computer Science Dua, Sumeet Techniques to Explore Time-Related Correlation in Large Datasets
description	The next generation of database management and computing systems will be significantly complex with data distributed both in functionality and operation. The complexity arises, at least in part, due to data types involved and types of information request rendered by the database user. Time sequence databases are generated in many practical applications. Detecting similar sequences and subsequences within these databases is an important research area and has generated lot of interest recently. Previous studies in this area have concentrated on calculating similitude between (sub)sequences of equal sizes. The question of unequal sized (sub)sequence comparison to report similitude has been an open problem for some time. The problem is an important and non-trivial one. In this dissertation, we propose a solution to the problem of finding sequences, in a database of unequal sized sequences, that are similar to a given query sequence. A paradigm to search pairs of similar, equal and unequal sized, subsequences within a pair of sequences is also presented. We put forward new approaches for sequence time-scale reduction, feature aggregation and object recognition. To make the search of similar sequences efficient, we propose an indexing technique to index the unequal-sized sequence database. We also introduce a unique indexing technique to index identified subsequences within a reference sequence. This index is subsequently employed to report similar pairs of subsequences, when presented with a query sequence. We present several experimental results and also compare the proposed framework with previous work in this area.
author2	Bush Jones
author_facet	Bush Jones Dua, Sumeet
author	Dua, Sumeet
author_sort	Dua, Sumeet
title	Techniques to Explore Time-Related Correlation in Large Datasets
title_short	Techniques to Explore Time-Related Correlation in Large Datasets
title_full	Techniques to Explore Time-Related Correlation in Large Datasets
title_fullStr	Techniques to Explore Time-Related Correlation in Large Datasets
title_full_unstemmed	Techniques to Explore Time-Related Correlation in Large Datasets
title_sort	techniques to explore time-related correlation in large datasets
publisher	LSU
publishDate	2002
url	http://etd.lsu.edu/docs/available/etd-0302102-175245/
work_keys_str_mv	AT duasumeet techniquestoexploretimerelatedcorrelationinlargedatasets
_version_	1716476353179549696

Techniques to Explore Time-Related Correlation in Large Datasets

Similar Items