Partition-based Cluster Analysis on Time Series Gene Expression Data

碩士 === 國立嘉義大學 === 資訊工程學系研究所 === 97 === Abstract Gene expression data produced by DNA microarray experiments advance the study of functions of genes. Researchers use microarray to generate large amount of gene expression data and observe the differences among gene expression. Data of gene expression...

Full description

Bibliographic Details
Main Authors: Tsung-Lung Lee, 李宗龍
Other Authors: Huang-Cheng Kuo
Format: Others
Language:en_US
Online Access:http://ndltd.ncl.edu.tw/handle/r5u5ey
id ndltd-TW-097NCYU5392003
record_format oai_dc
spelling ndltd-TW-097NCYU53920032019-05-15T19:49:41Z http://ndltd.ncl.edu.tw/handle/r5u5ey Partition-based Cluster Analysis on Time Series Gene Expression Data 時間基因序列分割式群集分析 Tsung-Lung Lee 李宗龍 碩士 國立嘉義大學 資訊工程學系研究所 97 Abstract Gene expression data produced by DNA microarray experiments advance the study of functions of genes. Researchers use microarray to generate large amount of gene expression data and observe the differences among gene expression. Data of gene expression time series describes the trend of gene behaviors. Recently, clustering is often used and is a popular analysis for studying gene expression time series data. In the same cluster, genes have similar behavior. Cluster analysis demonstrates the relativity among genes. In the paper, we use a similarity measurement, named LCSS (Longest Common Subseries Similarity), to overcome the influence of “shift-effect.” This influence is an issue which needs to overcome when using similarity measurement. In clustering, sequential pattern mining technique is used to resolve the number of clusters for partitioning-based clustering algorithm, and several similarity measurements (like Pearson correlation coefficient) are used to determine neighbors in KNN, A mechanism based on nearest neighbor is used as the criteria for objects relocation in the clustering algorithm. Huang-Cheng Kuo 郭煌政 學位論文 ; thesis 55 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立嘉義大學 === 資訊工程學系研究所 === 97 === Abstract Gene expression data produced by DNA microarray experiments advance the study of functions of genes. Researchers use microarray to generate large amount of gene expression data and observe the differences among gene expression. Data of gene expression time series describes the trend of gene behaviors. Recently, clustering is often used and is a popular analysis for studying gene expression time series data. In the same cluster, genes have similar behavior. Cluster analysis demonstrates the relativity among genes. In the paper, we use a similarity measurement, named LCSS (Longest Common Subseries Similarity), to overcome the influence of “shift-effect.” This influence is an issue which needs to overcome when using similarity measurement. In clustering, sequential pattern mining technique is used to resolve the number of clusters for partitioning-based clustering algorithm, and several similarity measurements (like Pearson correlation coefficient) are used to determine neighbors in KNN, A mechanism based on nearest neighbor is used as the criteria for objects relocation in the clustering algorithm.
author2 Huang-Cheng Kuo
author_facet Huang-Cheng Kuo
Tsung-Lung Lee
李宗龍
author Tsung-Lung Lee
李宗龍
spellingShingle Tsung-Lung Lee
李宗龍
Partition-based Cluster Analysis on Time Series Gene Expression Data
author_sort Tsung-Lung Lee
title Partition-based Cluster Analysis on Time Series Gene Expression Data
title_short Partition-based Cluster Analysis on Time Series Gene Expression Data
title_full Partition-based Cluster Analysis on Time Series Gene Expression Data
title_fullStr Partition-based Cluster Analysis on Time Series Gene Expression Data
title_full_unstemmed Partition-based Cluster Analysis on Time Series Gene Expression Data
title_sort partition-based cluster analysis on time series gene expression data
url http://ndltd.ncl.edu.tw/handle/r5u5ey
work_keys_str_mv AT tsunglunglee partitionbasedclusteranalysisontimeseriesgeneexpressiondata
AT lǐzōnglóng partitionbasedclusteranalysisontimeseriesgeneexpressiondata
AT tsunglunglee shíjiānjīyīnxùlièfēngēshìqúnjífēnxī
AT lǐzōnglóng shíjiānjīyīnxùlièfēngēshìqúnjífēnxī
_version_ 1719096520108670976