Quality filtering and normalization for microarray-based CGH data

Altered copy numbers of DNA sequences are a characteristic of solid tumors. Microarray-based Comparative Genomic Hybridization (CGH) has emerged as a promising technology that has the potential to identify minute genomic changes, in the order of single DNA copy number changes, at the gene level. The...

Full description

Bibliographic Details
Main Author: Khojasteh Lakelayeh, Mehrnoush
Language:English
Published: 2009
Online Access:http://hdl.handle.net/2429/16526
id ndltd-UBC-oai-circle.library.ubc.ca-2429-16526
record_format oai_dc
spelling ndltd-UBC-oai-circle.library.ubc.ca-2429-165262018-01-05T17:38:25Z Quality filtering and normalization for microarray-based CGH data Khojasteh Lakelayeh, Mehrnoush Altered copy numbers of DNA sequences are a characteristic of solid tumors. Microarray-based Comparative Genomic Hybridization (CGH) has emerged as a promising technology that has the potential to identify minute genomic changes, in the order of single DNA copy number changes, at the gene level. The data to be extracted from the two microarray images of a 2-color microarray experiment, in the image analysis step, are the ratios of the fluorescent intensities of each spot of the microarray in one image and that of the corresponding spot in the other image. Without identifying the sources of experimental error, and correcting for these errors or removing the data corrupted by significant errors, microarray results can lead to incorrect experimental conclusions. This research focuses on improving the "image analysis" step of array-CGH experiments. The aim is to reduce the variability and increase the validity of the experimental results. Two issues are addressed in this work: 1) identifying spots likely to be of poor quality, and 2) normalization of the data to remove systematic errors. In this work, we present a novel approach to quality filtering of microarray spots. We use a variety of shape and image texture measures and design a binary decision tree to discriminate between the spots likely to produce meaningful data and the ones with unreliable measurement data. The proposed procedure is shown to reduce the variability of the data resulting from the low quality spots. In addressing the second issue, possible sources of systematic variations are examined and accordingly a three-step normalization scheme is used to remove these systematic variations. The normalization scheme we used consists of the following steps. The spatial bias of the ratio of each spot is estimated using a sliding window centered on each spot and the median of the ratios of the spots inside the window is calculated. The spatial bias is then removed from the data. In the next step, microplate effects are removed from the data. In the final step, the intensity dependent bias is estimated by fitting a LOESS curve to the logarithm of ratios of spots as a function of the intensities of spots. This bias is then subtracted from the log ratios. This normalization scheme was shown to increase the accuracy and precision of microarray data. Applied Science, Faculty of Electrical and Computer Engineering, Department of Graduate 2009-12-11T18:46:21Z 2009-12-11T18:46:21Z 2005 2005-05 Text Thesis/Dissertation http://hdl.handle.net/2429/16526 eng For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.
collection NDLTD
language English
sources NDLTD
description Altered copy numbers of DNA sequences are a characteristic of solid tumors. Microarray-based Comparative Genomic Hybridization (CGH) has emerged as a promising technology that has the potential to identify minute genomic changes, in the order of single DNA copy number changes, at the gene level. The data to be extracted from the two microarray images of a 2-color microarray experiment, in the image analysis step, are the ratios of the fluorescent intensities of each spot of the microarray in one image and that of the corresponding spot in the other image. Without identifying the sources of experimental error, and correcting for these errors or removing the data corrupted by significant errors, microarray results can lead to incorrect experimental conclusions. This research focuses on improving the "image analysis" step of array-CGH experiments. The aim is to reduce the variability and increase the validity of the experimental results. Two issues are addressed in this work: 1) identifying spots likely to be of poor quality, and 2) normalization of the data to remove systematic errors. In this work, we present a novel approach to quality filtering of microarray spots. We use a variety of shape and image texture measures and design a binary decision tree to discriminate between the spots likely to produce meaningful data and the ones with unreliable measurement data. The proposed procedure is shown to reduce the variability of the data resulting from the low quality spots. In addressing the second issue, possible sources of systematic variations are examined and accordingly a three-step normalization scheme is used to remove these systematic variations. The normalization scheme we used consists of the following steps. The spatial bias of the ratio of each spot is estimated using a sliding window centered on each spot and the median of the ratios of the spots inside the window is calculated. The spatial bias is then removed from the data. In the next step, microplate effects are removed from the data. In the final step, the intensity dependent bias is estimated by fitting a LOESS curve to the logarithm of ratios of spots as a function of the intensities of spots. This bias is then subtracted from the log ratios. This normalization scheme was shown to increase the accuracy and precision of microarray data. === Applied Science, Faculty of === Electrical and Computer Engineering, Department of === Graduate
author Khojasteh Lakelayeh, Mehrnoush
spellingShingle Khojasteh Lakelayeh, Mehrnoush
Quality filtering and normalization for microarray-based CGH data
author_facet Khojasteh Lakelayeh, Mehrnoush
author_sort Khojasteh Lakelayeh, Mehrnoush
title Quality filtering and normalization for microarray-based CGH data
title_short Quality filtering and normalization for microarray-based CGH data
title_full Quality filtering and normalization for microarray-based CGH data
title_fullStr Quality filtering and normalization for microarray-based CGH data
title_full_unstemmed Quality filtering and normalization for microarray-based CGH data
title_sort quality filtering and normalization for microarray-based cgh data
publishDate 2009
url http://hdl.handle.net/2429/16526
work_keys_str_mv AT khojastehlakelayehmehrnoush qualityfilteringandnormalizationformicroarraybasedcghdata
_version_ 1718590248461533184