Image segmentation and multivariate analysis in two-dimensional gel electrophoresis

The topic of this thesis is data-analysis on images from two-dimensional electrophoretic gels. Because of the complexity of these images, there are numerous steps and approaches to such an analysis, and no “golden standard” has yet been established on how to produce the desired output. In this thesi...

Full description

Bibliographic Details
Main Author: Rye, Morten Beck
Format: Doctoral Thesis
Language:English
Published: Norges teknisk-naturvitenskapelige universitet, Institutt for kjemi 2007
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-1744
http://nbn-resolving.de/urn:isbn:978-471-4414-5
id ndltd-UPSALLA1-oai-DiVA.org-ntnu-1744
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-ntnu-17442013-01-08T13:04:35ZImage segmentation and multivariate analysis in two-dimensional gel electrophoresisengRye, Morten BeckNorges teknisk-naturvitenskapelige universitet, Institutt for kjemiFakultet for naturvitenskap og teknologi2007The topic of this thesis is data-analysis on images from two-dimensional electrophoretic gels. Because of the complexity of these images, there are numerous steps and approaches to such an analysis, and no “golden standard” has yet been established on how to produce the desired output. In this thesis focus is put on two essential fields concerning 2D-gel analysis; registration of images by segregation and protein spot identification, and data-analysis on the output of such a registration by multivariate methods. Image segmentation is mainly concerned with the task of identifying individual protein spots in a gel-image. This has generally been the natural starting point of all methods and procedures developed since the introduction of 2D-gels in the mid-seventies, simply because this best reproduces the results created by a human analyst, who manually identify protein-spot entities. The amount of data produced in a 2D-gel experiment can be quite large, especially in multiple gels where the human analyst is dependent on additional statistical data-analytical tools to produce results. Because of the correlated nature of most gel-data, analysis by multivariate methods is natural choice, and are therefore adopted in this thesis. The goal of this thesis is to introduce the above mentioned procedures at different stages in the analysis pipeline where they are not yet fully exploited, rather than to improve already existing algorithms. In this way new insight and ideas on how to handle data from 2D-gel experiments are achieved. The thesis starts with a review of segmentation methodology, and introduces a selected procedure used to identify protein spots throughout. Output from the segmentation is then used to create a multivariate spot-filtering model, which aims to separate protein spots from noise and artefacts often creating problems in 2D-gel analysis. Lately the use of common spot boundaries in multiple gels have been the method of choice when gels are analysed. How such boundaries should be defined is an important subject of discussion, and thus a new method for defining common boundaries based on the individual segmentation of each gel is introduced. Segmentation may be a natural starting point when gels are analysed, but it is not necessarily the most correct. Often the introduction of fixed spot entities introduces restrictions to the data which cause problems at later stages in the analysis. Analysing pixels from multiple gels directly has no such restrictions, and it is shown in this thesis that the output of such an analysis based on multivariate methods can produce very useful results. It can also give insight to the data problematic to achieve with the spot boundary approach. At last in the thesis an improved pixel-based approach is introduced, where a less restricted segmentation is used to reduce and concentrate the amount of data analysed, improving the final output. Doctoral thesis, comprehensive summaryinfo:eu-repo/semantics/doctoralThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-1744urn:isbn:978-471-4414-5Doktoravhandlinger ved NTNU, 1503-8181 ; 2007:200application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Doctoral Thesis
sources NDLTD
description The topic of this thesis is data-analysis on images from two-dimensional electrophoretic gels. Because of the complexity of these images, there are numerous steps and approaches to such an analysis, and no “golden standard” has yet been established on how to produce the desired output. In this thesis focus is put on two essential fields concerning 2D-gel analysis; registration of images by segregation and protein spot identification, and data-analysis on the output of such a registration by multivariate methods. Image segmentation is mainly concerned with the task of identifying individual protein spots in a gel-image. This has generally been the natural starting point of all methods and procedures developed since the introduction of 2D-gels in the mid-seventies, simply because this best reproduces the results created by a human analyst, who manually identify protein-spot entities. The amount of data produced in a 2D-gel experiment can be quite large, especially in multiple gels where the human analyst is dependent on additional statistical data-analytical tools to produce results. Because of the correlated nature of most gel-data, analysis by multivariate methods is natural choice, and are therefore adopted in this thesis. The goal of this thesis is to introduce the above mentioned procedures at different stages in the analysis pipeline where they are not yet fully exploited, rather than to improve already existing algorithms. In this way new insight and ideas on how to handle data from 2D-gel experiments are achieved. The thesis starts with a review of segmentation methodology, and introduces a selected procedure used to identify protein spots throughout. Output from the segmentation is then used to create a multivariate spot-filtering model, which aims to separate protein spots from noise and artefacts often creating problems in 2D-gel analysis. Lately the use of common spot boundaries in multiple gels have been the method of choice when gels are analysed. How such boundaries should be defined is an important subject of discussion, and thus a new method for defining common boundaries based on the individual segmentation of each gel is introduced. Segmentation may be a natural starting point when gels are analysed, but it is not necessarily the most correct. Often the introduction of fixed spot entities introduces restrictions to the data which cause problems at later stages in the analysis. Analysing pixels from multiple gels directly has no such restrictions, and it is shown in this thesis that the output of such an analysis based on multivariate methods can produce very useful results. It can also give insight to the data problematic to achieve with the spot boundary approach. At last in the thesis an improved pixel-based approach is introduced, where a less restricted segmentation is used to reduce and concentrate the amount of data analysed, improving the final output.
author Rye, Morten Beck
spellingShingle Rye, Morten Beck
Image segmentation and multivariate analysis in two-dimensional gel electrophoresis
author_facet Rye, Morten Beck
author_sort Rye, Morten Beck
title Image segmentation and multivariate analysis in two-dimensional gel electrophoresis
title_short Image segmentation and multivariate analysis in two-dimensional gel electrophoresis
title_full Image segmentation and multivariate analysis in two-dimensional gel electrophoresis
title_fullStr Image segmentation and multivariate analysis in two-dimensional gel electrophoresis
title_full_unstemmed Image segmentation and multivariate analysis in two-dimensional gel electrophoresis
title_sort image segmentation and multivariate analysis in two-dimensional gel electrophoresis
publisher Norges teknisk-naturvitenskapelige universitet, Institutt for kjemi
publishDate 2007
url http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-1744
http://nbn-resolving.de/urn:isbn:978-471-4414-5
work_keys_str_mv AT ryemortenbeck imagesegmentationandmultivariateanalysisintwodimensionalgelelectrophoresis
_version_ 1716507965884727296