Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms

In recent years, there has been an explosive increase in the amount of bioinformatics data produced, but data are not information. The purpose of bioinformatics research is to obtain information with biological significance from large amounts of data. Multiple sequence alignment is widely used in se...

Full description

Bibliographic Details
Main Authors: Haihe Shi, Xuchu Zhang
Format: Article
Language:English
Published: Frontiers Media S.A. 2020-02-01
Series:Frontiers in Genetics
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fgene.2020.00105/full
id doaj-068a50e61b2a46a19a4cc60789b8d4a5
record_format Article
spelling doaj-068a50e61b2a46a19a4cc60789b8d4a52020-11-25T02:25:55ZengFrontiers Media S.A.Frontiers in Genetics1664-80212020-02-011110.3389/fgene.2020.00105506816Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment AlgorithmsHaihe ShiXuchu ZhangIn recent years, there has been an explosive increase in the amount of bioinformatics data produced, but data are not information. The purpose of bioinformatics research is to obtain information with biological significance from large amounts of data. Multiple sequence alignment is widely used in sequence homology detection, protein secondary and tertiary structure prediction, phylogenetic tree analysis, and other fields. Existing research mainly focuses on the specific steps of the algorithm or on specific problems, and there is a lack of high-level abstract domain algorithm frameworks. As a result, multiple sequence alignment algorithms are complex, redundant, and difficult to understand, and it is not easy for users to select the appropriate algorithm, which may lead to computing errors. Here, through in-depth study and analysis of the heuristic multiple sequence alignment algorithm (HMSAA) domain, a domain-feature model and an interactive model of HMSAA components have been established according to the generative programming method. With the support of the PAR (partition and recur) platform, the HMSAA algorithm component library is formalized and a specific alignment algorithm is assembled, thus improving the reliability of algorithm assembly. This work provides a valuable theoretical reference for the applications of other biological sequence analysis algorithms.https://www.frontiersin.org/article/10.3389/fgene.2020.00105/fullheuristic multiple sequence alignment algorithmsfeature modelgenerative programmingcomponent interaction modelpartition and recur platform
collection DOAJ
language English
format Article
sources DOAJ
author Haihe Shi
Xuchu Zhang
spellingShingle Haihe Shi
Xuchu Zhang
Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms
Frontiers in Genetics
heuristic multiple sequence alignment algorithms
feature model
generative programming
component interaction model
partition and recur platform
author_facet Haihe Shi
Xuchu Zhang
author_sort Haihe Shi
title Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms
title_short Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms
title_full Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms
title_fullStr Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms
title_full_unstemmed Component-Based Design and Assembly of Heuristic Multiple Sequence Alignment Algorithms
title_sort component-based design and assembly of heuristic multiple sequence alignment algorithms
publisher Frontiers Media S.A.
series Frontiers in Genetics
issn 1664-8021
publishDate 2020-02-01
description In recent years, there has been an explosive increase in the amount of bioinformatics data produced, but data are not information. The purpose of bioinformatics research is to obtain information with biological significance from large amounts of data. Multiple sequence alignment is widely used in sequence homology detection, protein secondary and tertiary structure prediction, phylogenetic tree analysis, and other fields. Existing research mainly focuses on the specific steps of the algorithm or on specific problems, and there is a lack of high-level abstract domain algorithm frameworks. As a result, multiple sequence alignment algorithms are complex, redundant, and difficult to understand, and it is not easy for users to select the appropriate algorithm, which may lead to computing errors. Here, through in-depth study and analysis of the heuristic multiple sequence alignment algorithm (HMSAA) domain, a domain-feature model and an interactive model of HMSAA components have been established according to the generative programming method. With the support of the PAR (partition and recur) platform, the HMSAA algorithm component library is formalized and a specific alignment algorithm is assembled, thus improving the reliability of algorithm assembly. This work provides a valuable theoretical reference for the applications of other biological sequence analysis algorithms.
topic heuristic multiple sequence alignment algorithms
feature model
generative programming
component interaction model
partition and recur platform
url https://www.frontiersin.org/article/10.3389/fgene.2020.00105/full
work_keys_str_mv AT haiheshi componentbaseddesignandassemblyofheuristicmultiplesequencealignmentalgorithms
AT xuchuzhang componentbaseddesignandassemblyofheuristicmultiplesequencealignmentalgorithms
_version_ 1724849495090397184