Gene Sequence Input Formatting and MapReduce Computing
Considering the limitations of the application programming interface (API) of Hadoop in gene sequence computing, this paper puts forward an input formatting method that reads the format of gene sequence as key-value pairs in the form of records. This method relies on the rewriting of Hadoop source c...
Main Authors: | Xiaolong Feng, Jing Gao |
---|---|
Format: | Article |
Language: | English |
Published: |
Bulgarian Academy of Sciences
2019-06-01
|
Series: | International Journal Bioautomation |
Subjects: | |
Online Access: | http://www.biomed.bas.bg/bioautomation/2019/vol_23.2/files/23.2_10.pdf |
Similar Items
-
FASTA/Q data compressors for MapReduce-Hadoop genomics: space and time savings made easy
by: Umberto Ferraro Petrillo, et al.
Published: (2021-03-01) -
An Efficient Platform for Large-Scale MapReduce Processing
by: Wang, Liqiang
Published: (2009) -
Improving MapReduce Performance on Clusters
by: Gault, Sylvain
Published: (2015) -
Towards a Virtual Domain Based Authentication on MapReduce
by: Ibrahim Lahmer, et al.
Published: (2016-01-01) -
K-mer clustering algorithm using a MapReduce framework: application to the parallelization of the Inchworm module of Trinity
by: Chang Sik Kim, et al.
Published: (2017-11-01)