Construction of GC-Balanced DNA With Deletion/Insertion/Mutation Error Correction for DNA Storage System
Synthetic deoxyribonucleic acid (DNA) is a good medium for storing digital data for a long period due to its achievable high data storage density and outstanding longevity. However, synthesizing and sequencing DNA sequences in a DNA storage system are prone to a wide variety of errors, including ins...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9151948/ |
id |
doaj-30a6d246fd2a4ccc80c9e74f900cbf3c |
---|---|
record_format |
Article |
spelling |
doaj-30a6d246fd2a4ccc80c9e74f900cbf3c2021-03-30T04:30:13ZengIEEEIEEE Access2169-35362020-01-01814097214098010.1109/ACCESS.2020.30126889151948Construction of GC-Balanced DNA With Deletion/Insertion/Mutation Error Correction for DNA Storage SystemTianbo Xue0https://orcid.org/0000-0003-3758-4321Francis C. M. Lau1https://orcid.org/0000-0002-8279-0899Department of Electronic and Information Engineering, The Hong Kong Polytechnic University, Hong KongDepartment of Electronic and Information Engineering, The Hong Kong Polytechnic University, Hong KongSynthetic deoxyribonucleic acid (DNA) is a good medium for storing digital data for a long period due to its achievable high data storage density and outstanding longevity. However, synthesizing and sequencing DNA sequences in a DNA storage system are prone to a wide variety of errors, including insertion, deletion and mutation errors. At the same time, it is known that DNA sequences with 50% GC content are less susceptible to errors. This paper presents the construction of a GC-balanced DNA sequence with error correction capability. A systematic single insertion/deletion/substitution error correction code is first proposed and then used to design a GC-balanced scheme for synthesizing DNA sequences. With the proposed method, DNA sequences with exactly 50% GC content are constructed. Such DNA sequences not only have the maximum endurance to errors, but are able to correct both insertion/deletion and mutation of the nucleotide bases. The decoding procedures for the sequences are described and can readily be used in practice. Simulation results show that the proposed GC-balanced DNA sequences can correct base errors adequately.https://ieeexplore.ieee.org/document/9151948/Systematic single error correcting codemutation/insertion/deletionGC-balancedDNA storage system |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Tianbo Xue Francis C. M. Lau |
spellingShingle |
Tianbo Xue Francis C. M. Lau Construction of GC-Balanced DNA With Deletion/Insertion/Mutation Error Correction for DNA Storage System IEEE Access Systematic single error correcting code mutation/insertion/deletion GC-balanced DNA storage system |
author_facet |
Tianbo Xue Francis C. M. Lau |
author_sort |
Tianbo Xue |
title |
Construction of GC-Balanced DNA With Deletion/Insertion/Mutation Error Correction for DNA Storage System |
title_short |
Construction of GC-Balanced DNA With Deletion/Insertion/Mutation Error Correction for DNA Storage System |
title_full |
Construction of GC-Balanced DNA With Deletion/Insertion/Mutation Error Correction for DNA Storage System |
title_fullStr |
Construction of GC-Balanced DNA With Deletion/Insertion/Mutation Error Correction for DNA Storage System |
title_full_unstemmed |
Construction of GC-Balanced DNA With Deletion/Insertion/Mutation Error Correction for DNA Storage System |
title_sort |
construction of gc-balanced dna with deletion/insertion/mutation error correction for dna storage system |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
Synthetic deoxyribonucleic acid (DNA) is a good medium for storing digital data for a long period due to its achievable high data storage density and outstanding longevity. However, synthesizing and sequencing DNA sequences in a DNA storage system are prone to a wide variety of errors, including insertion, deletion and mutation errors. At the same time, it is known that DNA sequences with 50% GC content are less susceptible to errors. This paper presents the construction of a GC-balanced DNA sequence with error correction capability. A systematic single insertion/deletion/substitution error correction code is first proposed and then used to design a GC-balanced scheme for synthesizing DNA sequences. With the proposed method, DNA sequences with exactly 50% GC content are constructed. Such DNA sequences not only have the maximum endurance to errors, but are able to correct both insertion/deletion and mutation of the nucleotide bases. The decoding procedures for the sequences are described and can readily be used in practice. Simulation results show that the proposed GC-balanced DNA sequences can correct base errors adequately. |
topic |
Systematic single error correcting code mutation/insertion/deletion GC-balanced DNA storage system |
url |
https://ieeexplore.ieee.org/document/9151948/ |
work_keys_str_mv |
AT tianboxue constructionofgcbalanceddnawithdeletioninsertionmutationerrorcorrectionfordnastoragesystem AT franciscmlau constructionofgcbalanceddnawithdeletioninsertionmutationerrorcorrectionfordnastoragesystem |
_version_ |
1714742229314043904 |