Jupyter notebook-based tools for building structured datasets from the Sequence Read Archive [version 2; peer review: 2 approved]

The Sequence Read Archive (SRA) is a large public repository that stores raw next-generation sequencing data from thousands of diverse scientific investigations.  Despite its promise, reuse and re-analysis of SRA data has been challenged by the heterogeneity and poor quality of the metadata that des...

Full description

Bibliographic Details
Main Authors: Matthew N. Bernstein, Ariella Gladstein, Khun Zaw Latt, Emily Clough, Ben Busby, Allissa Dillman
Format: Article
Language:English
Published: F1000 Research Ltd 2020-08-01
Series:F1000Research
Online Access:https://f1000research.com/articles/9-376/v2