A comparison of topologically associating domain callers over mammals at high resolution

Background: Topologically associating domains (TADs) are locally highly-interacting genome regions, which also play a critical role in regulating gene expression in the cell. TADs have been first identified while investigating the 3D genome structure over High-throughput Chromosome Conformation Capt...

Full description

Bibliographic Details
Main Author: Sefer, E. (Author)
Format: Article
Language:English
Published: BioMed Central Ltd 2022
Subjects:
Online Access:View Fulltext in Publisher
LEADER 03565nam a2200445Ia 4500
001 10.1186-s12859-022-04674-2
008 220425s2022 CNT 000 0 und d
020 |a 14712105 (ISSN) 
245 1 0 |a A comparison of topologically associating domain callers over mammals at high resolution 
260 0 |b BioMed Central Ltd  |c 2022 
856 |z View Fulltext in Publisher  |u https://doi.org/10.1186/s12859-022-04674-2 
520 3 |a Background: Topologically associating domains (TADs) are locally highly-interacting genome regions, which also play a critical role in regulating gene expression in the cell. TADs have been first identified while investigating the 3D genome structure over High-throughput Chromosome Conformation Capture (Hi-C) interaction dataset. Substantial degree of efforts have been devoted to develop techniques for inferring TADs from Hi-C interaction dataset. Many TAD-calling methods have been developed which differ in their criteria and assumptions in TAD inference. Correspondingly, TADs inferred via these callers vary in terms of both similarities and biological features they are enriched in. Result: We have carried out a systematic comparison of 27 TAD-calling methods over mammals. We use Micro-C, a recent high-resolution variant of Hi-C, to compare TADs at a very high resolution, and classify the methods into 3 categories: feature-based methods, Clustering methods, Graph-partitioning methods. We have evaluated TAD boundaries, gaps between adjacent TADs, and quality of TADs across various criteria. We also found particularly CTCF and Cohesin proteins to be effective in formation of TADs with corner dots. We have also assessed the callers performance on simulated datasets since a gold standard for TADs is missing. TAD sizes and numbers change remarkably between TAD callers and dataset resolutions, indicating that TADs are hierarchically-organized domains, instead of disjoint regions. A core subset of feature-based TAD callers regularly perform the best while inferring reproducible domains, which are also enriched for TAD related biological properties. Conclusion: We have analyzed the fundamental principles of TAD-calling methods, and identified the existing situation in TAD inference across high resolution Micro-C interaction datasets over mammals. We come up with a systematic, comprehensive, and concise framework to evaluate the TAD-calling methods performance across Micro-C datasets. Our research will be useful in selecting appropriate methods for TAD inference and evaluation based on available data, experimental design, and biological question of interest. We also introduce our analysis as a benchmarking tool with publicly available source code. © 2022, The Author(s). 
650 0 4 |a animal 
650 0 4 |a Animals 
650 0 4 |a chromatin 
650 0 4 |a Chromatin 
650 0 4 |a Chromatin organization 
650 0 4 |a Chromatin organization 
650 0 4 |a Gene expression 
650 0 4 |a Genes expression 
650 0 4 |a genetics 
650 0 4 |a Genome structures 
650 0 4 |a Hi-C 
650 0 4 |a Hi-C 
650 0 4 |a High resolution 
650 0 4 |a High-throughput 
650 0 4 |a mammal 
650 0 4 |a Mammals 
650 0 4 |a Mammals 
650 0 4 |a Micro-C 
650 0 4 |a Micro-C 
650 0 4 |a Performance 
650 0 4 |a Quality control 
650 0 4 |a TAD callers 
650 0 4 |a Topologically associating domain 
650 0 4 |a Topologically associating domain caller 
650 0 4 |a Topologically associating domains 
650 0 4 |a Topology 
700 1 |a Sefer, E.  |e author 
773 |t BMC Bioinformatics