LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes

Animal genes of different lineages, such as vertebrates and arthropods, are well-organized and blended into dynamic chromosomal structures that represent a primary regulatory mechanism for body development and cellular differentiation. The majority of genes in a genome are actually clustered, which...

Full description

Bibliographic Details
Main Authors: Dapeng Wang, Yubin Zhang, Zhonghua Fan, Guiming Liu, Jun Yu
Format: Article
Language:English
Published: SAGE Publishing 2012-01-01
Series:Evolutionary Bioinformatics
Online Access:https://doi.org/10.4137/EBO.S8540
id doaj-84b62a04a0844ea7a9f248871f83fab1
record_format Article
spelling doaj-84b62a04a0844ea7a9f248871f83fab12020-11-25T03:12:30ZengSAGE PublishingEvolutionary Bioinformatics1176-93432012-01-01810.4137/EBO.S8540LCGbase: A Comprehensive Database for Lineage-Based Co-regulated GenesDapeng Wang0Yubin Zhang1Zhonghua Fan2Guiming Liu3Jun Yu4These authors contributed equally to this work.These authors contributed equally to this work.These authors contributed equally to this work.CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100029, PR China.Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, PR China.Animal genes of different lineages, such as vertebrates and arthropods, are well-organized and blended into dynamic chromosomal structures that represent a primary regulatory mechanism for body development and cellular differentiation. The majority of genes in a genome are actually clustered, which are evolutionarily stable to different extents and biologically meaningful when evaluated among genomes within and across lineages. Until now, many questions concerning gene organization, such as what is the minimal number of genes in a cluster and what is the driving force leading to gene co-regulation, remain to be addressed. Here, we provide a user-friendly database—LCGbase (a comprehensive database for lineage-based co-regulated genes)—hosting information on evolutionary dynamics of gene clustering and ordering within animal kingdoms in two different lineages: vertebrates and arthropods. The database is constructed on a web-based Linux-Apache-MySQL-PHP framework and effective interactive user-inquiry service. Compared to other gene annotation databases with similar purposes, our database has three comprehensible advantages. First, our database is inclusive, including all high-quality genome assemblies of vertebrates and representative arthropod species. Second, it is human-centric since we map all gene clusters from other genomes in an order of lineage-ranks (such as primates, mammals, warmblooded, and reptiles) onto human genome and start the database from well-defined gene pairs (a minimal cluster where the two adjacent genes are oriented as co-directional, convergent, and divergent pairs) to large gene clusters. Furthermore, users can search for any adjacent genes and their detailed annotations. Third, the database provides flexible parameter definitions, such as the distance of transcription start sites between two adjacent genes, which is extendable to genes that flanking the cluster across species. We also provide useful tools for sequence alignment, gene ontology (GO) annotation, promoter identification, gene expression (co-expression), and evolutionary analysis. This database not only provides a way to define lineage-specific and species-specific gene clusters but also facilitates future studies on gene co-regulation, epigenetic control of gene expression (DNA methylation and histone marks), and chromosomal structures in a context of gene clusters and species evolution. LCGbase is freely available at http://lcgbase.big.ac.cn/LCGbase .https://doi.org/10.4137/EBO.S8540
collection DOAJ
language English
format Article
sources DOAJ
author Dapeng Wang
Yubin Zhang
Zhonghua Fan
Guiming Liu
Jun Yu
spellingShingle Dapeng Wang
Yubin Zhang
Zhonghua Fan
Guiming Liu
Jun Yu
LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes
Evolutionary Bioinformatics
author_facet Dapeng Wang
Yubin Zhang
Zhonghua Fan
Guiming Liu
Jun Yu
author_sort Dapeng Wang
title LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes
title_short LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes
title_full LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes
title_fullStr LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes
title_full_unstemmed LCGbase: A Comprehensive Database for Lineage-Based Co-regulated Genes
title_sort lcgbase: a comprehensive database for lineage-based co-regulated genes
publisher SAGE Publishing
series Evolutionary Bioinformatics
issn 1176-9343
publishDate 2012-01-01
description Animal genes of different lineages, such as vertebrates and arthropods, are well-organized and blended into dynamic chromosomal structures that represent a primary regulatory mechanism for body development and cellular differentiation. The majority of genes in a genome are actually clustered, which are evolutionarily stable to different extents and biologically meaningful when evaluated among genomes within and across lineages. Until now, many questions concerning gene organization, such as what is the minimal number of genes in a cluster and what is the driving force leading to gene co-regulation, remain to be addressed. Here, we provide a user-friendly database—LCGbase (a comprehensive database for lineage-based co-regulated genes)—hosting information on evolutionary dynamics of gene clustering and ordering within animal kingdoms in two different lineages: vertebrates and arthropods. The database is constructed on a web-based Linux-Apache-MySQL-PHP framework and effective interactive user-inquiry service. Compared to other gene annotation databases with similar purposes, our database has three comprehensible advantages. First, our database is inclusive, including all high-quality genome assemblies of vertebrates and representative arthropod species. Second, it is human-centric since we map all gene clusters from other genomes in an order of lineage-ranks (such as primates, mammals, warmblooded, and reptiles) onto human genome and start the database from well-defined gene pairs (a minimal cluster where the two adjacent genes are oriented as co-directional, convergent, and divergent pairs) to large gene clusters. Furthermore, users can search for any adjacent genes and their detailed annotations. Third, the database provides flexible parameter definitions, such as the distance of transcription start sites between two adjacent genes, which is extendable to genes that flanking the cluster across species. We also provide useful tools for sequence alignment, gene ontology (GO) annotation, promoter identification, gene expression (co-expression), and evolutionary analysis. This database not only provides a way to define lineage-specific and species-specific gene clusters but also facilitates future studies on gene co-regulation, epigenetic control of gene expression (DNA methylation and histone marks), and chromosomal structures in a context of gene clusters and species evolution. LCGbase is freely available at http://lcgbase.big.ac.cn/LCGbase .
url https://doi.org/10.4137/EBO.S8540
work_keys_str_mv AT dapengwang lcgbaseacomprehensivedatabaseforlineagebasedcoregulatedgenes
AT yubinzhang lcgbaseacomprehensivedatabaseforlineagebasedcoregulatedgenes
AT zhonghuafan lcgbaseacomprehensivedatabaseforlineagebasedcoregulatedgenes
AT guimingliu lcgbaseacomprehensivedatabaseforlineagebasedcoregulatedgenes
AT junyu lcgbaseacomprehensivedatabaseforlineagebasedcoregulatedgenes
_version_ 1724650160963715072