Analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved]

For the generation of contemporary databases of bioactive compounds, activity information is usually extracted from the scientific literature. However, when activity data are analyzed, source publications are typically no longer taken into consideration. Therefore, compound activity data selected fr...

Full description

Bibliographic Details
Main Authors: Ye Hu, Jürgen Bajorath
Format: Article
Language:English
Published: F1000 Research Ltd 2016-06-01
Series:F1000Research
Subjects:
Online Access:http://f1000research.com/articles/5-1227/v1
id doaj-d5ae209b24b54cf2ba551f33c729c8e1
record_format Article
spelling doaj-d5ae209b24b54cf2ba551f33c729c8e12020-11-25T03:52:05ZengF1000 Research LtdF1000Research2046-14022016-06-01510.12688/f1000research.8792.19463Analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved]Ye Hu0Jürgen Bajorath1Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Bonn, D-53113, GermanyDepartment of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Bonn, D-53113, GermanyFor the generation of contemporary databases of bioactive compounds, activity information is usually extracted from the scientific literature. However, when activity data are analyzed, source publications are typically no longer taken into consideration. Therefore, compound activity data selected from ChEMBL were traced back to thousands of original publications, activity records including compound, assay, and target information were systematically generated, and their distributions across the literature were determined. In addition, publications were categorized on the basis of activity records. Furthermore, compound promiscuity, defined as the ability of small molecules to specifically interact with multiple target proteins, was analyzed in light of publication statistics, thus adding another layer of information to promiscuity assessment. It was shown that the degree of compound promiscuity was not influenced by increasing numbers of source publications. Rather, most non-promiscuous as well as promiscuous compounds, regardless of their degree of promiscuity, originated from single publications, which emerged as a characteristic feature of the medicinal chemistry literature.http://f1000research.com/articles/5-1227/v1BioinformaticsBiomacromolecule-Ligand InteractionsMacromolecular Chemistry
collection DOAJ
language English
format Article
sources DOAJ
author Ye Hu
Jürgen Bajorath
spellingShingle Ye Hu
Jürgen Bajorath
Analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved]
F1000Research
Bioinformatics
Biomacromolecule-Ligand Interactions
Macromolecular Chemistry
author_facet Ye Hu
Jürgen Bajorath
author_sort Ye Hu
title Analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved]
title_short Analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved]
title_full Analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved]
title_fullStr Analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved]
title_full_unstemmed Analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved]
title_sort analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved]
publisher F1000 Research Ltd
series F1000Research
issn 2046-1402
publishDate 2016-06-01
description For the generation of contemporary databases of bioactive compounds, activity information is usually extracted from the scientific literature. However, when activity data are analyzed, source publications are typically no longer taken into consideration. Therefore, compound activity data selected from ChEMBL were traced back to thousands of original publications, activity records including compound, assay, and target information were systematically generated, and their distributions across the literature were determined. In addition, publications were categorized on the basis of activity records. Furthermore, compound promiscuity, defined as the ability of small molecules to specifically interact with multiple target proteins, was analyzed in light of publication statistics, thus adding another layer of information to promiscuity assessment. It was shown that the degree of compound promiscuity was not influenced by increasing numbers of source publications. Rather, most non-promiscuous as well as promiscuous compounds, regardless of their degree of promiscuity, originated from single publications, which emerged as a characteristic feature of the medicinal chemistry literature.
topic Bioinformatics
Biomacromolecule-Ligand Interactions
Macromolecular Chemistry
url http://f1000research.com/articles/5-1227/v1
work_keys_str_mv AT yehu analyzingcompoundactivityrecordsandpromiscuitydegreesinlightofpublicationstatisticsversion1referees2approved
AT jurgenbajorath analyzingcompoundactivityrecordsandpromiscuitydegreesinlightofpublicationstatisticsversion1referees2approved
_version_ 1724484339374227456