Analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved]
For the generation of contemporary databases of bioactive compounds, activity information is usually extracted from the scientific literature. However, when activity data are analyzed, source publications are typically no longer taken into consideration. Therefore, compound activity data selected fr...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
F1000 Research Ltd
2016-06-01
|
Series: | F1000Research |
Subjects: | |
Online Access: | http://f1000research.com/articles/5-1227/v1 |
id |
doaj-d5ae209b24b54cf2ba551f33c729c8e1 |
---|---|
record_format |
Article |
spelling |
doaj-d5ae209b24b54cf2ba551f33c729c8e12020-11-25T03:52:05ZengF1000 Research LtdF1000Research2046-14022016-06-01510.12688/f1000research.8792.19463Analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved]Ye Hu0Jürgen Bajorath1Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Bonn, D-53113, GermanyDepartment of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Bonn, D-53113, GermanyFor the generation of contemporary databases of bioactive compounds, activity information is usually extracted from the scientific literature. However, when activity data are analyzed, source publications are typically no longer taken into consideration. Therefore, compound activity data selected from ChEMBL were traced back to thousands of original publications, activity records including compound, assay, and target information were systematically generated, and their distributions across the literature were determined. In addition, publications were categorized on the basis of activity records. Furthermore, compound promiscuity, defined as the ability of small molecules to specifically interact with multiple target proteins, was analyzed in light of publication statistics, thus adding another layer of information to promiscuity assessment. It was shown that the degree of compound promiscuity was not influenced by increasing numbers of source publications. Rather, most non-promiscuous as well as promiscuous compounds, regardless of their degree of promiscuity, originated from single publications, which emerged as a characteristic feature of the medicinal chemistry literature.http://f1000research.com/articles/5-1227/v1BioinformaticsBiomacromolecule-Ligand InteractionsMacromolecular Chemistry |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ye Hu Jürgen Bajorath |
spellingShingle |
Ye Hu Jürgen Bajorath Analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved] F1000Research Bioinformatics Biomacromolecule-Ligand Interactions Macromolecular Chemistry |
author_facet |
Ye Hu Jürgen Bajorath |
author_sort |
Ye Hu |
title |
Analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved] |
title_short |
Analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved] |
title_full |
Analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved] |
title_fullStr |
Analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved] |
title_full_unstemmed |
Analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved] |
title_sort |
analyzing compound activity records and promiscuity degrees in light of publication statistics [version 1; referees: 2 approved] |
publisher |
F1000 Research Ltd |
series |
F1000Research |
issn |
2046-1402 |
publishDate |
2016-06-01 |
description |
For the generation of contemporary databases of bioactive compounds, activity information is usually extracted from the scientific literature. However, when activity data are analyzed, source publications are typically no longer taken into consideration. Therefore, compound activity data selected from ChEMBL were traced back to thousands of original publications, activity records including compound, assay, and target information were systematically generated, and their distributions across the literature were determined. In addition, publications were categorized on the basis of activity records. Furthermore, compound promiscuity, defined as the ability of small molecules to specifically interact with multiple target proteins, was analyzed in light of publication statistics, thus adding another layer of information to promiscuity assessment. It was shown that the degree of compound promiscuity was not influenced by increasing numbers of source publications. Rather, most non-promiscuous as well as promiscuous compounds, regardless of their degree of promiscuity, originated from single publications, which emerged as a characteristic feature of the medicinal chemistry literature. |
topic |
Bioinformatics Biomacromolecule-Ligand Interactions Macromolecular Chemistry |
url |
http://f1000research.com/articles/5-1227/v1 |
work_keys_str_mv |
AT yehu analyzingcompoundactivityrecordsandpromiscuitydegreesinlightofpublicationstatisticsversion1referees2approved AT jurgenbajorath analyzingcompoundactivityrecordsandpromiscuitydegreesinlightofpublicationstatisticsversion1referees2approved |
_version_ |
1724484339374227456 |