The Development of a Combined Search for a Heterogeneous Chemistry Database

A combined search, which joins a slow molecule structure search with a fast compound property search, results in more accurate search results and has been applied in several chemistry databases. However, the problems of search speed differences and combining the two separate search results are two m...

Full description

Bibliographic Details
Main Authors: Lulu Jiang, Yuehong Zhao, Bojun Xu, Hao Wen
Format: Article
Language:English
Published: Ubiquity Press 2015-05-01
Series:Data Science Journal
Online Access:http://datascience.codata.org/articles/552
id doaj-79af6d60b8b847aeacaea0cc995325e3
record_format Article
spelling doaj-79af6d60b8b847aeacaea0cc995325e32020-11-24T23:48:49ZengUbiquity PressData Science Journal1683-14702015-05-011410.5334/dsj-2015-003569The Development of a Combined Search for a Heterogeneous Chemistry DatabaseLulu Jiang0Yuehong Zhao1Bojun Xu2Hao Wen3State Key Laboratory of Multiphase Complex Systems, Institute of Process Engineering, Chinese Academy of Sciences, Beijing 100190 University of the Chinese Academy of Science, Beijing 100049State Key Laboratory of Multiphase Complex Systems, Institute of Process Engineering, Chinese Academy of Sciences, Beijing 100190State Key Laboratory of Multiphase Complex Systems, Institute of Process Engineering, Chinese Academy of Sciences, Beijing 100190State Key Laboratory of Multiphase Complex Systems, Institute of Process Engineering, Chinese Academy of Sciences, Beijing 100190A combined search, which joins a slow molecule structure search with a fast compound property search, results in more accurate search results and has been applied in several chemistry databases. However, the problems of search speed differences and combining the two separate search results are two major challenges. In this paper, two kinds of search strategies, synchronous search and asynchronous search, are proposed to solve these problems in the heterogeneous structure database and the property database found in ChemDB, a chemistry database owned by the Institute of Process Engineering, CAS. Their advantages and disadvantages under different conditions are discussed in detail. Furthermore, we applied these two searches to ChemDB and used them to screen for potential molecules that can work as CO2 absorbents. The results reveal that this combined search discovers reasonable target molecules within an acceptable time frame.http://datascience.codata.org/articles/552
collection DOAJ
language English
format Article
sources DOAJ
author Lulu Jiang
Yuehong Zhao
Bojun Xu
Hao Wen
spellingShingle Lulu Jiang
Yuehong Zhao
Bojun Xu
Hao Wen
The Development of a Combined Search for a Heterogeneous Chemistry Database
Data Science Journal
author_facet Lulu Jiang
Yuehong Zhao
Bojun Xu
Hao Wen
author_sort Lulu Jiang
title The Development of a Combined Search for a Heterogeneous Chemistry Database
title_short The Development of a Combined Search for a Heterogeneous Chemistry Database
title_full The Development of a Combined Search for a Heterogeneous Chemistry Database
title_fullStr The Development of a Combined Search for a Heterogeneous Chemistry Database
title_full_unstemmed The Development of a Combined Search for a Heterogeneous Chemistry Database
title_sort development of a combined search for a heterogeneous chemistry database
publisher Ubiquity Press
series Data Science Journal
issn 1683-1470
publishDate 2015-05-01
description A combined search, which joins a slow molecule structure search with a fast compound property search, results in more accurate search results and has been applied in several chemistry databases. However, the problems of search speed differences and combining the two separate search results are two major challenges. In this paper, two kinds of search strategies, synchronous search and asynchronous search, are proposed to solve these problems in the heterogeneous structure database and the property database found in ChemDB, a chemistry database owned by the Institute of Process Engineering, CAS. Their advantages and disadvantages under different conditions are discussed in detail. Furthermore, we applied these two searches to ChemDB and used them to screen for potential molecules that can work as CO2 absorbents. The results reveal that this combined search discovers reasonable target molecules within an acceptable time frame.
url http://datascience.codata.org/articles/552
work_keys_str_mv AT lulujiang thedevelopmentofacombinedsearchforaheterogeneouschemistrydatabase
AT yuehongzhao thedevelopmentofacombinedsearchforaheterogeneouschemistrydatabase
AT bojunxu thedevelopmentofacombinedsearchforaheterogeneouschemistrydatabase
AT haowen thedevelopmentofacombinedsearchforaheterogeneouschemistrydatabase
AT lulujiang developmentofacombinedsearchforaheterogeneouschemistrydatabase
AT yuehongzhao developmentofacombinedsearchforaheterogeneouschemistrydatabase
AT bojunxu developmentofacombinedsearchforaheterogeneouschemistrydatabase
AT haowen developmentofacombinedsearchforaheterogeneouschemistrydatabase
_version_ 1725484353980465152