Capturing mixture composition: an open machine-readable format for representing mixed substances

Abstract We describe a file format that is designed to represent mixtures of compounds in a way that is fully machine readable. This Mixfile format is intended to fill the same role for substances that are composed of multiple components as the venerable Molfile does for specifying individual struct...

Full description

Bibliographic Details
Main Authors: Alex M. Clark, Leah R. McEwen, Peter Gedeck, Barry A. Bunin
Format: Article
Language:English
Published: BMC 2019-05-01
Series:Journal of Cheminformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13321-019-0357-4
id doaj-fa3fbe46800940458ad76705a1e810ee
record_format Article
spelling doaj-fa3fbe46800940458ad76705a1e810ee2020-11-25T03:21:55ZengBMCJournal of Cheminformatics1758-29462019-05-0111111710.1186/s13321-019-0357-4Capturing mixture composition: an open machine-readable format for representing mixed substancesAlex M. Clark0Leah R. McEwen1Peter Gedeck2Barry A. Bunin3Collaborative Drug DiscoveryCornell UniversityCollaborative Drug DiscoveryCollaborative Drug DiscoveryAbstract We describe a file format that is designed to represent mixtures of compounds in a way that is fully machine readable. This Mixfile format is intended to fill the same role for substances that are composed of multiple components as the venerable Molfile does for specifying individual structures. This much needed datastructure is intended to replace current practices for communicating information about mixtures, which usually relies on human-readable text descriptions, drawing several species within a single molecular diagram, or mutually incompatible ad hoc solutions. We describe an open source software application for editing mixture files, which can also be used as web-ready tools for manipulating the file format. We also present a corpus of mixture examples, which we have extracted from collections of text-based descriptions. Furthermore, we present an early look at the proposed IUPAC Mixtures InChI specification, instances of which can be automatically generated using the Mixfile format as a precursor.http://link.springer.com/article/10.1186/s13321-019-0357-4MixtureMixfileMolfileInChIMInChI
collection DOAJ
language English
format Article
sources DOAJ
author Alex M. Clark
Leah R. McEwen
Peter Gedeck
Barry A. Bunin
spellingShingle Alex M. Clark
Leah R. McEwen
Peter Gedeck
Barry A. Bunin
Capturing mixture composition: an open machine-readable format for representing mixed substances
Journal of Cheminformatics
Mixture
Mixfile
Molfile
InChI
MInChI
author_facet Alex M. Clark
Leah R. McEwen
Peter Gedeck
Barry A. Bunin
author_sort Alex M. Clark
title Capturing mixture composition: an open machine-readable format for representing mixed substances
title_short Capturing mixture composition: an open machine-readable format for representing mixed substances
title_full Capturing mixture composition: an open machine-readable format for representing mixed substances
title_fullStr Capturing mixture composition: an open machine-readable format for representing mixed substances
title_full_unstemmed Capturing mixture composition: an open machine-readable format for representing mixed substances
title_sort capturing mixture composition: an open machine-readable format for representing mixed substances
publisher BMC
series Journal of Cheminformatics
issn 1758-2946
publishDate 2019-05-01
description Abstract We describe a file format that is designed to represent mixtures of compounds in a way that is fully machine readable. This Mixfile format is intended to fill the same role for substances that are composed of multiple components as the venerable Molfile does for specifying individual structures. This much needed datastructure is intended to replace current practices for communicating information about mixtures, which usually relies on human-readable text descriptions, drawing several species within a single molecular diagram, or mutually incompatible ad hoc solutions. We describe an open source software application for editing mixture files, which can also be used as web-ready tools for manipulating the file format. We also present a corpus of mixture examples, which we have extracted from collections of text-based descriptions. Furthermore, we present an early look at the proposed IUPAC Mixtures InChI specification, instances of which can be automatically generated using the Mixfile format as a precursor.
topic Mixture
Mixfile
Molfile
InChI
MInChI
url http://link.springer.com/article/10.1186/s13321-019-0357-4
work_keys_str_mv AT alexmclark capturingmixturecompositionanopenmachinereadableformatforrepresentingmixedsubstances
AT leahrmcewen capturingmixturecompositionanopenmachinereadableformatforrepresentingmixedsubstances
AT petergedeck capturingmixturecompositionanopenmachinereadableformatforrepresentingmixedsubstances
AT barryabunin capturingmixturecompositionanopenmachinereadableformatforrepresentingmixedsubstances
_version_ 1724612416142049280