Escape from 'availability bias' in compound design

Posted by
Gábor Imre
on 12 09 2018

Escape from 'availability bias' in compound design


Small molecule design is an information demanding activity, since all relevant knowledge is to be accessible within a single space and requires synchronized application of computational models to assist decision making on synthesis candidates. Our study aims to evaluate a software platform coping with this complexity (Design Hub, Marvin Live's successor). The tool provides central management of innovative ideas and helps triage them based on predicted properties and available knowledge collected from a variety of sources. The calculated properties span phys-chem descriptors, combined metrics like MPO score, 3D overlay and modelling results conducted with KNIME. Use cases of rapid freedom to operate analysis by ultra-fast searching (MadFast Similarity Search) of exemplified structures from patents (SureChEMBL, ~16M entries) and SAR by catalog via searching large set of synthesizable compounds (Enamine REAL DataBase, ~170M entries) real time will be shown to ensure that designers can seamlessly exploit the chemical space around their ideas. The presentation will walk through an example drug design cycle to obtain statistical results regarding performance as well as to demonstrate the suitability of the calculations.

Open poster in pdf


Small molecule design is an information demanding activity, since all relevant knowledge is to be accessible within a single space and requires synchronized application of computational models to assist decision making on synthesis candidates. Our study aims to evaluate a software platform coping with this complexity (Design Hub, Marvin Live's successor). The tool provides central management of innovative ideas and helps triage them based on predicted properties and available knowledge collected from a variety of sources. The calculated properties span phys-chem descriptors, combined metrics like MPO score, 3D overlay and modelling results conducted with KNIME. Use cases of rapid freedom to operate analysis by ultra-fast searching (MadFast Similarity Search) of exemplified structures from patents (SureChEMBL, ~16M entries) and SAR by catalog via searching large set of synthesizable compounds (Enamine REAL DataBase, ~170M entries) real time will be shown to ensure that designers can seamlessly exploit the chemical space around their ideas. The presentation will walk through an example drug design cycle to obtain statistical results regarding performance as well as to demonstrate the suitability of the calculations.

Open poster in pdf