Webinar: An Integrated Machine Learning Framework for Novel Small Molecule Drug Design
An Integrated Machine Learning Framework for Novel Small Molecule Drug Design
Dr. Jonathan E. Allen, Informatics Thrust Leader, Biosecurity Center at ATOM
Consortium
Wednesday July 15, 2020, 12:00 to 1:00 pm EDT
Register for free at https://www.rosaandco.com/webinars
Abstract:
The drug discovery process is costly, slow, and failure prone. It takes an
average of 5.5 years to get to the clinical testing stage, and in this time
millions of molecules are tested, thousands are made, and most fail. The ATOM
Consortium (atomscience.org), comprised of LLNL, GSK, Frederick National Lab,
and UCSF, is working to increase efficiencies in the drug discovery process
through improved integration of machine learning earlier in the drug design and
discovery process by evaluating multiple properties needed to make a viable
drug. A combination of safety, pharmacokinetic and efficacy properties are
considered simultaneously in the early drug design phase with an aim to
ultimately show that these molecules will have better success rates with
subsequent pre-clinical and clinical testing.
The purpose of this webinar will be to introduce key components of the ATOM
computational framework, highlight ongoing challenges and opportunities for
improvement. The presentation will begin with a description of AMPL, the open
source framework developed to build machine learning models that generate key
safety and pharmacokinetics parameters, used for molecule evaluation and as
input to anticipated Quantitative System Pharmacology and Toxicology models.
The end-to-end pipeline handles data curation, feature extraction, model
building, prediction generation, and data visualization.
Next, we'll describe how the best-performing models are integrated into an
active learning loop (with code in the process of being open sourced) to guide
the search for de novo compounds, with plans to integrate an in-house PBPK
model to predict in-vivo behavior. The active learning loop includes a
computational search through chemical space for candidate small molecules with
opportunities for proposed molecules to be evaluated experimentally for model
validation and re-training. Discussion of the active learning pipeline will
include an examination of the utility of machine learning model uncertainty
estimates needed to guide active learning and challenges in designing and
bounding the chemical search space. We will conclude with an examination of an
early test of one round of the active learning loop applied to the design of a
selective kinase inhibitor.