Abstract

To plan the syntheses of small organic molecules, chemists use retrosynthesis, a problem-solving technique in which target molecules are recursively transformed into increasingly simpler precursors. Computer-aided retrosynthesis would be a valuable tool but at present it is slow and provides results of unsatisfactory quality. Here we use Monte Carlo tree search and symbolic artificial intelligence (AI) to discover retrosynthetic routes. We combined Monte Carlo tree search with an expansion policy network that guides the search, and a filter network to pre-select the most promising retrosynthetic steps. These deep neural networks were trained on essentially all reactions ever published in organic chemistry. Our system solves for almost twice as many molecules, thirty times faster than the traditional computer-aided search method, which is based on extracted rules and hand-designed heuristics. In a double-blind AB test, chemists on average considered our computer-generated routes to be equivalent to reported literature routes.

Keywords

Artificial neural networkArtificial intelligenceCognitive scienceComputer sciencePsychology

MeSH Terms

Artificial IntelligenceChemistry TechniquesSyntheticChemistryOrganicMonte Carlo MethodNeural NetworksComputer

Affiliated Institutions

Related Publications

Network In Network

Abstract: We propose a novel deep network structure called In Network (NIN) to enhance model discriminability for local patches within the receptive field. The conventional con...

2014 arXiv (Cornell University) 1037 citations

Publication Info

Year
2018
Type
article
Volume
555
Issue
7698
Pages
604-610
Citations
1780
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1780
OpenAlex
39
Influential

Cite This

Marwin Segler, Mike Preuß, Mark P. Waller (2018). Planning chemical syntheses with deep neural networks and symbolic AI. Nature , 555 (7698) , 604-610. https://doi.org/10.1038/nature25978

Identifiers

DOI
10.1038/nature25978
PMID
29595767
arXiv
1708.04202

Data Quality

Data completeness: 88%