Log in to save to my catalogue

Fragment-Based Hit Discovery via Unsupervised Learning of Fragment-Protein Complexes

Fragment-Based Hit Discovery via Unsupervised Learning of Fragment-Protein Complexes

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_2739563509

Fragment-Based Hit Discovery via Unsupervised Learning of Fragment-Protein Complexes

About this item

Full title

Fragment-Based Hit Discovery via Unsupervised Learning of Fragment-Protein Complexes

Publisher

Cold Spring Harbor: Cold Spring Harbor Laboratory Press

Journal title

bioRxiv, 2022-11

Language

English

Formats

Publication information

Publisher

Cold Spring Harbor: Cold Spring Harbor Laboratory Press

More information

Scope and Contents

Contents

The process of finding molecules that bind to a target protein is a challenging first step in drug discovery. Crystallographic fragment screening is a strategy based on elucidating binding modes of small polar compounds and then building potency by expanding or merging them. Recent advances in high-throughput crystallography enable screening of large fragment libraries, reading out dense ensembles of fragments spanning the binding site. However, fragments typically have low affinity thus the road to potency is often long and fraught with false starts. Here, we take advantage of high-throughput crystallography to reframe fragment-based hit discovery as a denoising problem -- identifying significant pharmacophore distributions from a fragment ensemble amid noise due to weak binders -- and employ an unsupervised machine learning method to tackle this problem. Our method screens potential molecules by evaluating whether they recapitulate those fragment-derived pharmacophore distributions. We retrospectively validated our approach on an open science campaign against SARS-CoV-2 main protease (Mpro), showing that our method can distinguish active compounds from inactive ones using only structural data of fragment-protein complexes, without any activity data. Further, we prospectively found novel hits for Mpro and the Mac1 domain of SARS-CoV-2 non-structural protein 3. More broadly, our results demonstrate how unsupervised machine learning helps interpret high throughput crystallography data to rapidly discover of potent chemical modulators of protein function.Competing Interest StatementThe authors have declared no competing interest.Footnotes* https://github.com/wjm41/fresco...

Alternative Titles

Full title

Fragment-Based Hit Discovery via Unsupervised Learning of Fragment-Protein Complexes

Identifiers

Primary Identifiers

Record Identifier

TN_cdi_proquest_journals_2739563509

Permalink

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_journals_2739563509

Other Identifiers

E-ISSN

2692-8205

DOI

10.1101/2022.11.21.517375