论文标题

使用机器学习和集合对接模拟对COVID-19的治疗剂进行筛查

Screening of Therapeutic Agents for COVID-19 using Machine Learning and Ensemble Docking Simulations

论文作者

Batra, Rohit, Chan, Henry, Kamath, Ganesh, Ramprasad, Rampi, Cherukara, Mathew J., Sankaranarayanan, Subramanian

论文摘要

全世界目睹了由新型冠状病毒SARS-COV-2引起的Covid-19疾病的前所未有的人类和经济损失。正在全球进行广泛的研究,以鉴定针对SARS-COV-2的治疗剂。在这里,我们通过将基于机器学习的模型(ML)模型和高保真合奏对接模拟使用强大而有效的计算策略来快速筛选可能的治疗分子(或配体)。我们的筛选是基于与宿主受体区域或sprotein-human ACE2接口复合物的孤立的SARS-COV-2 S-蛋白的结合亲和力,从而可能限制和/或破坏宿主病毒相互作用。我们首先将筛查策略应用于两个药物数据集(Cureffi和drugCentral),以识别数百种与上述两个系统强烈结合的配体。然后,通过所有原子对接模拟验证候选配体。随后使用了验证的ML模型来筛选大型生物分子数据集(有近一百万个条目),以提供约19,000个潜在有用化合物的排名列表以进行进一步验证。总体而言,这项工作不仅扩大了我们对COVID-19的小分子治疗的了解,而且还提供了一种有效的途径来通过将快速ML替代模型与昂贵的高效率模拟相结合,以加速疾病的治疗方法来进行高通量计算药物筛查。

The world has witnessed unprecedented human and economic loss from the COVID-19 disease, caused by the novel coronavirus SARS-CoV-2. Extensive research is being conducted across the globe to identify therapeutic agents against the SARS-CoV-2. Here, we use a powerful and efficient computational strategy by combining machine learning (ML) based models and high-fidelity ensemble docking simulations to enable rapid screening of possible therapeutic molecules (or ligands). Our screening is based on the binding affinity to either the isolated SARS-CoV-2 S-protein at its host receptor region or to the Sprotein-human ACE2 interface complex, thereby potentially limiting and/or disrupting the host-virus interactions. We first apply our screening strategy to two drug datasets (CureFFI and DrugCentral) to identify hundreds of ligands that bind strongly to the aforementioned two systems. Candidate ligands were then validated by all atom docking simulations. The validated ML models were subsequently used to screen a large bio-molecule dataset (with nearly a million entries) to provide a rank-ordered list of ~19,000 potentially useful compounds for further validation. Overall, this work not only expands our knowledge of small-molecule treatment against COVID-19, but also provides an efficient pathway to perform high-throughput computational drug screening by combining quick ML surrogate models with expensive high-fidelity simulations, for accelerating the therapeutic cure of diseases.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源