Workshop on Integrative modelling with HADDOCK in conjunction with Workshop on Novel Computational Methods for Structural Biology

Asia/Taipei
Room 1 (BHSS, Academia Sinica)

Room 1

BHSS, Academia Sinica

Alexandre M.J.J. Bonvin (Utrecht University) , Jung-Hsin LIN (Research Center for Applied Sciences, Academia Sinica)
Description

Workshop on Integrative modelling with HADDOCK

The prediction of the quaternary structure of biomolecular macromolecules is of paramount importance for fundamental understanding of cellular processes and drug design. In the era of integrative structural biology, one way of increasing the accuracy of modelling methods used to predict the structure of biomolecular complexes is to include as much experimental or predictive information as possible in the process. We have developed for this purpose a versatile information-driven docking approach HADDOCK (https://www.bonvinlab.org/software) available as a web service at https://wenmr.science.uu.nl/haddock2.4. HADDOCK can integrate information derived from biochemical, biophysical or bioinformatics methods to guide the modelling.

This workshop will consist of lectures and computer tutorials during which some of the recent HADDOCK developments will be discussed and demonstrated in hands-on computer tutorials. In particular we will introduce HADDOCK3, the new modular version of HADDOCK which represents a redesign of the HADDOCK2.X series, implementing new ways to interact with the HADDOCK sub-routines and offering more customization to the end user. Participants are encouraged to bring their own problems to get advice on the best modelling strategy.

The workshop will consist of lectures by Prof. Alexandre Bonvin and guided computer tutorials during which participants will get experience in using both the HADDOCK2.4 web service and the new modular HADDOCK3 version (command line). For these participants are required to bring their own laptops. Knowledge of Linux and command line is required.

 

Workshop on Novel Computational Methods for Structural Biology

In this special session three invited talks will be presented, which span from the novel mathematical approaches for dealing with the distance geometry problems for determining the protein structures, their new implementation on the GPU platform; and then a novel statistical mechanics-based enhanced sampling molecular dynamics simulation approach for evaluation of binding free energy; and finally to the quantum machine learning for structure-based computational drug discovery.

 

 

    • 1
      General introduction to docking, integrative modelling and HADDOCK
      Speaker: Alexandre M.J.J. Bonvin (Utrecht University)
    • 10:30 AM
      Coffee Break
    • 2
      Computer practical: Antibody-antigen docking with the HADDOCK server

      HADDOCK2.4 antibody-antigen docking tutorial: This tutorial demonstrates the use of HADDOCK2.4 for predicting the structure of an antibody-antigen complex using information about the hypervariable loops of the antibody and either the entire surface of the antigen or a loose definition of the epitope. This tutorial does not require any Linux expertise and only makes use of our web servers and PyMol for visualisation/analysis.
      https://www.bonvinlab.org/education/HADDOCK24/HADDOCK24-antibody-antigen/

      Speaker: Alexandre M.J.J. Bonvin (Utrecht University)
    • 12:30 PM
      Lunch
    • 3
      Lecture & practical: Introduction to HADDOCK3 and practical

      HADDOCK3 antibody-antigen docking: This tutorial demonstrates the use of HADDOCK3 for predicting the structure of an antibody-antigen complex using information about the hypervariable loops of the antibody and either the entire surface of the antigen or a loose definition of the epitope. It illustrate the modularity of HADDOCK3 by introducing a new workflow not possible under the current HADDOCK2.X versions. As HADDOCK3 only exists as a command line version, this tutorial does require some basic Linux expertise.
      https://www.bonvinlab.org/education/HADDOCK3/HADDOCK3-antibody-antigen/

      Speaker: Alexandre M.J.J. Bonvin (Utrecht University)
    • 3:30 PM
      Coffee Break
    • Workshop on Novel Computational Methods for Structural Biology
      Convener: Prof. Jung-Hsin LIN (Research Center for Applied Science, Academia Sinica)
      • 4
        Different Facets of Distance Geometry

        The Distance Geometry Problem (DGP) asks whether a simple weighted undirected graph G=(V,E,d) can be realized in the K-dimensional Euclidean space so that the distance constraints implied by the weights on the graph edges are satisfied. This problem was proven to be NP-hard in the context of graph embeddability, and has several applications. In this talk, we will focus on various currently ongoing works in this very rich research context: (1) We will talk about a particular class of DGP instances where, under particular assumptions, it is possible to represent the search space as a binary tree, and where in ideal situations, vertex positions can be assigned to each of its nodes. In real-life applications, however, the distances are generally provided with low precision, and they are actually likely to carry measurement errors, so that local continuous search spaces are actually assigned to each tree node. This gives rise to special combinatorial problems which are locally continuous. (2) We will review the main applications of the DGP, ranging from structural biology, passing through sensor network localization and adaptive maps, until the dynamical component is included in DGP instances for computer graphics applications. (3) Finally, we present an alternative computing approach for the solution of the DGPs in dimension 1, where an analog optical processor is employed for the computations, which is based on some properties of laser light beams.

        Speaker: Prof. Antonio MUCHERINO (Institut de Recherche en Informatique et Systèmes Aléatoires, University of Rennes 1, France)
      • 5
        A Curvilinear-Path Umbrella Sampling Approach to Characterizing Thermodynamics and Mechanisms of Biomolecular Interactions

        Protein-protein and protein-ligand interactions are central in biological mechanisms. These interactions can be classified into thermodynamics and mechanistic pathways. Estimating accurate and reliable interaction energetics along the thermodynamic pathway is one of the ongoing challenges in computational biophysics. Umbrella sampling simulation-based potential of mean force calculations is one of the methods to estimate the interaction energetics. Previously this method was implemented by first choosing a predefined path of dissociation, which is often chosen as a straight-line/vectorial path. However, there are several unresolved issues such as choices of predefined direction, corrections of potential of mean force to standard free energy of binding, etc. To unleash these limitations, we developed a curvilinear-path umbrella sampling molecular dynamics (MD) simulation approach to addressed some of the issues. We have applied the new method for evaluating the standard free energy of binding for the barnase-barstar protein-protein system and then on a protein-ligand system, where the interaction energetics of FKBP12-rapamycin protein-ligand system is estimated. The computed energetics for both systems are in good agreement with the experimental values. The revealed mechanistic insight for the protein-protein complex matches very-well with the computationally expensive adaptive biasing MD based brute-force methods. Further, we also conducted the simulations of dissociation reactions of ternary complex FKBP12-rapalog-FRB, which indeed demonstrated a tug-of-war between FRBP12 and FRB to bind with the rapamycin, and revealed that the rapamycin prefers to bind with FKBP12 more than FRB. Thus, the glue-like molecule rapamycin and other rapalogs seems to follow a step-wise path of forming FKBP12-rapalog complex first and then the ternary complex with FRB. Thus, the developed curvilinear-path approach offers accurate and reliable binding energetic, is sensitive enough to distinguish the change in interaction energetics upon mutations, and can reliably reveal mechanistic details towards the fulfillment of the characterization.

        Speaker: Dr Dhananjay JOSHI (Research Center for Applied Science, Academia Sinica)
      • 6
        Quantum Machine Learning for Structure-Based Virtual Screening of the Entire Medicinal Chemical Space

        It has been estimated based on the graph theory that there are at least 1060 organic molecules that are relevant for small-molecule drug discovery. Using machine learning to estimate the binding free energies for screening of large chemical libraries to search for the tightly binding inhibitors would take a considerable amount of computational resources, yet it is not possible to explore the entire biologically relevant chemical space. Quantum computing provides a unique opportunity to accomplish such a computational task in the near future. Here, we demonstrate how to use 512 occupancies to describe the structures of protein-ligand complexes, how to convert the classical occupancies to the quantum states using nine qubits, and to estimate the binding free energies (Gbind) of the complexes using quantum machine learning. We showed that it is possible to use only 450 parameters to prepare the quantum states for describing the structure of one protein-ligand complex. In this work the entire 2020 PDBbind dataset was adopted as the training set, and we used 45 parameters as the first attempt to construct the model for predicting the binding free energies (Gbind). The Pearson correlation coefficient (PCC) between the estimated binding free energies and the corresponding experimental values are 0.49. By slightly increasing to 1,440 parameters for constructing the neural network model for the prediction of the Gbind, the PCC is improved to be 0.78, which is even slightly better than to the results achieved by recent classical convolutional neural network models using more than millions of parameters. In this work, for the first time, we demonstrated the feasibility of using quantum computers to explore the entire medicinal chemical space with a concrete, implementable approach.

        Speaker: Prof. It has been estimated based on the graph theory that there are at least 1060 organic molecules that are relevant for small-molecule drug discovery. Using machine learning to estimate the binding free energies for screening of large chemical libraries to search for the tightly binding inhibitors would take a considerable amount of computational resources, yet it is not possible to explore the entire biologically relevant chemical space. Quantum computing provides a unique opportunity to accomplish such a computational task in the near future. Here, we demonstrate how to use 512 occupancies to describe the structures of protein-ligand complexes, how to convert the classical occupancies to the quantum states using nine qubits, and to estimate the binding free energies (Gbind) of the complexes using quantum machine learning. We showed that it is possible to use only 450 parameters to prepare the quantum states for describing the structure of one protein-ligand complex. In this work the entire 2020 PDBbind dataset was adopted as the training set, and we used 45 parameters as the first attempt to construct the model for predicting the binding free energies (Gbind). The Pearson correlation coefficient (PCC) between the estimated binding free energies and the corresponding experimental values are 0.49. By slightly increasing to 1,440 parameters for constructing the neural network model for the prediction of the Gbind, the PCC is improved to be 0.78, which is even slightly better than to the results achieved by recent classical convolutional neural network models using more than millions of parameters. In this work, for the first time, we demonstrated the feasibility of using quantum computers to explore the entire medicinal chemical space with a concrete, implementable approach. LIN (Research Center for Applied Science, Academia Sinica)