


5th ISF
COMP012 - Application of Machine Learning Technique Assisted Prediction of Bioactivities of Ligands in Targeted Drug Discovery Process of Lung Cancer for EGFR Target


Lung cancer is a chronic non-communicable disease and is the cancer with the world’s highest incidence in the 21st century. One of the leading mechanisms underlying the development of lung cancer in nonsmokers is an amplification of the epidermal growth factor receptor (EGFR) gene. However, laboratories employing conventional processes of drug discovery and development for such targets encounter several pain-points that are cost- and time-consuming. Moreover, high failure rates are caused by efficacy and safety problems during research and development. Therefore, it is imperative to develop improved methods for drug discovery. Herein, we developed a deep learning model with spatial graph embedding and molecular descriptors based on predicting pIC50 potency estimates of small molecules and classifying hit compounds against the human epidermal growth factor receptor (LigEGFR). The model was generated with a large-scale cell line-based dataset containing broad lists of chemical features. LigEGFR outperformed baseline machine learning models for predicting pIC50. Our model was notable for higher performance in hit compound classification, compared to molecular docking and machine learning approaches. The proposed predictive model provides a powerful strategy that potentially helps researchers overcome major challenges in drug discovery and development processes, leading to a reduction of failure to discover novel hit compounds. We provide an online prediction platform and the source code that are freely available at https://ligegfr.vistec.ist, and https://github.com/scads-biochem/LigEGFR, respectively.
Show MoreName :
Email :
Advisor :
School :
PROJECT QR CODE
