Applications of Deep Neural Networks to Protein Structure Prediction

Applications of Deep Neural Networks to Protein Structure Prediction PDF Author: Chao Fang (Computer scientist)
Publisher:
ISBN:
Category :
Languages : en
Pages : 132

Get Book Here

Book Description
Protein secondary structure, backbone torsion angle and other secondary structure features can provide useful information for protein 3D structure prediction and protein functions. Deep learning offers a new opportunity to significantly improve prediction accuracy. In this dissertation, several new deep neural network architectures are proposed for protein secondary structure prediction: deep inception-inside-inception (Deep3I) networks and deep neighbor residual (DeepNRN) networks for secondary structure prediction; deep residual inception networks (DeepRIN) for backbone torsion angle prediction; deep dense inception networks (DeepDIN) for beta turn prediction; deep inception capsule networks (DeepICN) for gamma turn prediction. Every tool was then implemented as a standalone tool integrated into MUFold package and freely available to research community. A webserver called MUFold-SS-Angle is also developed for protein property prediction. The input feature to those deep neural networks is a carefully designed feature matrix corresponding to the primary amino acid sequence of a protein, which consists of a rich set of information derived from individual amino acid, as well as the context of the protein sequence. Specifically, the feature matrix is a composition of physio-chemical properties of amino acids, PSI-BLAST profile, HHBlits profile and/or predicted shape string. The deep architecture enables effective processing of local and global interactions between amino acids in making accurate prediction. In extensive experiments on multiple datasets, the proposed deep neural architectures outperformed the best existing methods and other deep neural networks significantly: The proposed DeepNRN achieved highest Q8 75.33, 72.9, 70.8 on CASP 10, 11, 12 higher than previous state-of-the-art DeepCNF-SS with 71.8, 72.3, and 69.76. The proposed MUFold-SS (Deep3I) achieved highest Q8 76.47, 74.51, 72.1 on CASP 10, 11, 12. Compared to the recently released state-of-the-art tool, SPIDER3, DeepRIN reduced the Psi angle prediction error by more than 5 degrees and the Phi angle prediction error by more than 2 degrees on average. DeepDIN outperformed significantly BetaTPred3 in both two-class and nine-class beta turn prediction on benchmark BT426 and BT6376. DeepICN is the first application of using capsule network to biological sequence analysis and outperformed all previous gamma-turn predictors on benchmark GT320.

Applications of Deep Neural Networks to Protein Structure Prediction

Applications of Deep Neural Networks to Protein Structure Prediction PDF Author: Chao Fang (Computer scientist)
Publisher:
ISBN:
Category :
Languages : en
Pages : 132

Get Book Here

Book Description
Protein secondary structure, backbone torsion angle and other secondary structure features can provide useful information for protein 3D structure prediction and protein functions. Deep learning offers a new opportunity to significantly improve prediction accuracy. In this dissertation, several new deep neural network architectures are proposed for protein secondary structure prediction: deep inception-inside-inception (Deep3I) networks and deep neighbor residual (DeepNRN) networks for secondary structure prediction; deep residual inception networks (DeepRIN) for backbone torsion angle prediction; deep dense inception networks (DeepDIN) for beta turn prediction; deep inception capsule networks (DeepICN) for gamma turn prediction. Every tool was then implemented as a standalone tool integrated into MUFold package and freely available to research community. A webserver called MUFold-SS-Angle is also developed for protein property prediction. The input feature to those deep neural networks is a carefully designed feature matrix corresponding to the primary amino acid sequence of a protein, which consists of a rich set of information derived from individual amino acid, as well as the context of the protein sequence. Specifically, the feature matrix is a composition of physio-chemical properties of amino acids, PSI-BLAST profile, HHBlits profile and/or predicted shape string. The deep architecture enables effective processing of local and global interactions between amino acids in making accurate prediction. In extensive experiments on multiple datasets, the proposed deep neural architectures outperformed the best existing methods and other deep neural networks significantly: The proposed DeepNRN achieved highest Q8 75.33, 72.9, 70.8 on CASP 10, 11, 12 higher than previous state-of-the-art DeepCNF-SS with 71.8, 72.3, and 69.76. The proposed MUFold-SS (Deep3I) achieved highest Q8 76.47, 74.51, 72.1 on CASP 10, 11, 12. Compared to the recently released state-of-the-art tool, SPIDER3, DeepRIN reduced the Psi angle prediction error by more than 5 degrees and the Phi angle prediction error by more than 2 degrees on average. DeepDIN outperformed significantly BetaTPred3 in both two-class and nine-class beta turn prediction on benchmark BT426 and BT6376. DeepICN is the first application of using capsule network to biological sequence analysis and outperformed all previous gamma-turn predictors on benchmark GT320.

Machine Learning Meets Quantum Physics

Machine Learning Meets Quantum Physics PDF Author: Kristof T. Schütt
Publisher: Springer Nature
ISBN: 3030402452
Category : Science
Languages : en
Pages : 473

Get Book Here

Book Description
Designing molecules and materials with desired properties is an important prerequisite for advancing technology in our modern societies. This requires both the ability to calculate accurate microscopic properties, such as energies, forces and electrostatic multipoles of specific configurations, as well as efficient sampling of potential energy surfaces to obtain corresponding macroscopic properties. Tools that can provide this are accurate first-principles calculations rooted in quantum mechanics, and statistical mechanics, respectively. Unfortunately, they come at a high computational cost that prohibits calculations for large systems and long time-scales, thus presenting a severe bottleneck both for searching the vast chemical compound space and the stupendously many dynamical configurations that a molecule can assume. To overcome this challenge, recently there have been increased efforts to accelerate quantum simulations with machine learning (ML). This emerging interdisciplinary community encompasses chemists, material scientists, physicists, mathematicians and computer scientists, joining forces to contribute to the exciting hot topic of progressing machine learning and AI for molecules and materials. The book that has emerged from a series of workshops provides a snapshot of this rapidly developing field. It contains tutorial material explaining the relevant foundations needed in chemistry, physics as well as machine learning to give an easy starting point for interested readers. In addition, a number of research papers defining the current state-of-the-art are included. The book has five parts (Fundamentals, Incorporating Prior Knowledge, Deep Learning of Atomistic Representations, Atomistic Simulations and Discovery and Design), each prefaced by editorial commentary that puts the respective parts into a broader scientific context.

The Science Behind AlphaFold

The Science Behind AlphaFold PDF Author: StoryBuddiesPlay
Publisher: StoryBuddiesPlay
ISBN:
Category : Computers
Languages : en
Pages : 58

Get Book Here

Book Description
AlphaFold, a groundbreaking AI system, has cracked the code on protein structure prediction, a challenge that baffled scientists for decades. This book explores the science behind AlphaFold, delving into deep learning, big data, and the inner workings of this remarkable program. Uncover how AlphaFold is revolutionizing protein science, with the potential to accelerate drug discovery, personalize medicine, and design innovative materials. This comprehensive guide explores: The significance of protein structures and the challenges of prediction How AlphaFold leverages deep learning and vast data resources The process of protein structure prediction with AlphaFold, including its strengths and limitations The ethical considerations surrounding AI in protein science The exciting future applications of AlphaFold in various scientific fields Whether you're a scientist, student, or simply curious about the future of biology, this book provides a clear and engaging exploration of AlphaFold and its transformative impact on protein science.

Prediction of Protein Secondary Structure

Prediction of Protein Secondary Structure PDF Author: Yaoqi Zhou
Publisher: Humana
ISBN: 9781493964048
Category : Science
Languages : en
Pages : 0

Get Book Here

Book Description
This thorough volume explores predicting one-dimensional functional properties, functional sites in particular, from protein sequences, an area which is getting more and more attention. Beginning with secondary structure prediction based on sequence only, the book continues by exploring secondary structure prediction based on evolution information, prediction of solvent accessible surface areas and backbone torsion angles, model building, global structural properties, functional properties, as well as visualizing interior and protruding regions in proteins. Written for the highly successful Methods in Molecular Biology series, the chapters include the kind of detail and implementation advice to ensure success in the laboratory. Practical and authoritative, Prediction of Protein Secondary Structure serves as a vital guide to numerous state-of-the-art techniques that are useful for computational and experimental biologists.

Protein Structure Accuracy Prediction with Deep Learning and Its Application to Structure Prediction and Design

Protein Structure Accuracy Prediction with Deep Learning and Its Application to Structure Prediction and Design PDF Author: Naozumi Hiranuma
Publisher:
ISBN:
Category :
Languages : en
Pages : 0

Get Book Here

Book Description
Understanding the rules of protein structure folding has always been one of the central goals in computational biology. Deep learning is gaining popularity in protein machine learning due to its ability to learn complex functions on large amounts of protein geometry data. To help understand the rules of protein folding better, we developed neural networks (DeepAccNet and Pluto) that estimate the error in protein models. In other words, these networks estimate how much a computationally modeled protein structure deviates from its experimentally determined conformation. Approximately two million conformations from 21000 protein sequences located at different local energy minima with a large diversity of errors were sampled and used for training. The network uses 3D convolutions to evaluate local atomic environments followed by 2D convolutions to provide their global contexts and outperforms other methods that similarly predict the accuracy of protein structure models. Overall accuracy predictions for X-ray and cryoEM structures in the PDB correlate with their resolution. The network should be broadly helpful in assessing the accuracy of both predicted structure models and experimentally determined structures and identifying specific regions likely to be in error. The DeepAccNet methods were selected as top-performing methods for the estimation of model accuracy (EMA) category in CASP14. We extended the accuracy prediction models for proteins to more general chemistry by training graph neural networks on a wide variety of protein and non-protein datasets. We showed that the resulting framework (GAAP) successfully estimates the accuracy of non-protein molecules, such as peptides and Protein-DNA complexes. Our results illustrate how deep learning can impact the efficiency and accuracy of large-scale simulations for both modeling and designing of molecules.

Applications of Machine Learning and Deep Learning on Biological Data

Applications of Machine Learning and Deep Learning on Biological Data PDF Author: Faheem Masoodi
Publisher: CRC Press
ISBN: 1000833798
Category : Computers
Languages : en
Pages : 233

Get Book Here

Book Description
The automated learning of machines characterizes machine learning (ML). It focuses on making data-driven predictions using programmed algorithms. ML has several applications, including bioinformatics, which is a discipline of study and practice that deals with applying computational derivations to obtain biological data. It involves the collection, retrieval, storage, manipulation, and modeling of data for analysis or prediction made using customized software. Previously, comprehensive programming of bioinformatical algorithms was an extremely laborious task for such applications as predicting protein structures. Now, algorithms using ML and deep learning (DL) have increased the speed and efficacy of programming such algorithms. Applications of Machine Learning and Deep Learning on Biological Data is an examination of applying ML and DL to such areas as proteomics, genomics, microarrays, text mining, and systems biology. The key objective is to cover ML applications to biological science problems, focusing on problems related to bioinformatics. The book looks at cutting-edge research topics and methodologies in ML applied to the rapidly advancing discipline of bioinformatics. ML and DL applied to biological and neuroimaging data can open new frontiers for biomedical engineering, such as refining the understanding of complex diseases, including cancer and neurodegenerative and psychiatric disorders. Advances in this field could eventually lead to the development of precision medicine and automated diagnostic tools capable of tailoring medical treatments to individual lifestyles, variability, and the environment. Highlights include: Artificial Intelligence in treating and diagnosing schizophrenia An analysis of ML’s and DL’s financial effect on healthcare An XGBoost-based classification method for breast cancer classification Using ML to predict squamous diseases ML and DL applications in genomics and proteomics Applying ML and DL to biological data

Feature Representation and Learning Methods With Applications in Protein Secondary Structure

Feature Representation and Learning Methods With Applications in Protein Secondary Structure PDF Author: Zhibin Lv
Publisher: Frontiers Media SA
ISBN: 2889715558
Category : Science
Languages : en
Pages : 112

Get Book Here

Book Description


Deep Learning Techniques and Optimization Strategies in Big Data Analytics

Deep Learning Techniques and Optimization Strategies in Big Data Analytics PDF Author: Thomas, J. Joshua
Publisher: IGI Global
ISBN: 1799811948
Category : Computers
Languages : en
Pages : 355

Get Book Here

Book Description
Many approaches have sprouted from artificial intelligence (AI) and produced major breakthroughs in the computer science and engineering industries. Deep learning is a method that is transforming the world of data and analytics. Optimization of this new approach is still unclear, however, and there’s a need for research on the various applications and techniques of deep learning in the field of computing. Deep Learning Techniques and Optimization Strategies in Big Data Analytics is a collection of innovative research on the methods and applications of deep learning strategies in the fields of computer science and information systems. While highlighting topics including data integration, computational modeling, and scheduling systems, this book is ideally designed for engineers, IT specialists, data analysts, data scientists, engineers, researchers, academicians, and students seeking current research on deep learning methods and its application in the digital industry.

Deep Learning Applications in Translational Bioinformatics

Deep Learning Applications in Translational Bioinformatics PDF Author: Khalid Raza
Publisher: Elsevier
ISBN: 0443222983
Category : Technology & Engineering
Languages : en
Pages : 299

Get Book Here

Book Description
Deep Learning Applications in Translational Bioinformatics, a new volume in the Advances in Ubiquitous Sensing Application for Healthcare series, offers a detailed overview of basic bioinformatics, deep learning, various applications of deep learning in translational bioinformatics including deep learning ensembles, deep learning in protein classification, detection of various diseases, prediction of antiviral peptides, identification of antibiotic resistance, computer aided drug design and drug formulation. This new volume helps researchers working in the field of machine learning and bioinformatics to foster future research and development in ensemble deep learning and inspire new bioinformatics applications that cannot be attained by using traditional machine learning models. - Addresses the practical application of deep learning algorithms to a wide range of bioinformatics challenges - Presents integrative and multidisciplinary approaches to ubiquitous healthcare - Includes case studies to illustrate the concepts discussed

Machine Learning In Bioinformatics Of Protein Sequences: Algorithms, Databases And Resources For Modern Protein Bioinformatics

Machine Learning In Bioinformatics Of Protein Sequences: Algorithms, Databases And Resources For Modern Protein Bioinformatics PDF Author: Lukasz Kurgan
Publisher: World Scientific
ISBN: 9811258597
Category : Science
Languages : en
Pages : 378

Get Book Here

Book Description
Machine Learning in Bioinformatics of Protein Sequences guides readers around the rapidly advancing world of cutting-edge machine learning applications in the protein bioinformatics field. Edited by bioinformatics expert, Dr Lukasz Kurgan, and with contributions by a dozen of accomplished researchers, this book provides a holistic view of the structural bioinformatics by covering a broad spectrum of algorithms, databases and software resources for the efficient and accurate prediction and characterization of functional and structural aspects of proteins. It spotlights key advances which include deep neural networks, natural language processing-based sequence embedding and covers a wide range of predictions which comprise of tertiary structure, secondary structure, residue contacts, intrinsic disorder, protein, peptide and nucleic acids-binding sites, hotspots, post-translational modification sites, and protein function. This volume is loaded with practical information that identifies and describes leading predictive tools, useful databases, webservers, and modern software platforms for the development of novel predictive tools.