Reilly, Ronan, Kechadi, M.-T., Kuznetosov, Yuri, Timoshenko, E.G. and Dawson, Kenneth A. (1998) Using recurrent neural networks to predict aspects of 3-D structure of folded copolymer sequences. Il Nuovo Cimento D, 20 (12). pp. 2565-2574. ISSN 0392-6737
Preview
RR-Recurrent-1998.pdf
Download (174kB) | Preview
Abstract
Neural networks have been applied with some limited success to the problem of predicting the secondary [1,2] and tertiary [3,4] structure of proteins based on their amino acid residue sequence. The number of sequences for which there is a known 3-D structure is relatively limited. The rate at which 3-D structures are being solved is at least one order of magnitude lower than the rate at which new protein sequences are being determined [3]. In addition, a limitation in the neural network approaches taken to date is their inability to deal with very long sequences, and with the possibility of dependencies between different regions of a sequence [8]. The work described here is an attempt to address these limitations. In order to obtain a large set of sequences with known 3-D structures for training the neural network, we use the approach described in [5] to generate a set of artificial copolymers consisting of hydrophobic and hydrophilic units with a known 3-D structure when folded. By employing recurrent neural networks and building on the approach described in [3, 4], we describe a way to augment a neural network with both with a facility to deal with sequences of realistic length, and with a mechanism for handling possible long-distant interactions between regions of the sequence.
These sequences are very approximate models of real proteins, given that we only encode the hydrophobicity of the amino acid side chains, and there is no attempt to model their secondary or super-secondary structure. Nonetheless, the neural network techniques developed using artificial sequences are readily applicable to real proteins.
Item Type: | Article |
---|---|
Keywords: | recurrent neural networks; 3-D structure; folded copolymer sequences; |
Academic Unit: | Faculty of Science and Engineering > Computer Science |
Item ID: | 10445 |
Identification Number: | 10.1016/j.cogbrainres.2004.03.019 |
Depositing User: | Prof. Ronan Reilly |
Date Deposited: | 22 Jan 2019 16:39 |
Journal or Publication Title: | Il Nuovo Cimento D |
Publisher: | Società Italiana di fisica |
Refereed: | Yes |
Related URLs: | |
URI: | https://mural.maynoothuniversity.ie/id/eprint/10445 |
Use Licence: | This item is available under a Creative Commons Attribution Non Commercial Share Alike Licence (CC BY-NC-SA). Details of this licence are available here |
Repository Staff Only (login required)
Downloads
Downloads per month over past year