Please use this identifier to cite or link to this item:
https://dspace.iiti.ac.in/handle/123456789/4838
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Chauhan, Vikas | en_US |
dc.contributor.author | Tiwari, Aruna | en_US |
dc.contributor.author | Joshi, Niranjan | en_US |
dc.contributor.author | Khandelwal, Sahaj | en_US |
dc.date.accessioned | 2022-03-17T01:00:00Z | - |
dc.date.accessioned | 2022-03-17T15:35:42Z | - |
dc.date.available | 2022-03-17T01:00:00Z | - |
dc.date.available | 2022-03-17T15:35:42Z | - |
dc.date.issued | 2021 | - |
dc.identifier.citation | Chauhan, V., Tiwari, A., Joshi, N., & Khandelwal, S. (2021). Multi-label classifier for protein sequence using heuristic-based deep convolution neural network. Applied Intelligence, doi:10.1007/s10489-021-02529-6 | en_US |
dc.identifier.issn | 0924-669X | - |
dc.identifier.other | EID(2-s2.0-85108609849) | - |
dc.identifier.uri | https://doi.org/10.1007/s10489-021-02529-6 | - |
dc.identifier.uri | https://dspace.iiti.ac.in/handle/123456789/4838 | - |
dc.description.abstract | Deep learning techniques are found very useful to classify sequential data in recent times. The protein sequences belong to the functional classes based on the structure of their sequences. The annotation task of protein sequences into corresponding functional classes is multi-label in nature. The primary structure of protein contains a notable amount of vast data compared to the other secondary, tertiary, and quaternary structures. The clustering-based techniques require expert domain knowledge from the extensive data samples. Traditional methods use the n-gram features of amino acids while ignoring the relationship of motifs and amino acid sequence. This paper proposes an efficient method to classify the proteins into their functional classes using a convolution neural network based on heuristic rules. The proposed approach works on the primary structure of protein sequences which considers the relationship among motifs and amino acids. The proposed approach also takes into account the amino acid locations in the protein sequence. The proposed approach considers the affinity information between amino acids and motifs. Along with achieving high performance in the classification of protein sequences, we propose a heuristic approach to improve the precision and recall of the individual functional classes. The proposed heuristic approach improves the performance and handles the data imbalance problem. The proposed approach is compared with other competitive approaches, and our approach provides better performance metrics in terms of precision, recall, AUC, and subset accuracy. The greatest challenge with multi-label classification is to handle the data imbalance, which appears due to variance in frequencies of the labels in the data. This data imbalance is dealt with weight modulation in the loss function to influence the learning process. © 2021, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Springer | en_US |
dc.source | Applied Intelligence | en_US |
dc.subject | Amino acids | en_US |
dc.subject | Convolution | en_US |
dc.subject | Deep learning | en_US |
dc.subject | Deep neural networks | en_US |
dc.subject | Heuristic methods | en_US |
dc.subject | Learning systems | en_US |
dc.subject | Neural networks | en_US |
dc.subject | Proteins | en_US |
dc.subject | Amino acid sequence | en_US |
dc.subject | Convolution neural network | en_US |
dc.subject | Learning techniques | en_US |
dc.subject | Multi label classification | en_US |
dc.subject | Performance metrics | en_US |
dc.subject | Precision and recall | en_US |
dc.subject | Primary structures | en_US |
dc.subject | Quaternary structure | en_US |
dc.subject | Classification (of information) | en_US |
dc.title | Multi-label classifier for protein sequence using heuristic-based deep convolution neural network | en_US |
dc.type | Journal Article | en_US |
Appears in Collections: | Department of Computer Science and Engineering |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
Altmetric Badge: