Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/17597
Title: Deep learning accelerated correlation of genotypic and phenotypic data
Authors: Agrawal, Aayush Dhanesh
Supervisors: Ahuja, Kapil
Keywords: Computer Science and Engineering
Issue Date: 30-May-2025
Publisher: Department of Computer Science and Engineering, IIT Indore
Series/Report no.: MT466;
Abstract: The prediction of phenotypic values based on genetic data is referred to as genomic prediction (GP). Genome-wide association studies (GWAS), on the other hand, look for correlations between genotypic markers (single nucleotide polymorphisms, SNPs) and phenotypic traits like grain yield and plant height in order to discover the key SNPs responsible for those traits. This study aims to address the distinct challenges of both GP and SNP identification. The rrBLUP and BLINK models are widely used for GP and GWAS, respectively. However, rrBLUP can only model simple linear relationships between genotype and phenotype, and BLINK often results in false positives when identifying SNPs. To address these challenges, we use machine learning approaches capable of capturing complicated, non-linear patterns, hence improving genomic prediction performance and SNP identification. In this study, we evaluate popular ML model support vector regression (SVR) and its variants as well as the transformer-based GPformer, for their ability to improve predictive performance. Motivated by the di!culty of identifying significant SNPs in high dimensionalty low sample size SNP data, we initially create a hybrid model that combines the regression power of SVR with the feature interaction strength of self-attention. Building on this breakthrough, we then reimagine the SNP sequence as a two-dimensional, image like representation, a strategy that reveals spatial patterns in genomic variation by taming the curse of dimensionality and enabling potent image-based learning models.
URI: https://dspace.iiti.ac.in:8080/jspui/handle/123456789/17597
Type of Material: Thesis_M.Tech
Appears in Collections:Department of Computer Science and Engineering_ETD

Files in This Item:
File Description SizeFormat 
MT_466_Aayush_Dhanesh_Agrawal_2302101001.pdf2.55 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: