摘要
Advancements in genome sequencing have facilitated whole-genome characterization of numerous plant species,providing an abundance of genotypic data for genomic analysis.Genomic selection and neural networks(NNs),particularly deep learning,have been developed to predict complex traits from dense genotypic data.Autoencoders,an NN model to extract features from images in an unsupervised manner,has proven to be useful for plant phenotyping.This study introduces an autoencoder framework,GenoDrawing,for predicting and retrieving apple images from a low-depth single-nucleotide polymorphism(SNP)array,potentially useful in predicting traits that are difficult to define.GenoDrawing demonstrates proficiency in its task using a small dataset of shape-related SNPs.Results indicate that the use of SNPs associated with visual traits has substantial impact on the generated images,consistent with biological interpretation.While using substantial SNPs is crucial,incorporating additional,unrelated SNPs results in performance degradation for simple NN architectures that cannot easily identify the most important inputs.The proposed GenoDrawing method is a practical framework for exploring genomic prediction in fruit tree phenotyping,particularly beneficial for small to medium breeding companies to predict economically substantial heritable traits.Although GenoDrawing has limitations,it sets the groundwork for future research in image prediction from genomic markers.Future studies should focus on using stronger models for image reproduction,SNP information extraction,and dataset balance in terms of phenotypes for more precise outcomes.
基金
FJ.-R.is recipient of grant PRE2019-087427 funded by MCIN/AEI/10.13039/501100011033 and by“ESF Investing in your future”
supported by project PID2021128885OB-I00 funded by MCIN/AEI/10.13039/501100011033 and by“ERDF A way of making Europe”
funding from the European Union's Horizon 2020 research and innovation programme under grant agreement no.817970(INVTTE)
support from the CERCA Programme(“Generalitat de Catalunya”)
the“Severo Ochoa Programme for Centres of Excellence in R&D”2016-2019(SEV-2015-0533)and 2020-2023(CEX2019-000902-S)both funded by MCIN/AEI/10.13039/501100011033.