期刊文献+
共找到25篇文章
< 1 2 >
每页显示 20 50 100
Genome-wide association mapping and genomic prediction of stalk rot in two mid-altitude tropical maize populations
1
作者 Junqiao Song Angela Pacheco +7 位作者 Amos Alakonya Andrea S.Cruz-Morales Carlos Muoz-Zavala Jingtao Qu Chunping Wang Xuecai Zhang Felix San Vicente Thanda Dhliwayo 《The Crop Journal》 SCIE CSCD 2024年第2期558-568,共11页
Maize stalk rot reduces grain yield and quality.Information about the genetics of resistance to maize stalk rot could help breeders design effective breeding strategies for the trait.Genomic prediction may be a more e... Maize stalk rot reduces grain yield and quality.Information about the genetics of resistance to maize stalk rot could help breeders design effective breeding strategies for the trait.Genomic prediction may be a more effective breeding strategy for stalk-rot resistance than marker-assisted selection.We performed a genome-wide association study(GWAS)and genomic prediction of resistance in testcross hybrids of 677 inbred lines from the Tuxpe?o and non-Tuxpe?o heterotic pools grown in three environments and genotyped with 200,681 single-nucleotide polymorphisms(SNPs).Eighteen SNPs associated with stalk rot shared genomic regions with gene families previously associated with plant biotic and abiotic responses.More favorable SNP haplotypes traced to tropical than to temperate progenitors of the inbred lines.Incorporating genotype-by-environment(G×E)interaction increased genomic prediction accuracy. 展开更多
关键词 Maize stalk rot Genome-wide association mapping Haplotype analysis genomic prediction G×E interaction
下载PDF
The impact of genotyping strategies and statistical models on accuracy of genomic prediction for survival in pigs 被引量:1
2
作者 Tianfei Liu Bjarne Nielsen +2 位作者 Ole F.Christensen Mogens SandøLund Guosheng Su 《Journal of Animal Science and Biotechnology》 SCIE CAS CSCD 2023年第3期908-916,共9页
Background:Survival from birth to slaughter is an important economic trait in commercial pig productions.Increasing survival can improve both economic efficiency and animal welfare.The aim of this study is to explore ... Background:Survival from birth to slaughter is an important economic trait in commercial pig productions.Increasing survival can improve both economic efficiency and animal welfare.The aim of this study is to explore the impact of genotyping strategies and statistical models on the accuracy of genomic prediction for survival in pigs during the total growing period from birth to slaughter.Results:We simulated pig populations with different direct and maternal heritabilities and used a linear mixed model,a logit model,and a probit model to predict genomic breeding values of pig survival based on data of individual survival records with binary outcomes(0,1).The results show that in the case of only alive animals having genotype data,unbiased genomic predictions can be achieved when using variances estimated from pedigreebased model.Models using genomic information achieved up to 59.2%higher accuracy of estimated breeding value compared to pedigree-based model,dependent on genotyping scenarios.The scenario of genotyping all individuals,both dead and alive individuals,obtained the highest accuracy.When an equal number of individuals(80%)were genotyped,random sample of individuals with genotypes achieved higher accuracy than only alive individuals with genotypes.The linear model,logit model and probit model achieved similar accuracy.Conclusions:Our conclusion is that genomic prediction of pig survival is feasible in the situation that only alive pigs have genotypes,but genomic information of dead individuals can increase accuracy of genomic prediction by 2.06%to 6.04%. 展开更多
关键词 genomic prediction Genotyping strategy Simulation Statistical models SURVIVAL
下载PDF
Incorporating kernelized multi-omics data improves the accuracy of genomic prediction 被引量:1
3
作者 Mang Liang Bingxing An +10 位作者 Tianpeng Chang Tianyu Deng Lili Du Keanning Li Sheng Cao Yueying Du Lingyang Xu Lupei Zhang Xue Gao Junya Li Huijiang Gao 《Journal of Animal Science and Biotechnology》 SCIE CAS CSCD 2023年第1期88-97,共10页
Background:Genomic selection(GS)has revolutionized animal and plant breeding after the first implementation via early selection before measuring phenotypes.Besides genome,transcriptome and metabolome information are i... Background:Genomic selection(GS)has revolutionized animal and plant breeding after the first implementation via early selection before measuring phenotypes.Besides genome,transcriptome and metabolome information are increasingly considered new sources for GS.Difficulties in building the model with multi-omics data for GS and the limit of specimen availability have both delayed the progress of investigating multi-omics.Results:We utilized the Cosine kernel to map genomic and transcriptomic data as n×n symmetric matrix(G matrix and T matrix),combined with the best linear unbiased prediction(BLUP)for GS.Here,we defined five kernel-based prediction models:genomic BLUP(GBLUP),transcriptome-BLUP(TBLUP),multi-omics BLUP(MBLUP,M=ratio×G+(1-ratio)×T),multi-omics single-step BLUP(mss BLUP),and weighted multi-omics single-step BLUP(wmss BLUP)to integrate transcribed individuals and genotyped resource population.The predictive accuracy evaluations in four traits of the Chinese Simmental beef cattle population showed that(1)MBLUP was far preferred to GBLUP(ratio=1.0),(2)the prediction accuracy of wmss BLUP and mss BLUP had 4.18%and 3.37%average improvement over GBLUP,(3)We also found the accuracy of wmss BLUP increased with the growing proportion of transcribed cattle in the whole resource population.Conclusions:We concluded that the inclusion of transcriptome data in GS had the potential to improve accuracy.Moreover,wmss BLUP is accepted to be a promising alternative for the present situation in which plenty of individuals are genotyped when fewer are transcribed. 展开更多
关键词 BLUP Cosine kernel genomic prediction TRANSCRIPTOME
下载PDF
Improving the accuracy of genomic prediction for meat quality traits using whole genome sequence data in pigs 被引量:1
4
作者 Zhanwei Zhuang Jie Wu +14 位作者 Yibin Qiu Donglin Ruan Rongrong Ding Cineng Xu Shenping Zhou Yuling Zhang Yiyi Liu Fucai Ma Jifei Yang Ying Sun Enqin Zheng Ming Yang Gengyuan Cai Jie Yang Zhenfang Wu 《Journal of Animal Science and Biotechnology》 SCIE CAS CSCD 2023年第5期1880-1894,共15页
Background Pork quality can directly affect customer purchase tendency and meat quality traits have become valu-able in modern pork production.However,genetic improvement has been slow due to high phenotyping costs.In... Background Pork quality can directly affect customer purchase tendency and meat quality traits have become valu-able in modern pork production.However,genetic improvement has been slow due to high phenotyping costs.In this study,whole genome sequence(WGS)data was used to evaluate the prediction accuracy of genomic best linear unbiased prediction(GBLUP)for meat quality in large-scale crossbred commercial pigs.Results We produced WGS data(18,695,907 SNPs and 2,106,902 INDELs exceed quality control)from 1,469 sequenced Duroc×(Landrace×Yorkshire)pigs and developed a reference panel for meat quality including meat color score,marbling score,L*(lightness),a*(redness),and b*(yellowness)of genomic prediction.The prediction accuracy was defined as the Pearson correlation coefficient between adjusted phenotypes and genomic estimated breeding values in the validation population.Using different marker density panels derived from WGS data,accuracy differed substantially among meat quality traits,varied from 0.08 to 0.47.Results showed that MultiBLUP outperform GBLUP and yielded accuracy increases ranging from 17.39%to 75%.We optimized the marker density and found medium-and high-density marker panels are beneficial for the estimation of heritability for meat quality.Moreover,we conducted genotype imputation from 50K chip to WGS level in the same population and found average concord-ance rate to exceed 95%and r^(2)=0.81.Conclusions Overall,estimation of heritability for meat quality traits can benefit from the use of WGS data.This study showed the superiority of using WGS data to genetically improve pork quality in genomic prediction. 展开更多
关键词 genomic prediction Meat quality PIGS Whole genome sequence
下载PDF
Genomic prediction of yield performance among single-cross maize hybrids using a partial diallel cross design
5
作者 Ping Luo Houwen Wang +23 位作者 Zhiyong Ni Ruisi Yang Fei Wang Hongjun Yong Lin Zhang Zhiqiang Zhou Wei Song Mingshun Li Jie Yang Jianfeng Weng Zhaodong Meng Degui Zhang Jienan Han Yong Chen Runze Zhang Liwei Wang Meng Zhao Wenwei Gao Xiaoyu Chen Wenjie Li Zhuanfang Hao Junjie Fu Xuecai Zhang Xinhai Li 《The Crop Journal》 SCIE CSCD 2023年第6期1884-1892,共9页
Genomic prediction(GP)in plant breeding has the potential to predict and identify the best-performing hybrids based on the genotypes of their parental lines.In a GP experiment,34 elite inbred lines were selected to ma... Genomic prediction(GP)in plant breeding has the potential to predict and identify the best-performing hybrids based on the genotypes of their parental lines.In a GP experiment,34 elite inbred lines were selected to make 285 single-cross hybrids in a partial-diallel cross design.These lines represented a mini-core collection of Chinese maize germplasm and comprised 18 inbred lines from the Stiff Stalk heterotic group and 16 inbred lines from the Non-Stiff Stalk heterotic group.The parents were genotyped by sequencing and the 285 hybrids were phenotyped for nine yield and yield-related traits at two locations in the summer sowing area(SUS)and three locations in the spring sowing area(SPS)in the main maizeproducing regions of China.Multiple GP models were employed to assess the accuracy of trait prediction in the hybrids.By ten-fold cross-validation,the prediction accuracies of yield performance of the hybrids estimated by the genomic best linear unbiased prediction(GBLUP)model in SUS and SPS were 0.51 and 0.46,respectively.The prediction accuracies of the remaining yield-related traits estimated with GBLUP ranged from 0.49 to 0.86 and from 0.53 to 0.89 in SUS and SPS,respectively.When additive,dominance,epistasis effects,genotype-by-environment interaction,and multi-trait effects were incorporated into the prediction model,the prediction accuracy of hybrid yield performance was improved.The ratio of training to testing population and size of training population optimal for yield prediction were determined.Multiple prediction models can improve prediction accuracy in hybrid breeding. 展开更多
关键词 MAIZE genomic prediction prediction model Genetic effects Hybrid performance
下载PDF
Strategies to improve genomic predictions for 35 duck carcass traits in an F2 population 被引量:1
6
作者 Wentao Cai Jian Hu +7 位作者 Wenlei Fan Yaxi Xu Jing Tang Ming Xie Yunsheng Zhang Zhanbao Guo Zhengkui Zhou Shuisheng Hou 《Journal of Animal Science and Biotechnology》 SCIE CAS CSCD 2023年第5期1854-1868,共15页
Background Carcass traits are crucial for broiler ducks,but carcass traits can only be measured postmortem.Genomic selection(GS)is an effective approach in animal breeding to improve selection and reduce costs.However... Background Carcass traits are crucial for broiler ducks,but carcass traits can only be measured postmortem.Genomic selection(GS)is an effective approach in animal breeding to improve selection and reduce costs.However,the performance of genomic prediction in duck carcass traits remains largely unknown.Results In this study,we estimated the genetic parameters,performed GS using different models and marker densi-ties,and compared the estimation performance between GS and conventional BLUP on 35 carcass traits in an F2 population of ducks.Most of the cut weight traits and intestine length traits were estimated to be high and moder-ate heritabilities,respectively,while the heritabilities of percentage slaughter traits were dynamic.The reliability of genome prediction using GBLUP increased by an average of 0.06 compared to the conventional BLUP method.The Permutation studies revealed that 50K markers had achieved ideal prediction reliability,while 3K markers still achieved 90.7%predictive capability would further reduce the cost for duck carcass traits.The genomic relationship matrix nor-malized by our true variance method instead of the widely used 2pi(1-pi)could achieve an increase in prediction reliability in most traits.We detected most of the bayesian models had a better performance,especially for BayesN.Compared to GBLUP,BayesN can further improve the predictive reliability with an average of 0.06 for duck carcass traits.Conclusion This study demonstrates genomic selection for duck carcass traits is promising.The genomic prediction can be further improved by modifying the genomic relationship matrix using our proposed true variance method and several Bayesian models.Permutation study provides a theoretical basis for the fact that low-density arrays can be used to reduce genotype costs in duck genome selection. 展开更多
关键词 Bayesian model Carcass traits DUCK Genome prediction genomic relationship matrix Mark density
下载PDF
Genome-wide association study and genomic prediction of Fusarium ear rot resistance in tropical maize germplasm 被引量:6
7
作者 Yubo Liu Guanghui Hu +10 位作者 Ao Zhang Alexander Loladze Yingxiong Hu Hui Wang Jingtao Qu Xuecai Zhang Michael Olsen Felix San Vicente Jose Crossa Feng Lin Boddupalli M.Prasanna 《The Crop Journal》 SCIE CSCD 2021年第2期325-341,共17页
Fusarium ear rot(FER)is a destructive maize fungal disease worldwide.In this study,three tropical maize populations consisting of 874 inbred lines were used to perform genomewide association study(GWAS)and genomic pre... Fusarium ear rot(FER)is a destructive maize fungal disease worldwide.In this study,three tropical maize populations consisting of 874 inbred lines were used to perform genomewide association study(GWAS)and genomic prediction(GP)analyses of FER resistance.Broad phenotypic variation and high heritability for FER were observed,although it was highly influenced by large genotype-by-environment interactions.In the 874 inbred lines,GWAS with general linear model(GLM)identified 3034 single-nucleotide polymorphisms(SNPs)significantly associated with FER resistance at the P-value threshold of 1×10^(-5),the average phenotypic variation explained(PVE)by these associations was 3%with a range from 2.33%to 6.92%,and 49 of these associations had PVE values greater than 5%.The GWAS analysis with mixed linear model(MLM)identified 19 significantly associated SNPs at the P-value threshold of 1×10^(-4),the average PVE of these associations was 1.60%with a range from 1.39%to 2.04%.Within each of the three populations,the number of significantly associated SNPs identified by GLM and MLM ranged from 25 to 41,and from 5 to 22,respectively.Overlapping SNP associations across populations were rare.A few stable genomic regions conferring FER resistance were identified,which located in bins 3.04/05,7.02/04,9.00/01,9.04,9.06/07,and 10.03/04.The genomic regions in bins 9.00/01 and 9.04 are new.GP produced moderate accuracies with genome-wide markers,and relatively high accuracies with SNP associations detected from GWAS.Moderate prediction accuracies were observed when the training and validation sets were closely related.These results implied that FER resistance in maize is controlled by minor QTL with small effects,and highly influenced by the genetic background of the populations studied.Genomic selection(GS)by incorporating SNP associations detected from GWAS is a promising tool for improving FER resistance in maize. 展开更多
关键词 MAIZE Fusarium ear rot Genome-wide association study genomic prediction genomic selection
下载PDF
Using machine learning to improve the accuracy of genomic prediction of reproduction traits in pigs 被引量:5
8
作者 Xue Wang Shaolei Shi +5 位作者 Guijiang Wang Wenxue Luo Xia Wei Ao Qiu Fei Luo Xiangdong Ding 《Journal of Animal Science and Biotechnology》 SCIE CAS CSCD 2022年第5期1293-1304,共12页
Background:Recently,machine learning(ML)has become attractive in genomic prediction,but its superiority in genomic prediction over conventional(ss)GBLUP methods and the choice of optimal ML methods need to be investig... Background:Recently,machine learning(ML)has become attractive in genomic prediction,but its superiority in genomic prediction over conventional(ss)GBLUP methods and the choice of optimal ML methods need to be investigated.Results:In this study,2566 Chinese Yorkshire pigs with reproduction trait records were genotyped with the GenoBaits Porcine SNP 50 K and PorcineSNP50 panels.Four ML methods,including support vector regression(SVR),kernel ridge regression(KRR),random forest(RF)and Adaboost.R2 were implemented.Through 20 replicates of fivefold cross-validation(CV)and one prediction for younger individuals,the utility of ML methods in genomic prediction was explored.In CV,compared with genomic BLUP(GBLUP),single-step GBLUP(ssGBLUP)and the Bayesian method BayesHE,ML methods significantly outperformed these conventional methods.ML methods improved the genomic prediction accuracy of GBLUP,ssGBLUP,and BayesHE by 19.3%,15.0% and 20.8%,respectively.In addition,ML methods yielded smaller mean squared error(MSE)and mean absolute error(MAE)in all scenarios.ssGBLUP yielded an improvement of 3.8% on average in accuracy compared to that of GBLUP,and the accuracy of BayesHE was close to that of GBLUP.In genomic prediction of younger individuals,RF and Adaboost.R2_KRR performed better than GBLUP and BayesHE,while ssGBLUP performed comparably with RF,and ssGBLUP yielded slightly higher accuracy and lower MSE than Adaboost.R2_KRR in the prediction of total number of piglets born,while for number of piglets born alive,Adaboost.R2_KRR performed significantly better than ssGBLUP.Among ML methods,Adaboost.R2_KRR consistently performed well in our study.Our findings also demonstrated that optimal hyperparameters are useful for ML methods.After tuning hyperparameters in CV and in predicting genomic outcomes of younger individuals,the average improvement was 14.3% and 21.8% over those using default hyperparameters,respectively.Conclusion:Our findings demonstrated that ML methods had better overall prediction performance than conventional genomic selection methods,and could be new options for genomic prediction.Among ML methods,Adaboost.R2_KRR consistently performed well in our study,and tuning hyperparameters is necessary for ML methods.The optimal hyperparameters depend on the character of traits,datasets etc. 展开更多
关键词 genomic prediction Machine learning PIG prediction accuracy
下载PDF
Comparisons of improved genomic predictions generated by different imputation methods for genotyping by sequencing data in livestock populations 被引量:4
9
作者 Xiao Wang Guosheng Su +2 位作者 Dan Hao Mogens SandøLund Haja N.Kadarmideen 《Journal of Animal Science and Biotechnology》 CAS CSCD 2020年第2期316-326,共11页
Background:Genotyping by sequencing(GBS)still has problems with missing genotypes.Imputation is important for using GBS for genomic predictions,especially for low depths,due to the large number of missing genotypes.Mi... Background:Genotyping by sequencing(GBS)still has problems with missing genotypes.Imputation is important for using GBS for genomic predictions,especially for low depths,due to the large number of missing genotypes.Minor allele frequency(MAF)is widely used as a marker data editing criteria for genomic predictions.In this study,three imputation methods(Beagle,IMPUTE2 and FImpute software)based on four MAF editing criteria were investigated with regard to imputation accuracy of missing genotypes and accuracy of genomic predictions,based on simulated data of livestock population.Results:Four MAFs(no MAF limit,MAF≥0.001,MAF≥0.01 and MAF≥0.03)were used for editing marker data before imputation.Beagle,IMPUTE2 and FImpute software were applied to impute the original GBS.Additionally,IMPUTE2 also imputed the expected genotype dosage after genotype correction(GcIM).The reliability of genomic predictions was calculated using GBS and imputed GBS data.The results showed that imputation accuracies were the same for the three imputation methods,except for the data of sequencing read depth(depth)=2,where FImpute had a slightly lower imputation accuracy than Beagle and IMPUTE2.GcIM was observed to be the best for all of the imputations at depth=4,5 and 10,but the worst for depth=2.For genomic prediction,retaining more SNPs with no MAF limit resulted in higher reliability.As the depth increased to 10,the prediction reliabilities approached those using true genotypes in the GBS loci.Beagle and IMPUTE2 had the largest increases in prediction reliability of 5 percentage points,and FImpute gained 3 percentage points at depth=2.The best prediction was observed at depth=4,5 and 10 using GcIM,but the worst prediction was also observed using GcIM at depth=2.Conclusions:The current study showed that imputation accuracies were relatively low for GBS with low depths and high for GBS with high depths.Imputation resulted in larger gains in the reliability of genomic predictions for GBS with lower depths.These results suggest that the application of IMPUTE2,based on a corrected GBS(GcIM)to improve genomic predictions for higher depths,and FImpute software could be a good alternative for routine imputation. 展开更多
关键词 genomic prediction Genotyping by sequencing IMPUTATION MAF Simulation
下载PDF
Improving genomic predictions by correction of genotypes from genotyping by sequencing in livestock populations 被引量:4
10
作者 Xiao Wang Mogens SandΦ Lund +3 位作者 Peipei Ma Luc Janss Haja N.Kadarmideen Guosheng Su 《Journal of Animal Science and Biotechnology》 SCIE CAS CSCD 2019年第2期283-292,共10页
Background: Genotyping by sequencing(GBS) is a robust method to genotype markers. Many factors can influence the genotyping quality. One is that heterozygous genotypes could be wrongly genotyped as homozygotes,depende... Background: Genotyping by sequencing(GBS) is a robust method to genotype markers. Many factors can influence the genotyping quality. One is that heterozygous genotypes could be wrongly genotyped as homozygotes,dependent on the genotyping depths. In this study, a method correcting this type of genotyping error was demonstrated. The efficiency of this correction method and its effect on genomic prediction were assessed using simulated data of livestock populations.Results: Chip array(Chip) and four depths of GBS data was simulated. After quality control(call rate ≥ 0.8 and MAF ≥ 0.01), the remaining number of Chip and GBS SNPs were both approximately 7,000, averaged over 10 replicates. GBS genotypes were corrected with the proposed method. The reliability of genomic prediction was calculated using GBS, corrected GBS(GBSc), true genotypes for the GBS loci(GBSr) and Chip data. The results showed that GBSc had higher rates of correct genotype calls and higher correlations with true genotypes than GBS. For genomic prediction, using Chip data resulted in the highest reliability. As the depth increased to 10, the prediction reliabilities using GBS and GBSc data approached those using true GBS data. The reliabilities of genomic prediction using GBSc data were 0.604, 0.672, 0.684 and 0.704 after genomic correction, with the improved values of 0.013, 0.009, 0.006 and 0.001 at depth = 2, 4, 5 and 10, respectively.Conclusions: The current study showed that a correction method for GBS data increased the genotype accuracies and, consequently, improved genomic predictions. These results suggest that a correction of GBS genotype is necessary, especially for the GBS data with low depths. 展开更多
关键词 genomic prediction Genotype correction Genotyping by sequencing Simulation
下载PDF
Genomic Prediction of Arsenic Tolerance and Grain Yield in Rice: Contribution of Trait-Specific Markers and Multi-Environment Models 被引量:3
11
作者 Nourollah AHMADI Tuong-Vi CAO +2 位作者 Julien FROUIN Gareth J.NORTON Adam H.PRICE 《Rice science》 SCIE CSCD 2021年第3期268-278,共11页
Many rice-growing areas are affected by high concentrations of arsenic(As).Rice varieties that prevent As uptake and/or accumulation can mitigate As threats to human health.Genomic selection is known to facilitate rap... Many rice-growing areas are affected by high concentrations of arsenic(As).Rice varieties that prevent As uptake and/or accumulation can mitigate As threats to human health.Genomic selection is known to facilitate rapid selection of superior genotypes for complex traits.We explored the predictive ability(PA)of genomic prediction with single-environment models,accounting or not for trait-specific markers,multi-environment models,and multi-trait and multi-environment models,using the genotypic(1600K SNPs)and phenotypic(grain As content,grain yield and days to flowering)data of the Bengal and Assam Aus Panel.Under the base-line single-environment model,PA of up to 0.707 and 0.654 was obtained for grain yield and grain As content,respectively;the three prediction methods(Bayesian Lasso,genomic best linear unbiased prediction and reproducing kernel Hilbert spaces)were considered to perform similarly,and marker selection based on linkage disequilibrium allowed to reduce the number of SNP to 17K,without negative effect on PA of genomic predictions.Single-environment models giving distinct weight to trait-specific markers in the genomic relationship matrix outperformed the base-line models up to 32%.Multi-environment models,accounting for genotype×environment interactions,and multi-trait and multi-environment models outperformed the base-line models by up to 47%and 61%,respectively.Among the multi-trait and multi-environment models,the Bayesian multi-output regressor stacking function obtained the highest predictive ability(0.831 for grain As)with much higher efficiency for computing time.These findings pave the way for breeding for As-tolerance in the progenies of biparental crosses involving members of the Bengal and Assam Aus Panel.Genomic prediction can also be applied to breeding for other complex traits under multiple environments. 展开更多
关键词 genomic prediction model genomic selection complex trait arsenic tolerance RICE predictive ability
下载PDF
Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction 被引量:2
12
作者 Shaopan Ye Jiaqi Li Zhe Zhang 《Journal of Animal Science and Biotechnology》 SCIE CAS CSCD 2021年第2期508-519,共12页
Background:Presently,multi-omics data(e.g.,genomics,transcriptomics,proteomics,and metabolomics)are available to improve genomic predictors.Omics data not only offers new data layers for genomic prediction but also pr... Background:Presently,multi-omics data(e.g.,genomics,transcriptomics,proteomics,and metabolomics)are available to improve genomic predictors.Omics data not only offers new data layers for genomic prediction but also provides a bridge between organismal phenotypes and genome variation that cannot be readily captured at the genome sequence level.Therefore,using multi-omics data to select feature markers is a feasible strategy to improve the accuracy of genomic prediction.In this study,simultaneously using whole-genome sequencing(WGS)and gene expression level data,four strategies for single-nucleotide polymorphism(SNP)preselection were investigated for genomic predictions in the Drosophila Genetic Reference Panel.Results:Using genomic best linear unbiased prediction(GBLUP)with complete WGS data,the prediction accuracies were 0.208±0.020(0.181±0.022)for the startle response and 0.272±0.017(0.307±0.015)for starvation resistance in the female(male)lines.Compared with GBLUP using complete WGS data,both GBLUP and the genomic feature BLUP(GFBLUP)did not improve the prediction accuracy using SNPs preselected from complete WGS data based on the results of genome-wide association studies(GWASs)or transcriptome-wide association studies(TWASs).Furthermore,by using SNPs preselected from the WGS data based on the results of the expression quantitative trait locus(eQTL)mapping of all genes,only the startle response had greater accuracy than GBLUP with the complete WGS data.The best accuracy values in the female and male lines were 0.243±0.020 and 0.220±0.022,respectively.Importantly,by using SNPs preselected based on the results of the eQTL mapping of significant genes from TWAS,both GBLUP and GFBLUP resulted in great accuracy and small bias of genomic prediction.Compared with the GBLUP using complete WGS data,the best accuracy values represented increases of 60.66%and 39.09%for the starvation resistance and 27.40%and 35.36%for startle response in the female and male lines,respectively.Conclusions:Overall,multi-omics data can assist genomic feature preselection and improve the performance of genomic prediction.The new knowledge gained from this study will enrich the use of multi-omics in genomic prediction. 展开更多
关键词 ACCURACY Drosophila melanogaster genomic prediction Multi-omics data SNP preselection
下载PDF
Effects of marker density and minor allele frequency on genomic prediction for growth traits in Chinese Simmental beef cattle 被引量:2
13
作者 ZHU Bo ZHANG Jing-jing +8 位作者 NIU Hong GUAN Long GUO Peng XU Ling-yang CHEN Yan ZHANG Lu-pei GAO Hui-jiang GAO Xue LI Jun-ya 《Journal of Integrative Agriculture》 SCIE CAS CSCD 2017年第4期911-920,共10页
Genomic selection has been demonstrated as a powerful technology to revolutionize animal breeding. However, marker density and minor allele frequency can affect the predictive ability of genomic estimated breeding val... Genomic selection has been demonstrated as a powerful technology to revolutionize animal breeding. However, marker density and minor allele frequency can affect the predictive ability of genomic estimated breeding values (GEBVs). To investigate the impact of marker density and minor allele frequency on predictive ability, we estimated GEBVs by constructing the different subsets of single nucleotide polymorphisms (SNPs) based on varying markers densities and minor allele frequency (MAF) for average daily gain (ADG), live weight (LW) and carcass weight (CW) in 1 059 Chinese Simmental beef cattle. Two strategies were proposed for SNP selection to construct different marker densities: 1) select evenly-spaced SNPs (Strategy 1 ), and 2) select SNPs with large effects estimated from BayesB (Strategy 2). Furthermore, predictive ability was assessed in terms of the correlation between predicted genomic values and corrected phenotypes from 10-fold cross-validation. Predictive ability for ADG, LW and CW using autosomal SNPs were 0.13+0.002, 0.21+0.003 and 0.25+0.003, respectively. In our study, the predictive ability increased dramatically as more SNPs were included in analysis until 200K for Strategy 1. Under Strategy 2, we found the predictive ability slightly increased when marker densities increased from 5K to 20K, which indicated the predictive ability of 20K (3% of 770K) SNPs with large effects was equal to the predictive ability of using all SNPs. For different MAF bins, we obtained the highest predictive ability for three traits with MAF bin 0.01-0.1. Our result suggested that designing a low-density chip by selecting low frequency markers with large SNP effects sizes should be helpful for commercial application in Chinese Simmental cattle. 展开更多
关键词 genomic prediction cross-validation Chinese Simmental beef cattle marker density minor allele frequency (MAF)
下载PDF
Genomic prediction using composite training sets is an effective method for exploiting germplasm conserved in rice gene banks 被引量:1
14
作者 Sang He Hongyan Liu +4 位作者 Junhui Zhan Yun Meng Yamei Wang Feng Wang Guoyou Ye 《The Crop Journal》 SCIE CSCD 2022年第4期1073-1082,共10页
Germplasm conserved in gene banks is underutilized,owing mainly to the cost of characterization.Genomic prediction can be applied to predict the genetic merit of germplasm.Germplasm utilization could be greatly accele... Germplasm conserved in gene banks is underutilized,owing mainly to the cost of characterization.Genomic prediction can be applied to predict the genetic merit of germplasm.Germplasm utilization could be greatly accelerated if prediction accuracy were sufficiently high with a training population of practical size.Large-scale resequencing projects in rice have generated high quality genome-wide variation information for many diverse accessions,making it possible to investigate the potential of genomic prediction in rice germplasm management and exploitation.We phenotyped six traits in nearly 2000 indica(XI)and japonica(GJ)accessions from the Rice 3K project and investigated different scenarios for forming training populations.A composite core training set was considered in two levels which targets used for prediction of subpopulations within subspecies or prediction across subspecies.Composite training sets incorporating 400 or 200 accessions from either subpopulation of XI or GJ showed satisfactory prediction accuracy.A composite training set of 600 XI and GJ accessions showed sufficiently high prediction accuracy for both XI and GJ subspecies.Comparable or even higher prediction accuracy was observed for the composite training set than for the corresponding homogeneous training sets comprising accessions only of specific subpopulations of XI or GJ(within-subspecies level)or pure XI or GJ accessions(across-subspecies level)that were included in the composite training set.Validation using an independent population of 281 rice cultivars supported the predictive ability of the composite training set.Reliability,which reflects the robustness of a training set,was markedly higher for the composite training set than for the corresponding homogeneous training sets.A core training set formed from diverse accessions could accurately predict the genetic merit of rice germplasm. 展开更多
关键词 genomic prediction Composite training set Rice germplasm Gene bank Reliability criterion
下载PDF
A novel genomic prediction method combining randomized Haseman-Elston regression with a modified algorithm for Proven and Young for large genomic data
15
作者 Hailan Liu Guo-Bo Chen 《The Crop Journal》 SCIE CSCD 2022年第2期550-554,共5页
Computational efficiency has become a key issue in genomic prediction(GP) owing to the massive historical datasets accumulated. We developed hereby a new super-fast GP approach(SHEAPY) combining randomized Haseman-Els... Computational efficiency has become a key issue in genomic prediction(GP) owing to the massive historical datasets accumulated. We developed hereby a new super-fast GP approach(SHEAPY) combining randomized Haseman-Elston regression(RHE-reg) with a modified Algorithm for Proven and Young(APY) in an additive-effect model, using the former to estimate heritability and then the latter to invert a large genomic relationship matrix for best linear prediction. In simulation results with varied sizes of training population, GBLUP, HEAPY|A and SHEAPY showed similar predictive performance when the size of a core population was half that of a large training population and the heritability was a fixed value, and the computational speed of SHEAPY was faster than that of GBLUP and HEAPY|A. In simulation results with varied heritability, SHEAPY showed better predictive ability than GBLUP in all cases and than HEAPY|A in most cases when the size of a core population was 4/5 that of a small training population and the training population size was a fixed value. As a proof of concept, SHEAPY was applied to the analysis of two real datasets. In an Arabidopsis thaliana F2 population, the predictive performance of SHEAPY was similar to or better than that of GBLUP and HEAPY|A in most cases when the size of a core population(2 0 0) was 2/3 of that of a small training population(3 0 0). In a sorghum multiparental population,SHEAPY showed higher predictive accuracy than HEAPY|A for all of three traits, and than GBLUP for two traits. SHEAPY may become the GP method of choice for large-scale genomic data. 展开更多
关键词 genomic prediction GBLUP Randomized HE-regression Modified algorithm for Proven and Young
下载PDF
Leveraging the potential of big genomic and phenotypic data for genome-wide association mapping in wheat
16
作者 Moritz Lell Yusheng Zhao Jochen C.Reif 《The Crop Journal》 SCIE CSCD 2024年第3期803-813,共11页
Genome-wide association mapping studies(GWAS)based on Big Data are a potential approach to improve marker-assisted selection in plant breeding.The number of available phenotypic and genomic data sets in which medium-s... Genome-wide association mapping studies(GWAS)based on Big Data are a potential approach to improve marker-assisted selection in plant breeding.The number of available phenotypic and genomic data sets in which medium-sized populations of several hundred individuals have been studied is rapidly increasing.Combining these data and using them in GWAS could increase both the power of QTL discovery and the accuracy of estimation of underlying genetic effects,but is hindered by data heterogeneity and lack of interoperability.In this study,we used genomic and phenotypic data sets,focusing on Central European winter wheat populations evaluated for heading date.We explored strategies for integrating these data and subsequently the resulting potential for GWAS.Establishing interoperability between data sets was greatly aided by some overlapping genotypes and a linear relationship between the different phenotyping protocols,resulting in high quality integrated phenotypic data.In this context,genomic prediction proved to be a suitable tool to study relevance of interactions between genotypes and experimental series,which was low in our case.Contrary to expectations,fewer associations between markers and traits were found in the larger combined data than in the individual experimental series.However,the predictive power based on the marker-trait associations of the integrated data set was higher across data sets.Therefore,the results show that the integration of medium-sized to Big Data is an approach to increase the power to detect QTL in GWAS.The results encourage further efforts to standardize and share data in the plant breeding community. 展开更多
关键词 Big Data Genome-wide association study Data integration genomic prediction WHEAT
下载PDF
A Novel Deep Learning-Based Model for Classification of Wheat Gene Expression
17
作者 Amr Ismail WalidHamdy +5 位作者 Aya MAl-Zoghby Wael AAwad Ahmed Ismail Ebada Yunyoung Nam Byeong-Gwon Kang Mohamed Abouhawwash 《Computer Systems Science & Engineering》 2024年第2期273-285,共13页
Deep learning(DL)plays a critical role in processing and converting data into knowledge and decisions.DL technologies have been applied in a variety of applications,including image,video,and genome sequence analysis.I... Deep learning(DL)plays a critical role in processing and converting data into knowledge and decisions.DL technologies have been applied in a variety of applications,including image,video,and genome sequence analysis.In deep learning the most widely utilized architecture is Convolutional Neural Networks(CNN)are taught discriminatory traits in a supervised environment.In comparison to other classic neural networks,CNN makes use of a limited number of artificial neurons,therefore it is ideal for the recognition and processing of wheat gene sequences.Wheat is an essential crop of cereals for people around the world.Wheat Genotypes identification has an impact on the possible development of many countries in the agricultural sector.In quantitative genetics prediction of genetic values is a central issue.Wheat is an allohexaploid(AABBDD)with three distinct genomes.The sizes of the wheat genome are quite large compared to many other kinds and the availability of a diversity of genetic knowledge and normal structure at breeding lines of wheat,Therefore,genome sequence approaches based on techniques of Artificial Intelligence(AI)are necessary.This paper focuses on using the Wheat genome sequence will assist wheat producers in making better use of their genetic resources and managing genetic variation in their breeding program,as well as propose a novel model based on deep learning for offering a fundamental overview of genomic prediction theory and current constraints.In this paper,the hyperparameters of the network are optimized in the CNN to decrease the requirement for manual search and enhance network performance using a new proposed model built on an optimization algorithm and Convolutional Neural Networks(CNN). 展开更多
关键词 Gene expression convolutional neural network optimization algorithm genomic prediction WHEAT
下载PDF
The impact of genomic relatedness between populations on the genomic estimated breeding values 被引量:3
18
作者 Peipei Ma Ju Huang +5 位作者 Weijia Gong Xiujin Li Hongding Gao Qin Zhang Xiangdong Ding Chonglong Wang 《Journal of Animal Science and Biotechnology》 SCIE CAS CSCD 2018年第4期841-846,共6页
In genomic selection, prediction accuracy is highly driven by the size of animals in the reference population(RP).Combining related populations from different countries and regions or using a related population with l... In genomic selection, prediction accuracy is highly driven by the size of animals in the reference population(RP).Combining related populations from different countries and regions or using a related population with large size of RP has been considered to be viable strategies in cattle breeding. The genetic relationship between related populations is important for improving the genomic predictive ability. In this study, we used 122 French bulls as test individuals. The genomic estimated breeding values(GEBVs) evaluated using French RP, America RP and Chinese RP were compared.The results showed that the GEBVs were in higher concordance using French RP and American RP compared with using Chinese population. The persistence analysis, kinship analysis and the principal component analysis(PCA) were performed for 270 French bulls, 270 American bulls and 270 Chinese bulls to interpret the results. All the analyses illustrated that the genetic relationship between French bulls and American bulls was closer compared with Chinese bulls. Another reason could be the size of RP in China was smaller than the other two RPs. In conclusion, using RP of a related population to predict GEBVs of the animals in a target population is feasible when these two populations have a close genetic relationship and the related population is large. 展开更多
关键词 genomic prediction genomic relationship Joint population prediction
下载PDF
A k-mer-based pangenome approach for cataloging seed-storage-protein genes in wheat to facilitate genotype-to-phenotype prediction and improvement of end-use quality
19
作者 Zhaoheng Zhang Dan Liu +13 位作者 Binyong Li Wenxi Wang Jize Zhang Mingming Xin Zhaorong Hu Jie Liu Jinkun Du Huiru Peng Chenyang Hao Xueyong Zhang Zhongfu Ni Qixin Sun Weilong Guo Yingyin Yao 《Molecular Plant》 SCIE CSCD 2024年第7期1038-1053,共16页
Wheat is a staple foodfor more than 35%of the world's population,with wheatflourused to make hundreds of baked goods.Superior end-use quality is a major breeding target;however,improving it is especially time-cons... Wheat is a staple foodfor more than 35%of the world's population,with wheatflourused to make hundreds of baked goods.Superior end-use quality is a major breeding target;however,improving it is especially time-consuming and expensive.Furthermore,genes encoding seed-storage proteins(ssPs)form multigene families and are repetitive,with gaps commonplace in several genome assemblies.To overcome these barriers and efficiently identify superior wheat SSP alleles,we developed"PanSK"(Pan-SSP k-mer)for genotype-to-phenotype prediction based on an SsP-based pangenome resource.PanSK uses 29-mer sequences that represent each ssP gene at the pangenomic level to reveal untapped diversity across landraces and modern cultivars.Genome-wide association studies with k-mers identified 23 Ssp genes associated with end-use quality that represent novel targets for improvement.We evaluated the effect of rye secalin genes on end-use quality and found that removal of w-secalins from 1BL/1RS wheat translocation lines is associated with enhanced end-use quality.Finally,using machine-learning-based prediction inspired by PanSK,we predicted the quality phenotypes with high accuracy from genotypes alone.This study provides an effective approach for genome design based on ssP genes,enabling the breeding of wheat varieties with superior processing capabilities and improved end-use quality. 展开更多
关键词 WHEAT seed-storage protein end-use quality k-mer pangenome genomic prediction
原文传递
Genomic selection in plant breeding:Key factors shaping two decades of progress
20
作者 Admas Alemu Johanna Astrand +9 位作者 Osval A.Montesinos-López Julio Isidro y Sanchez Javier Fernandez-Gónzalez Wuletaw Tadesse Ramesh R.Vetukuri Anders S.Carlsson Alf Ceplitis JoséCrossa Rodomiro Ortiz Aakash Chawade 《Molecular Plant》 SCIE CSCD 2024年第4期552-578,共27页
Genomic selection,the application of genomic prediction(GP)models to select candidate individuals,has significantly advanced in the past two decades,effectively accelerating genetic gains in plant breeding.This articl... Genomic selection,the application of genomic prediction(GP)models to select candidate individuals,has significantly advanced in the past two decades,effectively accelerating genetic gains in plant breeding.This article provides a holistic overview of key factors that have influenced GP in plant breeding during this period.We delved into the pivotal roles of training population size and genetic diversity,and their relationship with the breeding population,in determining GP accuracy.Special emphasis was placed on optimizing training population size.We explored its benefits and the associated diminishing returns beyond an optimum size.This was done while considering the balance between resource allocation and maximizing prediction accuracy through current optimization algorithms.The density and distribution of single-nucleotide polymorphisms,level of linkage disequilibrium,genetic complexity,trait heritability,statistical machine-learning methods,and non-additive effects are the other vital factors.Using wheat,maize,and potato as examples,we summarize the effect of these factors on the accuracy of GP for various traits.The search for high accuracy in GP—theoretically reaching one when using the Pearson’s correlation as a metric—is an active research area as yet far from optimal for various traits.We hypothesize that with ultra-high sizes of genotypic and phenotypic datasets,effective training population optimization methods and support from other omics approaches(transcriptomics,metabolomics and proteomics)coupled with deep-learning algorithms could overcome the boundaries of current limitations to achieve the highest possible prediction accuracy,making genomic selection an effective tool in plant breeding. 展开更多
关键词 genomic selection genetic gain genomic prediction optimization deep learning training population optimization
原文传递
上一页 1 2 下一页 到第
使用帮助 返回顶部