Background: A random multiple-regression model that simultaneously fit all allele substitution effects for additive markers or haplotypes as uncorrelated random effects was proposed for Best Linear Unbiased Predictio...Background: A random multiple-regression model that simultaneously fit all allele substitution effects for additive markers or haplotypes as uncorrelated random effects was proposed for Best Linear Unbiased Prediction, using whole-genome data. Leave-one-out cross validation can be used to quantify the predictive ability of a statistical model.Methods: Naive application of Leave-one-out cross validation is computationally intensive because the training and validation analyses need to be repeated n times, once for each observation. Efficient Leave-one-out cross validation strategies are presented here, requiring little more effort than a single analysis.Results: Efficient Leave-one-out cross validation strategies is 786 times faster than the naive application for a simulated dataset with 1,000 observations and 10,000 markers and 99 times faster with 1,000 observations and 100 markers. These efficiencies relative to the naive approach using the same model will increase with increases in the number of observations.Conclusions: Efficient Leave-one-out cross validation strategies are presented here, requiring little more effort than a single analysis.展开更多
Background: The frequency of recombination events varies across the genome and between individuals, which may be related to some genomic features. The objective of this study was to assess the frequency of recombinati...Background: The frequency of recombination events varies across the genome and between individuals, which may be related to some genomic features. The objective of this study was to assess the frequency of recombination events and to identify QTL(quantitative trait loci) for recombination rate in two purebred layer chicken lines.Methods: A total of 1200 white-egg layers(WL) were genotyped with 580 K SNPs and 5108 brown-egg layers(BL)were genotyped with 42 K SNPs(single nucleotide polymorphisms). Recombination events were identified within half-sib families and both the number of recombination events and the recombination rate was calculated within each0.5 Mb window of the genome. The 10% of windows with the highest recombination rate on each chromosome were considered to be recombination hotspots. A BayesB model was used separately for each line to identify genomic regions associated with the genome-wide number of recombination event per meiosis. Regions that explained more than 0.8% of genetic variance of recombination rate were considered to harbor QTL.Results: Heritability of recombination rate was estimated at 0.17 in WL and 0.16 in BL. On average, 11.3 and 23.2 recombination events were detected per individual across the genome in 1301 and 9292 meioses in the WL and BL,respectively. The estimated recombination rates differed significantly between the lines, which could be due to differences in inbreeding levels, and haplotype structures. Dams had about 5% to 20% higher recombination rates per meiosis than sires in both lines. Recombination rate per 0.5 Mb window had a strong negative correlation with chromosome size and a strong positive correlation with GC content and with CpG island density across the genome in both lines. Different QTL for recombination rate were identified in the two lines. There were 190 and 199 non-overlapping recombination hotspots detected in WL and BL respectively, 28 of which were common to both lines.Conclusions: Differences in the recombination rates, hotspot locations, and QTL regions associated with genomewide recombination were observed between lines, indicating the breed-specific feature of detected recombination events and the control of recombination events is a complex polygenic trait.展开更多
基金supported by the US Department of Agriculture,Agriculture and Food Research Initiative National Institute of Food and Agriculture Competitive grant no.2015-67015-22947
文摘Background: A random multiple-regression model that simultaneously fit all allele substitution effects for additive markers or haplotypes as uncorrelated random effects was proposed for Best Linear Unbiased Prediction, using whole-genome data. Leave-one-out cross validation can be used to quantify the predictive ability of a statistical model.Methods: Naive application of Leave-one-out cross validation is computationally intensive because the training and validation analyses need to be repeated n times, once for each observation. Efficient Leave-one-out cross validation strategies are presented here, requiring little more effort than a single analysis.Results: Efficient Leave-one-out cross validation strategies is 786 times faster than the naive application for a simulated dataset with 1,000 observations and 10,000 markers and 99 times faster with 1,000 observations and 100 markers. These efficiencies relative to the naive approach using the same model will increase with increases in the number of observations.Conclusions: Efficient Leave-one-out cross validation strategies are presented here, requiring little more effort than a single analysis.
基金supported by Hy-Line Int.,the EW group,and Agriculture and Food Research Initiative competitive grants 2009–35205-05100 and 2010–65205-20341 from the USDA National Institute of Food and Agriculture Animal Genome Program
文摘Background: The frequency of recombination events varies across the genome and between individuals, which may be related to some genomic features. The objective of this study was to assess the frequency of recombination events and to identify QTL(quantitative trait loci) for recombination rate in two purebred layer chicken lines.Methods: A total of 1200 white-egg layers(WL) were genotyped with 580 K SNPs and 5108 brown-egg layers(BL)were genotyped with 42 K SNPs(single nucleotide polymorphisms). Recombination events were identified within half-sib families and both the number of recombination events and the recombination rate was calculated within each0.5 Mb window of the genome. The 10% of windows with the highest recombination rate on each chromosome were considered to be recombination hotspots. A BayesB model was used separately for each line to identify genomic regions associated with the genome-wide number of recombination event per meiosis. Regions that explained more than 0.8% of genetic variance of recombination rate were considered to harbor QTL.Results: Heritability of recombination rate was estimated at 0.17 in WL and 0.16 in BL. On average, 11.3 and 23.2 recombination events were detected per individual across the genome in 1301 and 9292 meioses in the WL and BL,respectively. The estimated recombination rates differed significantly between the lines, which could be due to differences in inbreeding levels, and haplotype structures. Dams had about 5% to 20% higher recombination rates per meiosis than sires in both lines. Recombination rate per 0.5 Mb window had a strong negative correlation with chromosome size and a strong positive correlation with GC content and with CpG island density across the genome in both lines. Different QTL for recombination rate were identified in the two lines. There were 190 and 199 non-overlapping recombination hotspots detected in WL and BL respectively, 28 of which were common to both lines.Conclusions: Differences in the recombination rates, hotspot locations, and QTL regions associated with genomewide recombination were observed between lines, indicating the breed-specific feature of detected recombination events and the control of recombination events is a complex polygenic trait.