首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
There is an increasing interest in using whole‐genome sequence data in genomic selection breeding programmes. Prediction of breeding values is expected to be more accurate when whole‐genome sequence is used, because the causal mutations are assumed to be in the data. We performed genomic prediction for the number of eggs in white layers using imputed whole‐genome resequence data including ~4.6 million SNPs. The prediction accuracies based on sequence data were compared with the accuracies from the 60 K SNP panel. Predictions were based on genomic best linear unbiased prediction (GBLUP) as well as a Bayesian variable selection model (BayesC). Moreover, the prediction accuracy from using different types of variants (synonymous, non‐synonymous and non‐coding SNPs) was evaluated. Genomic prediction using the 60 K SNP panel resulted in a prediction accuracy of 0.74 when GBLUP was applied. With sequence data, there was a small increase (~1%) in prediction accuracy over the 60 K genotypes. With both 60 K SNP panel and sequence data, GBLUP slightly outperformed BayesC in predicting the breeding values. Selection of SNPs more likely to affect the phenotype (i.e. non‐synonymous SNPs) did not improve the accuracy of genomic prediction. The fact that sequence data were based on imputation from a small number of sequenced animals may have limited the potential to improve the prediction accuracy. A small reference population (n = 1004) and possible exclusion of many causal SNPs during quality control can be other possible reasons for limited benefit of sequence data. We expect, however, that the limited improvement is because the 60 K SNP panel was already sufficiently dense to accurately determine the relationships between animals in our data.  相似文献   

2.
We studied the effect of including GWAS results on the accuracy of single‐ and multipopulation genomic predictions. Phenotypes (backfat thickness) and genotypes of animals from two sire lines (SL1, n = 1146 and SL3, n = 1264) were used in the analyses. First, GWAS were conducted for each line and for a combined data set (both lines together) to estimate the genetic variance explained by each SNP. These estimates were used to build matrices of weights (D), which was incorporated into a GBLUP method. Single population evaluated with traditional GBLUP had accuracies of 0.30 for SL1 and 0.31 for SL3. When weights were employed in GBLUP, the accuracies for both lines increased (0.32 for SL1 and 0.34 for SL3). When a multipopulation reference set was used in GBLUP, the accuracies were higher (0.36 for SL1 and 0.32 for SL3) than in single‐population prediction. In addition, putting together the multipopulation reference set and the weights from the combined GWAS provided even higher accuracies (0.37 for SL1, and 0.34 for SL3). The use of multipopulation predictions and weights estimated from a combined GWAS increased the accuracy of genomic predictions.  相似文献   

3.
A population-based imputation procedure was used to predict the most likely genotype of un-typed loci on low density SNP maker panels to improve data integrity before genetic association and selection studies when pedigree information is not available such as in feedlot applications. It is of practical importance to evaluate the accuracy effects of imputed genotypes. In our report, a population consisting of 2246 Angus bulls that were genotyped using both Illumina Bovine3k and Bovin50 BeadChip was used. Several scenarios with varying percentages of missing SNP genotypes under a random missing pattern were simulated. Additionally, several scenarios with varying percentages of animals genotyped using the 3 k and 50 k panels assuming a structured missing pattern were considered. With the random missing scenarios, SNP genotypes on the Bovine50 panel were masked at random until reaching the desired missing percentage. With the structured missing scenarios, all SNP genotypes in the Bovine50 chip were masked, with the exception of those corresponding to the Bovine3 panel. The missing rates considered in this study ranged from 70% to 94% across chromosomes. Population-based imputation software fastPHASE1.2 was used for the separate analysis of each of the 30 pairs of chromosomes in the bovine genome. The results of the imputation of the random-missing SNP genotypes were similar to previous reports and accuracy rates, defined as the percentage of correct prediction of the true missing genotypes, ranging from 68% to 97% were influenced primarily by the proportion of missing genotypes. Moreover, imputation performance using structured-missing-pattern panels was impacted by the amount of individuals in reference population and level of linkage disequilibrium (LD) on each chromosome. In order to further elucidate the potential effect of incorrect imputation on genomic selection, wrongly imputed genotypes were grouped into two groups as a function of the number of incorrectly imputed alleles.  相似文献   

4.
旨在比较结合全基因组关联分析(genome-wide association study, GWAS)先验标记信息的基因组育种值(genomic estimated breeding value, GEBV)估计与基因组最佳线性无偏预测(genomic best linear unbiased prediction, GBLUP)方法对鸡剩余采食量性状育种值估计的准确性,为提高基因组选择准确性提供理论与技术支持。本研究选用广西金陵花鸡3个世代共2 510个个体作为素材,其中公鸡1 648只,母鸡862只,以42~56日龄期间的剩余采食量(residual feed intake, RFI)为目标性状,将试验群体随机分为两组,其中一组作为先验标记信息发现群体,用于GWAS分析并筛选最显著的top5%、top10%、top15%和top20%的位点作为先验标记信息;另外一组分别结合不同的先验标记信息进行遗传参数估计并比较基因组育种值的预测准确性,使用重复10次的五倍交叉验证法获取准确性,随后两组群体再进行交叉验证。研究结果表明,GBLUP计算RFI的遗传力为0.153,预测准确性为0.38...  相似文献   

5.
Bootstrap aggregation (bagging) is a resampling method known to produce more accurate predictions when predictors are unstable or when the number of markers is much larger than sample size, because of variance reduction capabilities. The purpose of this study was to compare genomic best linear unbiased prediction (GBLUP) with bootstrap aggregated sampling GBLUP (Bagged GBLUP, or BGBLUP) in terms of prediction accuracy. We used a 600 K Affymetrix platform with 1351 birds genotyped and phenotyped for three traits in broiler chickens; body weight, ultrasound measurement of breast muscle and hen house egg production. The predictive performance of GBLUP versus BGBLUP was evaluated in different scenarios consisting of including or excluding the TOP 20 markers from a standard genome‐wide association study (GWAS) as fixed effects in the GBLUP model, and varying training sample sizes and allelic frequency bins. Predictive performance was assessed via five replications of a threefold cross‐validation using the correlation between observed and predicted values, and prediction mean‐squared error. GBLUP overfitted the training set data, and BGBLUP delivered a better predictive ability in testing sets. Treating the TOP 20 markers from the GWAS into the model as fixed effects improved prediction accuracy and added advantages to BGBLUP over GBLUP. The performance of GBLUP and BGBLUP at different allele frequency bins and training sample sizes was similar. In general, results of this study confirm that BGBLUP can be valuable for enhancing genome‐enabled prediction of complex traits.  相似文献   

6.
The objectives of this study were to estimate the additive and dominance variance component of several weight and ultrasound scanned body composition traits in purebred and combined cross‐bred sheep populations based on single nucleotide polymorphism (SNP) marker genotypes and then to investigate the effect of fitting additive and dominance effects on accuracy of genomic evaluation. Additive and dominance variance components were estimated in a mixed model equation based on “average information restricted maximum likelihood” using additive and dominance (co)variances between animals calculated from 48,599 SNP marker genotypes. Genomic prediction was based on genomic best linear unbiased prediction (GBLUP), and the accuracy of prediction was assessed based on a random 10‐fold cross‐validation. Across different weight and scanned body composition traits, dominance variance ranged from 0.0% to 7.3% of the phenotypic variance in the purebred population and from 7.1% to 19.2% in the combined cross‐bred population. In the combined cross‐bred population, the range of dominance variance decreased to 3.1% and 9.9% after accounting for heterosis effects. Accounting for dominance effects significantly improved the likelihood of the fitting model in the combined cross‐bred population. This study showed a substantial dominance genetic variance for weight and ultrasound scanned body composition traits particularly in cross‐bred population; however, improvement in the accuracy of genomic breeding values was small and statistically not significant. Dominance variance estimates in combined cross‐bred population could be overestimated if heterosis is not fitted in the model.  相似文献   

7.
Reliable genomic prediction of breeding values for quantitative traits requires the availability of sufficient number of animals with genotypes and phenotypes in the training set. As of 31 October 2016, there were 3,797 Brangus animals with genotypes and phenotypes. These Brangus animals were genotyped using different commercial SNP chips. Of them, the largest group consisted of 1,535 animals genotyped by the GGP‐LDV4 SNP chip. The remaining 2,262 genotypes were imputed to the SNP content of the GGP‐LDV4 chip, so that the number of animals available for training the genomic prediction models was more than doubled. The present study showed that the pooling of animals with both original or imputed 40K SNP genotypes substantially increased genomic prediction accuracies on the ten traits. By supplementing imputed genotypes, the relative gains in genomic prediction accuracies on estimated breeding values (EBV) were from 12.60% to 31.27%, and the relative gain in genomic prediction accuracies on de‐regressed EBV was slightly small (i.e. 0.87%–18.75%). The present study also compared the performance of five genomic prediction models and two cross‐validation methods. The five genomic models predicted EBV and de‐regressed EBV of the ten traits similarly well. Of the two cross‐validation methods, leave‐one‐out cross‐validation maximized the number of animals at the stage of training for genomic prediction. Genomic prediction accuracy (GPA) on the ten quantitative traits was validated in 1,106 newly genotyped Brangus animals based on the SNP effects estimated in the previous set of 3,797 Brangus animals, and they were slightly lower than GPA in the original data. The present study was the first to leverage currently available genotype and phenotype resources in order to harness genomic prediction in Brangus beef cattle.  相似文献   

8.
Most genomic prediction studies fit only additive effects in models to estimate genomic breeding values (GEBV). However, if dominance genetic effects are an important source of variation for complex traits, accounting for them may improve the accuracy of GEBV. We investigated the effect of fitting dominance and additive effects on the accuracy of GEBV for eight egg production and quality traits in a purebred line of brown layers using pedigree or genomic information (42K single‐nucleotide polymorphism (SNP) panel). Phenotypes were corrected for the effect of hatch date. Additive and dominance genetic variances were estimated using genomic‐based [genomic best linear unbiased prediction (GBLUP)‐REML and BayesC] and pedigree‐based (PBLUP‐REML) methods. Breeding values were predicted using a model that included both additive and dominance effects and a model that included only additive effects. The reference population consisted of approximately 1800 animals hatched between 2004 and 2009, while approximately 300 young animals hatched in 2010 were used for validation. Accuracy of prediction was computed as the correlation between phenotypes and estimated breeding values of the validation animals divided by the square root of the estimate of heritability in the whole population. The proportion of dominance variance to total phenotypic variance ranged from 0.03 to 0.22 with PBLUP‐REML across traits, from 0 to 0.03 with GBLUP‐REML and from 0.01 to 0.05 with BayesC. Accuracies of GEBV ranged from 0.28 to 0.60 across traits. Inclusion of dominance effects did not improve the accuracy of GEBV, and differences in their accuracies between genomic‐based methods were small (0.01–0.05), with GBLUP‐REML yielding higher prediction accuracies than BayesC for egg production, egg colour and yolk weight, while BayesC yielded higher accuracies than GBLUP‐REML for the other traits. In conclusion, fitting dominance effects did not impact accuracy of genomic prediction of breeding values in this population.  相似文献   

9.
Genomic selection is a method to predict breeding values using genome‐wide single‐nucleotide polymorphism (SNP) markers. High‐quality marker data are necessary for genomic selection. The aim of this study was to investigate the effect of marker‐editing criteria on the accuracy of genomic predictions in the Nordic Holstein and Jersey populations. Data included 4429 Holstein and 1071 Jersey bulls. In total, 48 222 SNP for Holstein and 44 305 SNP for Jersey were polymorphic. The SNP data were edited based on (i) minor allele frequencies (MAF) with thresholds of no limit, 0.001, 0.01, 0.02, 0.05 and 0.10, (ii) deviations from Hardy–Weinberg proportions (HWP) with thresholds of no limit, chi‐squared p‐values of 0.001, 0.02, 0.05 and 0.10, and (iii) GenCall (GC) scores with thresholds of 0.15, 0.55, 0.60, 0.65 and 0.70. The marker data sets edited with different criteria were used for genomic prediction of protein yield, fertility and mastitis using a Bayesian variable selection and a GBLUP model. De‐regressed EBV were used as response variables. The result showed little difference between prediction accuracies based on marker data sets edited with MAF and deviation from HWP. However, accuracy decreased with more stringent thresholds of GC score. According to the results of this study, it would be appropriate to edit data with restriction of MAF being between 0.01 and 0.02, a p‐value of deviation from HWP being 0.05, and keeping all individual SNP genotypes having a GC score over 0.15.  相似文献   

10.
Background: Accurate evaluation of SNP effects is important for genome wide association studies and for genomic prediction. The genetic architecture of quantitative traits differs widely, with some traits exhibiting few if any quantitative trait loci(QTL) with large effects, while other traits have one or several easily detectable QTL with large effects.Methods: Body weight in broilers and egg weight in layers are two examples of traits that have QTL of large effect.A commonly used method for genome wide association studies is to fit a mixture model such as Bayes B that assumes some known proportion of SNP effects are zero. In contrast, the most commonly used method for genomic prediction is known as GBLUP, which involves fitting an animal model to phenotypic data with the variance-covariance or genomic relationship matrix among the animals being determined by genome wide SNP genotypes. Genotypes at each SNP are typically weighted equally in determining the genomic relationship matrix for GBLUP. We used the equivalent marker effects model formulation of GBLUP for this study. We compare these two classes of models using egg weight data collected over 8 generations from 2,324 animals genotyped with a42 K SNP panel.Results: Using data from the first 7 generations, both Bayes B and GBLUP found the largest QTL in a similar well-recognized QTL region, but this QTL was estimated to account for 24 % of genetic variation with Bayes B and less than 1 % with GBLUP. When predicting phenotypes in generation 8 Bayes B accounted for 36 % of the phenotypic variation and GBLUP for 25 %. When using only data from any one generation, the same QTL was identified with Bayes B in all but one generation but never with GBLUP. Predictions of phenotypes in generations 2 to 7 based on only 295 animals from generation 1 accounted for 10 % phenotypic variation with Bayes B but only6 % with GBLUP. Predicting phenotype using only the marker effects in the 1 Mb region that accounted for the largest effect on egg weight from generation 1 data alone accounted for almost 8 % variation using Bayes B but had no predictive power with GBLUP.Conclusions: In conclusion, In the presence of large effect QTL, Bayes B did a better job of QTL detection and its genomic predictions were more accurate and persistent than those from GBLUP.  相似文献   

11.
Significance testing for genome‐wide association study (GWAS) with increasing SNP density up to whole‐genome sequence data (WGS) is not straightforward, because of strong LD between SNP and population stratification. Therefore, the objective of this study was to investigate genomic control and different significance testing procedures using data from a commercial pig breeding scheme. A GWAS was performed in GCTA with data of 4,964 Large White pigs using medium density, high density or imputed whole‐genome sequence data, fitting a genomic relationship matrix based on a leave‐one–chromosome‐out approach to account for population structure. Subsequently, genomic inflation factors were assessed on whole‐genome level and the chromosome level. To establish a significance threshold, permutation testing, Bonferroni corrections using either the total number of SNPs or the number of independent chromosome fragments, and false discovery rates (FDR) using either the Benjamini–Hochberg procedure or the Benjamini and Yekutieli procedure were evaluated. We found that genomic inflation factors did not differ between different density genotypes but do differ between chromosomes. Also, the leave‐one‐chromosome‐out approach for GWAS or using the pedigree relationships did not account appropriately for population stratification and gave strong genomic inflation. Regarding different procedures for significance testing, when the aim is to find QTL regions that are associated with a trait of interest, we recommend applying the FDR following the Benjamini and Yekutieli approach to establish a significance threshold that is adjusted for multiple testing. When the aim is to pinpoint a specific mutation, the more conservative Bonferroni correction based on the total number of SNPs is more appropriate, till an appropriate method is established to adjust for the number of independent tests.  相似文献   

12.
Genomic selection   总被引:2,自引:0,他引:2  
Genomic selection is a form of marker-assisted selection in which genetic markers covering the whole genome are used so that all quantitative trait loci (QTL) are in linkage disequilibrium with at least one marker. This approach has become feasible thanks to the large number of single nucleotide polymorphisms (SNP) discovered by genome sequencing and new methods to efficiently genotype large number of SNP. Simulation results and limited experimental results suggest that breeding values can be predicted with high accuracy using genetic markers alone but more validation is required especially in samples of the population different from that in which the effect of the markers was estimated. The ideal method to estimate the breeding value from genomic data is to calculate the conditional mean of the breeding value given the genotype of the animal at each QTL. This conditional mean can only be calculated by using a prior distribution of QTL effects so this should be part of the research carried out to implement genomic selection. In practice, this method of estimating breeding values is approximated by using the marker genotypes instead of the QTL genotypes but the ideal method is likely to be approached more closely as more sequence and SNP data is obtained. Implementation of genomic selection is likely to have major implications for genetic evaluation systems and for genetic improvement programmes generally and these are discussed.  相似文献   

13.
Genomic selection (GS) is now practiced successfully across many species. However, many questions remain, such as long-term effects, estimations of genomic parameters, robustness of genome-wide association study (GWAS) with small and large datasets, and stability of genomic predictions. This study summarizes presentations from the authors at the 2020 American Society of Animal Science (ASAS) symposium. The focus of many studies until now is on linkage disequilibrium between two loci. Ignoring higher-level equilibrium may lead to phantom dominance and epistasis. The Bulmer effect leads to a reduction of the additive variance; however, the selection for increased recombination rate can release anew genetic variance. With genomic information, estimates of genetic parameters may be biased by genomic preselection, but costs of estimation can increase drastically due to the dense form of the genomic information. To make the computation of estimates feasible, genotypes could be retained only for the most important animals, and methods of estimation should use algorithms that can recognize dense blocks in sparse matrices. GWASs using small genomic datasets frequently find many marker-trait associations, whereas studies using much bigger datasets find only a few. Most of the current tools use very simple models for GWAS, possibly causing artifacts. These models are adequate for large datasets where pseudo-phenotypes such as deregressed proofs indirectly account for important effects for traits of interest. Artifacts arising in GWAS with small datasets can be minimized by using data from all animals (whether genotyped or not), realistic models, and methods that account for population structure. Recent developments permit the computation of P-values from genomic best linear unbiased prediction (GBLUP), where models can be arbitrarily complex but restricted to genotyped animals only, and single-step GBLUP that also uses phenotypes from ungenotyped animals. Stability was an important part of nongenomic evaluations, where genetic predictions were stable in the absence of new data even with low prediction accuracies. Unfortunately, genomic evaluations for such animals change because all animals with genotypes are connected. A top-ranked animal can easily drop in the next evaluation, causing a crisis of confidence in genomic evaluations. While correlations between consecutive genomic evaluations are high, outliers can have differences as high as 1 SD. A solution to fluctuating genomic evaluations is to base selection decisions on groups of animals. Although many issues in GS have been solved, many new issues that require additional research continue to surface.  相似文献   

14.
The objective of this paper was to investigate, for various scenarios at low and high marker density, the accuracy of imputing genotypes when using a multivariate mixed model framework using information from 2, 4, or 10 surrounding markers. This model predicts genotypes at a locus, using genotypes at nearby loci as correlated traits, and the additive genetic relationship matrix to use information from genotyped relatives. For 2 scenarios this method was compared with the population-based imputation algorithms FastPHASE and Beagle. Accuracies of imputation were obtained with Monte Carlo simulation and predicted with selection index theory, using input from the simulated data. Five different scenarios of missing genotypes were considered: 1) genotypes of some loci are missing due to genotyping errors, 2) juvenile selection candidates are genotyped using a smaller SNP panel, 3) some animals in the pedigree of a breeding population are not genotyped, 4) juvenile selection candidates are not genotyped, and 5) 1 generation of animals in the top of the pedigree are not genotyped. Surrounding marker information did not improve accuracy of imputation when animals whose genotypes were imputed were not genotyped for those surrounding markers. When those animals were genotyped for surrounding markers, results indicated a limited gain when linkage disequilibrium (LD) between SNP was low, but a substantial increase in accuracy when LD between SNP was high. For scenario 1, using 1 vs. 11 SNP, accuracy was respectively 0.75 and 0.81 at low, and 0.75 and 0.93 at high density. For scenario 2, using 1 vs. 11 SNP, accuracy was, respectively, 0.70 and 0.73 at low, and 0.71 and 0.84 at high density. Beagle outperformed the other methods at high SNP density, whereas the multivariate mixed model was clearly superior when SNP density was low and animals where genotyped with a reduced SNP panel. The results showed that extending the univariate gene content method to a multivariate BLUP model with inclusion of surrounding marker information only yields greater imputation accuracy when the animals with imputed loci are at least genotyped for some SNP that are in LD with the SNP to be imputed. The equation derived from selection index theory accurately predicted the accuracy of imputation using the multivariate mixed model framework.  相似文献   

15.
为探究基于A矩阵期望遗传关系最大化(maximizing the expected genetic relationship for matrix A,RELA)、基于A矩阵目标群体遗传方差最小化(minimized the target population genetic variance for matrix A,MCA)、平均亲缘关系最大化(the highest mean kinship coefficients,KIN)、随机选择(random selection,RAN)、共同祖先筛选(common ancestor,CA)等不同参考群筛选方法及参考群规模对基因型填充准确性的影响。本研究使用矮小型黄羽肉鸡作为试验群体,采用鸡600K SNP芯片(Affymetrix Axion HD genotyping array)进行基因分型,测定435羽子代公鸡45、56、70、84、91日龄体重。利用Beagle软件将低密度SNP芯片填充为高密度SNP芯片数据,比较不同参考群筛选方法、参考群规模对基因型填充准确性的影响,以及填充芯片基因组预测准确性。结果表明,使用Beagle 4.0结合系谱信息进行填充效果最佳,其次为Beagle 4.0,而Beagle 5.1填充效果最差。使用MCA方法筛选参考群进行基因型填充准确性最高,使用RAN方法筛选参考群进行基因型填充准确性最低,MCA、RELA、CA 3种方法基因型填充准确性差别较小。相比其他方法,使用MCA方法筛选个体作为参考群将低密度SNP芯片填充至高密度SNP芯片进行基因组选择的预测准确性较高,与真实高密度SNP芯片的基因组预测准确性相差甚微。随着参考群规模增大,基因型填充准确性也随之增加,但增速逐渐下降,最后趋于平缓。综上所述,可以通过参考群筛选方法构建参考群以及控制参考群规模,以保证基因型填充和基因组预测准确性并节省成本,本研究为基因型填充在畜禽遗传育种中的应用提供技术参考。  相似文献   

16.
Previously accurate genomic predictions for Bacterial cold water disease (BCWD) resistance in rainbow trout were obtained using a medium‐density single nucleotide polymorphism (SNP) array. Here, the impact of lower‐density SNP panels on the accuracy of genomic predictions was investigated in a commercial rainbow trout breeding population. Using progeny performance data, the accuracy of genomic breeding values (GEBV) using 35K, 10K, 3K, 1K, 500, 300 and 200 SNP panels as well as a panel with 70 quantitative trait loci (QTL)‐flanking SNP was compared. The GEBVs were estimated using the Bayesian method BayesB, single‐step GBLUP (ssGBLUP) and weighted ssGBLUP (wssGBLUP). The accuracy of GEBVs remained high despite the sharp reductions in SNP density, and even with 500 SNP accuracy was higher than the pedigree‐based prediction (0.50–0.56 versus 0.36). Furthermore, the prediction accuracy with the 70 QTL‐flanking SNP (0.65–0.72) was similar to the panel with 35K SNP (0.65–0.71). Genomewide linkage disequilibrium (LD) analysis revealed strong LD (r2 ≥ 0.25) spanning on average over 1 Mb across the rainbow trout genome. This long‐range LD likely contributed to the accurate genomic predictions with the low‐density SNP panels. Population structure analysis supported the hypothesis that long‐range LD in this population may be caused by admixture. Results suggest that lower‐cost, low‐density SNP panels can be used for implementing genomic selection for BCWD resistance in rainbow trout breeding programs.  相似文献   

17.
旨在比较不同方法对中国荷斯坦牛繁殖性状的基因组预测效果,选择最佳的基因组预测方法及信息矩阵权重组合(τ和ω)用于实际育种。本研究利用北京地区33个牧场1998—2020年荷斯坦牛群繁殖记录,分析了3个重要繁殖性状:产犊至首次配种间隔(ICF)、青年牛配种次数(NSH)和成母牛配种次数(NSC)共98 483~197 764条表型数据。同时收集了8 718头母牛和3 477头公牛的基因芯片数据,根据具有芯片数据的牛群结构划分为公牛验证群和母牛验证群。随后,通过BLUPF90软件的AIREMLF90和BLUPF90模块利用最佳线性无偏预测(BLUP)、基因组最佳线性无偏预测(GBLUP)和一步法(ssGBLUP)对3个性状进行基因组预测,不同方法的预测效果根据准确性和无偏性来评估。结果表明,3个繁殖性状均为低遗传力性状(0.03~0.08);ssGBLUP方法中,各性状信息矩阵的权重取值能够在一定程度上提升基因组预测的效果;ICF、NSH和NSC在母牛验证群下的最佳权重取值分别为:τ=1.3和ω=0,τ=0.5和ω=0.4以及τ=0.5和ω=0;在公牛验证群下最优权重组合分别为:τ=1.5和ω=0,τ=1.3和ω=0.8以及τ=0.5和ω=0;基于最佳权重的ssGBLUP方法准确性较BLUP和GBLUP方法准确性分别提升了0.10~0.39和0.08~0.15,且无偏性最接近于1。综上,使用最佳权重组合的ssGBLUP时,各性状基因组预测结果具有较高准确性和无偏性,建议作为中国荷斯坦牛繁殖性状基因组选择方法。  相似文献   

18.
New challenges have arisen with the development of large marker panels for livestock species. Models easily become overparameterized when all available markers are included. Solutions have led to the development of shrinkage or regularization techniques. The objective of this study was the application and comparison of Bayesian LASSO (B-L), thick-tailed (Student-t), and semiparametric multiple shrinkage methods. The B-L and Student-t methods were also each analyzed within a single shrinkage and a multiple shrinkage framework. Simulated and real data were used to evaluate each method's performance. Real data consisted of SNP genotypes of 4,069 Holstein sires. Traits included in analysis of real data were milk, fat, protein yield, and somatic cell score. The performance of each model was compared based on correlations between true and predicted genomic predicted transmitting abilities. Model performance was also compared with the performance of routinely used methods such as Bayes-A and GBLUP through cross-validation techniques. When using simulated data regardless of shrinkage framework, shrinkage models outperformed genomic BLUP (GBLUP). The average advantage of shrinkage models ranged from 1% to approximately 8% depending on the prior specification. When analyzing real data, shrinkage models slightly outperformed GBLUP for most traits. Shrinkage models were better able to model traits for which 1 or more SNP of large effect have been identified. Overall, results suggested a relatively small advantage in multiple shrinkage models. Multiple shrinkage methods could represent a useful alternative to current methods of prediction; however, their performance in a variety of scenarios needs to be investigated further.  相似文献   

19.
This study investigated the effect of including Nordic Holsteins in the reference population on the imputation accuracy and prediction accuracy for Chinese Holsteins. The data used in this study include 85 Chinese Holstein bulls genotyped with both 54K chip and 777K (HD) chip, 2862 Chinese cows genotyped with 54K chip, 510 Nordic Holstein bulls genotyped with HD chip, and 4398 Nordic Holstein bulls genotyped with 54K chip and with deregressed proofs for five milk production traits. Based on these data, the accuracy of imputation from 54K to HD marker data and the accuracy of genomic predictions in Chinese Holstein were assessed. The allele correct rate increased around 2.7 and 1.7% in imputation from the 54K to the HD marker data for Chinese Holstein bulls and cows, respectively, when the Nordic HD‐genotyped bulls were included in the reference data for imputation. However, the prediction accuracy was improved slightly when using the marker data imputed based on the combined HD reference data, compared with using the marker data imputed based on the Chinese HD reference data only. On the other hand, when using the combined reference population including 4398 Nordic Holstein bulls, the accuracy of genomic predictions increased 6.5 percentage points together with a reduction of prediction bias. The HD markers did not outperform the 54K markers in genomic prediction based on the present data. The results indicate that for Chinese Holsteins, it is necessary to genotype more individuals with 54K chip to increase reference population rather than increasing marker density.  相似文献   

20.
This study was carried out to evaluate the advantage of preselecting SNP markers using Markov blanket algorithm regarding the accuracy of genomic prediction for carcass and meat quality traits in Nellore cattle. This study considered 3675, 3680, 3660 and 524 records of rib eye area (REA), back fat thickness (BF), rump fat (RF), and Warner–Bratzler shear force (WBSF), respectively, from the Nellore Brazil Breeding Program. The animals have been genotyped using low-density SNP panel (30 k), and subsequently imputed for arrays with 777 k SNPs. Four Bayesian specifications of genomic regression models, namely Bayes A, Bayes B, Bayes Cπ and Bayesian Ridge Regression methods were compared in terms of prediction accuracy using a five folds cross-validation. Prediction accuracy for REA, BF and RF was all similar using the Bayesian Alphabet models, ranging from 0.75 to 0.95. For WBSF, the predictive ability was higher using Bayes B (0.47) than other methods (0.39 to 0.42). Although the prediction accuracies using Markov blanket of SNP markers were lower than those using all SNPs, for WBSF the relative gain was lower than 13%. With a subset of informative SNPs markers, identified using Markov blanket, probably, is possible to capture a large proportion of the genetic variance for WBSF. The development of low-density and customized arrays using Markov blanket might be cost-effective to perform a genomic selection for this trait, increasing the number of evaluated animals, improving the management decisions based on genomic information and applying genomic selection on a large scale.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号