Introduction

Great improvements have been made in poultry growth, efficiency, and meat yield1. Strategic improvements in management, nutrition, and genetic selection, have led to turkey toms that can weigh over 20 kg at 20 weeks of age2. However, some negative consequences of this improvement in growth and production are becoming apparent. White striping (WS) is a growth-related myopathy that is of increasing interest to the poultry industry. This myopathy presents itself as varying degrees of white striations on the surface of the muscle running parallel to the muscle fibers3. White striping is highly prevalent and has recently been shown to be as high as 88% (including mild to severe cases) in a population of purebred turkeys and as much as 60% in other turkey populations4,5. This myopathy has known negative consequences for consumer acceptance, nutritional, and functional quality of the product3,6,7,8,9. These aspects make research into the biological mechanisms behind the condition and potential methods of prevention of importance to the turkey industry.

Although the amount of research conducted on WS in turkeys is limited, there is more known about the myopathy in broiler chickens. Several studies have been conducted in broilers that investigated WS microscopically and showed that affected breasts have necrotic muscle tissue and increased presence of inflammatory cells, connective tissue, and fat10,11,12. While the exact mechanism for development of WS is still unknown, one of the main mechanisms proposed is ischemia in the affected muscle13. With the magnitude and speed of growth in modern genotypes, the limits of supporting physiological systems such as the circulatory and cardiovascular might have been reached. A major consequence of selection for muscle growth is an increase in muscle fiber hypertrophy14,15,16. This hypertrophy can then lead to insufficient vascularization and reduced blood supply to the fast-growing muscles17,18,19. A restriction in the circulatory system can lead to changes in stem cell growth and the accumulation of metabolic byproducts, inducing oxidative stress likely leading to necrosis, and increases in hypoxic conditions potentially impairing muscle cell regeneration, ultimately leading to the development of WS.

This mechanism of WS development has been supported through various -omics studies in broiler chickens at the level of the transcriptome20,21,22, proteome23, and metabolome13. However, there is a lack of research in these areas for turkeys. Consequently, the objective of this study was to investigate the genomic architecture of WS in turkeys through the estimation of genomic heritability and execution of a genome-wide association study (GWAS) followed by functional analysis for detection of metabolic pathways and gene ontologies associated with the myopathy.

Results and discussion

Estimation of genetic parameters

Genetic and phenotypic correlations of WS with other economically important traits were published in Vanderhout et al.24. Heritability of WS was estimated to be 0.20 ± 0.022 and is the first published genomic heritability estimate for WS in turkeys. The addition of genomic data resulted in a 33% increase in estimated heritability compared to pedigree information alone24. The present estimate was found to be within the range of previously published estimates of heritability (observed scale) in broiler chickens of 0.18–0.5025,26,27. The moderate heritability estimated in the present study suggests that there is a presence of genetic factors influencing WS that could potentially be exploited in selecting birds for reduced WS severity, however, environmental factors can also influence most of the phenotypic variance observed in the population25. However, it is worth noting that the comparing the present heritability estimate with what is reported in the literature may be challenging due to the different species being studied (chickens vs. turkeys), different methods of scoring WS (e.g., different levels of scoring), different breeding goals, and different prevalences of WS in the given populations. Differences in trait prevalence are well known to influence heritability estimates when using linear models which estimate parameters on the observed scale. The prevalence of WS in the present study ranged from 84 to 92% which is substantially greater than what was observed for Bailey et al.25 (18.5–33.8%), Lake et al.27 (79%), and Alnahhas et al.26 (50%). Therefore, it is reasonable to expect variability among studies with regard to heritability estimates.

Significant SNP and positional candidate genes

The percentage of genetic variance explained by each 30-SNP sliding window is presented in Fig. 1. Each window explained 0.05% of the genetic variance on average with no more than 1.00% of the variance being explained by any given window. This suggests that the inheritance of the trait is largely polygenic in nature. A total of 544 SNP windows were classified as significant (top 1% of variance explained) resulting in 95 positional candidate genes found within 50 kb upstream or downstream of these SNP. This distance has been suggested by Do et al.28 to be used when dealing with lower quality assemblies like that of the turkey. The positional candidate genes were located on Meleagris gallopavo autosomal chromosomes (MGA) 2 to 9, 11, 14, 19, 20, and 24. The 95 positional candidate genes were significantly associated (p < 0.05) with four KEGG metabolic pathways (Table 1) and 31 GO terms (21 BP, 3 CC, and 7 MF; Table 2). Positional candidate genes were further considered functional candidate genes (FCG) if they were associated with both a significant metabolic pathway and a significant GO term. Seven FCG were found and were involved mainly in the Wnt signaling pathway (NFATc1), RNA degradation (LSM6 and DHX36), and focal adhesion (COL6A3, FN1, VCL, and GRB2).

Figure 1
figure 1

Manhattan plot for percentage of genetic variance explained by a 30-SNP sliding window across the genome for white striping severity score (0–3). The top 1% of SNP windows that explain the most genetic variance are located above the horizontal line (% of variance explained > 0.330%).

Table 1 List of the KEGG metabolic pathways (p < 0.05) associated with the 95 positional candidate genes for white striping severity score (0–3) in turkeys.
Table 2 List of gene ontology terms including biological processes, cellular components, and molecular functions (p < 0.05) associated with the 95 positional candidate genes for white striping severity score (0–3) in turkeys.

Due to WS being a growth-related myopathy, it is not surprising that several growth and muscle development related BP terms (GO:0003012 muscle system process, GO:0040007 growth, and GO:0061061 muscle structure development) were found to be significantly overrepresented (p < 0.05) by the 95 positional candidate genes. The large selection pressure placed on growth for economically significant muscle groups (i.e., pectoralis) has resulted in meat producing birds that are likely reaching the limit of supporting systems, such as the circulatory system. Thus, the proposed mechanism of WS development is primarily thought to be related to poor blood flow in the breast muscle leading to hypoxia, pressure on satellite cells, and oxidative stress13,21,29. The highly conserved Wnt signalling pathway, one of the four significant metabolic pathways, plays an important role in both embryonic development, where it regulates processes such as differentiation and cell proliferation, polarity, and migration, as well as post-natally, where it regulates tissue homeostasis and biological processes involved in many disorders and cancers30,31,32. The FCG associated with the Wnt signalling pathway, nuclear factor of activated T cells 1 (NFATc1), is found on MGA3 associated with the largest peak in variance explained. This gene has been shown to play a large role in cell cycle progression of human aortic smooth muscle cells33 and promoting the response to injury in arterial smooth muscle cells34. The effect of NFATc1 and the Wnt signalling pathway in the development and repair of the vascular system may be what leads to its significant relationship to WS. Some significant BP terms found in the present study were associated with FCG (including NFATc1, DHX36, and FN1), specifically GO:0009628 response to abiotic stimulus (p = 0.01) and GO:0072359 circulatory system development (p = 0.04), further supporting the relationship between hypoxia, oxidative stress, and WS.

Functional candidate genes collagen type VI alpha 3-chain (COL6A3) and fibronectin 1 (FN1) were also previously found to be significantly associated with WS in broiler chickens35 and differentially expressed between broiler chicken breasts affected versus not affected by WS36,37. The COL6A3 gene produces collagen found in the extracellular matrix of cells that make up skeletal muscles, and mutations in the gene are associated with muscle weakness, atrophy, and necrosis in humans38. The FN1 gene encodes a glycoprotein which plays a role in the creation of extracellular matrix structures during tissue repair and increases in the expression of this gene have been linked with Duchenne muscular dystrophy in humans39. Given the increase in fat and connective tissue that replaces damaged muscle tissue in affected breast muscles, the link between these two genes and WS is reasonable. Another gene of interest is cytosine and glycine rich protein 3 (CSRP3), a positional candidate gene found to be associated with several significant GO terms including 10 BP, one CC, and 1 MF. The CSRP3 gene has been previously shown to be upregulated in broiler chicken breasts affected with WS22,40. This gene encodes a muscle LIM protein and overexpression of such protein can promote muscle differentiation, regeneration, and structural repair of skeletal muscle41,42 further emphasizing the link between WS and muscle tissue damage.

The BP term, GO: 0001816 cytokine production, was found to be significantly overrepresented (p = 0.02) by the positional candidate genes in the present study, including three of the seven FCG (NFATc1, FN1, and DXH36). A microscopic characteristic consistently found in poultry breast tissue affected by WS is an elevated presence of inflammatory cells and cytokines10,43,44. Cytokines are small proteins that play a large role in immune response and inflammation and the elevated presence of these molecules in the muscle of affected breasts is symbolic of muscle cell injury45,46. Whether these genes, and subsequent production of cytokines, was upregulated in the affected breasts of the current study is unknown, however, the expression of inflammatory cytokine genes has been shown to increase with increasing severity of WS in broiler chickens44.

To the best of our knowledge, this study provides the first published estimate of genomic heritability of WS in turkeys and provides the first look into the genomic architecture of WS in turkeys by means of a GWAS and functional analysis. The heritability estimate of WS was found to be 0.20 ± 0.022, and results of the GWAS emphasize the polygenic nature of the trait with no specific genomic region contributing a large portion to the overall genetic variance. Results of the functional analysis identified four significant KEGG metabolic pathways, 31 significant GO terms (21 BP, 3 CC, and 7 MF) and seven functional candidate genes associated with WS. Overall, pathways relating to growth, muscle development, collagen formation, circulatory system development, cell response to stimulus, and cytokine production were highlighted. The results of the present study provide support for the oxidative stress and hypoxic theory of WS development. It should be noted that the WS phenotype was analyzed using a linear model which may reduce the statistical power when considering categorical traits (i.e., compared to a threshold model). Continued -omics research on the topic of WS in turkeys is recommended to further identify relationships between the myopathy and biological processes to identify improved prevention methods. For example, using a meta-GWAS approach to provide a comprehensive assessment of genetic factors influencing WS. Future research should also focus on developing methods of quantitatively scoring WS using technologies such as machine vision algorithms. Such measures would permit an increase in phenotypic measures increasing the power of future analyses.

Materials and methods

Animals

All protocols complied with the guidelines of the Canadian Council on Animal Care and were approved by the University of Guelph Animal Care Committee (AUP 3782). The study was conducted in accordance with relevant guidelines and regulations as well as the ARRIVE guidelines47. Adult male turkeys (20–24 weeks old) from three purebred genetic lines (A, B, and C) were processed over 44 weeks between July 2018 and November 2019. The genetic lines included a sire-line with selection focused on body weight, meat yield, and feed efficiency (line A), a dam-line that was selected primarily for body weight and reproductive traits (line B), and a dam-line selected mainly for reproductive traits (line C). Birds were reared under identical housing and management conditions as specified by the breeding company management guidelines (Hybrid Turkeys, 2020). During processing at a commercial poultry processing plant, birds were electrically stunned, exsanguinated, scalded, defeathered, and eviscerated before moving to the water chiller. Upon completion of the 24 h chilling period (40 min in 5 °C water, 1.5–2 h in 1–2 °C water, and remainder of time layered in ice), birds were deboned, and meat quality and breast muscle weights were measured.

Phenotype and genotype data

Summary statistics of the data are shown in Table 3. Deboned Pectoralis major muscles (N = 8422) were photographed (Hero 6, GoPro, San Mateo, CA, USA) approximately 24 h post-mortem. Photographs were taken using the normal focal length setting from approximately 40 cm above the surface of the breast. The photographs were randomly assigned to six observers who scored the breasts for WS using a 0–3 scoring scale adapted from a system developed in broiler chickens after testing the reliability of the system5,7. In brief, a score of 0 indicated no or minimal white striations whereas a score of 3 indicated the presence of thick white striations covering the breast. Genotypes were collected on 4667 birds using a proprietary 65 K single nucleotide polymorphism (SNP) array (65,000 SNP; Illumina, Inc.). PLINK software48 was used for quality control and SNP markers located on non-autosomal regions with minor allele frequency lower than 0.05, call rate lower than 90%, or significantly deviating from Hardy Weinberg proportions (p < 1 × 10–8) were removed. The quality control resulted in 54,407 markers retained for analysis.

Table 3 Summary statistics of each genetic line (A, B, C) of turkeys.

Statistical analysis

A linear mixed model was used to estimate variance components through restricted maximum likelihood using the BLUPf90 family of programs49. The linear mixed model used can be described as follows:

$${\mathbf{y}} = {\mathbf{Xb}} + {\mathbf{Za}} + {\mathbf{e}},$$

where y is the vector of WS scores; b is a vector of fixed effects including genetic line (3 levels: A, B, and C), hatch week-year (58 levels), age at slaughter (7 levels; 141–163 days), and score observer (6 levels); a is a vector of additive genetic effects distributed as \(\mathbf{a}\sim N(0, \mathbf{H}{\sigma }_{a}^{2})\), where H is the combined pedigree-genomic relationship matrix as in Aguilar et al. (2010) constructed using the PREGSf90 program49. \({\sigma }_{a}^{2}\) is the additive genetic variance; e is the vector of residual effects which has a distribution of \(\mathbf{e}\sim N(0,{\sigma }_{e}^{2})\) where \({\sigma }_{e}^{2}\) is the residual variance; and X and Z are design matrices relating the observations to the fixed and random effects, respectively.

Estimates of SNP effects were derived from the estimated genomic breeding values (gEBV) following50, using a weighted genomic relationship matrix:

$$\widehat{\mathbf{g}}=\mathbf{D}{\mathbf{Z}}^{\mathrm{^{\prime}}}{[\mathbf{Z}\mathbf{D}{\mathbf{Z}}^{\mathrm{^{\prime}}}]}^{-1}{\widehat{\mathbf{u}}}_{{\text{g}}},$$

where \(\widehat{\mathbf{g}}\) is a vector of SNP marker effects; D is a diagonal matrix of weights for variances of SNPs; Z is a matrix relating genotype of each locus; and \({\widehat{\mathbf{u}}}_{{\text{g}}}\) is the vector of gEBV. Due to the proposed polygenic nature of WS and the relatively poor annotation of the turkey genome, a 30-SNP sliding window approach was utilized. This approach allows for accumulating the variance explained by each set of 30 adjacent SNP, which would lead to identify potential genomic regions associated with WS that may not be detected due to the low variance explained by single SNPs. These analyses were carried out using the BLUPf90 family of programs49.

Functional analysis

An arbitrary threshold for markers in the 99th percentile of variance explained were considered significant. Using the Turkey 5.1 assembly51, positional candidate genes within ± 50 kb of the significant SNP were retrieved using the Ensembl Genes database version 104 (https://useast.ensembl.org/Meleagris_gallopavo/Info/Index) implemented through the GALLO R package52. Gene ontology (GO) enrichment analysis including biological processes (BP), cellular components (CC), and molecular functions (MF) as well as metabolic pathway analysis using the Kyoto Encyclopedia of Genes and Genomes (KEGG) database53 were performed on the positional candidate genes using the WebGestaltR R package54 and the Gallus gallus database.