Introduction

Parkinson’s disease (PD) is a chronic neurodegenerative disease affecting roughly 8 million people globally1. Comorbidities frequently occur in PD patients and at a rate higher than similar age controls, which complicates the management of the disease2,3,4. This includes a variety of health conditions, including type 2 diabetes mellitus (T2DM)2,5. PD patients with comorbidities are an especially vulnerable subpopulation among all PD patients, as these patients often have worse prognosis, reduced quality of life, and increased medical costs5,6,7.

Previous research has indicated that T2DM, characterized as insulin resistance that results in impairment of glucose regulation and metabolism, may contribute to the onset of neurodegenerative diseases, such as PD, and also influence the progression of these conditions8,9. Research points toward dysregulation of shared pathophysiologic pathways between T2DM and PD, including insulin resistance, mitochondrial dysfunction, inflammation, or metabolic dysregulation9,10. Few epidemiologic studies, however, have analyzed metabolic profiles associated with T2DM among PD patients to help identify physiologic responses that are similar or different in those suffering from both medical conditions. To date, no published study has investigated metabolic features associated with T2DM in PD patients using untargeted metabolomics.

High-resolution metabolomics (HRM)11 has emerged as a useful tool that can profile thousands of small molecules produced from metabolism (metabolites) in different biospecimens. Previous research has described metabolomic profiles in PD12,13,14, including our own15, and T2DM16 separately. Two previous studies including both targeted and untargeted metabolomics found that lower levels of low-density lipoprotein cholesterol, higher level of fibrinogen, and lipid metabolic dysregulation were associated with a higher risk of dementia in PD patients with T2DM than without T2DM17,18. The current study builds on this previous work, specifically assessing the metabolic profile of T2DM among PD patients. We applied untargeted HRM to serum samples of 636 PD patients of whom 96 also suffered from T2DM and 253 older adults without PD, among whom 36 were diagnosed with T2DM. The goal of the study is to describe shared and unique metabolic features in PD patients with and without T2DM and shed light on unique dysregulated metabolic pathways in PD with T2DM. Given the aging of populations and the projected increase of prevalence in both PD19 and T2DM20, understanding the links between PD and T2DM is a public health priority21,22.

Results

Study population

As shown in Table 1, 15% of the 636 PD patients had T2DM and the mean age of PD diagnosis was similar in PD patients with T2DM (68 y, SD = ±10 y) and without T2DM (67 y, SD = ±10 y). PD patients with and without T2DM were also similar in terms of the percentages of men, people with ≥12 years of education, and have never smoked. However, more of the PD patients with T2DM reported non-European ancestry than those without T2DM. Race and ethnicity of the study population is shown in Supplementary Table 1. In total, 253 non-PD participants were included in the analysis, with 36 (14%) having T2DM. The patterns of age, lifestyle, and ethnicity were also similar to T2DM (see Table 1).

Table 1 Parkinson’s disease (PD) patients’ and non-PD participants’ characteristics by diabetes among those with metabolomics data

Metabolome-wide analysis and annotation

For untargeted, high-resolution metabolomics, we analyzed serum samples using both hydrophilic interactions (HILIC) chromatography with positive electrospray ionization (ESI) and C18 chromatography with negative ESI. After feature alignment and quality control, we included 2913 metabolic features detected in the positive ion mode (HILIC) and 2222 metabolic features from the negative ion mode (C18) in our MWAS analysis. MWAS results are shown in Figs. 1 and 2 and complete summary statistics for the metabolic features associated with T2DM are available through figshare (https://doi.org/10.6084/m9.figshare.22589464). After adjusting for multiple testing and considering a significance level of FDR < 0.05, 192 metabolic features from the HILIC column and 133 in the C18 column were associated with T2DM among the PD patients. When considering only replication of the T2DM MWAS metabolites discovered in this PD population, after multiple testing correction, 49 of the 192 metabolic features associated in HILIC and 38 out of 133 metabolic features in C18 were also associated with T2DM at a replication FDR < 0.05 in the non-PD participants (Supplementary Table 2). The coefficients for T2DM association across metabolic features from the MWAS among the PD patient group and non-PD participant group were moderately correlated (HILIC: r = 0.4, p < 2.2e−16; C18: r = 0.45, p < 2.2e−16). This is displayed in Fig. 2, which plots metabolite feature coefficients from the T2DM MWAS in PD and non-PD participants, colored features by association status (i.e., associated with T2DM in PD patient group, non-PD participant group or both). Most features showed concordance in the direction of association with T2DM among both the PD and non-PD participants.

Fig. 1: Manhattan plots of results of metabolome-wide association study for diabetes mellitus (DM).
figure 1

a Among Parkinson’s disease (PD) patients in HILIC column, b among PD patients in C18 column, c among non-PD participants in HILIC column, and d among non-PD participants in C18 column.

Fig. 2: Correlations between coefficients of metabolic features from metabolome-wide association study for diabetes mellitus (DM) among Parkinson’s disease (PD) patients and non-PD participants.
figure 2

a HILIC column and b C18 column colored by significance level adjusting for multiple testing.

Annotations for all metabolic features using the in-house library and xMSannotator are included in Supplementary Tables 3 and 4. Among the T2DM-associated metabolic features, 9 metabolites were uniquely annotated at high confidence, while 9 features were annotated at high confidence to multiple metabolites based on both the in-house library and xMSannotator (see Table 2). For example, the top feature (mz = 215.0328, retention time = 30.913) associated with T2DM among PD patients (OR = 1.80 per SD, 95% CI = 1.46, 2.25, FDR = 1.30e−05) and non-PD patients (OR = 3.62, 95% CI = 2.22, 5.91, FDR = 1.33e−04) was annotated at high confidence to glucose. Similarly, another top feature (mz = 179.0562, retention time = 36.803) in both groups (PD patients, OR = 1.75, 95% CI = 1.41, 2.15, FDR = 2.81e−05; non-PD participants, OR = 2.80, 95% CI = 1.84, 4.26, FDR = 2.83e−04) was annotated to simple sugars (i.e., glucose, fructose and mannose). The amino acid metabolite citrulline (mz = 174.088, rt = 30.043) was inversely associated with T2DM in both PD patients (OR = 0.54, 95% CI = 0.43, 0.68, FDR = 2.35e−05) and non-PD participants (OR = 0.48, 95% CI = 0.33, 0.70, FDR = 1.07e−02).

Table 2 Annotation of metabolic features associated with diabetes mellitus among Parkinson’s disease (PD) patients and non-PD participants

Most were associated with T2DM only among PD patients, though many showed similar trends among the non-PD participants. Metabolites with unique annotation matches included several amino acids (cystathionine, cystine, glutamic acid, gulonolactone, proline, aspartic acid, and indoleacetaldehyde), nucleic acids (uridine, hypoxanthine), and tricarboxylic acid (TCA) cycle/coenzyme A (CoA) related metabolites (oxoglutaric acid, pantothenic acid, trans-aconitic acid, and mevalonic acid).

Pathway enrichment analysis

We identified 55 metabolic pathways enriched (p < 0.05) among the features associated with T2DM in PD patients and 28 metabolic pathways from the features associated with T2DM among non-PD participants (see Table 3). After removing duplicate pathways identified in both HILIC and C18 in either group, 17 metabolic pathways remained associated with T2DM (p < 0.05) in both PD patients and non-PD participants, 26 pathways were only enriched in PD patients, and 5 only in non-PD participants. Figure 3 shows these pathways, along with each pathway’s biochemical classification and functional group.

Table 3 Metabolic pathways associated with diabetes mellitus (DM) using metabolites captured by positive ion mode (HILIC) and negative ion mode (C18) among Parkinson’s disease (PD) patients and non-PD participants
Fig. 3: Pathway enriched among Parkinson’s disease (PD) patients and non-PD participants.
figure 3

a Annotation of pathways by chemical groups, b by functional proporties.

The pathways enriched for T2DM-associated metabolites in both PD and non-PD participants included carbohydrate-associated metabolic pathways (e.g., simple sugar metabolism, such as fructose, mannose, galactose metabolism and glycolysis, and complex sugar metabolism, such as starch and sucrose metabolism, n-glycan pathway, amino sugars metabolism) and glycosphingolipid metabolism.

The T2DM-associated pathways enriched only among PD patients included multiple amino acid metabolism pathways (e.g., alanine, aspartate, arginine, proline, glutamate, and glutathione metabolism), nucleic acid metabolism pathways (e.g., purine metabolism and pyrimidine metabolism), and fatty acid metabolisms (e.g., butanoate metabolism and mono-unsaturated fatty acid beta-oxidation). In addition, the T2DM-associated metabolites from PD patients were also enriched for urea cycle and TCA cycle-related pathways, including CoA catabolism and vitamin B5 - CoA biosynthesis from pantothenate.

Full pathway analysis results are included in Supplementary Table 5. The pathway classifications and supporting references are provided in Supplementary Table 6. In addition, when we used a significance level of 0.15 for pathway enrichment analysis among non-PD participants, the T2DM-associated metabolic pathways were similar to the pathways identified using a significance level of 0.05 among non-PD participants (See Supplementary Table 7).

Power analysis

Based on power analysis for replication of the T2DM metabolite associations discovered among the PD patients in the non-PD participants, we were powered to detect betas >0.81 and <−0.82 [power = 0.8, alpha = 0.05/192 of tests (HILIC), sample size of 253, event rate of 0.14] and betas >0.80 and <−0.80 [power = 0.8, alpha = 0.05/133 of tests (C18)]. For a full, untargeted MWAS among the non-PD participants with logistic regression, we were powered to detect betas >0.93 and <−0.94 [power = 0.8, alpha = 0.05/2931 of tests (HILIC)] and betas >0.92 and <−0.92 [power = 0.8, alpha = 0.05/2222 of tests (C18)]. Additionally, for the pathway analysis among the T2DM-associated metabolites from non-PD controls, we assessed enrichment with more relaxed significance thresholds from the MWAS for metabolite inclusion. This included an unadjusted p = 0.15, which we were powered to detect betas >0.41 and <−0.42 (power = 0.8, alpha = 0.15) and unadjusted p = 0.05, which we were powered to detect betas >0.51 and <−0.51 (power = 0.8, alpha=0.05).

Comparison between T2DM and PD MWAS

We have previously published the findings from a PD MWAS using the same population15. Overall, there were 13 metabolites identified as associated with T2DM among the PD patients and PD (Supplementary Table 8). These metabolites include indoleacetaldehyde, glutamic acid, indoleacrylic acid, 3-methyldioxyindole, tryptophan, glutamate, uridine, serine, oxoglutaric acid, threonine, aminobutyric acid, squamolone, and 3-mercapto-3-methylbutan-1-ol.

Sensitivity analyses

The results from the T2MD MWAS among the PD patients stratified by gender (Supplementary Table 9) and European Ancestry (Supplementary Table 10) can be found in the supplement. Beta coefficients were highly correlated between those found among the men and the primary MWAS (r = 0.91 and 0.93, p < 2.2e−16 for HILIC and C18) and somewhat less, though still correlated, comparing the women and the primary MWAS (r = 0.63 and 0.67, p < 2.2e−16 for HILIC and C18), which shown in Supplementary Fig. 1. Beta coefficients were also highly correlated between the European ancestry only group and the primary MWAS (r = 0.91 and 0.93, p < 2.2e−16 for HILIC and C18) and between the non-European ancestry group and our main analysis (r = 0.74 and 0.75, p < 2.2e−16 for HILIC and C18), which shown in Supplementary Fig. 2.

The results of 3-fold cross-validation can be found in Supplementary Table 11. The significance rates for the validation are the number of significantly replicated metabolic features in the replicate sample divided by the total number of significant metabolic features in the discovery sample for each fold. For the T2DM MWAS among PD patients, the significance rates were 49%, 50%, and 70% in HILIC with the correlation of the betas between discovery and the replication fold at 0.88, 0.85, and 0.92 (Supplementary Fig. 3), and 25%, 77%, and 58% in C18 with beta correlations at 0.69, 0.98, and 0.94 (Supplementary Fig. 3). For the T2DM MWAS among non-PD participants, the significance rates and beta correlations were 95%, 75%, and 96% and 0.93, 0.88, and 0.88 (Supplementary Fig. 4) for HILIC metabolites, and 85%, 48%, and 100% and 0.98, 0.70, and 0.87 (Supplementary Fig. 4) for C18 metabolites.

Further sensitivity analyses, additionally controlling for levodopa equivalent daily dose (Supplementary Table 12 and Supplementary Fig. 5) and sample year (Supplementary Tables 13, 14 and Supplementary Figs. 6 and 7) showed very similar results (MWAS beta correlations >0.99). Similarly, considering T2DM with T2DM medication use as the exposure resulted in very similar results (MWAS beta correlations >0.96, Supplementary Table 15 and Supplementary Figs. 8 and 9).

Discussion

Comorbidities including type 2 diabetes mellitus (T2DM) can significantly impact the prognosis of PD patients and increase their financial burden5. Recent studies have suggested that T2DM contributes to faster motor progression and more severe cognitive impairment among PD patients23,24,25,26. However, the underlying mechanisms are unknown27. To better understand the influence of T2DM on PD physiology, we used untargeted HRM to profile thousands of metabolites in the serum of PD patients and non-PD elderly participants with and without T2DM.

Our results suggest that—as one would expect—some of the most dysregulated T2DM-related metabolic pathways are shared by PD and non-PD study participants, confirming the impact of T2DM on the human metabolome independent of PD. Even though the number of statistically significant metabolic features was small, the correlation of the coefficients across metabolic features was somewhat high (R > 0.4) (see Fig. 2). These metabolites were enriched in sixteen pathways of high biologic plausibility for T2DM, that is, one would expect to find them enriched in T2DM, such as carbohydrate metabolism, glycosaminoglycans, glycan metabolism, and amino sugars metabolism, as well as glycosphingolipid metabolism, the latter has been linked to insulin resistance28.

In terms of individual metabolites, we observed many expected associations, such as glucose and simple sugars, along with several interesting associations, including biliverdin, which was negatively associated with T2DM among PD patients. Biliverdin is derived from the breakdown of heme, which is then generally quickly broken down to bilirubin by biliverdin reductase29. Interestingly, both a buildup of and reduction in biliverdin have been associated with disease29. For instance, biliverdin has been shown to have anti-inflammatory effects and be protective against insulin resistance30. However, liver failure and impaired metabolism also result in elevated biliverdin29.

We also observed several pathways that were altered with T2DM among PD patients. These include essential amino acid pathways such as histidine metabolism, lysine metabolism, and tryptophan metabolism. These amino acids are important for many physiologic functions, including energy production and nitrogen balance, which are essential for growth, development, and tissue repair31,32, as well as regulating neurological function, gene expression, cell signaling, antioxidative responses, and immunity33. Essential amino acids have also been linked to mitochondrial dysfunction via disrupted energy production and oxidative stress34. Mitochondrial dysfunction is strongly linked to the pathogenesis of both T2DM35 and PD36. Tryptophan is also metabolized into indole derivatives, including 3-methyldioxyindole, indoleacetic acid, and indoleacetaldehyde via gut microbiota37,38,39. In the present study, these indole derivatives were also negatively associated with T2DM among the PD patients. Interestingly, tryptophan metabolism, indole metabolism, and metabolic disease have been linked recently through the gut microbiome38,40,41. The importance of the gut microbiome and gut-brain axis is also increasingly recognized in PD42.

There is also evidence that several non-essential amino acids, including glutamate43 and glycine44, are involved in oxidative stress pathways. Our study showed elevated levels of glutamate metabolites associated with T2DM in PD patients. Glutamate can be converted to α-ketoglutarate to participate TCA cycle45. However, excessive amounts of glutamate, called glutamate excitotoxicity, can lead to Ca2+ influx that can promote mitochondria dysfunction and cell death46. Furthermore, these amino acid pathways have other links to PD. For instance, through overexpression of PGC-1α, which can result in mitochondrial function and a decrease in phenylalanine, tyrosine, and glutamine47,48, or involvement of PINK1, which has been linked to an increased glutamate and dysregulation of the TCA cycle49. Oxidative stress has been suggested as a shared pathway between PD and T2DM50 and is closely linked to mitochondria dysfunction among other mechanisms51. In T2DM, oxidative stress is induced by hyperglycemia52. In PD, oxidative stress is thought to play an important role in the death of dopaminergic neurons53. It is especially interesting that nearly 20% of the pathways enriched in PD patients with T2DM are linked to oxidative stress versus <1% of the pathways enriched among non-PD participants with T2DM.

Dysregulation of the urea cycle and nitrogen metabolism was also linked to T2DM among PD patients. Increased levels of urea in the kidneys are associated with insulin resistance and suppression of insulin, with evidence from both experimental and epidemiological studies54,55. Elevated urea levels were also found in the brains of PD patients in a brain tissue study56. Urea accumulated in the blood can slowly cross the blood-brain barrier as it becomes compromised with PD, resulting in elevated levels of urea in the brain57,58. In addition, previous metabolomics studies also identified urea metabolism associated with PD and T2DM independently59,60.

Several other pathways with compelling links to both PD and T2DM were also implicated by our results. Both pyrimidine and purine metabolism were enriched in the T2DM-associated metabolites of PD patients. Purines have roles in energy metabolism and signaling, and, along with pyrimidines, DNA and RNA production. A previous metabolomics study identified altered purine metabolism with T2DM61 and mouse models have linked purine metabolites to faster T2DM progression62. Similarly, purine metabolism has also been linked to faster PD motor progression in epidemiological studies and DNA damage has been implicated as contributing to PD pathogenesis by epidemiological and experimental studies63,64. Two fatty acid metabolism pathways, butanoate metabolism, and mono-unsaturated fatty acid β oxidation were also associated with T2DM among PD patients. Higher levels of fatty acid metabolites can impact the severity of insulin resistance65,66. Fatty acid oxidation has been linked to PD onset and mild cognitive impairment in a previous metabolic study of PD patients67. In addition, β oxidation has been suggested as a potential biomarker for the diagnosis of PD at an early stage68. Butanoate or butyrate metabolism refers to short-chain fatty acid metabolism produced by the bacterial formation in the human colon and butyrate is important for maintaining energy balance69. A previous metabolic profiling of T2DM patients also showed an elevated level of butanoate70 and a meta-analysis of gut microbiome-focused studies found bacterial butanoate metabolism to be upregulated among PD patients71. Our results suggest that PD patients with T2DM have levels of these important PD-linked metabolites that are more strongly altered than in PD patients without T2DM.

Our study had several limitations to note. First, HRM analyses were performed after the onset of both T2DM and PD and therefore, detected metabolite alterations likely capture metabolic responses related to existing PD and T2DM. This information can be very informative for understanding disease progression and biological pathways affected by both diseases. However, further longitudinal studies assessing metabolomic patterns prior to disease onset will be needed for causal inferences. The analyses among non-PD participants were also underpowered. So, while we report associations related to T2DM among PD patients, many of which were not related to T2DM among the non-PD participants, the smaller sample size limits conclusions related to the specificity. There are also limitations related to metabolite annotation and pathway analysis. Many of the metabolic features could not be annotated or were annotated one to many. However, we did make use on an in-house library, providing confidence in features we were able to match72. Pathway analysis also depends on linking metabolites to pathways, which requires some level of annotation. mummichog pathway analysis is designed for untargeted, unannotated MWAS data73. But many top features could not be linked to annotation even using mummichog. However, given that the identification of metabolites and metabolic pathways is based on existing knowledge, when new information is made available, future analyses can use our summary statistics for annotation and pathway analysis. In addition, our summary statistics provide an opportunity for future meta-analytical explorations. Given our sample size, we were also unable to consider the effects due to better or worse diabetes disease management. In addition, T2DM medication use was almost perfectly correlated with T2DM status and therefore, we were not able to adjust T2DM medication use. This would be of interest for future studies as perhaps better T2DM disease management may mitigate the contributions of T2DM to PD progression. Finally, it is important to note that this is an initial discovery study to uncover the molecular link between PD and T2DM. Our untargeted metabolomics analysis needs to be replicated, but the knowledge gained by this approach may help to eventually improve the prognosis of PD patients with T2DM in the future.

Our study provides insights into metabolic profiles related uniquely to diabetes among PD patients. The differences in metabolites and dysregulated metabolic pathways shed some light on mechanisms that possibly contributed to PD progression or pathogenesis and may eventually provide targets for disease management for comorbid patients. We found that the most important metabolic pathways associated with T2DM do not depend on PD status as expected. However, more informative are pathways we found to be uniquely associated with diabetes among PD patients, including amino acid, nucleic acid, and fatty acid metabolisms pathways. These pathways implicate the enhanced dysregulation of energy metabolism, nitrogen balance, mitochondrial dysfunction, and oxidative stress as potential contributing factors to progression in PD patients with T2DM as these pathways were found to be enriched even compared to PD patients without T2DM.

Methods

Study population

PD patients were recruited as part of a community-based case-control study, the Parkinson’s Gene and Environment (PEG) study. Idiopathic PD patients were recruited between 2000 and 2017 from central California (Kern, Fresno, and Tulare) in two separate study waves, referred to as PEG1 and PEG2. PEG1 participants were recruited between 2001 and 2007 and PEG2 participants between 2011 and 2017. Eligibility criteria for cases included: living in California for five years at minimum, having been diagnosed with PD for ≤3 years for PEG1 and ≤5 years for PEG2, and agreeing to participate in the study. PD patients were recruited from local clinics, neurologists, medical groups, radio advertisements, and the California PD registry. Most of the PD patients (>70%) were taking PD medications (see Supplementary Table 1). Population controls for PD were randomly sampled from the study area using either Medicare enrollee lists (prior to Health Insurance Portability and Accountability Act) or residential parcels listed in property tax assessor records. PEG1 and PEG2 also enrolled a set of household controls for PD patients. These household controls for PD were limited to one per household. Eligibility criteria for controls included: being more than 35 years old, living in California for five years at minimum, and not being diagnosed with PD. Household controls had an additional requirement of living in the same household with the PD participants for at least 1 year. In total, 831 PD patients and 873 non-PD older adults were enrolled in the PEG study. Detailed study population and recruitment methods were published elsewhere74,75. For our analysis, we included 636 PD patients and 253 non-PD older adults with metabolomics data. These were all study participants with serum available for metabolic profiling. The participants contributing to the metabolomics data included in the analysis were similar in covariate distribution to those without metabolomics data (see Table 1 and Supplementary Table 1).

We collected demographic and lifestyle information including age (age at interview for controls and diagnosis for PD patients), gender, race/ethnicity, education, smoking status, as well as their self-reported medical history, including T2DM status and age of diagnosis for T2DM. We excluded PD patients and non-PD participants without blood samples or who did not report their medical history. We imputed education, smoking status, and race/ethnicity for 7 PD patients (education only) and 5 non-PD patient controls.

We obtained blood samples at enrollment and all lab staff followed the same sample processing and storing procedure. A detailed procedure was published previously76. Briefly, after blood was drawn, we let the blood sit for 30 min to clot. We then centrifuged the blood for 20 min to remove the clot and aliquoted 1 mL of the serum per 1.5 mL microcentrifuge tubes. Biospecimens were then transferred from the field on dry ice and stored in a −80 °C freezer at UCLA within the same day. Serum samples were mailed out on dry ice and analyzed for metabolomics at Emory University. We used serum instead of plasma primarily due to sample availability.

High-resolution metabolomics (HRM)

Briefly, serum samples along with quality control samples were centrifuged and analyzed using both hydrophilic interactions (HILIC) chromatography with positive electrospray ionization (ESI) and C18 chromatography with negative ESI (Ultimate 3000, Q-Exactive HF, Thermo Fisher, m/z range 85–1275)72,76. The samples were analyzed in batches of 40. Two quality control samples were used: NIST 1950, which was run as the first and last sample, and commercially purchased plasma pooled from an unknown number of males and females, which were run at the beginning, middle, and end of each batch. HRM detected ions by producing metabolic features including mass-to-charge ratio (m/z), retention time (rt), and abundance. The raw data were extracted and aligned using apLCMS77 with modifications by xMSanalyzer78. Batch effects were corrected by using ComBat from the sva package79. For data analyses, we only included metabolomic features detected in >50% of all samples, with median coefficients of variation (CV) among technical replicates <75% and Pearson correlation >0.7. The metabolic abundance was imputed for missing values with minimum value of each metabolic feature, median-normalized, and auto-scaled.

Annotation

Confirmed metabolites were identified by comparison to an in-house library of reference standards that includes over 300 metabolites that were previously analyzed using identical analytical parameters72,80 using a mass error of <10 ppm and a retention time difference of <30 s. Metabolites without standard available were annotated using xMSannotator81. We retained annotations with a confidence score >2, mass error <10 ppm for m/z, and retention time difference <30 s for grouping adducts and isotopes.

Statistical analysis

We conducted a metabolome-wide association analysis (MWAS) to assess each metabolic feature’s association with T2DM among PD patients (i.e., comparing PD patients with and without T2DM to each other) using logistic regression and controlling for age of PD diagnosis, gender, education, smoking, non-European ancestry, and study wave. We applied a false discovery rate (FDR) to correct for multiple testing. We then conducted pathway enrichment analysis using mummichog version 2 with metabolic features including m/z, retention time, and test statistics from the logistic model (z-value and crude p-value)82. Mummichog primarily uses the Kyoto Encyclopedia of Genes and Genomes (KEGG), UCSD Recon1, and the Edinburgh human metabolic network. Unlike the traditional method that identified metabolites and then maps them to the metabolic network, Mummichog takes untargeted data and maps all possible metabolites, and then searches for local enrichment. The algorithm has been previously validated73. For the mummichog analysis parameters, we selected the M[1+], M + H[1+], or M+Na[1+] adduct types for positive ion mode feature annotation and M-H[-], M-2H[2-], or M-H2O-H[-] for the negative ions. We repeated the same MWAS and pathway enrichment analysis to assess associations with T2DM among the non-PD participant group. We controlled for the same set of potential confounders except for the age of enrollment in place of the age of PD diagnosis. We adjusted for multiple testing, both based on just replication of the T2DM-associated metabolites among PD patients and for the full MWAS. The sample size for the non-PD participant group was limited, therefore, we also present results from a power analysis. Furthermore, to take into account a potential lack of statistical power due to the smaller number of non-PD participants, we compared enrichment analyses using a significance level of 0.15 and 0.05 for metabolic features.

We additionally performed 3-fold cross-validation to assess the generalizability of the findings using an FDR-adjusted p-value of 0.05 in the discovery population to identify associated metabolic features and a crude p-value of 0.10 in the replication population to identify whether metabolites in the discovery population were replicated. We chose these cut-off p values due to the smaller sample size when splitting the population into 3 folds. We then calculated a significance rate as the number of significantly replicated metabolic features in the replicate sample divided by the total number of significant metabolic features in the discovery sample.

To identify metabolites associated with both T2DM and PD in the study population, we compared our present T2DM MWAS results with previously performed PD MWAS results15. We also performed several sensitivity analyses for the metabolic features identified in the main analysis. We present gender-stratified and race/ethnicity-stratified results to assess the potential effect measure modification. To assess potential confounding, we additionally adjusted the T2DM MWAS among PD patients for the amount of levodopa equivalent daily dose taken in mg, and for the sample collection among both PD and non-PD participants. We also compared the results using T2DM diagnosis with T2DM medication treatment, comparing the metabolic differences between participants who reported both having T2DM and using T2DM medication with those participants who reported not having T2DM and not using T2DM medication.

All analyses were performed in R version 4.1.3.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.