Intended for healthcare professionals

CCBY Open access
Research

Chocolate intake and risk of type 2 diabetes: prospective cohort studies

BMJ 2024; 387 doi: https://doi.org/10.1136/bmj-2023-078386 (Published 04 December 2024) Cite this as: BMJ 2024;387:e078386
  1. Binkai Liu, doctoral student1,
  2. Geng Zong, professor23,
  3. Lu Zhu, research assistant1,
  4. Yang Hu, research scientist1,
  5. JoAnn E Manson, professor456,
  6. Molin Wang, associate professor457,
  7. Eric B Rimm, professor145,
  8. Frank B Hu, professor145,
  9. Qi Sun, associate professor145
  1. 1Department of Nutrition, Harvard T.H. Chan School of Public Health, Boston, MA, USA
  2. 2Key Laboratory of Nutrition, Metabolism and Food Safety, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
  3. 3Department of Endocrinology and Metabolism, Shanghai Jiao Tong University Affiliated Sixth People’s Hospital, Shanghai, China
  4. 4Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
  5. 5Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
  6. 6Division of Preventive Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
  7. 7Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
  1. Correspondence to: Q Sun qisun{at}hsph.harvard.edu
  • Accepted 5 October 2024

Abstract

Objective To prospectively investigate the associations between dark, milk, and total chocolate consumption and risk of type 2 diabetes (T2D) in three US cohorts.

Design Prospective cohort studies.

Setting Nurses’ Health Study (NHS; 1986-2018), Nurses’ Health Study II (NHSII; 1991-2021), and Health Professionals Follow-Up Study (HPFS; 1986-2020).

Participants At study baseline for total chocolate analyses (1986 for NHS and HPFS; 1991 for NHSII), 192 208 participants without T2D, cardiovascular disease, or cancer were included. 111 654 participants were included in the analysis for risk of T2D by intake of chocolate subtypes, assessed from 2006 in NHS and HPFS and from 2007 in NHSII.

Main outcome measure Self-reported incident T2D, with patients identified by follow-up questionnaires and confirmed through a validated supplementary questionnaire. Cox proportional hazards regression was used to estimate hazard ratios and 95% confidence intervals (CIs) for T2D according to chocolate consumption.

Results In the primary analyses for total chocolate, 18 862 people with incident T2D were identified during 4 829 175 person years of follow-up. After adjusting for personal, lifestyle, and dietary risk factors, participants consuming ≥5 servings/week of any chocolate showed a significant 10% (95% CI 2% to 17%; P trend=0.07) lower rate of T2D compared with those who never or rarely consumed chocolate. In analyses by chocolate subtypes, 4771 people with incident T2D were identified. Participants who consumed ≥5 servings/week of dark chocolate showed a significant 21% (5% to 34%; P trend=0.006) lower risk of T2D. No significant associations were found for milk chocolate intake. Spline regression showed a linear dose-response association between dark chocolate intake and risk of T2D (P for linearity=0.003), with a significant risk reduction of 3% (1% to 5%) observed for each serving/week of dark chocolate consumption. Intake of milk, but not dark, chocolate was positively associated with weight gain.

Conclusions Increased consumption of dark, but not milk, chocolate was associated with lower risk of T2D. Increased consumption of milk, but not dark, chocolate was associated with long term weight gain. Further randomized controlled trials are needed to replicate these findings and further explore the mechanisms.

Introduction

The global prevalence of type 2 diabetes (T2D) has increased noticeably over the past few decades, with an estimated 463 million people affected worldwide in 2019 and projected to increase to 700 million by 2045.1 T2D is a multifactorial disease characterized by insulin resistance and impaired insulin secretion, which can lead to numerous severe complications such as cardiovascular disease, renal failure, and loss of vision.2 A growing body of research has highlighted the importance of lifestyle factors, including healthy diets, in the prevention and management of T2D.34

Higher consumption of total dietary flavonoids, as well as specific flavonoid subclasses, has been associated with a decreased risk of T2D.56 In randomized controlled trials, these flavonoids exerted antioxidant, anti-inflammatory, and vasodilatory effects that might confer cardiometabolic benefits and reduce the risk of T2D,789 although data were not consistent.10 Chocolate, derived from the beans of the cacao tree (Theobroma cacao), is among foods with the highest flavanol content and is a popular snack globally.111213 However, the association between chocolate consumption and risk of T2D remains controversial owning to inconsistent findings in observational studies.1415 Furthermore, most previous studies have primarily focused on total chocolate intake, without considering the potential differences in health effects between chocolate subtypes (ie, dark, milk, and white chocolate).1617 These subtypes differ in cocoa content and proportions of other ingredients such as sugar and milk, which may influence the association with risk of T2D.18

Using data from three prospective cohort studies that repeatedly assess participants’ diet during longitudinal follow-up, we investigated the association between subtypes of chocolate intake and risk of T2D, as well as change in bodyweight, which is a strong predictor for risk of T2D.19

Methods

Study population

This study used data from three large prospective cohorts: the Nurses’ Health Study (NHS), initiated in 1976 and comprising 121 700 female registered nurses20; the Nurses’ Health Study II (NHSII), launched in 1989 and comprising 116 340 female nurses21; and the Health Professionals Follow-Up Study (HPFS), initiated in 1986 and comprising 51 529 male health professionals.22

For analyses on total chocolate consumption, the study baselines (baseline 1) were 1986 for NHS and HPFS and 1991 for NHSII, when diet was first comprehensively assessed using a validated semiquantitative food frequency questionnaire.232425 For analyses on chocolate subtypes, the study baselines (baseline 2) were 2006 for NHS and HPFS and 2007 for NHSII, coinciding with the availability of data on chocolate subtypes in the food frequency questionnaire.

In the primary analyses, we excluded participants who had prevalent diabetes, cardiovascular disease (myocardial infarction, coronary artery bypass surgery, stroke), or cancer (except non-melanoma skin cancer) at baseline; had missing information on baseline age and chocolate intake, or an unusual total energy intake (ie, <600 or >3500 kcal/d for women and <800 or >4200 kcal/d for men); only completed the baseline food frequency questionnaire; or had an undetermined diagnosis date for T2D.

In a secondary analysis of change in bodyweight by levels of chocolate intake, we excluded participants with self-reported diabetes, cardiovascular disease, or cancer at baseline. Additional baseline exclusions were extreme total energy intake and missing data on chocolate intake or bodyweight. During follow-up, we censored individuals who reported cardiovascular disease, cancer, or a diagnosis of diabetes. We also censored individuals with missing data on chocolate intake or bodyweight during follow-up. The flowchart in supplementary figure S1 shows the selection process for participants.

The study protocol was approved by the Human Research Committee of Brigham and Women’s Hospital and the Harvard T.H. Chan School of Public Health. Participants provided informed consent by completing and returning study questionnaires.

Assessment of diet

Diet was assessed every four years using a semiquantitative food frequency questionnaire.23 Participants reported their average frequency of consumption for a standard food portion size (eg, one chocolate bar/pack or 1 oz) in the past year, choosing from nine levels ranging from “never, or less than once per month” to “≥6 per day.” Questions about chocolate consumption were included from 1980 in NHS (study baseline was set at 1986 for total chocolate when the comprehensive food frequency questionnaire was first administered in NHS), 1991 in NHSII, and 1986 in HPFS. Questions about specific subtypes of chocolate consumption, specifically “How often do you consume milk chocolate (bar or pack),” and “How often do you consume dark chocolate,” were added to the food frequency questionnaire from 2006 for NHS and HPFS and from 2007 for NHSII, which were study baselines for the analyses by chocolate subtypes. Information on consumption of chocolate subtypes was collected in 2006 and 2010 for NHS; 2007, 2001, and 2015 for NHSII; and 2006, 2010, 2014, and 2018 for HPFS. Nutrient intakes were measured based on the US Department of Agriculture food composition database. We calculated the average daily intake of nutrients by multiplying the frequency of consumption of each food item that contains the nutrient by the nutrient content, and then summing across all food items. The intake level of flavan-3-ols was calculated by summing the individual components (catechin, epicatechin, gallocatechin, epigallocatechin, epicatechin 3-gallate, and epigallocatechin 3-gallate). The intake level of total flavonoids was further calculated by summing intakes of flavonols, flavones, flavanones, flavan-3-ols, total theaflavins, and polymers proanthocyanidins. Total energy intake was derived from the food frequency questionnaires using the same algorithm.

The validity and reproducibility of the food frequency questionnaires have been reported in detail previously.232425 In validation studies among subgroups of participants in NHS and HPFS, the correlation coefficient between chocolate intake assessed by diet records over seven days and food frequency questionnaire was, respectively, 0.53 and 0.65 for dark chocolate and 0.42 and 0.47 for milk chocolate.26 Additionally, the food frequency questionnaire shows reasonable consistency of food intake measurements over time, with mean intraclass correlation coefficients of 0.71 (dark: 0.71; milk: 0.59) in NHS and 0.72 (dark: 0.66; milk: 0.60) in HPFS.26

Assessment of covariates

In all three cohorts, information on participants’ personal and lifestyle factors, as well as diseases or medical conditions, was collected biennially through questionnaires. Variables assessed included race/ethnicity, bodyweight, waist circumference, smoking status, alcohol consumption, multivitamin use, menopausal status and postmenopausal hormone use (women only), oral contraceptive use (NHSII only), hypertension, hypercholesterolemia, and family history of diabetes. Specialty of health professionals was assessed in HPFS, and academic or professional degrees earned were assessed in NHS. Physical activity was measured using a validated questionnaire,272829 which derived metabolic equivalent tasks in hours per week based on time spent on 10 recreational activities. Height was measured in the baseline questionnaire of these three cohorts, and time varying body mass index (BMI) was calculated as biennially updated bodyweight in kilograms divided by the square of height in meters. The Alternate Healthy Eating Index-2010 (AHEI) was calculated based on participants’ responses in the food frequency questionnaire to reflect overall diet quality, which emphasizes foods and nutrients, such as whole grains, nuts, red and processed meats, sugar sweetened beverages, and alcohol, that predict the risk of chronic disease.30 For NHS and NHSII, we further calculated z scores of standardized neighborhood socioeconomic status using census tract variables from the Neighborhood Change Database. More details on computation are described elsewhere.31

Assessment of T2D

T2D was self-reported in the biennial follow-up questionnaires, and the diagnosis was further confirmed by study doctors through a supplementary questionnaire, which collected data on factors such as glucose concentration, glycated hemoglobin (HbA1c) concentration (after 2010), symptoms and treatments for T2D. People with confirmed T2D, which constituted the outcome of this analysis, were defined according to the National Diabetes Data Group criteria before 1998,32 where at least one of the following items was reported: raised fasting plasma glucose level (≥7.8 mmol/L (140 mg/dL)) or plasma glucose (≥11.1 mmol/L (200 mg/dL)), and at least one T2D symptom (excessive thirst, polyuria, weight loss, hunger); no T2D symptom, but had at least two raised plasma glucose measurements on different occasions; or treatment with insulin or other hypoglycemic drug (insulin or oral hypoglycemic agent). For people with T2D diagnosed after 1998, the threshold for raised fasting plasma glucose level was lowered to 7.0 mmol/L (126 mg/dL) according to the criteria of the American Diabetes Association.33 For people with a diagnosis after January 2010, we further added HbA1c ≥6.5% to the diagnostic criteria.34 The validity of the supplementary questionnaire was established in two previous studies in NHS and HPFS through blinded medical record reviews, with T2D diagnosis confirmed in 98% and 97% of people, respectively.3536

Ascertainment of weight change

Bodyweight was self-reported at baseline and in the biennial follow-up questionnaires, and it has been previously validated in these cohorts, with correlation coefficients of 0.97 in NHS and HPFS and 0.84 in NHSII.3738 We calculated the outcome of interest, weight change every four years, by subtracting bodyweight at the start of each four year interval from bodyweight at the end of the four year interval.

Statistical analysis

We present participants’ characteristics at baseline 1 (1986 for NHS and HPFS, 1991 for NHSII) and baseline 2 (2006 for NHS and HPFS, 2007 for NHSII), stratified by consumption levels of total, dark, and milk chocolate and further categorized by study cohort. This breakdown allowed for a more nuanced understanding of the characteristics of participants, as individuals consuming similar levels of dark or milk chocolate might have distinct baseline characteristics across cohorts. To reflect the long term usual intake of total chocolate at baseline 2, we calculated cumulatively averaged total chocolate intake at or before baseline 2 (1980-2006 for NHS, 1991-2007 for NHSII, and 1986-2006 for HPFS). Person years were calculated as the duration between the return date of the baseline food frequency questionnaire to the date of T2D diagnosis, date of death, date of the last return of a valid follow-up questionnaire, or end of follow-up (2018 for NHS, 2021 for NHSII, and 2020 for HPFS), whichever came first. To reduce potential reverse causality that participants changed their diet after diagnosis of certain diseases, we stopped diet updates once participants reported myocardial infarction, stroke, coronary artery bypass graft, or cancer during follow-up. The rates for missing covariates were low in the cohorts. Any missing values for physical activity, alcohol intake, total energy intake, food and nutrients intake, BMI, AHEI, or z scores for socioeconomic status during follow-up were first replaced by valid values in the previous questionnaire cycle. For the remaining missing data of individual food items (<0.5%), we replaced the missing values with the median intake in the cohort. For missing values of smoking status, oral contraceptive use, menopausal status, and postmenopausal hormone use, we used the missing covariate indicator method.

In primary analyses, we used Cox proportional hazards models to estimate the hazard ratios and corresponding 95% confidence intervals (CIs) for the association between consumption levels of total, dark, and milk chocolate and risk of T2D.39 To maximize statistical power and reduce within person variations, we cumulatively averaged total chocolate intake from baseline 1 and dark and milk chocolate intake from baseline 2. We categorized regular chocolate consumption into four groups of levels, including never or <1 serving/month (reference level), 1 serving/month to <1 serving/week, 1-4 servings/week, and ≥5 servings/week. The Cox models were stratified by age and calendar time. Fully adjusted models accounted for age, calendar year, ethnicity (white, African American, Asian, and other ethnicities), smoking status (never, former, current (cigarettes/day: 1-14, 15-24, or ≥25), or missing), alcohol intake (g/day: 0, 0.1-4.9, 5.0-14.9, and ≥15.0 in women, 0, 0.1-4.9, 5.0-14.9, 15.0-29.9, and ≥30.0 in men, or missing), family history of diabetes (yes/no), menopausal status and postmenopausal hormone use (premenopausal, postmenopausal (never, former, or current hormone use), or missing, for women), use of oral contraceptives (yes, no, NHSII only), physical activity (metabolic equivalent task hours (MET-h)/week: <3, 3.0-8.9, 9.0-17.9, 18.0-26.9, ≥27.0, or missing), baseline BMI (<21.0, 21.0-22.9, 23.0-24.9, 25.0-26.9, 27.0-29.9, 30.0-32.9, 33.0-34.9, ≥35.0, or missing), multivitamin use (yes/no), baseline hypertension, baseline hypercholesterolemia, total energy intake (continuous, kcal/day), and AHEI (fifths). For analyses of chocolate subtypes, we further adjusted total chocolate intake before baseline in the models (as cumulative averages for 1980-2002 (NHS), 1991-2003 (NHSII), 1986-2002 (HPFS)). Analyses were run separately in each cohort first, and then data were pooled to calculate overall study estimates, with cohort origin further adjusted in the model.

Sensitivity analyses were conducted using the pooled dataset: using baseline chocolate intake level instead of time varying intake; using most recently updated data on chocolate intake instead of cumulatively averaged intake; adjusting for baseline BMI as a continuous instead of categorical variable; adjusting for time varying BMI instead of baseline BMI to explore the mediation effect of BMI; adjusting for time varying waist circumference instead of baseline BMI; adjusting for consumption of specific food and beverages high in flavan-3-ols (eg, tea, red wine, apples, pears, blueberries, bananas, and peppers) instead of AHEI; adjusting for consumption of red and processed meat, fruit and vegetables, sugar sweetened beverages, and whole grains instead of AHEI; further adjusting for each individual intake of flavonoids; further adjusting for saturated fat intake; and further adjusting for added sugar intake. In the pooled dataset, we examined potential effect modification by age (<70 years, ≥70 years), sex (male, female), baseline BMI (<30, ≥30), physical activity (<median, ≥median), AHEI (<median, ≥median), and family history of diabetes (yes, no). Stratified analyses were conducted for these binary modifiers, and all models were multivariable adjusted. To address any residual confounding by BMI, we further adjusted baseline BMI as a continuous variable in the BMI stratified models. Interaction terms were created between the binary effect modifiers and the categorical chocolate consumption levels, and P values for interaction were examined using the likelihood ratio test.

Moreover, we explored the potential dose-response association between intake of total, dark, and milk chocolate and risk of T2D using restricted cubic spline analysis (SAS Macro %LGTPHCURV9)40 with three knots, adjusting for the previously mentioned covariates. To reduce the effect of outliers, we combined and truncated data from the three cohorts at the first and 99th centiles. We performed a sensitivity analysis of spline regression not excluding outliers. Additionally, linear regressions were run between total, dark, and milk chocolate intake with selected nutrients, food, and drink at baseline, such as saturated fat, added sugar, flavan-3-ols, and sweets and desserts.

To further determine the association of chocolate intake with weight change, we conducted secondary analyses using multivariable generalized linear regression models with independent correlation matrix and robust variance to examine the associations of four year changes in chocolate intake with concomitant four year changes in bodyweight. For total chocolate analyses, eight four year cycles were available for HPFS (1986-2018) and six four year cycles for both NHS (1986-2010) and NHSII (1991-2015). For analyses of chocolate subtypes, three four year cycles were available for HPFS (2006-18), one four year cycle for NHS (2006-10), and two four year cycles for NHSII (2007-15). For each four year interval, we examined the associations between chocolate intake (increased or decreased versus no change) and weight change. We combined data from three cohorts for this analysis. In the multivariable adjusted model, we adjusted for age, ethnicity (white, African American, Asian, other), family history of diabetes (yes/no), baseline hypertension (yes/no), baseline hypercholesterolemia (yes/no), baseline total energy intake (continuous, kcal/day), postmenopausal hormone use (women only; premenopausal, never, former, current, or missing; time varying), and oral contraceptive use (NHSII only; yes, no; time varying), baseline and change in physical activity (MET-h/week), change in smoking status (stayed never smoker, stayed former smoker, stayed current smoker, changed from former to current smoker, changed from never to current smoker, and changed from current to former smoker), change in alcohol consumption (continuous, g/day), change in AHEI (fifths), and study origin (NHS, NHSII, HPFS). We also conducted stratified weight change analyses by baseline BMI (<25.0, 25.0-29.9, or ≥30.0) to explore any potential effect modifications. We adjusted for the same covariates as in the main analysis, and we further adjusted for baseline BMI in the continuous form. P values for interactions were calculated by general score tests requested by the type 3 command in the model statement (equivalent to the likelihood ratio test).

Analyses were conducted using SAS for Unix (version 9.4, SAS Institute) and RStudio (version 4.2.3, RStudio). We considered two sided P values <0.05 to be statistically significant.

Patient and public involvement

We did not have the infrastructure, resources, funding, or time to involve the public in study design, interpretation of results, or publication.

Results

Baseline characteristics

Supplementary figure S1 shows the selection of participants at specific baselines for primary analysis. Table 1, table 2, and supplementary table S1 present age standardized characteristics of participants at baselines 1 and 2, stratified by cohort, chocolate consumption levels, and types of chocolate (total, dark, and milk, respectively). A total of 192 208 participants (63 798 women in NHS, 88 383 women in NHSII, and 40 027 men in HPFS) were included in the analysis of total chocolate intake and 111 654 (39 400 women in NHS, 58 187 women in NHSII, and 14 067 men in HPFS) in the analyses of chocolate subtypes. The mean ages at baseline 1 were 52.3 years in NHS, 36.1 years in NHSII, and 53.1 years in HPFS. The mean ages at baseline 2 were 70.4 years in NHS, 52.3 years in NHSII, and 68.3 years in HPFS. Most participants were of non-Hispanic white ethnicity. Across all three cohorts, participants with higher levels of chocolate intake also had higher energy intakes, saturated fat, and added sugar. Higher consumption of dark chocolate was associated with higher AHEI and consumption of fruit and vegetables, epicatechin, and total flavonoids. Associations between milk chocolate consumption and these dietary variables were, however, inverse. The distribution of participants’ characteristics across total chocolate intake groups were similar to those for milk chocolate groups.

Table 1

Age standardized baseline characteristics of participants according to dark chocolate* intake at baseline. Values are mean (SD) unless stated otherwise

View this table:
Table 2

Age standardized baseline characteristics according to milk chocolate* intake at baseline. Values are mean (SD) unless stated otherwise

View this table:

When examining the crude associations between chocolate consumption and selected nutrients, foods, and drinks at study baseline, milk chocolate consumption showed stronger positive associations with the consumption of less healthy food and nutrients, including saturated fat, added sugar, red and processed meat, and sweets and desserts. Dark chocolate consumption was positively associated with intakes of flavan-3-ols, particularly epicatechin. Overall, dark chocolate consumption was more positively associated with intakes of other flavan-3-ols-rich food and beverages, such as blueberries, tea, and red wine. The β coefficients between total chocolate consumption and these nutrients or food and beverages were similar to the coefficients observed for milk chocolate (data not shown).

Chocolate intake and T2D

In the primary analyses for total chocolate, 18 862 people with incident T2D were identified during 4 829 175 person years of follow-up. In the age and calendar time stratified Cox proportional hazards models, no significant associations were observed between total chocolate consumption and risk of T2D in the pooled dataset (table 3). After adjusting for lifestyle and dietary risk factors, we found that participants who consumed ≥5 servings/week of any chocolate showed a 10% (95% CI 2% to 17%; P trend=0.07) lower relative risk of T2D compared with those who never or rarely consumed chocolate. A marginally significant 1% (0% to 2%) reduction in risk of T2D was observed for each serving/week consumption of total chocolate.

Table 3

Adjusted hazard ratios (95% CIs)* of T2D for total chocolate intake in NHS (1986-2018), NHSII (1991-2021), and HPFS (1986-2020)

View this table:

For analyses by chocolate subtypes, 4771 people with incident T2D were identified during 1 270 348 person years of follow-up (table 4). After adjusting for confounding factors, participants who consumed ≥5 servings/week of dark chocolate had a significant 21% (5% to 34%) lower rate of T2D compared with those who never or rarely ate dark chocolate, and a significant linear trend across four groups was observed (P trend=0.006). A significant 3% (1% to 5%) reduction in risk of T2D was observed for each serving/week consumption of dark chocolate.

Table 4

Adjusted hazard ratios (95% CIs)* of T2D for chocolate intake by subtype in NHS (2006-18), NHSII (2007-21), and HPFS (2006-20)

View this table:

Associations between milk chocolate intake and risk of T2D were largely null. The multivariable adjusted hazard ratio for consumption of milk chocolate and risk of T2D was 0.94 (95% CI 0.79 to 1.12; P trend=0.75), comparing extreme consumption groups (high versus low).

Statistically significant heterogeneity was observed when examining results across the three cohorts. In the analysis of total chocolate, stronger associations were observed in NHSII, with consumption of ≥5 servings/week of total chocolate associated with a 16% (5% to 26%) lower risk of T2D compared with those who rarely consumed chocolate. In the analyses by chocolate subtypes, HPFS showed the most significant associations for dark chocolate consumption, with consumption of ≥5 servings/week of dark chocolate associated with a 51% (8% to 74%) lower risk of T2D compared with those who never or rarely consumed dark chocolate. Similar associations and trends were observed in NHSII, but the results were less significant. In NHS, neither total nor subtypes of chocolate consumption were statistically significantly associated with risk of T2D.

Restricted cubic spline regression revealed a linear dose-response association between dark chocolate intake and risk of T2D (P for linearity=0.003; fig 1). The association between total chocolate intake and risk of T2D appeared to be non-linear (P for non-linearity=0.008). The dose-response association between milk chocolate intake with risk of T2D remained essentially null. The results continued to be robust without excluding outliers (supplementary figure S2).

Fig 1
Fig 1

Multivariable adjusted, pooled, dose-response associations between chocolate intake and risk of type 2 diabetes in NHS, NHSII, and HPFS. Dose-response analysis was performed using SAS Macro %LGTPHCURV9 (number of knots=3). Data from three cohorts were combined and truncated at the first and 99th centiles. P values for linearity and non-linearity were, respectively, P=0.003 and P=0.96 for dark chocolate, P=0.39 and P=0.99 for milk chocolate, and P=0.08 and P=0.008 for total chocolate. P values for non-linearity were obtained as the significance of the spline terms, and P values for linearity were obtained as the significance of the linear terms. P<0.05 indicates statistical significance. For dark and milk chocolate, follow-up periods were 2006-18 for NHS, 2007-21 for NHSII, and 2006-20 for HPFS. For total chocolate, follow-up periods were 1986-2018 for NHS, 1991-2021 for NHSII, and 1986-2020 for HPFS. Hazard ratios were adjusted for age, calendar year, ethnicity (white, African American, Asian, and others), smoking status (never, former, current (1-14, 15-24, or ≥25 cigarettes/day), or missing), alcohol intake (g/day: 0, 0.1-4.9, 5.0-14.9, and ≥15.0 in women, 0, 0.1-4.9, 5.0-14.9, 15.0-29.9, and ≥30.0 in men, or missing), family history of diabetes (yes/no), menopausal status and postmenopausal hormone use (premenopausal, postmenopausal (never, former, or current hormone use), or missing, women only), use of oral contraceptives (yes, no, NHSII only), physical activity (<3, 3.0-8.9, 9.0-17.9, 18.0-26.9, ≥27.0 MET-h/week, or missing), BMI (<21.0, 21.0-22.9, 23.0-24.9, 25.0-26.9, 27.0-29.9, 30.0-32.9, 33.0-34.9, ≥35.0, or missing), multivitamin use (yes/no), baseline hypertension, baseline hypercholesterolemia, total energy intake, AHEI (five groups), and study origin (NHS, NHSII, HPFS). For dark and milk chocolate dose-response analysis, total chocolate intake before baseline (as cumulative averages 1980-2002 for NHS, 1991-2003 for NHSII, and 1986-2002 for HPFS) were further adjusted. AHEI=Alternate Healthy Eating Index; BMI=body mass index; NHS=Nurses’ Health Study; NHSII=Nurses’ Health Study II; HPFS=Health Professionals Follow-up Study; MET-h=metabolic equivalent tasks per hour

Subgroup analyses

In stratified analyses, significant effect modifications were observed for overall dietary quality and associations of dark chocolate with risk of T2D (P for interaction=0.007; table 5). Specifically, participants with a higher quality diet, measured by AHEI (≥median), showed stronger associations of dark chocolate consumption with risk of T2D (34% (95% CI 12% to 51%) lower risk of T2D comparing ≥5 serving/week to reference group) than individuals with a lower quality diet (hazard ratio 0.87, 95% CI 0.68 to 1.10, comparing extreme groups). Among those younger than 70 years, male participants, those with higher physical activity, and those with no family history of diabetes, the risk of T2D was lower when comparing extreme groups of dark chocolate intake, although P values for interactions were not statistically significant. Age, sex, BMI, physical activity, AHEI, and family history of diabetes did not seem to significantly modify the associations of total and milk chocolate intake with risk of T2D.

Table 5

Stratified analyses by selected characteristics. Values are hazard ratio (95% CI)* unless stated otherwise

View this table:

Sensitivity analyses

In sensitivity analyses comparing extreme groups, using baseline chocolate consumption level as a non-time varying intake in the model largely weakened the associations between dark chocolate and risk of T2D (hazard ratio 0.90 (95% CI 0.75 to 1.07)), whereas this association was slightly strengthened when modeling dark chocolate consumption based on the most recent update (0.75, 0.63 to 0.90) (table 6). Overall results remained unchanged after adjusting for baseline BMI as a continuous variable or time varying categorical BMI instead of baseline categorical BMI (table 6 and supplementary table S2). Adjusting for waist circumference appeared to strengthen the association between dark chocolate and risk of T2D, and further attenuated the milk chocolate association towards the null (supplementary table S3). Adjusting for specific foods and drinks high in flavan-3-ols instead of overall dietary quality also did not appear to significantly change the findings. Adjusting for specific foods that predicted T2D slightly attenuated the results toward the null. Further adjustment for epicatechin levels attenuated the associations between dark chocolate and risk of T2D in the highest consumption group (hazard ratio 0.86 (95% CI 0.71 to 1.04)). Findings did not change after further adjustment for other individual flavonoids (supplementary table S4). Adjusting for added sugar slightly attenuated the associations between dark chocolate and risk of T2D (supplementary table S4). Further adjustment for saturated fat intake appeared to strengthen the association for intake of both total chocolate and chocolate subtypes and risk of T2D (total chocolate: 0.88 (0.81 to 0.96), dark chocolate: 0.77 (0.64 to 0.93), milk chocolate: 0.92 (0.78 to 1.10)). In addition, the associations persisted when adjusting for neighborhood socioeconomic status in two female cohorts (NHS and NHSII; supplementary table S5) and when adjusting for educational levels in NHS or specialties of professions in HPFS (supplementary table S6).

Table 6

Sensitivity analysis. Values are hazard ratio (95% CI)* unless stated otherwise

View this table:

Chocolate intake and weight change

Table 7 shows the results for bodyweight change analyses by chocolate subtypes. Compared with those who did not change their chocolate intake, increased intake of milk chocolate over four year periods was associated with 0.35 kg (95% CI 0.27 to 0.43) more four year weight gain over time. Increasing dark chocolate intake was not associated with weight change over time (−0.06 kg, −0.13 to 0.02). The association between changes in total chocolate intake with long term weight change was also positive (supplementary table S7). The results across the three cohorts were largely consistent, although the associations for total and milk chocolate were the strongest in NHSII compared with the other two cohorts (supplementary table S8).

Table 7

Weight change over four year periods according to four year change in dark and milk chocolate intake

View this table:

In the stratified weight change analyses, we identified baseline BMI as a significant modifier for the associations between four year changes in dark or milk chocolate intake and four year weight change (table 8; P values for interaction <0.001). Increased intake of milk chocolate was associated with more increase in bodyweight in those with obesity at baseline (BMI ≥30) compared with those with a BMI within normal range (<25). Specifically, increased consumption of milk chocolate compared with no change in milk chocolate intake was associated with an average weight gain of 0.68 kg (0.42 to 0.95) more in people with obesity, whereas weight gain in those with normal BMI was only 0.33 kg (0.24 to 0.42) more over four year intervals. Increased dark chocolate intake was not associated with long term weight change in any BMI subgroups. The associations for total chocolate were also stronger in the group with a higher BMI (supplementary table S7). In the primary analyses for diabetes outcome, however, adjusting for time varying BMI did not explain the associations for any chocolate intake and risk of T2D.

Table 8

Weight change over four year periods according to four year change in chocolate subtype intake, stratified by baseline BMI

View this table:

Discussion

The findings of the present study showed that higher consumption of dark, but not milk, chocolate was associated with a lower risk of T2D. Spline regression analysis showed a linear dose-response association between intake of dark chocolate and risk of T2D (P for linearity=0.003). These findings were independent of established and potential risk factors for diabetes and were robust in multiple sensitivity analyses, although statistically significant heterogeneity was also noted among cohorts. Stratified analyses indicated that the association of dark chocolate intake was more apparent among younger individuals. Epicatechin intake may partially account for the inverse associations of dark chocolate. Similar differential associations between milk and dark chocolate intake were also observed for change in bodyweight. Milk chocolate consumption was statistically significantly associated with more weight gain but increased dark chocolate intake was not associated with weight gain. In fact, time varying BMI did not explain the associations for dark chocolate and risk of T2D in our data.

Comparison with other studies

Our study’s finding that intake of total chocolate was statistically significantly associated with lower risk of diabetes was in line with previously published studies. The Physicians’ Health Study reported that consuming ≥2 servings/week of total chocolate was associated with a 17% (95% CI 1% to 31%) lower risk of T2D among 18 235 participants during a median follow-up of 9.2 years.16 In the Multiethnic Cohort Study, consuming chocolate ≥4 times/week was associated with a 19% (95% CI 9% to 28%) lower risk of T2D compared with a lower frequency of consumption (<once/month).41 In the Maine-Syracuse Longitudinal Study, individuals who never or rarely ate chocolate had a 91% (95% CI 3% to 255%) higher risk of T2D than those who consumed chocolate more than once per week.42

Dark and milk chocolate could have a differential effect on T2D; however, evidence for associations pertinent to intake of chocolate subtypes is sparse. The National Health and Nutrition Examination Survey (2007-08 and 2013-14) found that 11.1% of US adults consumed chocolate on a regular basis, whereas only 1.4% reported consuming dark chocolate (≥45% cocoa content).43 Major subtypes of chocolate (dark, milk, and white) differ mainly in their cocoa and sugar content and presence of milk.12 Among the three subtypes of chocolate products, dark chocolate has the highest cocoa content (50-80% cacao) and is the richest chocolate in flavan-3-ols (mean 3.65 mg/g).44 Milk chocolate comprises less than one fifth of the flavan-3-ols (mean 0.69 mg/g) in dark chocolate, has a lower cocoa content (~35%), and has a higher sugar content.44 White chocolate has the highest sugar content, and, since it does not contain cocoa, has no polyphenols. Flavan-3-ol content of chocolate is highly variable depending on the type of processing used. The current analysis investigated the association between chocolate subtypes and risk of T2D, and we found evidence of a linear, inverse association of dark chocolate intake with risk of T2D. Our findings are largely in line with randomized controlled trials that examined dark chocolate or cocoa in relation to T2D or cardiometabolic risk factors. For example, a 15 day randomized controlled trial in glucose intolerant, hypertensive participants found that daily consumption of 100 g high polyphenol dark chocolate led to significant reductions in blood pressure and improvements in insulin sensitivity, compared with a placebo group consuming 90 g white chocolate.45 Studies examining the consumption of high versus low flavanol cocoa products also showed health benefits in improving insulin resistance among individuals with overweight and obesity,46 increasing high density lipoprotein cholesterol levels, and lowering blood pressure among patients with diabetes.4748 However, the recent large scale randomized controlled trial, COSMOS (cocoa supplement and multivitamin outcomes study), which administered 500 mg cocoa flavanols supplements per day (including 80 mg epicatechin) among 21 442 US men and women aged, on average, 72.0 years, yielded somewhat unexpected findings after a median of 3.5 years of follow-up. Flavanol supplementation significantly lowered the risk of death due to cardiovascular disease compared with placebo (hazard ratio 0.73, 95% CI 0.54 to 0.98), but no effects were found on risk of T2D (1.04, 0.91 to 1.20).1049 Reasons for the discrepancy between our observations and the COSMOS findings are unknown. Meanwhile, the inverse associations between dark chocolate intake and risk of T2D in our cohorts were primarily observed among younger individuals (<70 years old) (table 5). Further studies are warranted to elucidate potential age specific effects of cocoa or flavanol intake.

Furthermore, our results suggest that increasing total chocolate intake was associated with more gain in bodyweight, but this was likely mainly driven by milk chocolate. These findings were consistent with some previously published longitudinal studies on total chocolate. In the Atherosclerosis Risk in Communities cohort, every 1 oz/day intake of total chocolate was associated with 0.19 (95% CI 0.04 to 0.15) more change in BMI over six years of the study period.14 In the Women’s Health Initiative, each 1 oz/day intake of chocolate candy and candy bar was associated with 0.92 kg (95% CI 0.80 to 1.05) more three year weight gain.50 Another intriguing observation in our study was the heterogeneity across the three cohorts, with more statistically significant results observed in men than in elderly postmenopausal women, consistent with previously published sex stratified results.51 Oba and colleagues reported similar findings to ours among Japanese people; that consumption of chocolate once or more per week was associated with a 35% (95% CI 3% to 57%) lower risk of T2D in men but no significant effect in women (0.73, 0.48 to 1.13), compared with those who never ate chocolate.51 The reasons underlying these potential gender differences are unknown, although sex hormones might play a role in modifying the association between chocolate consumption and risk of T2D. These findings might also be ascribed to milk chocolate as the primary type of chocolate consumed by study participants, yet neither of these longitudinal studies differentiated between subtypes of chocolate.50 In contrast with total or milk chocolate intake, our study showed that increasing dark chocolate intake was not associated with more weight gain over ≥20 years of follow-up. This finding was consistent with evidence from previous short term (≤8 weeks) randomized controlled trials, which showed no significant weight changes after regular intake of dark chocolate.4552

Dark chocolate, with a higher cocoa content than milk chocolate, may lower the risk of T2D through various mechanisms.53 It has been suggested that bioactive compounds in cocoa, such as flavan-3-ols and their monomeric form, epicatechin, mitigate risk of T2D by improving insulin sensitivity,1154 protecting pancreatic β cells from oxidative stress,55 lowering pro-inflammatory cytokines such as tumor necrosis factor-α and IL-6,56 and improving endothelial function by stimulating the production of nitric oxide, a vasodilator, which may lead to improved glucose metabolism and reduced risk of T2D.575859 Milk and white chocolate intake might not lead to the same metabolic health benefits owing to their higher added sugar content—an established dietary risk factor for cardiometabolic diseases.6061 In contrast, although dark chocolate contains similar levels of energy and saturated fat to milk chocolate (mean percentage of fat in commercially available chocolate (dark: 34.7%; milk: 32.6%44), the rich polyphenols in dark chocolate might offset the effects of saturated fat and sugar on weight gain and the risk of other cardiometabolic diseases.62

Strengths and limitations of this study

A major strength of our study is that we differentiated chocolate intake by subtypes and examined their associations with risk of T2D and weight change among healthy individuals over long follow-up periods. However, our study also has several limitations. First, we cannot entirely rule out the role of confounding in our observed associations. We controlled for multiple lifestyle and dietary covariates that might confound the associations of interest, although residual or unmeasured confounding, or both, may still exist owing to the observational nature of the analysis. Second, the relatively limited number of people with T2D in the higher chocolate consumption groups may have led to reduced statistical power for detecting modest associations between dark chocolate consumption and risk of T2D. Third, most of our study population for chocolate subtype analyses were non-Hispanic white adults older than 50 years at baseline, which, together with their professions, may limit the generalizability of our findings to other populations with different socioeconomic or personal characteristics. Nevertheless, the observed associations persisted when further adjusting for neighborhood z scores for socioeconomic status, educational levels, and specialty of professions. Fourth, chocolate consumption was relatively low in our study population, compared with the national average in the US Department of Agriculture Nationwide Food Consumption Survey 1987-88 of about three servings/week.63 This may have hindered our ability to assess the dose-response association at higher intake. Lastly, food frequency questionnaires are subject to measurement errors, although the prospective design may make this error non-differential, and, as such, the associations may be more likely to be biased toward the null.

Conclusion and policy implications

Intake of dark chocolate instead of milk chocolate may be associated with a lower risk of T2D. Increased consumption of milk chocolate but not dark chocolate, however, was associated with increased weight gain. Further research, especially randomized controlled trials among middle aged participants and of longer duration, is needed to confirm these findings.

What is already known on this topic

  • Chocolate contains high levels of flavanols, which promote cardiometabolic health and reduce the risk of type 2 diabetes (T2D), as shown in randomized controlled trials

  • The associations between chocolate consumption and risk of T2D remain controversial owing to inconsistent findings in observational studies

  • Most previous studies did not differentiate between chocolate subtypes (dark, milk), which differ in their cocoa content and proportions of other ingredients such as sugar and milk, and may have differential associations with risk of T2D

What this study adds

  • Consumption of ≥5 servings/week of dark chocolate compared with rare consumption was statistically significantly associated with lower risk of T2D

  • The association for milk chocolate was, however, null

  • Increased consumption of milk chocolate but not dark chocolate was associated with increased weight gain

Ethics statements

Ethical approval

The Nurses’ Health Study, Nurses’ Health Study II, and Health Professionals Follow-up Study were approved by the institutional review boards at Brigham and Women’s Hospital and Harvard T.H. Chan School of Public Health. The return of the completed self-administered questionnaires was considered to imply informed consent.

Data availability statement

No additional data available.

Acknowledgments

We thank the participants and staff of the Nurses’ Health Study, Nurses’ Health Study II, and Health Professionals Follow-up Study for their valuable contributions.

Footnotes

  • Contributors: BL, GZ, and QS designed the study. JEM, FBH, and QS collected the data. BL and GZ conducted the data analyses. BL prepared the first draft of the manuscript. All authors contributed to the interpretation of the results and critical revision of the manuscript for important intellectual content and approved the final version of the manuscript. QS is the guarantor. The corresponding author attests that all listed authors meet authorship criteria and that no others meeting the criteria have been omitted.

  • Funding: This study was supported by the National Institutes of Health (grants UM1 CA186107, P01 CA87969, R01 HL034594, U01 CA176726, U01 CA167552, R01 HL035464, R01 HL60712, R01 DK120870, R01 DK126698, R01 DK119268, U2C DK129670, R01 ES022981, and R21 AG070375). The funders had no role in considering the study design or in the collection, analysis, interpretation of data, writing of the report, or decision to submit the article for publication.

  • Competing interests: All authors have completed the ICMJE uniform disclosure form at https://www.icmje.org/disclosure-of-interest/ and declare: support from the National Institutes of Health for the submitted work. JEM reports receiving investigator initiated grants from Mars Edge. EBR is on the scientific advisory board and has received research funding from the US Department of Agriculture/US Highbush Blueberry growers commodity group.

  • Transparency: The first author (BL) affirms that the manuscript is an honest, accurate, and transparent account of the study being reported; that no important aspects of the study have been omitted; and that any discrepancies from the study as planned have been explained.

  • Dissemination to participants and related patient and public communities: There are no plans to disseminate the results of the research to study participants or the relevant patient community. The participants are updated on the study outcomes and developments through the study websites (www.nurseshealthstudy.org and www.hsph.harvard.edu/hpfs) and newsletters.

  • Provenance and peer review: Not commissioned; externally peer reviewed.

http://creativecommons.org/licenses/by/4.0/

This is an Open Access article distributed in accordance with the terms of the Creative Commons Attribution (CC BY 4.0) license, which permits others to distribute, remix, adapt and build upon this work, for commercial use, provided the original work is properly cited. See: http://creativecommons.org/licenses/by/4.0/.

References

View Abstract