A computational analysis of mouse behavior in the sucrose preference test

Verharen, Jeroen P. H.; de Jong, Johannes W.; Zhu, Yichen; Lammel, Stephan

doi:10.1038/s41467-023-38028-0

Download PDF

Article
Open access
Published: 27 April 2023

A computational analysis of mouse behavior in the sucrose preference test

Nature Communications volume 14, Article number: 2419 (2023) Cite this article

7401 Accesses
5 Citations
17 Altmetric
Metrics details

Subjects

Abstract

The sucrose preference test (SPT) measures the relative preference of sucrose over water to assess hedonic behaviors in rodents. Yet, it remains uncertain to what extent the SPT reflects other behavioral components, such as learning, memory, motivation, and choice. Here, we conducted an experimental and computational decomposition of mouse behavior in the SPT and discovered previously unrecognized behavioral subcomponents associated with changes in sucrose preference. We show that acute and chronic stress have sex-dependent effects on sucrose preference, but anhedonia was observed only in response to chronic stress in male mice. Additionally, reduced sucrose preference induced by optogenetics is not always indicative of anhedonia but can also reflect learning deficits. Even small variations in experimental conditions influence behavior, task outcome and interpretation. Thus, an ostensibly simple behavioral task can entail high levels of complexity, demonstrating the need for careful dissection of behavior into its subcomponents when studying the underlying neurobiology.

An operant social self-administration and choice model in mice

Article 24 March 2023

A novel multidimensional reinforcement task in mice elucidates sex-specific behavioral strategies

Article 06 May 2020

Whole-brain tracking of cocaine and sugar rewards processing

Article Open access 23 January 2023

Introduction

The sucrose preference test (SPT) measures the relative preference of rodents for a 1-2% sucrose solution over water as a proxy for reward sensitivity^1,2. Rodents typically exhibit a natural preference for palatable sweet solutions, and it is therefore assumed that such preference is correlated with the pleasure an animal experiences when it consumes sucrose. As such, a reduction in sucrose preference is interpreted as an inability to feel pleasure, a condition that is commonly known as anhedonia³. Because anhedonia is often observed in individuals with substance use disorders, major depressive disorders, and other neuropsychiatric disorders, the SPT is extensively used as a rodent assay to study the neurobiological basis of disease^4,5.

In rodents, both acute and chronic stress as well as other stressors (e.g., social defeat stress, foot shock stress, maternal deprivation) substantially reduce sucrose preference, which is used as a criterion for anhedonia^{1,6,7,8,9,10,11,12,13,14}. When anhedonia is observed in concert with other stress-induced behavioral adaptations (e.g., increased passive coping, deficits in social behaviors, changes in sleep patterns, altered circadian rhythm), animals are often classified as susceptible to a depression-like phenotype^{4,15,16,17,18,19,20} (but see ref. ²¹). In these models, validity is provided by the fact that stress is a major risk factor for the development of depression in humans and that antidepressant administration reverses depression-related behaviors in rodents, including anhedonia^2,22. However, methodological differences in how the SPT is conducted across different laboratories may account for difficulties with its reproducibility as indicated by meta-analyses from previous SPT studies^23,24,25 (Supplementary Fig. 1).

Anhedonia, in its most narrow meaning, is often considered a reduced ability to experience pleasure. In humans, however, it can reflect a diverse array of deficits in hedonic functions, encompassing reward expectation, reward evaluation, effort, reward learning and reward planning^3,4,26,27. Consistent with this are efforts from preclinical studies recognizing that hedonic capacity can be divided into different subcomponents including the ability to experience pleasure (‘liking’ or reward appreciation) and the motivational effort to obtain a reward (‘wanting’)²⁶. The distinction between ‘liking’ and ‘wanting’ is important because these behavioral components likely involve discrete brain areas, cell types and circuits^28,29,30,31. Indeed, preclinical studies have shown that reward appreciation and motivation involve distinct cell types within the brain’s reward systems³².

The SPT is often considered to selectively reflect an animal’s capacity to experience hedonic pleasure evoked by the sucrose solution (‘liking’) rather than motivational effort (‘wanting’). However, the SPT requires animals to learn and recognize the caloric and hedonic value of a sucrose solution, and sucrose and water intake can reflect how well animals can integrate sensory, ingestive and motivational signals that drive choice³³. Whether the main behavioral outcome measure of the SPT, sucrose preference, reflects additional behavioral components or sub-routines that are an integral part of the broader hedonic domain remains uncertain.

Results

A microstructure analysis of licking behavior in the SPT

To perform a detailed analysis of animal behavior in the SPT, we subjected a group of 25 C57Bl/6 mice (adult, male and female) to a 12-h SPT¹. During this test, animals had ad libitum access to a bottle of water and a bottle of 1% sucrose solution inside an operant chamber, while licks were measured using an electrical lickometer (Fig. 1a, b). Mice had no access to food during the SPT, and were not water or food deprived prior to the test. As expected, all animals showed a strong preference for the sucrose bottle after 12 h, with little inter-animal variability (90.5 ± 6.7%, mean ± s.d.). Interestingly, over the 12-h session, animals progressively shifted licking behavior towards the sucrose bottle (Fig. 1c), suggesting that the SPT involves some form of learning.

**Fig. 1: A microstructure analysis of licking behavior in the sucrose preference test.**

To further assess the component processes subserving the SPT, we next performed a microstructure analysis of licking behavior^34,35. To do this, we divided licks into different licking bouts (henceforth called ‘choices’) that were separated by a pause of at least 5 s (Fig. 1b). Thus, animals would make a choice between the sucrose or the water bottle, with each choice containing a certain number of licks for one of these fluids, typically yielding a licking frequency in the range of 8–10 Hz.

Our microstructure analysis of SPT behavior demonstrates that sucrose preference is established through both a higher number of choices for sucrose than for water (Fig. 1c, d; choices for sucrose, >50%), as well as a higher average number of licks within a sucrose choice than within a water choice (choice size ratio sucrose/water, average licks per choice for sucrose divided by the average licks per choice for water, >1). Thus, animal behavior in the SPT can be deconstructed based on analysis of (i) % of choices for sucrose and (ii) the number of licks within these choices.

A computational model of the SPT

An analysis of choice behavior can be approached from a reinforcement learning perspective³⁶, and as such could reveal potential changes in choice strategy that cannot be captured by conventional measures of the SPT. In this case, the mouse can be viewed as a reinforcement learning agent that ought to maximize reward, thus finding the highest valued bottle (i.e., sucrose) by attributing value to both bottles through sampling and subsequent learning. As a result, after each choice, the mouse assigns value to the selected bottle by means of reward prediction error-based learning and uses these reward expectations to guide future choices between the two bottles. To test which choice and learning strategy best described the behavior of the mice, we fit the raw choice data of the 25 mice to 13 different reinforcement learning models and performed Bayesian model selection³⁷, using the log-model-evidence estimates (Fig. 2a, Supplementary Fig. 2; see “Methods” for a description of the models).

**Fig. 2: A computational model for the sucrose preference test.**

The selected model described behavior of the animals on the basis of three free parameters (Fig. 2b): (1) hedonia parameter ρ, indicating the extent to which sucrose is valued over water (ρ > 1, sucrose valued over water; ρ < 1, water valued over sucrose), (2) learning rate α, measuring the extent to which a single choice affects bottle value (α = 0, no learning; α = 1, absolute learning), and (3) discount/attraction parameter η, indicating whether not choosing a certain bottle will decrease (discounting of value; η < 0) or increase (attraction to unchosen bottle, η > 0) the value of the unchosen bottle. The selected model was further described by a choice size-dependent choice rule, meaning that learning was stronger for choices with more licks, and by choice behavior according to a Softmax equation (with inverse temperature β set at 1; see “Methods”). For each session, best-fit parameters {ρ, α, η} were estimated using maximum likelihood estimation (Fig. 2c, Supplementary Movie 1), allowing for a point-estimate comparison of each of the parameters between different mice and SPT sessions. Performing this parameter estimation procedure for the 25 mice of Fig. 1 showed robust hedonia across the population (ρ > 1 for all mice; parameter normally distributed), a log-normal distribution in learning rate α, and an average negative value for the normally distributed parameter η, indicating that most animals progressively reduce (‘forget’) the value of a bottle when it is not chosen (Fig. 2d). A cross-correlational analysis demonstrated a multitude of relationships between the different conventional and computational parameters of the SPT, with sucrose preference being determined by more than merely hedonia factor ρ (Fig. 2e, first column). This suggests that the SPT is a paradigm with more complex behavioral patterns than previously assumed.

Model validations

We validated this computational model in different ways. First, we performed a successful parameter recovery procedure in a simulated dataset (Supplementary Fig. 3a). This demonstrated that parameters can be accurately estimated from the raw data, that parameters are independent, and that the three different parameters have qualitatively different effects on choice behavior in the SPT. Second, we performed a successful posterior predictive check of the model (Supplementary Fig. 3b), indicating that the three parameters are a good estimator of behavior in the SPT, even in (noisy) experimental data. Third, we performed an experimental manipulation to assess whether the model fitting procedure is sensitive to an artificial change in hedonia parameter ρ. To do this, we subjected 16 mice to two different SPT sessions, one in which the water solution was adulterated with bitter quinine—a manipulation that increased the relative value difference between water and sucrose by reducing the absolute value of water (Fig. 2f). As expected, sucrose preference was higher in sessions in which water was adulterated with quinine, an effect that was driven by a combined increase in the % choices for sucrose and an increase in the sucrose/water choice size ratio. Parameter estimation on the raw choice data revealed that the increased % of choices for sucrose was indeed driven by an isolated increase in the value of hedonia parameter ρ, providing direct empirical evidence that our model fitting procedure can detect a shift in the appreciation of sucrose relative to water.

Because a previous study observed diurnal fluctuations in motivation and hedonic processing³⁸, we also analyzed mice in a SPT that was conducted entirely during the animals’ dark cycle (Supplementary Fig. 4a, b). However, their sucrose preference behavior did not differ significantly from mice that underwent an SPT that was conducted 3 h in the animals’ light and 9 h in the animals’ dark cycle (Figs. 1 and 2). Furthermore, in an additional experiment, we analyzed mice that had ad libitum access to regular chow during the SPT (Supplementary Fig. 4a, b). This reduced the number of total licks during the SPT and decreased sucrose preference through an isolated reduction in hedonia parameter ρ. Thus, the presence of food reduces the relative value of a sucrose solution, suggesting that the high levels of sucrose preference observed in Fig. 1 and 2 are at least in part due to a metabolic need for calories. Finally, we successfully used our model fitting procedure to analyze sessions in which animals had to discriminate between a 1% and 10% sucrose solution (Supplementary Fig. 4c), indicating that this type of computational analysis may be applied to other two-bottle choice paradigms.

Variance in choices is explained by hedonia and learning

We next performed data simulations to determine the extent to which each of the three parameters {ρ, α, η} contributes to the % of choices for sucrose in the SPT. Sample simulations show how agents perform random sampling at the beginning of a session but achieve a preference for the sucrose bottle later in the session (Supplementary Fig. 5a), similar to the behavior observed in mice (Fig. 1b, c). By manipulating the value of one of the parameters, we can, for example, assess how a learning deficit may lead to incomplete learning at the end of an SPT session, preventing the development of a sucrose preference (Supplementary Fig. 5b).

Simulating such datasets for a wide variety of combinations of {ρ, α, η} allowed us to assess how each of the three model parameters drives the % choices for sucrose in the SPT (Supplementary Fig. 5c). This analysis indicated that hedonia parameter ρ and learning rate α together mainly establish the % choices for sucrose, with little effects on the discounting/attraction parameter η. To quantify the inter-animal variance in % of choices for sucrose explained by each of the three parameters ρ, α and η in experimental data, we drew parameter values from a Gaussian distribution with average and variance based on the experimental data from Fig. 2d, simulated SPT data of different lengths (ranging from 1 to 800 choices), and performed polynomial linear regression to calculate the contribution of each of the parameters to the % of choices for sucrose (Fig. 2g). This analysis revealed several insights into the processes that drive choice behavior in the SPT. First, the variance explained by the combined set of parameters gradually increased together with the number of choices made (i.e., the length of the SPT session), together reaching ~50% after 800 choices (Fig. 2g, dashed line). Second, for short SPT sessions (up to 183 choices), the % choices for sucrose is more reflective of learning rate α than of hedonia parameter ρ, meaning that many short (minutes to hours) SPT sessions are highly influenced by learning. Third, the longer an SPT session, the more the % of choices becomes a proxy of hedonia, instead of learning; in a 2-h SPT session, ρ explains a mere 8.3% in inter-animal variability in choices, which increases to 38.7% for a 12-h session. Lastly, discount/attraction parameter η does not have a major impact on the % of choices for sucrose (it explains at maximum 1.1% of inter-animal variance), although lower values of η may be associated with more inter-animal variability (Supplementary Fig. 5c). Together, these results suggest that the % choices for sucrose in the SPT depend on (i) hedonia, (ii) learning (especially at the beginning of a session), and (iii) the total number of choices the animal has made (which in turn depends on the animal’s motivation and the total duration of the test).

Because many SPT studies switch the position of the water and sucrose solution bottles during the test (Supplementary Fig. 1), we next sought to examine whether such a bottle switch influences the experimental outcome. In simulations (Supplementary Fig. 6a), we switched the position of the sucrose and water bottles every 75 choices (i.e., approximately the number of choices mice make within a 2-h session). The same polynomial regression analysis for this bottle switch-based SPT (Fig. 2h) revealed a strikingly higher contribution of learning rate α to the % of choices for sucrose compared to the regular (i.e., no bottle switch) SPT, with hedonia parameter ρ explaining only 9.9% of choices at maximum. Thus, switching the position of water and sucrose bottles increases the contribution of learning to choice behavior in the SPT. Indeed, when we switched the bottles in the middle of an experimental SPT session, we evoked disruptions in sucrose consumption for up to 2 h (Supplementary Fig. 6b and 6c), which further supports the idea that learning is required when bottles are switched. In summary, the results of our simulations suggest that variations in task structure, including task duration and switching the position of sucrose solution and water bottles, may result in the assessment of behavioral domains that do not necessarily reflect hedonia.

Effects of stress on SPT behavior

Next, we sought to determine whether a computational analysis of SPT behavior influences the interpretation of animal behavior in response to two widely used stress paradigms: chronic mild and acute restraint stress. Both paradigms are known to evoke a reduction in sucrose preference that is typically interpreted as anhedonia in studies of depression-related behaviors in mice^4,5,39. For the chronic mild stress paradigm, C57Bl/6 mice were exposed to 4 weeks of chronic mild stress, which involved 1 or 2 mild stressors per day, such as wet bedding, cage tilting or flashing lights. For the acute stress paradigm, C57Bl/6 mice were placed in a plastic restrainer for 4 h prior to the SPT. For both paradigms, the control mice that were used were housed under identical conditions but not exposed to chronic or acute stress. We then conducted a 12-h SPT that did not involve switching the position of the sucrose solution and water bottles.

Consistent with previous reports^7,13,17, we observed a significant reduction in sucrose preference after chronic mild stress in male and female C57Bl/6 mice (Fig. 3a). Interestingly, for both sexes, this difference did not emerge until the second half of the test, indicating that a short (i.e., <6 h) SPT would have led to false-negative results. Additionally, for both sexes, stress significantly reduced the total number of licks. It is important to note that a lower number of licks by itself may hamper the development of a sucrose preference, by reducing the exposure to (and thus learning of) the reward contingencies of the bottles. A subsequent microstructural and computational analysis of SPT behavior showed that for male mice (Fig. 3a, top), the observed sucrose preference deficit was driven by a combined reduction in % choices for sucrose and sucrose/water choice size ratio. Computational parameter estimation indicated that the observed reduction in choices for sucrose was indeed driven by anhedonia (i.e., a reduction in hedonia parameter ρ), confirming that chronic mild stress evokes anhedonia-like choice behavior in male mice. For female mice (Fig. 3a, bottom), the reduction in sucrose preference was driven by a lower % choices for sucrose, but not by changes in choice size ratio. Surprisingly, computational analyses showed no effects of stress on any of the model parameters values, suggesting that the observed reduction in % choices for sucrose was solely driven by lower liquid consumption. As such, a reduced exposure to the bottles may have prevented these animals from fully learning its reward contingencies. Thus, 4 weeks of chronic stress was sufficient to induce anhedonia-like choice behavior in male, but not female mice.

**Fig. 3: Different outcome measures following chronic and acute stress.**

Acute restraint stress evoked a more complex behavioral response in the SPT (Fig. 3b and Supplementary Fig. 7). In male C57Bl/6 mice (Fig. 3b, top), we did not observe a change in sucrose preference across the 12-h SPT session. However, computational parameter estimation on the raw choice data revealed a striking increase in hedonia parameter ρ, indicative of a paradoxically higher preference for the sucrose bottle. Interestingly, this did not lead to a change in the % of choices for sucrose (and hence sucrose preference), since it was masked by a concurrent learning deficit (i.e., a reduction in α). These data indicate that (i) in male C57Bl/6 mice, acute restraint stress increases hedonia but impairs learning, and (ii) computational parameter estimation can reveal latent differences in behavior that are not obvious in traditional measures of the SPT. Because previous studies reported potential sex differences in susceptibility to acute stress in rodents⁴⁰, we repeated this experiment in female C57BLl/6 mice (Fig. 3b, bottom). Here, we observed a stress-induced reduction in sucrose preference, which was driven by a reduction in choice size ratio, but not in the % choices for sucrose. Accordingly, we did not observe changes in any of the computational model parameters. Interestingly, the reduction in choice size ratio was mainly driven by a change in water consumption, rather than sucrose consumption (Supplementary Fig. 8c), suggesting that in female C57Bl/6 mice, reward-related behaviors remain largely unaffected by acute stress. Thus, acute stress may evoke sex-dependent effects on different latent components of the SPT, but we did not find evidence that it induces anhedonia.

Optogenetic inhibition of mPFC neurons reduces sucrose preference by impairing learning

Previous studies have demonstrated that acute optogenetic manipulations of various brain regions can reduce sucrose preference, which is typically interpreted as anhedonia^8,12,13,41. Because we found that a reduction in sucrose preference does not reflect anhedonia per se, we sought to establish an optogenetic manipulation that reduces sucrose preference to reflect deficits in learning rather than anhedonia. To do this, we focused on the medial prefrontal cortex (mPFC) given its suggested role in reward learning⁴². We expressed the inhibitory opsin halorhodopsin (eNpHR3.0) in mPFC pyramidal neurons of C57Bl/6 mice (male and female). Control mice were injected with an adeno-associated virus (AAV) carrying eYFP into the mPFC (Fig. 4a). 4 weeks later, mice were subjected to the SPT, and we used 590 nm light to inhibit mPFC neurons during specific phases of the SPT (Fig. 4b). Specifically, we tested animals three times in different versions of a 60-min SPT, separated into three 20-min epochs: one baseline session (i.e., three 20-min epochs with light OFF), one session in which cells were inhibited during the second 20-min epoch of the task (i.e., in the middle of the session; OFF-ON-OFF), and one session in which cells were inhibited during the first 20-min epoch of the task (i.e., in the beginning of the session; ON-OFF-OFF). In this short-duration SPT, mice were tested under water-restricted conditions to increase the number of choices. As a result, the relative contribution of hedonia to behavior was increased on such a short timescale (Fig. 2g).

We found that optogenetic inhibition of mPFC neurons reduced sucrose preference only when light stimulation occurred at the beginning of the session (i.e., ON-OFF-OFF), but not in the middle of the session (i.e., OFF-ON-OFF) (Fig. 4c, top panel). Accordingly, optogenetic inhibition at the beginning of the SPT (i.e., ON-OFF-OFF) reduced sucrose preference through a reduction in the % of choices for sucrose. In contrast, we did not find differences between sessions in terms of the total number of licks. Importantly, no changes in SPT behavior were observed in control mice expressing eYFP in the mPFC (Fig. 4c, bottom panel).

In parallel, we performed computational modeling to predict how optogenetically induced changes in each of the parameters {ρ, α, η} may affect choice behavior in the SPT. We performed parameter estimation on the baseline (OFF-OFF-OFF) sessions to predict how a deficit in each of the three parameters {ρ, α, η} during the ON epochs may affect choice behavior (Fig. 4b, Supplementary Fig. 9). Simulated data predicted that anhedonia (ρ → 1 during ON epoch) would lead to a change in choices for sucrose, regardless of whether inhibition occurred at the beginning or in the middle of the session. In contrast, a learning impairment (α → 0) would only affect behavior when inhibition occurred at the beginning of the session, when learning has not yet been established (Fig. 4d). Simulations further predicted that setting the value of the discount/attraction parameter η to 0 has no effect on % choices for sucrose. Together, these simulations suggest that the sequence of the optogenetic inhibition procedure can distinguish between anhedonia and learning deficits.

When comparing the simulated and experimental datasets, we found that the behavioral effects of mPFC optogenetic inhibition indeed matched the pattern of a learning impairment (i.e., α → 0), since sucrose preference was only reduced when optogenetic inhibition occurred at beginning of the session (i.e., ON-OFF-OFF). Therefore, the reduction in sucrose preference in our behavior experiment is likely mediated through a learning deficit, rather than anhedonia. Collectively, these results indicate that acute changes in SPT behavior in response to optogenetic manipulations may not necessarily indicate anhedonia.

Discussion

In this study, we found that the main outcome measure of the SPT, a reduction in sucrose preference, does not reflect anhedonia per se. Our results suggest that even ostensibly simple behavioral assays can entail high levels of complexity that should be considered when investigating the neural basis of behavior.

Utility of the SPT for neuropsychiatric research

As researchers have developed a deeper understanding of the utility of behavioral assays used to study neuropsychiatric diseases in rodents, it has become clear that previous notions about the utility of some of these assays need to be revised. For example, the forced swim test, which, like the SPT, is commonly used to assess depression-related behaviors in rodents, has been questioned in terms of its reproducibility, ethical nature, and utility for translational depression research⁴³. Although the SPT has remained the gold standard for characterizing hedonic behaviors in rodents, previous studies have raised concerns about the validity of the SPT. Unlike its effect in rodents, chronic stress does not appear to reduce the appreciation of sweet taste in humans^44,45, nor do antidepressants targeting the monoamine system always effectively alleviate anhedonia in humans^46,47. Additionally, large differences exist in how the SPT is conducted across labs (Supplementary Fig. 1), providing additional variability to sucrose preference behavior, potentially hampering its reproducibility. Despite these criticisms, and the fact that it is generally challenging to extrapolate from experiments in animals to humans, the validity of the SPT is deemed to be relatively high²³. In fact, its popularity may be explained by its relatively low technical demands and time consumption as well as its single—ostensibly easy-to-interpret—outcome measure. Despite these advantages, our results demonstrate that the main outcome measure of the SPT, reduced sucrose preference, is not always indicative of anhedonia. This finding is consistent with previous concerns that have been raised regarding the interpretation of reduced sucrose consumption as a manifestation of anhedonia. For example, a previous study showed that the standard practice of handling mice by their tails can decrease sucrose consumption⁴⁸. But how should scientists interpret changes in sucrose preference in the context of external perturbations (e.g., stress, optogenetics) in the absence of confounding factors (e.g., leaking bottles, bottle sizes etc.)? By performing a detailed experimental and computational decomposition of behavior in the SPT, we discovered previously unrecognized behavioral subcomponents that can be associated with reduced sucrose preference such as deficits in learning (learning rate α), motivation (through the total number of choices made) and fluctuations in value memory (i.e., discount/attraction parameter η). We argue that this distinction is important because these subcomponents likely involve neurochemically and anatomically distinct structures. Importantly, we have also developed a Python-based toolbox to assist researchers with the interpretation of data from the SPT (see below). For researchers interested in using the SPT to measure hedonia, we further recommend using long sessions that do not include switching of the sucrose and water bottles. Thus, the careful delineation of behavioral assays and identification of previously unrecognized behavioral subdomains could prove beneficial towards developing better and more specific treatments of neuropsychiatric disorders.

Role of SPT in measuring anhedonia

The SPT is considered to selectively reflect the animals’ capacity to experience hedonic pleasure evoked by the sucrose solution (‘liking’). Accordingly, a reduced sucrose preference in response to stress or other neural perturbations (e.g., optogenetics) is typically interpreted as anhedonia in its most stringent definition of “loss of pleasure”. Our study demonstrates that reduced sucrose preference in the SPT can also reflect deficits in other behavioral domains, such as motivation (i.e., through the number of choices made) and learning, that are essential to the processing of rewards. Still, arguments can be made that perturbations in motivational drive, reward learning and decision-making all belong to a spectrum of anhedonia symptoms that, beyond the failure to “experience pleasure”, encompass the whole domain of reward-associated disorders^3,49. Our results show that some of these different behavioral components can be separated experimentally through detailed behavioral analysis of sucrose preference behavior and the integration of computational methods. The identification of these distinct behavioral profiles is important, as they likely involve different cell types and circuits. Thus, in combination with additional behavioral approaches such as intracranial self-stimulation or operant assays (i.e., rodents have to perform work to receive rewards) that are tailored towards assessing pleasure or motivation, respectively, the SPT can be a very useful paradigm for studying the neural basis of behavior.

Choice size ratio as a proxy for hedonia

The % of choices for sucrose is only one of two factors that drive sucrose preference (Fig. 1d). The second factor, number of licks within choices, is captured by the choice size ratio. It indicates the ratio between the average number of licks within a sucrose choice relative to a water choice. This ratio typically falls between 1 and 7 (Fig. 1c, d), indicating that, on average, mice make 1–7 times more licks during a sucrose choice than during a water choice. Previous studies have suggested that the choice size ratio can be used as a direct measure of hedonia^50,51. To determine if choice size ratio is a good proxy for hedonia parameter ρ, we also carefully examined the correlation matrix of Fig. 2e and observed that choice size ratio indeed has a significant positive correlation with hedonia parameter ρ, but not with learning rate α or discount/attraction parameter η (Supplementary Fig. 8a). Therefore, in line with our hypothesis, choice size ratio can be used as a proxy for hedonia across mice, albeit with a low amount of variance explained (R² = 0.26). Furthermore, we more thoroughly analyzed the data from the stress experiments of Fig. 3 in which we observed a change in choice size ratio or hedonia parameter ρ in the experimental versus control condition (Supplementary Fig. 8b, c). In these data, we examined whether the change in choice size ratio was driven by a change in licking for water or licking for sucrose. This analysis suggests that choice size ratio can be used as a proxy for hedonia only if the change in choice size ratio is driven by an isolated change in sucrose choice size (i.e., a change in the number of licks per choice for sucrose, but not for water). Thus, in line with previous work⁵⁰, we confirm that in certain cases choice size ratio can be used as a proxy for hedonia parameter ρ.

A toolbox for the interpretation of SPT experiments

SPT data entail high levels of complexity, and sucrose preference—its main outcome measure—depends on many factors beyond hedonia. Conclusions based on SPT data should therefore be considered in light of appropriate control experiments, analyses and simulations, as reported in this study. To support researchers with the analysis of SPT data, we provide SweetiePy, which is a Python-based toolbox to estimate model parameters {ρ, α, η}. With this toolbox, researchers can enter timestamps of individual licks (in seconds) and follow a Jupyter notebook to estimate the best-fit parameters for an SPT session in a step-by-step manner. These parameter values can subsequently be applied towards traditional statistical analyses. Using this approach together with optimized experimental conditions could lead to new insights in previously confounding observations of sucrose preference variability in the context of strain³⁶, sex⁵², nutritive state⁵³, time of day⁵⁴ or social status⁵⁵. Ultimately, this may enhance the utility of the SPT for both the detection of novel targets for treatment of neuropsychiatric disorders and contributing to a better understanding of the neural basis of behavior.

Methods

Subjects

C57Bl/6 (Jackson Laboratory; 25–35 g, 8–20 weeks old at start of experiments) male and female mice were used for all experiments. Mice were maintained on a 12:12 h light cycle (lights on at 7:00 AM). Behavioral tests with a duration of 12 h (Fig. 1, 2 and 3) started ~9 h into the animals’ light phase, around 4:00 PM. Behavioral tests with a short duration (Fig. 4) were performed entirely during the light phase. Animals were housed in a temperature (20–23 °C) and humidity (40%–60%) controlled room that was illuminated by eight 32 W fluorescent lights each producing 2925 lumens. All procedures complied with the animal care standards set forth by the National Institutes of Health and were approved by University of California Berkeley’s Administrative Panel on Laboratory Animal Care.

Stereotaxic surgeries

Stereotaxic surgeries were performed under general anesthesia with ketamine-dexmedetomidine using a stereotaxic apparatus (Model 1900, Kopf Instruments, Germany). For optogenetic inhibition of mPFC pyramidal neurons, 250 nl of AAV5-CamKII-eNpHR3.0-eYFP-WPRE-PA or (UNC Vector Core, titer 5 × 10¹²) or AAV5-CamKII-eYFP (UNC Vector Core, titer 5 × 10¹²) was bilaterally infused into the prelimbic cortex (AP + 2.0, ML ± 0.4, DV −2.4 mm from Bregma) of C57Bl/6 mice using a glass pipette attached to tubing and a 1 μl Hamilton syringe in a syringe pump (Harvard apparatus; rate: 100 μl/min). The injection pipette was slowly withdrawn 5 min after the end of the infusion. A single optic fiber (200 μm diameter, 0.37 NA, 2.5 mm ferrule) was lowered to approximately the midline between the two infusion sites (AP + 2.0, ML + 0.4 from Bregma; DV −2.1 from skull; 10° mediolateral angle). One layer of adhesive cement (C&B Metabond; Parkell) followed by cranioplastic cement (Dental cement) was used to secure the fiber to the skull. The incision was closed with a suture and tissue adhesive (Vetbond; 3 M). The animal was woken up with an I.P. injection of atipamezole and kept on a heating pad until it recovered from anesthesia. Experiments were performed 4–5 weeks after stereotactic injection. Injection sites and optical fiber placements were confirmed in all animals by preparing coronal sections (80 µm) of injection and implantation sites.

Sucrose preference test

The sucrose preference test (SPT) was performed in operant chambers (Med Associates, Inc.; 8.5” L × 7.12” W × 5” H) equipped with a house light (40 lux) and two electrical lickometers and located in sound-attenuated cubicles. Bottles containing tap water or 1% sucrose in tap water were secured to the lickometers, so that each lick could be detected by the MedPC IV software (Med Associates, Inc.). No additional food was present in the operant chambers (except when indicated as in Supplementary Fig. 4a and 4b). The operant chambers were further equipped with some nesting material. For the 1-h SPTs (Fig. 4), no nesting material was provided. The bottle configuration was different in each of the 6 operant chambers used (i.e., bottles were located in different parts of the wall), so that for repeated measures experiments (Fig. 2f), animals could be re-tested, such that they had to re-establish learning in each session.

Microstructure analysis of licking behavior

Individual licks in the SPT, extracted from the MedPC data files, were first pre-processed using a microstructure analysis of licking behavior. In this analysis, licks were separated into different “lick bouts” (here called “choices”), determined by a cut-off of 5 s. For example, if an animal started licking for sucrose, made 15 licks, then took a pause for 6 s, and continued licking for sucrose, it was counted as 2 choices for sucrose (with the first choice containing 15 licks). Different cut-off values were tested (between 500 ms and 10 s), but this did not result in different experimental outcomes. Accordingly, the total number of licks for sucrose L_sucrose was defined as:

$${L}_{{{{{\rm{sucrose}}}}}}= {{{{{\rm{number}}}}}} \, {{{\rm{of}}}} \, {{{\rm{choices}}}} \, {{{{{\rm{for}}}}}} \, {{{\rm{sucrose}}}} \\ \times {{{{{\rm{avg}}}}}}. \, {{{\rm{choice}}}} \, {{{\rm{size}}}}\ ({{{{{\rm{avg}}}}}}. \, {{{\rm{licks}}}} \, {{{\rm{per}}}} \, {{{\rm{choice}}}} \, {{{\rm{for}}}} \, {{{\rm{sucrose}}}}).$$

(1)

And sucrose preference, the conventional outcome measure of the task, was defined as:

$${{{{{\rm{Sucrose}}}}}} \, {{{{{\rm{preference}}}}}}=\frac{{L}_{{{{{\rm{sucrose}}}}}}}{{L}_{{{{{\rm{sucrose}}}}}}+{L}_{{{{{\rm{water}}}}}}}.$$

(2)

To determine the difference in sucrose versus water consumption, we used two different measures. The first was % of choices for sucrose and it ranges from 0% to 100%. Here, 50% was the indifference point (i.e., the animal made the same number of choices for sucrose and water). The second measure was choice size ratio (S/W), which indicates how many more licks, on average, a sucrose choice contained relative to a water choice:

$${{{{\rm{Choice}}}}} \, {{{\rm{size}}}} \, {{{\rm{ratio}}}}=\frac{{{{\rm{avg}}}}. \, {{{\rm{licks}}}} \, {{{\rm{per}}}} \, {{{\rm{choice}}}} \, {{{\rm{for}}}} \, {{{\rm{sucrose}}}}} {{{{\rm{avg}}}}. \, {{{\rm{licks}}}} \, {{{\rm{per}}}} \, {{{\rm{choice}}}} \, {{{\rm{for}}}} \, {{{\rm{water}}}}}.$$

(3)

with values > 1 indicating a higher number of average licks for the sucrose choice than for the water choice.

Computational models

Individual choices in the task (including the number of licks within those choices) were used to fit different reinforcement learning models to the data. As such, a mouse is considered a reinforcement learning agent that aims to maximize reward by sampling from both bottles, subsequent learning, and letting future choices be guided by the value representation of each of the bottles. Both bottles are initially assigned a value of 0, and with repeated experience (consumption), the value representation of the bottles will more accurately resemble the true value of the bottles’ content. Hedonia parameter ρ, present in all models, represents the extent to which sucrose is valued over water, so that ρ > 1 indicates that the value of sucrose Q_s (after full learning) is higher than that of water Q_w:

$$\rho=\frac{{Q}_{s}}{{Q}_{w}}$$

(4)

Learning

Learning may not always be absolute, and the value representation of the two bottles, Q_SB and Q_WB, may more slowly approach their true values (ρ and 1, respectively). To test this notion, we included 3 different learning models in our model selection procedure (Fig. 2a). In the first learning model, learning is absolute (‘Absolute learning’ in Fig. 2a), so that the value representation of the bottle matches that of its content after a single choice. In the second learning model, learning is gradual, based on a Rescorla-Wagner learning rule, and is independent from the size (i.e., number of licks) of that choice (‘Rescorla-Wagner, Choice-size independent’ in Fig. 2a). In other words, the strength of learning is the same, regardless of whether the animal made a few or many licks within that choice. The third learning model consists of the same Rescorla-Wagner learning rule, but states that learning is stronger for choices in which more licks were made (‘Rescorla-Wagner, Choice-size dependent’ in Fig. 2a). For each of the learning rules, the value of the sucrose bottle, Q_SB, for each choice number t would be defined as:

$${Q}_{{SB},t}=\left\{\begin{array}{ccc}{Q}_{{SB},t-1}+{\delta }_{t}\hfill & {{\mbox{for absolute learning}}}\hfill\\ {Q}_{{SB},t-1}+\alpha \times {\delta }_{t}\hfill & {{\mbox{for Rescorla}}}{\mbox{-}}{{\mbox{Wagner}}},\,{{\mbox{choice size-independent learning}}}\hfill\\ {Q}_{{SB},t-1}+\alpha \times \tanh ({{\mbox{licks}}}/10)\times {\delta }_{t} & {{\mbox{for Rescorla}}}{\mbox{-}}{{\mbox{Wagner}}},\,{{\mbox{choice size-dependent learning}}}\hfill\end{array}\right.$$

(5)

In this equation, Q_SB,t can be replaced with Q_WB,t to get the value representation of the water bottle. α defines the Rescorla-Wagner learning rate, which equals 1 for absolute learning and is thus removed from the equation. In all equations, δ_t represents the reward prediction error δ on trial t so that:

$${\delta }_{t}=\left\{\begin{array}{cc}\rho -{Q}_{{SB},t-1} & {{{{\rm{when}}}}} \, {{{\rm{sucrose}}}} \, {{{\rm{is}}}} \, {{{\rm{chosen}}}}\\ 1-{Q}_{{WB},t-1} & {{{{\rm{when}}}}} \, {{{\rm{water}}}} \, {{{\rm{is}}}} \, {{{\rm{chosen}}}}\end{array}\right.$$

(6)

Thus, after full learning, Q_SB approaches Q_s (which is equal to hedonia parameter ρ), and Q_WB approaches Q_w (which is equal to 1).

Unchosen option modulation

Additionally, we tested the contribution of a model parameter that may modulate the value of a bottle, Q_{WB, t+1} or Q_{SB, t+1}, if this was not chosen on a certain trial t. To do this, an additional value component Q_UB,t was added to the unchosen bottle. Thus, in the case of the sucrose bottle, value is defined as:

$${Q}_{{SB},t}=\left\{\begin{array}{cc}{Q}_{{SB},t}\hfill & {{{\rm{if}}}}\; {Q}_{{SB},t}\; {{{{{\rm{was}}}}}}\; {{{{{\rm{chosen}}}}}}\; {{{{{\rm{on}}}}}}\; {{{{{\rm{trial}}}}}} \, {t}-1\hfill\\ {Q}_{{SB},t}+{Q}_{{UB},t} & {{{\rm{if}}}}\; {Q}_{{SB},t}\; {{{{{\rm{was}}}}}}\; {{{{{\rm{not}}}}}}\; {{{{{\rm{chosen}}}}}}\; {{{{{\rm{on}}}}}}\; {{{{{\rm{trial}}}}}} \, {t}-1\end{array}\right.$$

(7)

Here, the value of the unchosen sucrose bottle Q_UB,t is defined as:

$${Q}_{{UB},t}=\left\{\begin{array}{cc}{{\tanh }}\left(\eta \times \left[{{{{{\rm{times\; unchosen}}}}}}\right]\right)\hfill & {for} \, \eta \, > \, 0\\ {{\tanh }}\left(\eta \times \left[{{{{{\rm{times}}}}}}\,{{{{{\rm{unchosen}}}}}}\right]\right)\times {Q}_{{SB},t} & {for} \, \eta \, < \, 0\end{array}\right.$$

(8)

η > 0 indicates that the bottle that has not been chosen in the past choice(s) acquires an additional positive amount of value; this value is at maximum 1, as defined by the hyperbolic tangent function. For example, if a certain bottle has not been chosen 3 times in a row, and η = 0.2, the unchosen bottle acquires an additional value of tanh(0.2 × 3) = 0.537. This value will be attributed to this bottle in addition to the value that it already acquired through learning, i.e., Q_WB (which is 1 at maximum) or Q_SB (which is ρ at maximum). As such, η > 0 indicates attraction to the unchosen option.

η < 0 indicates that the bottle that has not been chosen in the past choice(s) acquires an additional negative amount of value which is at maximum the learned value Q_WB or Q_SB. For example, if a sucrose bottle, at some time in the test, has a value Q_SB = 1.5, but it has not been chosen 3 times in a row, and η = –0.2, it will gain an additional value of tanh(−0.2 × 3) × 1.5 = –0.806. Thus, the true value of the sucrose bottle will become Q_SB = 1.5 – 0.806 = 0.694. At maximum, η can fully reduce the value of an unchosen bottle to 0, given that a hyperbolic tangent asymptotes to 1. As such, η < 0 indicates discounting of the unchosen options.

Since discounting of and attraction to the unchosen option are mutually exclusive and two extremes on a single scale, we included these parameters as a single free parameter η in the model, which we defined as the discounting/attraction parameter.

Choice policy

Two different choice policies were tested. The first one is a Softmax choice policy, which states that choice behavior is described by a sigmoidal function of the value difference between the two options, Q_WB and Q_SB. The probability of choosing the sucrose bottle P_SB,t on choice t is defined as:

$${P}_{{{{SB}}},{{{t}}}}=\frac{\exp (\beta \cdot {Q}_{{{{SB}}},{{{t}}}})}{\exp (\beta \cdot {Q}_{{{{SB}}},{{{t}}}})+\exp (\beta \cdot {Q}_{{{{WB}}},{{{t}}}})}$$

(9)

and

$${P}_{{WB},t}=1-{P}_{{SB},t}$$

(10)

Here β is the Softmax’ inverse temperature, which determines the extent to which choices are driven by value; β = 0 indicates that choice is random, whereas β → ∞ indicates consistent choice for the highest valued bottle. For parameter estimation, β was set to 1, and thus was not a free variable in the model, since β correlated with the value of hedonia parameter ρ. In other words, some forms of anhedonia could both be described as a reduction in ρ or a reduction in β, and modeling fitting procedures were not able to discern between the two since they had qualitatively similar effects on choice behavior. This may intuitively make sense; an increase in noisy/random choice behavior over the entire length of the session may be caused by a reduced appreciation of sucrose or by more random choice behavior—both could be interpreted as a form of anhedonia³⁶.

The second choice rule we tested was an ε-greedy policy. Here, ε indicates a value between 0 and 1, and controls the extent to which the agent consistently chooses the highest valued option versus making a random choice. For example, in the case that the sucrose bottle is valued over water, Q_SB > Q_WB:

$${P}_{{{{{SB}}}},{{{{t}}}}}=\epsilon+0.5\times (1-\epsilon )$$

(11)

$${P}_{{{{{WB}}}},{{{{t}}}}}=0.5\times (1-\epsilon )$$

(12)

And in the (rare) case that the water bottled is valued over sucrose, Q_WB > Q_SB:

$${P}_{{{{{WB}}}},{{{{t}}}}}=\epsilon+0.5\times (1-\epsilon )$$

(13)

$${P}_{{{{{SB}}}},{{{{t}}}}}=0.5\times (1-\epsilon )$$

(14)

Thus, if ε = 0, the animal makes random choices (similar to β = 0 with the Softmax choice policy), and if ε = 1, the animal consistently chooses for the highest valued bottle (typically sucrose).

Model selection

Choice data of the 25 animals from Fig. 1 were fit to a total of 12 model combinations (3 learning rules × 2 for with or without discount/attraction parameter × 2 choice policies) plus a null model (that assumes that choice is fully random); see Fig. 2a. The log likelihoods were computed for each model by finding the combination of parameter values that maximizes the likelihood of the observed choice sequence from first choice t = 1 to final choice T:

$$\log \left(P\left(\frac{{data}}{{model},{parameters}}\right)\right)=\mathop{\sum }\limits_{t=1}^{T}\log \left(P\left({{choice}}_{t}|{Q}_{{WB},t},{Q}_{{SB},t},{Q}_{{UB},{t}}\right)\right)$$

(15)

The log-model evidences were subsequently penalized for model complexity by computing the Akaike Information Criterion (AIC):

$${{\mbox{AIC}}}= 2\times [{{{{\rm{number}}}}\ {{{\rm{of}}}}\ {{{\rm{free}}}}\ {{{\rm{parameters}}}}\ {{{\rm{in}}}}\ {{{\rm{model}}}}}] \\ -2\times \log (P({{{{\rm{data}}}}}{{{{{\rm{|}}}}}}{{{{\rm{model}}}}},\ {{{{\rm{parameters}}}}}))$$

(16)

so that a lower AIC resembles a better fit of the model. AIC values were input to a random effects model selection algorithm, using the function VBA_groupBMC of the VBA toolbox⁵⁶ for Matlab (MathWorks Inc.). The outcome measure used to determine the ‘selected model’ was exceedance probability, which indicates how likely it is that a given model is more frequent than the other models among the population of mice³⁷.

The selected model (Exceedance Probability = 0.98) was model #4 in Fig. 2a. It describes the behavior of the mice on the basis of (i) a Rescorla-Wagner learning rule with choice size-dependent learning (i.e., learning is stronger when more licks are made within a choice), (ii) the presence of an unchosen option modulation (through discounting/attraction parameter η), and (iii) a Softmax choice policy. To obtain point-estimate parameter value for this model, priors were used to obtain more realistic model parameter estimates on a population level, thus using maximum a posteriori probability estimation. These priors were based on the meaning of the parameters in context of the behavior (e.g., learning rate α between 0 and 1, hedonia parameter ρ > 1 on average with a right-skewed distribution). The priors that we used were:

ρ betapdf(ρ/10, 1.3, 3)

α betapdf(α, 1.1, 5)

η normpdf(η, 0, 0.2)

Model validations

Parameter recovery in simulated dataset

For the parameter recovery procedure (Supplementary Fig. 3a), we simulated SPT sessions of 250 choices with a variety of {ρ, α, η} combinations, and tried to estimate the best-fit parameter values based on the raw choice data. Input values of ρ were in the range of [1.2, 1.6, 2.0, 2.4, 2.8], approximately matching the population data observed in Fig. 2d. The inputs values of α were [0.01, 0.03, 0.05, 0.07, 0.09], and for η were [−0.2, −0.1, 0, 1, 2]. For each combination of parameter values, we simulated 50 different sessions and plotted the recovered parameters in each of these 50 simulations (one circle in Supplementary Fig. 3a represents an individual simulated session; black line indicates the median of those 50 simulations).

Posterior predictive check

For the posterior predictive check (Supplementary Fig. 3b), we used data from the 25 animals shown in Fig. 2d and simulated 50 sessions for each of these 25 mice, based on the best-fit model parameters and the number of choices each mouse made in the test. We calculated the % of choices for sucrose in the simulated data and plotted this as a function of the % of choices for sucrose in the experimental data. Each box in Supplementary Fig. 3b represents one mouse (i.e., session), with the box representing the range of % of choices for sucrose in those 50 simulated sessions.

Quinine adulteration

For the quinine experiment (Fig. 2f), 16 animals were tested twice in the SPT. Between sessions, the configuration of the walls of the operant chambers was changed. Accordingly, the sucrose and water bottles were located at different positions in each new session. This was achieved by randomly positioning the bottles in one of the six customizable wall panels of the Med Associates chambers. To achieve a higher value difference between the two bottles, the value of water was reduced rather than the value of sucrose increased (i.e., through a higher concentration of sucrose). The reason is that we observed in pilot experiments that a higher % of sucrose promoted an extremely high sucrose consumption which ultimately led to increased water consumption (possibly through increased thirst). Sessions were counterbalanced between days so that half of the animals first received a water versus sucrose session, and the other half a water+quinine versus sucrose session. Quinine hemisulphate salt monohydrate was added to water in a concentration of 0.1 mg/ml (~250 μM).

Acute and chronic stress paradigms

For the chronic mild stress experiments, cages of male and female C57Bl/6 mice were randomly allocated to the stress or control group, and the stress group received a series of random chronic mild stressors for 4 weeks⁷, twice per day during weekdays (one morning stressor and one overnight stressor) and constantly during the weekend. Morning stressors included 6 h of cage shaking, 6 h of crowded housing, 6 h of no bedding, 6 h of stroboscope light, or 3 × 30 min of cold stress. Overnight stressors included 45° cage tilting, food deprivation, water deprivation, and wet bedding. During the weekend, animals were in constant light with bobcat urine in their cages. In the last week before testing, food and water restriction were not used as stressors since it may interfere with SPT performance. They were replaced with other overnight stressors. After 4 weeks, all animals were tested in a 12-h SPT. The stress group received their last stressor the day before the SPT. Control mice were housed under the same conditions as stressed mice but did not receive any stressors.

For the acute restraint stress experiments^57,58,59 tests were also performed in male and female mice. Animals in the stress group were placed in a custom-made restrainer (made by drilling holes in 50 ml Eppendorf tubes) for 4 h⁵⁸. Stressed animals were removed from the restrainer and immediately moved to the operant chamber for a 12-h SPT. Control animals were moved to the operant chamber directly from the home cage, although a subset of control animals were food- and water-restricted for 4 h before the SPT to match the restriction experienced by stressed animals (Supplementary Fig. 7).

Optogenetics

Optogenetic experiments (Fig. 4) comprised of four SPT sessions: one habituation session followed by three experimental sessions. Experiments were performed on four consecutive days. Animals were water restricted one day before the habituation session. During the 1-h sessions, C57Bl/6 mice had a choice between 1% sucrose and water. Between sessions, the configuration of the walls of the operant chambers was changed. Accordingly, the sucrose and water bottles were located at different positions in each new session. This was achieved by randomly positioning the bottles in one of the six customizable wall panels of the Med Associates chambers. As a result, animals had to re-learn reward location and bottle content at the start of each session. The habituation session (day 1) was identical to the experimental sessions but without any laser stimulation. The experimental sessions (day 2–4) were: (i) OFF-OFF-OFF session, in which no light was applied during any epoch (each epoch was 20 min); (ii) OFF-ON-OFF session, in which 8 mW, 589 nm (at fiber tip) constant laser light (DPSS laser; Laserglow) was applied during the second epoch (ON); (iii) ON-OFF-OFF session, in which 8 mW, 589 nm (at fiber tip) constant laser light (DPSS laser; Laserglow) was applied during the first epoch (ON). The order of the experimental sessions across days was counterbalanced between animals.

Histology and microscopy

After the final day of the optogenetic experiments (Fig. 4), animals were injected with 0.05 ml pentobarbital (390 mg/ml; IP) and transcardially perfused with 4% paraformaldehyde in PBS, pH 7.4. The brains were post-fixed overnight and coronal brain sections (80 µm) were prepared. Image acquisition was performed on a Zeiss AxioImager M2 upright widefield fluorescence/differential interference contrast microscope with charge-coupled device camera. Images were analyzed using Zeiss ZEN microscopy software. Sections were labeled relative to bregma using landmarks and neuroanatomical nomenclature as described in “The Mouse Brain in Stereotaxic Coordinates”⁶⁰. eNpHR or eYFP expression patterns were verified by an experimenter blinded to behavioral outcome.

Statistics

Comparative statistical tests were performed in GraphPad Prism 8; linear regressions shown in Fig. 2e were performed in Python (statsmodels). Comparative tests were unpaired except for the repeated measures experiments shown in Figs. 1c, 2f and 4. T-tests were performed for normally distributed data and log-normally distributed data (on log-transformed data). If data for one of the experimental groups was neither normally nor log-normally distributed, the non-parametric Mann-Whitney test was used. If data was normally distributed, but the variation between different groups was unequal, a Welch t-test was used. Tests were two-tailed unless a specific direction of effects was expected based on published data, which was the case for measure ‘% sucrose preference’ shown in Fig. 3. Statistical significance was *p < 0.05, **p < 0.01, ***p < 0.001. All data are presented as means ± standard deviation (Fig. 2d and Supplementary Fig. 5c) or standard error of the mean (all other figures). All details of the statistical analysis are summarized in Supplementary Table 1.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Source data are provided as a Source Data file. Python package SweetiePy, available at www.github.com/jeroenphv/SweetiePy, also contains an example of a raw data file with time stamps of individual licks in the SPT. Source data are provided with this paper.

Code availability

Python package SweetiePy is available at www.github.com/jeroenphv/SweetiePy.

References

Liu, M.-Y. et al. Sucrose preference test for measurement of stress-induced anhedonia in mice. Nat. Protoc. 13, 1686–1698 (2018).
Article CAS PubMed Google Scholar
Willner, P., Towell, A., Sampson, D., Sophokleous, S. & Muscat, R. Reduction of sucrose preference by chronic unpredictable mild stress, and its restoration by a tricyclic antidepressant. Psychopharmacology 93, 358–364 (1987).
Article CAS PubMed Google Scholar
Scheggi, S., De Montis, M. G. & Gambarana, C. Making Sense of Rodent Models of Anhedonia. Int. J. Neuropsychopharmacol. 21, 1049–1065 (2018).
Article CAS PubMed PubMed Central Google Scholar
Pizzagalli, D. A. Depression, stress, and anhedonia: toward a synthesis and integrated model. Annu. Rev. Clin. Psychol. 10, 393–423 (2014).
Article PubMed PubMed Central Google Scholar
Willner, P., Muscat, R. & Papp, M. Chronic mild stress-induced anhedonia: a realistic animal model of depression. Neurosci. Biobehav. Rev. 16, 525–534 (1992).
Article CAS PubMed Google Scholar
Agudelo, L. Z. et al. Skeletal muscle PGC-1α1 modulates kynurenine metabolism and mediates resilience to stress-induced depression. Cell 159, 33–45 (2014).
Article CAS PubMed Google Scholar
Cerniauskas, I. et al. Chronic stress induces activity, synaptic, and transcriptional remodeling of the lateral habenula associated with deficits in motivated behaviors. Neuron 104, 899–915.e8 (2019).
Article CAS PubMed PubMed Central Google Scholar
Chaudhury, D. et al. Rapid regulation of depression-related behaviors by control of midbrain dopamine neurons. Nature 493, 532–536 (2013).
Article ADS CAS PubMed Google Scholar
Krishnan, V. et al. Molecular adaptations underlying susceptibility and resistance to social defeat in brain reward regions. Cell 131, 391–404 (2007).
Article CAS PubMed Google Scholar
Lim, B. K., Huang, K. W., Grueter, B. A., Rothwell, P. E. & Malenka, R. C. Anhedonia requires MC4R-mediated synaptic adaptations in nucleus accumbens. Nature 487, 183–189 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Pignatelli, M. et al. Cooperative synaptic and intrinsic plasticity in a disynaptic limbic circuit drive stress-induced anhedonia and passive coping in mice. Mol. Psychiatry 26, 1860–1879 (2021).
Article PubMed Google Scholar
Ramirez, S. et al. Activating positive memory engrams suppresses depression-like behavior. Nature 522, 335–339 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Tye, K. M. et al. Dopamine neurons modulate neural encoding and expression of depression-related behavior. Nature 493, 537–541 (2012).
Article ADS PubMed PubMed Central Google Scholar
Yang, Y. et al. Ketamine blocks bursting in the lateral habenula to rapidly relieve depression. Nature 554, 317–322 (2018).
Article ADS CAS PubMed Google Scholar
Cheeta, S., Ruigt, G., van Proosdij, J. & Willner, P. Changes in sleep architecture following chronic mild stress. Biol. Psychiatry 41, 419–427 (1997).
Article CAS PubMed Google Scholar
Monteggia, L. M., Heimer, H. & Nestler, E. J. Meeting report: can we make animal models of human mental illness? Biol. Psychiatry 84, 542–545 (2018).
Article PubMed PubMed Central Google Scholar
Moreau, J.-L., Scherschlicht, R., Jenck, F. & Martin, J. R. Chronic mild stress-induced anhedonia model of depression; sleep abnormalities and curative effects of electroshock treatment. Behav. Pharmacol. 6, 682–687 (1995).
Article PubMed Google Scholar
Russo, S. J. & Nestler, E. J. The brain reward circuitry in mood disorders. Nat. Rev. Neurosci. 14, 609–625 (2013).
Article CAS PubMed Google Scholar
Solberg, L. C., Horton, T. H. & Turek, F. W. Circadian rhythms and depression: effects of exercise in an animal model. Am. J. Physiol.-Regul. Integr. Comp. Physiol. 276, R152–R161 (1999).
Article CAS Google Scholar
Willner, P. Animal models as simulations of depression. Trends Pharmacol Sci. 12, 131–136 (1991).
Article CAS PubMed Google Scholar
Boonstra, R. Reality as the leading cause of stress: rethinking the impact of chronic stress in nature. Funct. Ecol. 27, 11–23 (2013).
Article Google Scholar
Papp, M., Moryl, E. & Willner, P. Pharmacological validation of the chronic mild stress model of depression. Eur. J. Pharmacol. 296, 129–136 (1996).
Article CAS PubMed Google Scholar
Antoniuk, S., Bijata, M., Ponimaskin, E. & Wlodarczyk, J. Chronic unpredictable mild stress for modeling depression in rodents: Meta-analysis of model reliability. Neurosci. Biobehav. Rev. 99, 101–116 (2019).
Article PubMed Google Scholar
Mao, Y., Xu, Y. & Yuan, X. Validity of chronic restraint stress for modeling anhedonic-like behavior in rodents: a systematic review and meta-analysis. J. Int. Med. Res. 50, 3000605221075816 (2022).
Article CAS PubMed Google Scholar
Moreira, P. S., Almeida, P. R., Leite-Almeida, H., Sousa, N. & Costa, P. Impact of chronic stress protocols in learning and memory in rodents: systematic review and meta-analysis. PLoS ONE 11, e0163245 (2016).
Article PubMed PubMed Central Google Scholar
Berridge, K. C., Robinson, T. E. & Aldridge, J. W. Dissecting components of reward: ‘liking’, ‘wanting’, and learning. Curr. Opin. Pharmacol. 9, 65–73 (2009).
Article CAS PubMed PubMed Central Google Scholar
Coccurello, R. Anhedonia in depression symptomatology: appetite dysregulation and defective brain reward processing. Behav. Brain Res. 372, 112041 (2019).
Article PubMed Google Scholar
Berridge, K. C. Affective valence in the brain: modules or modes? Nat. Rev. Neurosci. 20, 225–234 (2019).
Article CAS PubMed PubMed Central Google Scholar
Berridge, K. C. & Kringelbach, M. L. Pleasure systems in the brain. Neuron 86, 646–664 (2015).
Article CAS PubMed PubMed Central Google Scholar
Berridge, K. C. & Robinson, T. E. Liking, wanting, and the incentive-sensitization theory of addiction. Am. Psychol. 71, 670–679 (2016).
Article PubMed PubMed Central Google Scholar
Treadway, M. T. & Zald, D. H. Reconsidering anhedonia in depression: lessons from translational neuroscience. Neurosci. Biobehav. Rev. 35, 537–555 (2011).
Article PubMed Google Scholar
Castro, D. C. & Berridge, K. C. Advances in the neurobiological bases for food ‘liking’ versus ‘wanting’. Physiol. Behav. 136, 22–30 (2014).
Article CAS PubMed Google Scholar
Robinson, O. J. & Chase, H. W. Learning and choice in mood disorders: searching for the computational parameters of anhedonia. Comput. Psychiatry Camb. Mass 1, 208–233 (2017).
Article Google Scholar
Naneix, F., Peters, K. Z. & McCutcheon, J. E. Investigating the effect of physiological need states on palatability and motivation using microstructural analysis of licking. Neuroscience 447, 155–166 (2020).
Article CAS PubMed Google Scholar
Sclafani, A., Thompson, B. & Smith, J. C. The rat’s acceptance and preference for sucrose, maltodextrin, and saccharin solutions and mixtures. Physiol. Behav. 63, 499–503 (1998).
Article CAS PubMed Google Scholar
Huys, Q. J., Pizzagalli, D. A., Bogdan, R. & Dayan, P. Mapping anhedonia onto reinforcement learning: a behavioral meta-analysis. Biol. Mood Anxiety Disord. 3, 12 (2013).
Article PubMed PubMed Central Google Scholar
Stephan, K. E., Penny, W. D., Daunizeau, J., Moran, R. J. & Friston, K. J. Bayesian model selection for group studies. NeuroImage 46, 1004–1017 (2009).
Article PubMed Google Scholar
Acosta, J. et al. Circadian modulation of motivation in mice. Behav. Brain Res. 382, 112471 (2020).
Article CAS PubMed Google Scholar
Strekalova, T., Spanagel, R., Bartsch, D., Henn, F. A. & Gass, P. Stress-induced anhedonia in mice is associated with deficits in forced swimming and exploration. Neuropsychopharmacology 29, 2007–2017 (2004).
Article PubMed Google Scholar
Heck, A. L. & Handa, R. J. Sex differences in the hypothalamic–pituitary–adrenal axis’ response to stress: an important role for gonadal hormones. Neuropsychopharmacology 44, 45–58 (2019).
Article CAS PubMed Google Scholar
Ferenczi, E. A. et al. Prefrontal cortical regulation of brainwide circuit dynamics and reward-related behavior. Science 351, aac9698 (2016).
Article PubMed PubMed Central Google Scholar
Verharen, J. P. H., den Ouden, H. E. M., Adan, R. A. H. & Vanderschuren, L. J. M. J. Modulation of value-based decision making behavior by subregions of the rat prefrontal cortex. Psychopharmacology 237, 1267–1280 (2020).
Article CAS PubMed PubMed Central Google Scholar
Reardon, S. Depression researchers rethink popular mouse swim tests. Nature 571, 456–457 (2019).
Article ADS CAS PubMed Google Scholar
Dess, N. K. & Edelheit, D. The bitter with the sweet: The taste/stress/temperament nexus. Biol. Psychol. 48, 103–119 (1998).
Article CAS PubMed Google Scholar
Dichter, G. S., Smoski, M. J., Kampov-Polevoy, A. B., Gallop, R. & Garbutt, J. C. Unipolar depression does not moderate responses to the Sweet Taste Test. Depress. Anxiety 27, 859–863 (2010).
Article PubMed PubMed Central Google Scholar
Berton, O. & Nestler, E. J. New approaches to antidepressant drug discovery: beyond monoamines. Nat. Rev. Neurosci. 7, 137–151 (2006).
Article CAS PubMed Google Scholar
Uher, R. et al. Depression symptom dimensions as predictors of antidepressant treatment outcome: replicable evidence for interest-activity symptoms. Psychol. Med. 42, 967–980 (2012).
Article CAS PubMed Google Scholar
Clarkson, J. M., Dwyer, D. M., Flecknell, P. A., Leach, M. C. & Rowe, C. Handling method alters the hedonic value of reward in laboratory mice. Sci. Rep. 8, 2448 (2018).
Article ADS PubMed PubMed Central Google Scholar
Cooper, J. A., Arulpragasam, A. R. & Treadway, M. T. Anhedonia in depression: biological mechanisms and computational models. Curr. Opin. Behav. Sci. 22, 128–135 (2018).
Article PubMed PubMed Central Google Scholar
Berridge, K. Measuring hedonic impact in animals and infants: microstructure of affective taste reactivity patterns. Neurosci. Biobehav. Rev. 24, 173–198 (2000).
Article CAS PubMed Google Scholar
Dwyer, D. M. Licking and liking: the assessment of hedonic responses in rodents. Q. J. Exp. Psychol. 65, 371–394 (2012).
Article Google Scholar
Kokras, N. & Dalla, C. Sex differences in animal models of psychiatric disorders: sex differences in models of psychiatric disorders. Br. J. Pharmacol. 171, 4595–4619 (2014).
Article CAS PubMed PubMed Central Google Scholar
Campbell, B. A. Absolute and relative sucrose preference thresholds for hungry and satiated rats. J. Comp. Physiol. Psychol. 51, 795–800 (1958).
Article CAS PubMed Google Scholar
Bainier, C., Mateo, M., Felder-Schmittbuhl, M.-P. & Mendoza, J. Circadian rhythms of hedonic drinking behavior in mice. Neuroscience 349, 229–238 (2017).
Article CAS PubMed Google Scholar
Willner, P., D’Aquila, P. S., Coventry, T. & Brain, P. Loss of social status: preliminary evaluation of a novel animal model of depression. J. Psychopharmacol. 9, 207–213 (1995).
Article CAS PubMed Google Scholar
Daunizeau, J., Adam, V. & Rigoux, L. VBA: a probabilistic treatment of nonlinear models for neurobiological and behavioral data. PLoS Comput. Biol. 10, e1003441 (2014).
Article ADS PubMed PubMed Central Google Scholar
Chiba, S. et al. Chronic restraint stress causes anxiety- and depression-like behaviors, downregulates glucocorticoid receptor expression, and attenuates glutamate release induced by brain-derived neurotrophic factor in the prefrontal cortex. Prog. Neuropsychopharmacol. Biol. Psychiatry 39, 112–119 (2012).
Article CAS PubMed Google Scholar
Domingues, M. et al. Effects of a selanylimidazopyridine on the acute restraint stress-induced depressive- and anxiety-like behaviors and biological changes in mice. Behav. Brain Res. 366, 96–107 (2019).
Article CAS PubMed Google Scholar
Wang, M. et al. Acute restraint stress enhances hippocampal endocannabinoid function via glucocorticoid receptor activation. J. Psychopharmacol. Oxf. Engl. 26, 56–70 (2012).
Article Google Scholar
Franklin, K. B. J. & Paxinos, G. Paxinos and Franklin’s The Mouse Brain in Stereotaxic Coordinates. (Academic Press, an imprint of Elsevier, 2013).

Download references

Acknowledgements

We thank Kurt Fraser for critical reading of the manuscript. S.L. is a John P. Stock Faculty Fellow, Rita Allen Scholar and Weill Neurohub Investigator. This work was supported by NIH grants (R01-DA042889, R01-MH123246; S.L.), the One Mind Foundation (047483; S.L.), the Rita Allen Foundation (S.L.), the Wayne and Gladys Valley Foundation (S.L.), the Weill Neurohub (S.L.) and the Netherlands Organization of Scientific Research (Rubicon postdoctoral fellowship; J.P.H.V.).

Author information

Authors and Affiliations

Department of Molecular and Cell Biology and Helen Wills Neuroscience Institute, University of California, Berkeley, CA, 94720, USA
Jeroen P. H. Verharen, Johannes W. de Jong, Yichen Zhu & Stephan Lammel

Authors

Jeroen P. H. Verharen
View author publications
You can also search for this author in PubMed Google Scholar
Johannes W. de Jong
View author publications
You can also search for this author in PubMed Google Scholar
Yichen Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Stephan Lammel
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Behavior: J.P.H.V., Y.Z. Stereotactic injections: J.P.H.V. Immunohistochemistry: J.P.H.V. Computational modeling: J.P.H.V., J.W.d.J. Study design, analysis, and interpretation: J.P.H.V., S.L. Manuscript written by J.P.H.V., S.L. and edited by all authors.

Corresponding author

Correspondence to Stephan Lammel.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Stan Floresco and the other, anonymous, reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Movie 1

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Verharen, J.P.H., de Jong, J.W., Zhu, Y. et al. A computational analysis of mouse behavior in the sucrose preference test. Nat Commun 14, 2419 (2023). https://doi.org/10.1038/s41467-023-38028-0

Download citation

Received: 11 December 2022
Accepted: 12 April 2023
Published: 27 April 2023
DOI: https://doi.org/10.1038/s41467-023-38028-0

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

A microstructure analysis of licking behavior in the SPT

A computational model of the SPT

Model validations

Variance in choices is explained by hedonia and learning

Effects of stress on SPT behavior

Optogenetic inhibition of mPFC neurons reduces sucrose preference by impairing learning

Discussion

Utility of the SPT for neuropsychiatric research

Role of SPT in measuring anhedonia

Choice size ratio as a proxy for hedonia

A toolbox for the interpretation of SPT experiments

Methods

Subjects

Stereotaxic surgeries

Sucrose preference test

Microstructure analysis of licking behavior

Computational models

Learning

Unchosen option modulation

Choice policy

Model selection

Model validations

Parameter recovery in simulated dataset

Posterior predictive check

Quinine adulteration

Acute and chronic stress paradigms

Optogenetics

Histology and microscopy

Statistics

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links