Specific topics, specific symptoms: linking the content of recurrent involuntary memories to mental health using computational text analysis

Yeung, Ryan C.; Fernandes, Myra A.

doi:10.1038/s44184-023-00042-x

Download PDF

Article
Open access
Published: 18 December 2023

Specific topics, specific symptoms: linking the content of recurrent involuntary memories to mental health using computational text analysis

npj Mental Health Research volume 2, Article number: 22 (2023) Cite this article

1055 Accesses
1 Citations
25 Altmetric
Metrics details

Subjects

Abstract

Researchers debate whether recurrent involuntary autobiographical memories (IAMs; memories of one’s personal past retrieved unintentionally and repetitively) are pathological or ordinary. While some argue that these memories contribute to clinical disorders, recurrent IAMs are also common in everyday life. Here, we examined how the content of recurrent IAMs might distinguish between those that are maladaptive (related to worse mental health) versus benign (unrelated to mental health). Over two years, 6187 undergraduates completed online surveys about recurrent IAMs; those who experienced recurrent IAMs within the past year were asked to describe their memories, resulting in 3624 text descriptions. Using a previously validated computational approach (structural topic modeling), we identified coherent topics (e.g., “Conversations”, “Experiences with family members”) in recurrent IAMs. Specific topics (e.g., “Negative past relationships”, “Abuse and trauma”) were uniquely related to symptoms of mental health disorders (e.g., depression, PTSD), above and beyond the self-reported valence of these memories. Importantly, we also found that content in recurrent IAMs was distinct across symptom types (e.g., “Communication and miscommunication” was related to social anxiety, but not symptoms of other disorders), suggesting that while negative recurrent IAMs are transdiagnostic, their content remains unique across different types of mental health concerns. Our work shows that topics in recurrent IAMs—and their links to mental health—are identifiable, distinguishable, and quantifiable.

Disentangling boredom from depression using the phenomenology and content of involuntary autobiographical memories

Article Open access 24 January 2024

Facing the pandemic and lockdown: an insight on mental health from a longitudinal study using diaries

Article Open access 15 March 2022

Transdiagnostic distortions in autobiographical memory recollection

Article 06 February 2023

Introduction

Memories of the personal past that are retrieved unintentionally and effortlessly have been termed involuntary autobiographical memories (IAMs¹). Recent evidence suggests that some IAMs are experienced recurrently—that is, episodes of the same event can be retrieved repetitively and involuntarily². Past studies indicate that recurrent IAMs are commonly experienced in everyday life: large proportions (52–55%) of general populations (e.g., undergraduates, nationally representative samples, community-dwelling older adults) have endorsed experiencing at least one recurrent IAM within the past year^2,3,4. However, recurrent IAMs have also been conceptualized as harmful or characteristic of psychiatric disorders^5,6, despite their prevalence among the public. For instance, these memories have been described as a transdiagnostic component of many clinical disorders (e.g., depression, anxiety, PTSD), acting as a mechanism by which psychopathology emerges or is maintained^5,7. As such, recurrent IAMs have been simultaneously characterized as maladaptive or clinically relevant on one hand⁵ and benign or pleasant on the other².

This discrepancy has been acknowledged by some researchers, who have suggested that while many people experience recurrent IAMs, it may be the case that only some subset of IAMs are dysfunctional or related to poor mental health^8,9,10,11. Indeed, our recent work has supported these hypotheses, finding that the maladaptive subset of recurrent IAMs could be those that are negative in valence: participants who experienced recurrent IAMs with self-reported negative valence had elevated symptoms of depression, PTSD, social anxiety, and general anxiety compared to those who experienced neutral, positive, or no recurrent IAMs^3,4. However, solely relying on self-reported ratings of memories’ phenomenological properties (e.g., valence) is a major limitation of this field to date. For one, valence ratings are confounded with content (i.e., what people report remembering) since certain events are likely to be more negative or positive than others. Without examining the events that participants are remembering, we cannot conclude as to whether it is negative valence per se that characterizes maladaptive recurrent IAMs or that maladaptive recurrent IAMs tend to involve certain types of content. Prior attempts to characterize recurrent IAMs have been done without knowing what these memories are actually about, prompting us to ask if content analysis could help explain differences between maladaptive and benign recurrent memories.

Past content analyses of autobiographical memories have been fruitful at describing the events that participants remember, revealing common topics such as accidents, holidays, and interpersonal relationships^12,13,14,15. Importantly, some suggest that content in recurrent IAMs might change as a function of mental health status, and could even provide insights into how recurrent IAMs might diverge across different disorders^5,6,16,17,18. In other words, researchers have hypothesized that content in AMs (especially recurrent IAMs) differs between (1) those experiencing high versus low levels of psychopathology, as well as between (2) those with different mental health disorders (disorder-specific content). For example, early work has reported that recurrent IAMs about abuses or assaults were significantly related to greater depression severity, whereas recurrent IAMs about other topics (e.g., “illness or death”, “relationships/family”) were not significantly related to psychopathology¹⁹. More generally, some studies have found that AM content differed between those high versus low in symptoms, or with versus without diagnoses: patients with severe health anxiety have been more likely to report AMs about disease, illness, or death compared to participants without diagnoses²⁰, and participants with high social anxiety have been more likely to report social anxiety-related AMs than participants with low social anxiety²¹. Researchers have also observed linguistic differences in AMs produced by individuals with versus without psychiatric diagnoses. For instance, participants with social anxiety disorder have been found to use more self-referential, anxiety-related, and sensory language compared to nonpsychiatric controls²². Beyond distinguishing between high versus low levels of psychopathology, these content differences in AMs have also been theorized to be disorder-congruent, in that recurrent IAMs should differ across disorders in terms of their themes or topics⁵. Indeed, linguistic variables (e.g., sensory-somatic and self-referential language) have been shown to differentiate AMs produced by those with bipolar disorder, unipolar depression, or no diagnoses²³.

Though these results have supported that there are associations between AM content and mental health, numerous studies have not reached such conclusions. Other researchers have observed no significant differences in IAM content as a function of psychopathology, including between dysphoric versus nondysphoric participants²⁴ and between participants with high versus low social anxiety²⁵. Previous work has also failed to find evidence of disorder-congruent content: while AM content from participants with anxiety disorders was significantly different from nonclinical controls, content did not significantly differ across various disorders (i.e., social anxiety disorder versus panic disorder²⁶). Taken together, it remains inconclusive as to whether AM content varies across symptom severity or across disorders. It is possible that some of these mixed findings could have arisen due to limitations such as (1) relatively small sample sizes, (2) modeling content as pure categories (e.g., labeling each memory as containing a single topic), and (3) valence being entangled with content.

First, sample sizes have typically been relatively small (<100) due to the populations being studied (e.g., people with psychiatric diagnoses), which limits recruitment. Sample sizes also have an upper limit when conducting traditional, manual content analyses, since large volumes of text can quickly become unfeasible to code manually. While data from relatively small sample sizes have provided valuable foundations for this research area, it is yet to be seen whether past findings generalize to larger, nonclinical samples. Second, the prevailing method of content analysis—single, mutually exclusive content labels being assigned to each document (i.e., single-membership models)—is a rather coarse measure of content. Evidence suggests that mixed-membership models (i.e., assuming that each document is a mixture of topics) offer a more granular measure of content that can illustrate topic structures distinct from those produced by single-membership models^27,28. Third, past studies have not typically disentangled content and valence. We believe examining both content and valence simultaneously is an important open research question because while some types of content likely involve congruent valence (e.g., deaths and negative valence), content and valence can also be relatively independent. A participant might recall a relatively unpleasant event yet ascribe neutral or positive valence to the memory (e.g., failing a test, which subsequently led to improvements in study habits); conversely, a memory involving a relatively mundane event can feel highly distressing (e.g., a family dinner, which evokes feelings of homesickness after having moved out). Some have suggested that content is relatively unimportant compared to feelings and thoughts evoked by one’s IAMs. For example, individuals’ negative appraisals of their IAMs have been found to be better predictors of depression symptoms than experimenter-rated severity of the recalled event²⁹. Given this, we asked whether recalled details of the event still matter after accounting for the valence ascribed to the memory. While many previous studies have measured content and valence in AMs and incorporated these variables into their analyses (e.g., matching AMs for emotional intensity²²), to the best of our knowledge, studies to date have not simultaneously examined the relationships of IAM content and valence with psychopathology.

Here, we tested the hypothesis that content in recurrent IAMs would be associated with symptoms of mental health disorders, unique from previously observed links between self-reported valence and psychopathology^3,4. Further, we also asked whether any relationships between content and symptoms would be distinct across different disorders (e.g., disorder-specific content). To address these questions, we analyzed both content and self-reported valence in recurrent IAMs experienced by a large nonclinical sample.

Methods

In a previous study, we conducted the first large-scale content analysis of recurrent IAMs using computational techniques (e.g., machine learning, natural language processing) and highlighted the validity of using semi-automated methods for content analysis¹⁵. Here, we used the same approach to assess how symptoms of mental health disorders (i.e., depression, posttraumatic stress, social anxiety, general anxiety) might uniquely predict the use of different content categories (i.e., topics) within recurrent IAMs. By using computational methods, the current study analyzed data from a much larger sample size (N = 6187) than previous work and allowed us to ask more nuanced questions about content in recurrent IAMs (e.g., modeling topics as continuous variables rather than categorizing memories as containing single topics).

Participants

As part of a previous study¹⁵, a convenience sample of undergraduate students was recruited at the University of Waterloo, who participated in return for course credit. Data were collected in five waves between September 2018 and February 2020, with each wave occurring at the start of an academic term (i.e., Fall/September, Winter/January, Spring/May). In total, 6187 unique individuals participated, and they produced 3624 text responses (not all participants experience recurrent IAMs, so not all participants can produce text responses; see Recurrent Memory Scale under Materials). Of these participants, 71% were women, 28% were men, and 1% were nonbinary, genderqueer, or gender nonconforming. Participants were mostly White/Caucasian (39%), East Asian (23%), or South Asian (19%), and were primarily born in Canada (66%), China (9%), or India (5%). Mean age was 19.9 (SD = 3.3, range = 16–49).

Materials

Recurrent Memory Scale

The Recurrent Memory Scale³ was used to assess participants’ recurrent IAMs. Participants indicated if they had experienced at least one recurrent IAM within the past year, not within the past year, or never². If they had experienced at least one within the past year, they wrote a brief description of their one most frequently recurring IAM and rated it on ten 5-point Likert scales (e.g., frequency of recurring, valence³). For instance, valence of their most frequently recurring IAM was assessed using the item “Is the recollection emotionally very positive, positive, neutral, negative, or very negative?” (−2 = very negative, 0 = neutral, 2 = very positive). While participants were administered the full scale, here we focus on participants’ text descriptions of their recurrent IAMs (content) and self-reported valence ratings.

Depression Anxiety Stress Scales

The Depression Anxiety Stress Scales-21 (DASS-21³⁰) consists of 21 items with three subscales: depression (DASS-D), anxiety (DASS-A), and stress (DASS-S). Internal consistency was high in the current sample for the full scale (α = .95) and the subscales for depression (α = .91), anxiety (α = .87), and stress (α = .89).

Posttraumatic Stress Disorder Checklist for DSM-5

The Posttraumatic Stress Disorder Checklist for DSM-5 (PCL-5³¹) consists of 20 items assessing symptoms of PTSD. Participants indicated the degree to which they experienced symptoms in the past month following any very stressful event of their choosing. Internal consistency was high in the current sample (α = .96).

Social Phobia Inventory

The Social Phobia Inventory (SPIN³²) consists of 17 items assessing fear, anxiety, and physical discomfort experienced during social situations. Internal consistency was high in the current sample (α = .95).

State-Trait Inventory of Cognitive and Somatic Anxiety—Trait Version

The State-Trait Inventory of Cognitive and Somatic Anxiety—Trait Version (STICSA-T³³) consists of 21 items assessing cognitive and somatic aspects of trait anxiety. Internal consistency was high in the current sample (α = .94).

Procedure

Participants opted into completing this study, for which they received course credit towards undergraduate psychology courses. It consisted of a 60-minute online survey completed in a single session. This survey was used by the University of Waterloo’s Department of Psychology to characterize students volunteering to participate in psychology-related studies. After providing informed consent, participants voluntarily completed a battery of questionnaires in a randomized order, including the Recurrent Memory Scale and mental health indices (DASS-21, PCL-5, SPIN, and STICSA-T). All other measures in the online survey were unrelated to the current study (e.g., administered by other researchers at the University of Waterloo). All procedures were approved by the University of Waterloo’s Office of Research Ethics (Protocol #40049).

Data preparation

Prior to analysis, we first used supervised machine learning (ML) to detect and remove invalid texts; these include “don’t know” responses, incomprehensible or nonsensical responses, or responses that are irrelevant to the question (e.g., describing dreams when the question asked about memories³⁴). Removing invalid texts is a recommended step in text analysis^35,36 because invalid texts are unrelated to the construct in question (here, recurrent IAMs), and excluding them reduces noise in the data. Previous work has confirmed that ML-based methods can be more effective at identifying invalid text responses compared to other existing approaches, such as response length or time³⁴. Here, the ML-based approach identified 202 texts as invalid (71 human labeled, 131 model predicted¹⁵), all of which were excluded from further analyses.

Valid texts were then preprocessed following current recommendations^35,36,37, including tokenization, cleaning, stop word removal³⁸, vocabulary pruning³⁹^, and lemmatization⁴⁰. Texts were represented using a bag-of-words, unigram approach⁴¹, which decomposes texts into singular words without retaining information about word order.

Topic modeling

We discovered topics in participants’ descriptions of their recurrent memories using structural topic modeling (STM^42,43). STM is a method of unsupervised machine learning that estimates hidden topic structures that could have plausibly produced the observed set of documents (i.e., corpus). By using texts as the input, topic modeling can output topics, or groups of words that can be interpreted as themes in the input texts^42,44,45. Given the success of past work highlighting STM as a valid method of semi-automated content analysis with AM texts (see¹⁵ for details about preprocessing, model selection, and validation with human judgment), we extended our prior approach to answer novel questions about recurrent IAMs and psychopathology.

In previous work, we constructed topic models based on this dataset using only one covariate: participants’ self-reported ratings of the memory’s valence¹⁵. Here, we analyzed this dataset in conjunction with mental health-related covariates: participants’ current symptoms of depression (DASS-D), PTSD (PCL-5), social anxiety (SPIN), and general anxiety (STICSA-T). Data and code supporting the findings of this study are openly available on the Open Science Framework (https://doi.org/10.17605/OSF.IO/GUR5V).

Results

Recurrent IAM valence predicts symptoms of mental health disorders

We replicated our previous findings in that negative valence in recurrent memories was significantly related to greater symptoms of depression, PTSD, social anxiety, and general anxiety^3,4 (Fig. 1). Elevated symptoms were associated with more negative valence in recurrent IAMs. Correlations between recurrent IAM valence and symptoms of mental health disorders were all significant (ps < .001, rs < −.16).

**Fig. 1: Self-reported valence ratings of recurrent IAMs and symptoms of mental health disorders.**

As an exploratory analysis, we also examined whether these patterns held when excluding participants who scored above clinically relevant cutoffs on any of the mental health indices^31,46,47,48 (Fig. S1 in Supplementary materials). Even following these exclusions (n_excluded = 1562; 63%), all patterns remained significant (ps < .03, rs < −.07).

Topic structure and modeling

To examine content, we implemented structural topic modeling (STM) using the {stm} package in R⁴³. Model selection and validation procedures are reported elsewhere in detail¹⁵. In brief, researchers must select an a priori number of topics to be identified when using STM⁴³. To select an appropriate number of topics, we simulated and inspected models with the same parameters (e.g., covariates) across a varying number of topics (5–25 topics^36,49). We then selected a final number of topics using a two-stage approach^41,50. First, internal validation (based on computed metrics derived from the data) guided the initial selection of three candidate models out of the twenty simulated models. Second, external validation (based on human judgment and performance measures) guided the selection of the final model out of the three candidate models¹⁵. Previous work has indicated consistency between the current topics (discovered via semi-automated methods) and topics typically found in AMs (using manual methods¹⁵). The final topic structure obtained is shown in Table 1. Inclusion of the additional mental health-related covariates (participants’ scores on the DASS-D, PCL-5, STICSA-T, and SPIN) did not alter the topic structure obtained during the original study¹⁵, which only included valence as a covariate. Correlations between topics are also described in this previous study¹⁵.

Table 1 Topics in recurrent IAMs.

Full size table

Predicting topic prevalence using symptoms of mental health disorders

Symptoms of mental health disorders significantly accounted for unique variance in topic prevalence, even when controlling for valence ratings and symptoms of all other disorders (see https://doi.org/10.17605/OSF.IO/GUR5V for details). We found unique relationships between specific topics and specific symptoms of disorders, above and beyond how positive or negative a memory was rated (Table 2).

Table 2 Significant predictors of topic prevalence in recurrent IAMs

Full size table

Depression

Depression symptoms were significantly and uniquely predictive of greater use of topic 8 (“Abuse and trauma”; Fig. 2).

PTSD

PTSD symptoms were significantly and uniquely predictive of greater use of topic 2 (“Negative past relationships”). Further, PTSD symptoms were significantly and uniquely predictive of less use of topic 4 (“Embarrassing events”), topic 9 (“Conversations”), topic 11 (“Interactions with friends”; Fig. 3).

Social anxiety

Social anxiety symptoms were significantly and uniquely predictive of greater use of topic 12 (“Communication and miscommunication”) and topic 16 (“Reflections on decisions”). Further, social anxiety symptoms were significantly and uniquely predictive of less use of topic 2 (“Negative past relationships”), topic 8 (“Abuse and trauma”; Fig. 4).

General anxiety

General anxiety symptoms were significantly and uniquely predictive of greater use of topic 9 (“Conversations”; Fig. 5).

Discussion

Controversy surrounds the basic nature of recurrent IAMs. What are they typically about? Which of them—if any—are dysfunctional (i.e., related to worse mental health)? Some authors have speculated that only a subset of recurrent IAMs is maladaptive^8,9. Evidence suggests that this maladaptive subset could be characterized by negative valence^{3,4,51,52,53,54}. However, valence is potentially entangled with content (i.e., what people report remembering), since valence is related but dissociable from content. In other words, while some events or topics might typically be associated with certain valences (e.g., “Experiences with family members” and positive valence), content can also be relatively independent from valence. It remains relatively unexplored whether content (one’s reconstruction of the event) is related to psychopathology, after accounting for valence (emotional responses to IAMs). In the current study, we examined whether the content and valence of recurrent IAMs could differentiate between memories that are related to worse mental health status versus those that are not. By analyzing content in large samples of recurrent IAMs using structural topic modeling^15,42,43, our work indicates that both the valence and content of one’s recurrent IAMs are linked to symptoms of mental health disorders.

Specifically, we replicated the relationship between negative IAM valence and greater symptoms of all disorders^3,4. This result is consistent with past work showing more negative AMs in individuals with major depressive disorder⁵², PTSD⁵⁴, social anxiety disorder⁵³, or generalized anxiety disorder⁵¹. It also replicates a significant association between more negative AM valence and greater depression symptoms⁵⁵. Interestingly, Rubin et al.¹⁵ also report a nonsignificant correlation between AM valence and PTSD symptoms (r = −.12). Here, we found significant negative correlations between recurrent IAM valence and depression, as well as PTSD symptoms, potentially because of the much larger sample size (and statistical power to detect effects). Alternatively, the current work may have found these effects because it focused on recurrent IAMs whereas Rubin et al.¹⁵ examined a variety of AMs (e.g., both voluntary and involuntary ones); it may be that involuntary or recurrent memories have stronger relationships to mental health than voluntary memories. The current results provide empirical support for theoretical models wherein recurrent IAMs, and the emotions they evoke, are a transdiagnostic process involved in psychopathology^5,7, even in large, nonclinical samples.

In addition, our topic model allowed us to test hypotheses about whether one’s level of psychopathology is associated with recurrent memory content^2,3,5. We used participants’ symptoms of mental health disorders (i.e., depression, PTSD, social anxiety, general anxiety) to predict the content (i.e., topic prevalence) within recurrent IAMs. What we have shown here is that some topics were seemingly benign: some topics such as “Physical activities and performance” and “Environment and locations” were not significantly related to any mental health index. Critically, specific topics were significantly associated with symptoms of some mental health disorders, but not others; no topics were universally related to symptoms of all mental health disorders (i.e., depression, PTSD, social anxiety, general anxiety), suggesting that recurrent IAM content is disorder-specific.

One of our key findings was that each mental health index uniquely predicted the prevalence of distinct topics. For example, while PTSD symptoms were significant positive predictors of the “Negative past relationships” topic (e.g., “relationship”, “negative”, “situation”, “traumatic”), depressive symptoms were significant positive predictors of the “Assaults and abuse” topic (e.g., “assault”, “abuse”, “trauma”, “fail”). Furthermore, social anxiety symptoms were significant positive predictors of the “Communication and miscommunication” topic (e.g., “question”, “ask”, “teacher”, “class”), and general anxiety symptoms were significant positive predictors of the “Conversations” topic (e.g., “someone”, “conversation”, “say”, “person”). These findings support the idea that there is indeed disorder-specific content in recurrent IAMs^6,16, and that recurrent IAMs containing certain types of content are more likely to reflect psychopathology than other types of content^5,19. In fact, we replicated a significant positive relationship between use of the “Assaults and abuse” topic and depression symptoms^19,56, but with a much larger sample size and in a nonclinical sample. Our current evidence also lends support to hypotheses about the nature of emotional memory processes in PTSD. In particular, models of PTSD have suggested that difficulties in retrieving positive memories could underlie the development or maintenance of PTSD^57,58,59,60. In line with these ideas, our results indicated that greater PTSD symptoms were related to less use of a positive topic (“Interactions with friends”) in recurrent IAMs. Topics significantly related to social anxiety (e.g., topic 12: “Communication and miscommunication”, e.g., incorrectly answering a question aloud during a class) seem to reflect ideas that recurrent IAMs in social anxiety disorder might focus on specific, aversive social events that individuals have experienced^53,61. Similarly, topics significantly related to general anxiety (topic 9: “Conversations”) might reflect the high prevalence of social/interpersonal concerns as a worry topic in generalized anxiety disorder^62,63.

Overall, our results suggest that while it is accurate to say that negative recurrent IAMs are consistently related to increased symptoms of psychopathology^3,4, this statement can now be refined. Here, we show that both valence and specific types of content in recurrent IAMs are related to symptoms; self-reported valence as well as the use of specific topics in recurrent IAMs were significantly related to mental health indices. Moreover, many negative topics were not significantly related to symptoms of any disorder (e.g., “Stressful events”, “Confrontations, fights, and arguments”). Though emotions evoked by recurrent IAMs were an important component in the relationship between these memories and psychopathology, our study suggests that content (e.g., types of events described, how the individual reconstructs them) is also vital to consider and provides unique insight into mental health status. This is notable because it suggests that the emotional valence ascribed to specific memories is not entirely sufficient to distinguish maladaptive recurrent memories from benign ones. Based on our analyses, even if participants attributed the same level of valence to their memories (e.g., “very negative”), the way they reconstructed the memory (i.e., content) was still significantly related to their current symptoms of mental health disorders.

In terms of limitations, we only investigated recurrent IAMs—a specific type of AM—in the present study. While there is theoretical precedent claiming that these memories are particularly relevant to psychopathology^5,7, the current results could be compared against voluntary AMs to test these previous hypotheses. For instance, it would be valuable to examine if there are dissociations or similarities between voluntary and involuntary AM content in relation to PTSD symptoms (as in⁵⁵). Also due to the current study’s focus on recurrent IAMs, our data were largely limited to a past-focused temporal orientation, as participants were instructed to only describe/rate memories. While participants were free to describe future-oriented content, this would have been incidental (at the discretion of the participant) and related to their past-oriented memories. Future studies could compare memories to other forms of spontaneous or clinically relevant thought (such as episodic future thoughts, ruminations, or worries) to investigate how content and/or valence might have varying relationships with psychopathology depending on the type of thought. Alternatively, studies could qualitatively classify topics in terms of whether they are more past- or future-oriented.

Another relevant design choice was to focus on participants’ one most frequently recurring IAM. This approach is consistent with past literature^2,3,4, which facilitates comparison with prior studies. However, past work has also found that participants report experiencing many different recurrent IAMs (M = 7.3³). As such, future studies could ask participants to list all (or at least multiple) recurrent IAMs they have experienced recently. By asking participants to describe and/or rate these less frequent recurrent IAMs, one could potentially better capture the variety of memories that individuals experience in their daily lives.

Additionally, it is worth noting that the cross-sectional nature of the current study also introduces limitations. Participants completed all measures in a single, asynchronous, online session. Potentially, any given participant could have been in a negative mood during the study, which could have led to greater accessibility of negative recurrent IAMs, or more negative ratings of these memories’ valence¹. At the same time, negative mood could have inflated participants’ scores on mental health indices⁶⁴. While mood is indeed part of theoretical models of why recurrent IAMs persist and correlate with mental health outcomes^29,65, the current work cannot conclusively discern the directionality of effects observed. Future work could consider methods such as ecological momentary assessment⁶⁶ to unravel temporal dynamics of recurrent IAMs and the emotions preceding/following them.

We also used a large, nonclinical convenience sample of undergraduates in the current study. While it can be assumed that these participants were overall nonclinical (e.g., not patients with psychological/psychiatric diagnoses), based on general base rates of psychopathology, it is probable that some number of our participants were experiencing clinically relevant mental health challenges. Future work could recruit large samples of clinical and/or nonclinical participants and potentially examine whether the current observations are consistent across populations. In a similar vein, the current study examined symptoms as a dimensional measure of psychopathology, rather than a categorical distinction (e.g., individuals with disorders vs. individuals without). It remains possible that psychopathology could have nonlinear relationships with recurrent IAMs that emerge when comparing those with or without diagnosed mental health disorders.

Finally, future work could extend the computational approach taken in the current study. Here, we used a unigram, bag-of-words approach of topic modeling, which is common in the natural language processing literature⁶⁷. While this approach typically performs well and effectively captures content^41,42, it has a few noteworthy drawbacks. Unigram approaches can miss multiword phrases (e.g., “family” and “member” being highly representative of topic 6, rather than “family member”), which could potentially make a topic difficult for researchers to interpret (e.g., seeing “member” without “family”). Moreover, the bag-of-words approach does not have access to information about word order⁴¹, which can sometimes change the meaning of a document (e.g., “is it good?” vs. “it is good”). Developments in natural language processing and associated software could enable future studies to incorporate these additional features into topic models.

In conclusion, our current study is the first to provide a comprehensive description of how both recurrent IAM valence and content predict symptoms of mental health disorders in a large, nonclinical sample. While recurrent IAM valence seems to have relatively homogenous relationships with symptoms across various disorders, recurrent IAM content appears to differentiate between disorders. By using machine learning and natural language processing techniques in a novel application (i.e., autobiographical memory), we present a robust and reproducible topic model that reveals unique relationships between specific topics and specific mental health indices. Our work shows that AM phenomenology (e.g., valence) and content can be analyzed in tandem, and at much larger scales than previously thought possible, to answer critical questions about the fundamental nature of recurrent IAMs. Topics in recurrent IAMs—and their links to mental health—are identifiable, distinguishable, and quantifiable.

Data availability

The datasets generated and/or analyzed during the current study are available on the Open Science Framework (https://doi.org/10.17605/OSF.IO/GUR5V).

Code availability

The underlying code for this study is available on the Open Science Framework (https://doi.org/10.17605/OSF.IO/GUR5V).

References

Berntsen, D. Involuntary autobiographical memories. Appl. Cognit. Psychol. 10, 435–454 (1996).
Article Google Scholar
Berntsen, D. & Rubin, D. C. The reappearance hypothesis revisited: recurrent involuntary memories after traumatic events and in everyday life. Mem. Cognit. 36, 449–460 (2008).
Article PubMed PubMed Central Google Scholar
Yeung, R. C. & Fernandes, M. A. Recurrent involuntary autobiographical memories: Characteristics and links to mental health status. Memory 28, 753–765 (2020).
Article PubMed Google Scholar
Yeung, R. C. & Fernandes, M. A. Recurrent involuntary memories are modulated by age and linked to mental health. Psychol. Aging 36, 883–890 (2021).
Article PubMed Google Scholar
Brewin, C. R., Gregory, J. D., Lipton, M. & Burgess, N. Intrusive images in psychological disorders: characteristics, neural mechanisms, and treatment implications. Psychol. Rev. 117, 210–232 (2010).
Article PubMed PubMed Central Google Scholar
Bryant, R. A., O’Donnell, M. L., Creamer, M., McFarlane, A. C. & Silove, D. Posttraumatic intrusive symptoms across psychiatric disorders. J. Psychiatr. Res. 45, 842–847 (2011).
Article PubMed Google Scholar
Harvey, A. G., Watkins, E., Mansell, W. & Shafran, R. Cognitive Behavioural Processes Across Psychological Disorders: A Transdiagnostic Approach To Research and Treatment. (Oxford University Press, 2004).
Berntsen, D. The unbidden past: Involuntary autobiographical memories as a basic mode of remembering. Curr. Dir. Psychol. Sci. 19, 138–142 (2010).
Article Google Scholar
Clark, D. A. & Rhyno, S. Unwanted intrusive thoughts in nonclinical individuals: implications for clinical disorders. In Clark, D. A. (Ed.), Intrusive thoughts in clinical disorders: Theory, Research, and Treatment 1–29. (Guilford Press, New York, NY, USA, 2005).
Kvavilashvili, L. Solving the mystery of intrusive flashbacks in posttraumatic stress disorder: comment on Brewin (2014). Psychol. Bull. 140, 98–104 (2014).
Article PubMed Google Scholar
Iyadurai, L. et al. Intrusive memories of trauma: A target for research bridging cognitive science and its clinical application. Clin. Psychol. Rev. 69, 67–82 (2019).
Article PubMed PubMed Central Google Scholar
Schlagman, S., Schulz, J. & Kvavilashvili, L. A content analysis of involuntary autobiographical memories: examining the positivity effect in old age. Memory 14, 161–175 (2006).
Article PubMed Google Scholar
Grysman, A. Collecting narrative data on Amazon’s Mechanical Turk. Appl. Cognit. Psychol. 29, 573–583 (2015).
Article Google Scholar
Williams, A. D. & Moulds, M. L. An investigation of the cognitive and experiential features of intrusive memories in depression. Memory 15, 912–920 (2007).
Article PubMed Google Scholar
Yeung, R. C., Stastna, M. & Fernandes, M. A. Understanding autobiographical memory content using computational text analysis. Memory 30, 1267–1287 (2022).
Article PubMed Google Scholar
Gehrt, T. B., Frostholm, L., Obermann, M.-L. & Berntsen, D. Thought characteristics in patients with severe health anxiety: A comparison with obsessive–compulsive disorder and healthy controls. Psychol. Conscious. Theory Res. Pract. 10, 76–87 (2022).
Google Scholar
Reynolds, M. & Brewin, C. R. Intrusive memories in depression and posttraumatic stress disorder. Behav. Res. Ther. 37, 201–215 (1999).
Article CAS PubMed Google Scholar
Wenzel, A., Pinna, K. & Rubin, D. C. Autobiographical memories of anxiety-related experiences. Behav. Res. Ther. 42, 329–341 (2004).
Article PubMed Google Scholar
Brewin, C. R., Hunter, E., Carroll, F. & Tata, P. Intrusive memories in depression: an index of schema activation. Psychol. Med. 26, 1271–1276 (1996).
Article CAS PubMed Google Scholar
Gehrt, T. B., Frostholm, L., Obermann, M.-L. & Berntsen, D. Autobiographical memory and episodic future thinking in severe health anxiety: A comparison with obsessive–compulsive disorder. Cognit. Ther. Res. 44, 89–107 (2020).
Article Google Scholar
Krans, J., de Bree, J. & Bryant, R. A. Autobiographical memory bias in social anxiety. Memory 22, 890–897 (2014).
Article PubMed Google Scholar
Anderson, B., Goldin, P. R., Kurita, K. & Gross, J. J. Self-representation in social anxiety disorder: Linguistic analysis of autobiographical narratives. Behav. Res. Ther. 46, 1119–1125 (2008).
Article PubMed PubMed Central Google Scholar
Mariani, R., Di Trani, M., Negri, A. & Tambelli, R. Linguistic analysis of autobiographical narratives in unipolar and bipolar mood disorders in light of multiple code theory. J. Affect. Disord. 273, 24–31 (2020).
Article CAS PubMed Google Scholar
Kvavilashvili, L. & Schlagman, S. Involuntary autobiographical memories in dysphoric mood: a laboratory study. Memory 19, 331–345 (2011).
Article PubMed Google Scholar
Ashbaugh, A. R., Fishman, K. N. & Houle-Johnson, S. A. Intrusive social images in individuals with high and low social anxiety: a multi-method analysis. Behav. Cognit. Psychother. 47, 594–610 (2019).
Article Google Scholar
O’Toole, M. S., Watson, L. A., Rosenberg, N. K. & Berntsen, D. Negative autobiographical memories in social anxiety disorder: A comparison with panic disorder and healthy controls. J. Behav. Ther. Exp. Psychiatry 50, 223–230 (2016).
Article PubMed Google Scholar
Erosheva, E., Fienberg, S. & Lafferty, J. Mixed-membership models of scientific publications. Proc. Natl Acad. Sci. 101, 5220–5227 (2004).
Article CAS PubMed PubMed Central Google Scholar
Griffiths, T. L. & Steyvers, M. Finding scientific topics. Proc. Natl Acad. Sci. 101, 5228–5235 (2004).
Article CAS PubMed PubMed Central Google Scholar
Starr, S. & Moulds, M. L. The role of negative interpretations of intrusive memories in depression. J. Affect. Disord. 93, 125–132 (2006).
Article PubMed Google Scholar
Lovibond, P. F. & Lovibond, S. H. The structure of negative emotional states: comparison of the Depression Anxiety Stress Scales (DASS) with the beck depression and anxiety inventories. Behav. Res. Ther. 33, 335–343 (1995a).
Article CAS PubMed Google Scholar
Weathers, F. W. et al. The PTSD Checklist for DSM-5 (PCL-5). Scale available from the National Center for PTSD. www.ptsd.va.gov (2013).
Connor, K. M. et al. Psychometric properties of the Social Phobia Inventory (SPIN). Br. J. Psychiatry 176, 379–386 (2000).
Article CAS PubMed Google Scholar
Grös, D. F., Antony, M. M., Simms, L. J. & McCabe, R. E. Psychometric properties of the State-Trait Inventory for Cognitive and Somatic Anxiety (STICSA): comparison to the State-Trait Anxiety Inventory (STAI). Psychol. Assess. 19, 369–381 (2007).
Article PubMed Google Scholar
Yeung, R. C. & Fernandes, M. A. Machine learning to detect invalid text responses: validation and comparison to existing detection methods. Behav. Res. Methods 54, 3055–3070 (2022).
Article PubMed Google Scholar
Banks, G. C., Woznyj, H. M., Wesslen, R. S. & Ross, R. L. A review of best practice recommendations for text analysis in R (and a user-friendly app).J. Bus. Psychol. 33, 445–459 (2018).
Article Google Scholar
Maier, D. et al. Applying LDA topic modeling in communication research: toward a valid and reliable methodology. Commun. Methods. Measures 12, 93–118 (2018).
Article Google Scholar
Kobayashi, V. B., Mol, S. T., Berkers, H. A., Kismihók, G. & Den Hartog, D. N. Text mining in organizational research. Org. Res. Methods 21, 733–765 (2018).
Article Google Scholar
Porter, M. F. Snowball: A language for stemming algorithms. https://snowballstem.org/texts/introduction.html (2001).
Maier, D., Niekler, A., Wiedemann, G. & Stoltenberg, D. How document sampling and vocabulary pruning affect the results of topic models. Comput. Commun. Res. 2, 139–152 (2020).
Article Google Scholar
Benoit, K. & Matsuo, A. spacyr: Wrapper to the spaCy NLP library. R package version 1.2.1. https://spacyr.quanteda.io (2022).
Grimmer, J. & Stewart, B. M. Text as data: the promise and pitfalls of automatic content analysis methods for political texts. Political Anal. 21, 267–297 (2013).
Article Google Scholar
Roberts, M. E. et al. Structural topic models for open‐ended survey responses. Am. J. Political Sci. 58, 1064–1082 (2014).
Article Google Scholar
Roberts, M. E., Stewart, B. M. & Tingley, D. stm: an R package for structural topic models. J. Stat. Softw. 91, 1–40 (2019).
Article Google Scholar
Blei, D. M., Ng, A. Y. & Jordan, M. I. Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003).
Google Scholar
DiMaggio, P., Nag, M. & Blei, D. Exploiting affinities between topic modeling and the sociological perspective on culture: application to newspaper coverage of US government arts funding. Poetics 41, 570–606 (2013).
Article Google Scholar
Lovibond, S. H., & Lovibond, P. F. Manual for the Depression Anxiety Stress Scales 2nd edn. (Psychology Foundation of Australia, 1995b).
Moscovitch, D. A., Rodebaugh, T. L. & Hesch, B. D. How awkward! Social anxiety and the perceived consequences of social blunders. Behav. Res. Ther. 50, 142–149 (2012).
Article PubMed Google Scholar
Van Dam, N. T., Grös, D. F., Earleywine, M. & Antony, M. M. Establishing a trait anxiety threshold that signals likelihood of anxiety disorders. Anxiety Stress Coping 26, 70–86 (2013).
Article PubMed Google Scholar
Blei, D. M., & Lafferty, J. D. Topic models. In Srivastava, A. N. & Sahami, M. (Eds.), Text Mining: Classification, Clustering, and Applications 71–94. (Taylor and Francis, New York, NY, USA, 2009).
Quinn, K. M., Monroe, B. L., Colaresi, M., Crespin, M. H. & Radev, D. R. How to analyze political attention with minimal assumptions and costs. Am. J. Political Sci. 54, 209–228 (2010).
Article Google Scholar
Burke, M. & Mathews, A. Autobiographical memory and clinical anxiety. Cognit. Emot. 6, 23–35 (1992).
Article Google Scholar
Gotlib, I. H. & Joormann, J. Cognition and depression: current status and future directions. Annu. Rev. Clin. Psychol. 6, 285–312 (2010).
Article PubMed PubMed Central Google Scholar
Moscovitch, D. A. et al. Autobiographical memory retrieval and appraisal in social anxiety disorder. Behav. Res. Ther. 107, 106–116 (2018).
Article PubMed Google Scholar
Rubin, D. C., Dennis, M. F. & Beckham, J. C. Autobiographical memory for stressful events: the role of autobiographical memory in posttraumatic stress disorder. Conscious. Cognit 20, 840–856 (2011).
Article Google Scholar
Rubin, D. C., Boals, A. & Berntsen, D. Memory in posttraumatic stress disorder: properties of voluntary and involuntary, traumatic and nontraumatic autobiographical memories in people with and without posttraumatic stress disorder symptoms. J. Exp. Psychol. Gen. 137, 591–614 (2008).
Article PubMed PubMed Central Google Scholar
Kuyken, W. & Brewin, C. R. Intrusive memories of childhood abuse during depressive episodes. Behav. Res. Ther. 32. https://doi.org/10.1016/0005-7967(94)90140-6 (1994).
Contractor, A. A., Banducci, A. N. & Weiss, N. H. Critical considerations for the positive memory‐posttraumatic stress disorder model. Clin. Psychol. Psychother.29, 81–91 (2022).
Article PubMed Google Scholar
Contractor, A. A. et al. Posttraumatic stress disorder and positive memories: clinical considerations. J. Anxiety Disord. 58, 23–32 (2018).
Article PubMed Google Scholar
McNally, R. J., Lasko, N. B., Macklin, M. L. & Pitman, R. K. Autobiographical memory disturbance in combat-related posttraumatic stress disorder. Behav. Res. Ther. 33, 619–630 (1995).
Article CAS PubMed Google Scholar
Williams, J. M. G. et al. Autobiographical memory specificity and emotional disorder. Psychol. Bull. 133, 122–148 (2007).
Article PubMed PubMed Central Google Scholar
Hackmann, A., Clark, D. M. & McManus, F. Recurrent images and early memories in social phobia. Behav. Res. Ther. 38, 601–610 (2000).
Article CAS PubMed Google Scholar
Becker, E. S., Goodwin, R., Hölting, C., Hoyer, J. & Margraf, J. Content of worry in the community: what do people with generalized anxiety disorder or other disorders worry about. J. Nerv. Ment. Dis. 191, 688–691 (2003).
Article PubMed Google Scholar
Roemer, L., Molina, S. & Borkovec, T. D. An investigation of worry content among generally anxious individuals. J. Nerv. Ment. Dis. 185, 314–319 (1997).
Article CAS PubMed Google Scholar
Pavlova, B. & Uher, R. Assessment of psychopathology: Is asking questions good enough. JAMA Psychiatry 77, 557–558 (2020).
Article PubMed Google Scholar
Ehlers, A. & Steil, R. Maintenance of intrusive memories in posttraumatic stress disorder: a cognitive approach. Behav. Cognit. Psychother. 23, 217–249 (1995).
Article CAS Google Scholar
Shiffman, S., Stone, A. A. & Hufford, M. R. Ecological momentary assessment. Annu. Rev. Clin. Psychol. 4, 1–32 (2008).
Article PubMed Google Scholar
Wallach, H. M. Topic modeling: Beyond bag-of-words. In Proc. of the 23rd International Conference on Machine Learning. 977–984. (2006).

Download references

Acknowledgements

This work was supported by the Natural Sciences and Engineering Research Council of Canada (NSERC) through an Alexander Graham Bell Canada Graduate Scholarship (CGSD3-535024-2019) and Postdoctoral Fellowship (PDF-578187-2023) awarded to author R.C.Y. and an NSERC Discovery Grant (2020-03917) awarded to author M.A.F. The funder played no role in the study design, data collection, analysis, interpretation of data, or the writing of this manuscript.

Author information

Authors and Affiliations

Department of Psychology, University of Waterloo, Waterloo, ON, Canada
Ryan C. Yeung & Myra A. Fernandes
Rotman Research Institute, Baycrest Health Sciences, Toronto, ON, Canada
Ryan C. Yeung

Authors

Ryan C. Yeung
View author publications
You can also search for this author in PubMed Google Scholar
Myra A. Fernandes
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.C.Y.: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Software, Validation, Visualization, Writing—original draft, and Writing—review & editing. M.A.F.: Conceptualization, Funding acquisition, Project administration, Resources, Supervision, and Writing—review & editing. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Ryan C. Yeung.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Figure S1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yeung, R.C., Fernandes, M.A. Specific topics, specific symptoms: linking the content of recurrent involuntary memories to mental health using computational text analysis. npj Mental Health Res 2, 22 (2023). https://doi.org/10.1038/s44184-023-00042-x

Download citation

Received: 16 February 2023
Accepted: 19 October 2023
Published: 18 December 2023
DOI: https://doi.org/10.1038/s44184-023-00042-x

This article is cited by

Disentangling boredom from depression using the phenomenology and content of involuntary autobiographical memories
- Ryan C. Yeung
- James Danckert
- Myra A. Fernandes
Scientific Reports (2024)

Subjects

Abstract

Similar content being viewed by others

Disentangling boredom from depression using the phenomenology and content of involuntary autobiographical memories

Facing the pandemic and lockdown: an insight on mental health from a longitudinal study using diaries

Transdiagnostic distortions in autobiographical memory recollection

Introduction

Methods

Participants

Materials

Recurrent Memory Scale

Depression Anxiety Stress Scales

Posttraumatic Stress Disorder Checklist for DSM-5

Social Phobia Inventory

State-Trait Inventory of Cognitive and Somatic Anxiety—Trait Version

Procedure

Data preparation

Topic modeling

Results

Recurrent IAM valence predicts symptoms of mental health disorders

Topic structure and modeling

Predicting topic prevalence using symptoms of mental health disorders

Depression

PTSD

Social anxiety

General anxiety

Discussion

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary Information

Figure S1

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Disentangling boredom from depression using the phenomenology and content of involuntary autobiographical memories

Search

Quick links