Contribution of Chronic Disease in Predicting Depression and Suicidal Ideation Among the Older Adult Population

Article information

Psychiatry Investig. 2025;22(9):1068-1076
Publication date (electronic) : 2025 August 21
doi : https://doi.org/10.30773/pi.2024.0106
1Department of Health Care Sciences, Graduate School, Transdisciplinary Major in Learning Health Systems, Korea University, Seoul, Republic of Korea
2Department of Public Health Sciences, Graduate School, Transdisciplinary Major in Learning Health Systems, Korea University, Seoul, Republic of Korea
3Department of Health Policy and Management, College of Health Science, Korea University, Seoul, Republic of Korea
4Department of Health and Environmental Science, College of Health Science, Korea University, Seoul, Republic of Korea
5Department of Physical Therapy, College of Health Science, Korea University, Seoul, Republic of Korea
Correspondence: Hae-Young Kim, PhD, DDS Department of Health Policy and Management, College of Health Science, Korea University, 145 Anam-ro, Seongbuk-gu, Seoul 02841, Republic of Korea Tel: +82-2-3290-5667, Fax: +82-2-940-2879, E-mail: kimhaey@korea.ac.kr
Received 2024 March 25; Revised 2025 March 24; Accepted 2025 July 13.

Abstract

Objective

This study aimed to clarify how chronic diseases (CDs) contribute to depression and suicidal ideation (SI) prediction using machine learning (ML) techniques among the older adult population.

Methods

National representative data of 5,419 older adults from the Korea National Health and Nutrition Examination Survey conducted in 2013, 2015, 2017, and 2019 were used in this study. The number and type of CDs were incorporated into Models 1 and 2, respectively, using five ML methods.

Results

The average age of the participants was 72.7 years, with 43.2% males, 15.2% reporting depression, and 7.3% reporting SI. The number of CDs was correlated with increased depression and SI. The ML models showed moderate-to-good performance in the prediction of depression and SI. The area under the curve (AUC) values for Model 1 ranged from 0.729 to 0.772 for depression, and from 0.754 to 0.793 for SI. In Model 2, the AUC ranged from 0.704 to 0.768 for depression and from 0.750 to 0.785 for SI. More depression and SI were expected when the number of CDs was one or more and two or more, respectively. The top predictors of depression were osteoarthritis, myocardial infarction, diabetes, asthma, and stroke, whereas those predicting SI were stroke, hypertension, asthma, myocardial infarction, and rheumatoid arthritis.

Conclusion

The number and specific types of CDs predicted depression and SI among Korean older adults. These results may help enhance cooperation with clinicians treating CDs and promote the early detection and prevention of further SI and behaviors.

INTRODUCTION

Suicide among older adults is a critical public health concern worldwide, particularly as the average global population age continues to increase. As of 2021, individuals aged 65 and over account for approximately 10% of the global population and this proportion is expected to increase to around 16% by 2050 [1]. Suicide rates among older adults are the highest, 27.45 per 100,000 people, compared to other age groups worldwide [2]. Compared to other countries in OECD, Korea has the highest suicide rate, 41.7 per 100,000 people, among older adults. This rate is more than double and four times the Korean and global population average, respectively [3]. Various factors contribute to increased suicide risk among older adults, encompassing mental health issues such as depression and cognitive impairments; physical health challenges, including declining physical functions and chronic diseases (CDs); and socioeconomic factors such as economic poverty, social isolation, loneliness, and the loss of close relationships [2].

Suicide is a complex phenomenon that includes ideation, planning, and attempts, and individuals with suicidal ideation (SI) are at a higher risk of dying by suicide [4,5]. Early identification of individuals with SI is crucial for timely intervention to prevent both suicide and suicide attempts effectively. CDs have been reported as strong predictors of both depression and suicide, manifesting through increased symptom burden, pain, disability, and reduced quality of life [6,7]. Specifically, older adults with one or more physical CDs have shown an increased risk of depression and SI compared to those without them [8,9]. Meta-analytic studies comprising 40 and 19 studies revealed that, for individuals with two or more CDs, the risk of depression is two and three times higher, respectively, compared to those with one and no CD [10]. Various chronic conditions, including cancer, stroke, neurological disorders, and metabolic syndrome, have been reported as potential risk factors for depression [11]. Additionally, arthritis, heart disease, and chronic obstructive pulmonary disease have been associated with an increased risk of SI [12,13].

Machine learning (ML) techniques have emerged as efficient tools for predicting health outcomes, including mental health conditions such as depression and SI [14-16]. ML has demonstrated superior performance in predicting mental health outcomes compared to traditional statistical methods [6,17]. ML is increasingly applied to detect, predict, and support the treatment of suicide-related behaviors. Hekler et al. [18] found 4,002 articles that applied ML methods to predict psychological outcomes up to 2021. Long-term conditions such as diabetes, asthma, epilepsy, and chronic pain have been incorporated as one of the predictors of depression using ML models [19]. However, regarding SI prediction, only one ML-based study has included CDs as one of the 26 predictors, and no previous ML-based study has specifically examined the contribution of CDs in predicting SI [20]. Given the association between CDs and mental health outcomes and the role of CDs in increasing vulnerability to mental disorders through disease related distress, functional impairments, and medication adherence [11,12,14-16], incorporating CDs into ML-based predictive models may provide additional insights into the risk factors related to depression and SI.

The existing complex ML models are often criticized for not providing well-explained and interpretable predictions, which are essential for decision-making in healthcare [21]. Explainable artificial intelligence methods have been introduced to help make complex ML-based results more understandable for humans. The ranking of feature importance and Shapley Additive explanation (SHAP) have recently been adopted to enhance explainability [22].

Therefore, we developed predictive models focusing on CDs to identify depression and SI in community-dwelling older adults using five popular ML methods. Additionally, we assessed the independent predictive ability of the number and specific types of CDs by ranking feature importance and using SHAP results to predict depression and SI among older South Korean adults under the control of depressive symptoms.

METHODS

Data source

This study utilized cross-sectional survey data from the Korea National Health and Nutrition Examination Survey (KNHANES), conducted by the Korea Centers for Disease Control and Prevention. The KNHANES produced nationally representative data and comprised health interviews, health behavior surveys, nutritional surveys, and health examinations. To select a representative sample, a multistage cluster probability design based on administrative regions was adopted. We used the KNHANES data from 2013, 2015, 2017, and 2019, which exclusively maintained a consistent questionnaire about SI, asking, “Have you seriously considered suicide in the past year?”

Study population

Initially, 6,329 participants aged 65 and older who participated in 2013, 2015, 2017, and 2019 were selected. Subsequently, 863 individuals with missing values for depression and SI and 47 individuals with missing values for predictors such as cohabitation status, perceived daily stress, hypertension, and lifetime general cigarette use were excluded. Finally, data from 5,419 individuals were used for the analysis (Figure 1). This study was approved by the Korea University Institutional Review Board (KUIRB-2023-0197-01).

Figure 1.

Selection procedure of participants from the data of Korea National Health and Nutrition Examination Survey (KNHANES).

Dependent variables

Depression was employed as one of the dependent variables, measured using the question, “Have you experienced feeling sad or hopeless for 2 weeks or more continuously in the past year to the extent that it affected your daily life?” The depression group (n=825, 15.2%) and non-depression group (n=4,594, 84.8%) were assigned based on whether the responses were affirmative or negative.

SI experience served as another dependent variable, measured by the question, “Have you seriously considered suicide in the past year?” The SI (n=396, 7.3%) and non-SI (n=5,023, 92.7%) groups were allocated based on their responses to the questions on SI (Figure 1).

Predictive variables

Ten potential predictors were chosen from sociodemographic, socioeconomic, mental health, and health-related behavioral factors. The three sociodemographic factors were age, sex, and living arrangements (living alone vs. living with others). Three socioeconomic factors were education (less than elementary school, middle school, high school, college, or higher), household income (low, middle-low, middle-high, or high), and employment status. The two mental health factors were depression (experiencing depression for 2 or more consecutive weeks) and perceived daily stress (hardly noticeable, feeling a little bit, feeling a lot, or feeling very much). In addition, the two health-related behavioral factors were alcohol consumption (frequency of alcohol consumption in the past year) and lifetime cigarette smoking. Depression status was only used as a potential predictor of SI.

The CD statuses were determined based on self-reported answers about having specific CDs or having been diagnosed with specific CDs by a physician [23]. The originally chosen 25 CDs from KNHANES data were collapsed into 17 types, including hypertension, dyslipidemia, stroke, myocardial infarction, angina pectoris, osteoarthritis, rheumatoid arthritis, pulmonary tuberculosis, asthma, kidney failure, atopic dermatitis, diabetes, thyroid disease, hepatitis B, hepatitis C, liver cirrhosis, and cancers (stomach, liver, colorectal, breast, cervical, lung, and thyroid cancers, etc.).

The number of CDs among the 17 CDs, categorized as none, one, two, three, and four or more CDs, was included in Model 1 to predict depression and SI. In total, 13 and 14 predictors were used to predict depression and SI, respectively, including four dummy variables, based on the number of CDs and nine other predictors. Model 2 included 17 specific CDs and 27 and 26 predictor variables were used to predict SI and depression, respectively.

ML methods

For the prediction models, we employed logistic regression (LR) with ridge regularization and a support vector machine (SVM) with a linear kernel. Additionally, three tree-based and ensemble models, decision tree (DT), random forest (RF), and extreme gradient boosting (XGB) were employed.

The prediction models were developed according to the following procedure. The entire dataset was randomly split into training and test sets at a ratio of 8:2. A 10-fold cross-validation method was used to prevent overfitting of the prediction models. To avoid duplicates of the same observations, the training set and validation set were divided, and synthetic minority oversampling technique (SMOTE) was applied exclusively to the training set [24]. The class imbalance problem occurs when the size of the majority (negative class) is larger than that of the minority (positive class) [25]. Given the rarity of the minority class examples, which can impact ML prediction performance, SMOTE was applied to balance the datasets in the training set [26]. SMOTE is an oversampling technique that fills the gaps between minority class examples by creating “synthetic” examples, and it has a lower risk of overfitting compared to random oversampling, which simply duplicates minority class examples [27,28].

The optimal parameters for each ML method were tuned using a grid search. The best model was selected based on the area under the curve (AUC) value. Performance with an AUC below 0.6 was interpreted as poor, from 0.6 to 0.75 was moderate, from 0.75 to 0.9 was good, and above 0.9 was excellent [29]. In addition, model performance was evaluated using accuracy, sensitivity (recall), specificity, precision (or positive predictive value, PPV), Matthew’s correlation coefficient (MCC) and F1-score. For the appropriate evaluation of imbalanced data, balanced accuracy (BA), recommended sensitivity and specificity averages, and precision-recall AUC (PR-AUC) were calculated [30].

To assess the importance of the predictor variables, the permutation feature import (PFI) method was consistently applied to all the ML models. The PFI method involves randomly shuffling a specific value, breaking the relationship between the features and the target, and defining the decrease in model performance scores [31].

The SHAP value was utilized to interpret the impact of features on the predicted value, and the contribution of each feature was calculated and arranged by the magnitude of contribution [32]. The x-axis represents the SHAP value, with features that are marked with a red dot to the right of the x-axis indicating a positive impact on model predictions, while decreasing features are colored blue [22,32]. All ML prediction models were analyzed using Python (version 3.7.10; Python Software Foundation).

Basic statistical analysis was performed to assess the bivariate association between depression, SI, and predictors using the chi-square test in SAS 9.4 (SAS Institute). Statistical significance was set at p<0.05.

RESULTS

Characteristics of participants

The participants comprised 43.2% males and 56.8% females with an average age of 72.7 years (standard deviation: 5.0). Among the participants, 15.3% reported experiencing depression, and 7.3% reported a history of SI. Specifically, 10.8% of males and 18.6% of females reported experiencing depression, whereas 6.7% of males and 7.8% of females reported experiencing SI.

Regarding the number of CDs from the 17 CD types, the percentages of participants who had none, one, two, three, or four or more CDs were 22.5%, 33.4%, 27.8%, 13.0%, and 3.3%, respectively. The presence of both depression and SI was significantly and positively associated with the number of CDs, indicating that participants with more CDs had higher percentages of depression and SI (Supplementary Table 1).

Among the individual CDs, dyslipidemia, osteoarthritis, rheumatoid arthritis, asthma, and hepatitis C were significantly associated with depression (p<0.05). SI was associated with osteoarthritis, asthma, atopic dermatitis, and hepatitis B and C (p<0.05) (Supplementary Table 2).

Performance analysis for Model 1

In Model 1, which included the number of CDs as the main dependent variable, the prediction of depression using five different algorithms showed moderate to good performance, with an AUC ranging from 0.726 (XGB) to 0.772 (DT) and a PR-AUC ranging from 0.358 (SVM) to 0.391 (DT) (Table 1). The performances of five models in terms of AUC did not significantly differ. Sensitivity ranged from 0.394 (LR) to 0.618 (SVM), indicating that the algorithms accurately identified approximately 39.4% to 61.8% of cases with depression. The PPV ranged from 0.290 (XGB) to 0.471 (LR), resulting in a 1.91 to 3.1 times increase in predictive power compared with the baseline rate of 15.2% for depression. MCC ranged from 0.245 (RF) to 0.341 (DT), indicating correlation between predicted and actual depression cases; the F1-score ranged from 0.385 (XGB) to 0.680 (RF).

Performance of five machine learning models with number of chronic diseases (Model 1)

To predict SI in Model 1, all five algorithms showed similarly good performance, with AUC ranging from 0.754 (XGB) to 0.793 (DT) and PR-AUC ranging from 0.179 (SVM) to 0.294 (DT) (Table 1). Sensitivity ranged from 0.380 (RF) to 0.582 (LR), indicating that the algorithms accurately identified approximately 38.0% to 58.2% of cases with SI. The PPV ranged from 0.184 (LR) to 0.317 (DT), resulting in a 2.52 to 4.34 times increase in predictive power compared with the baseline rate of 7.3% for SI. MCC ranged from 0.213 (XGB) to 0.274 (DT), indicating correlation between predicted and actual SI cases; the F1-score ranged from 0.272 (XGB) to 0.327 (DT).

Feature importance and interpretability of depression and suicide ideation risk prediction

To predict depression in Model 1, the top five important variables and their respective frequencies in parentheses across the five models were stress (5), education (4), income (3), one CD (3), and age (3) (Supplementary Table 3). The rankings of the CDs ranged from 2nd (LR) to 10th (DT) for one CD, 6th (SVM) to 11th (RF) for two CDs, 9th (LR) to 13th (DT and XGB) for three CDs, and 10th (XGB) to 13th (RF) for four or more CDs.

To predict SI in Model 1, the top five important variables and their respective frequencies in parentheses across the five models were stress (5), depression (5), income (4), one CD (4), age (3), and education (2) (Supplementary Table 3). The rankings of the CDs ranged from 3rd (SVM) to 9th (DT) for one CD, 6th (SVM) to 14th (DT) for two CDs, 10th (LR) to 13th (RT) for three CDs, and 12th (LR) to 14th (RF and XGB) for four or more CDs.

The SHAP summary plots leverage individualized feature attributions to convey all aspects of a feature’s importance while remaining visually concise (Figure 2) [32,33]. Figure 2 illustrates that higher stress levels positively contribute to increased depression and SI. In Figure 2A, the variables that correlated positively with depression included stress, age, living arrangements, one CD, sex, two CDs, three CDs, lifetime cigarette smoking, and four CDs. In Figure 2B, the variables with a positive correlation with SI include stress, depression, age, lifetime cigarette smoking, living arrangements, three CDs, work status, two CDs, and four or more CDs.

Figure 2.

Shapley Additive explanation (SHAP) summary plot using a decision tree model: (A) prediction for depression and (B) prediction for suicidal ideation.

Performance analysis for Model 2

For the prediction of depression in Model 2, with 17 specific types of CDs, the five ML models showed moderate to good performance, with AUC ranging from 0.704 (XGB) to 0.768 (DT) and PR-AUC ranging from 0.329 (XGB) to 0.378 (DT) (Table 2). The sensitivity across the five algorithms ranged from 0.430 (DT) to 0.588 (LR), indicating that the algorithms accurately identified approximately 43.0% to 58.8% of depression cases. PPV ranged from 0.280 (XGB) to 0.452 (DT), resulting in a 1.84 to 2.97 times increase in the predictive power for depression. Regarding depression prediction in Model 2, the DT model exhibited better performance in predicting depression than did the other models, with the highest values for accuracy, BA, specificity, PPV, MCC, F1-score, and PR-AUC at 0.834, 0.668, 0.906, 0.452, 0.344, 0.441, and 0.378, respectively. The LR model exhibited the highest sensitivity (0.588). Based on the model performance metrics, the DT model was selected as the best-performing model for predicting depression in Model 2.

Performance of five machine learning models with specific types of chronic diseases (Model 2)

For the prediction of SI in Model 2, the AUC ranged from 0.750 (SVM) to 0.785 (DT) and the PR-AUC ranged from 0.212 (RF) to 0.277 (DT), demonstrating good performance (Table 2). The sensitivity across the five algorithms ranged from 0.418 (RF) to 0.595 (LR), indicating that the algorithms accurately identified approximately 41.8% to 59.5% of SI cases. The PPV ranged from 0.187 (SVM) to 0.296 (DT), resulting in a 2.56 to 4.05 times increase in predictive power for SI. To predict SI in Model 2, the DT model showed superior performance compared with the other models, with the highest values for accuracy, specificity, PPV, MCC, F1-score and PR-AUC of 0.880, 0.912, 0.296, 0.310, 0.363 and 0.277, respectively. The LR model exhibited the highest BA (0.701) and sensitivity (0.595). Based on the model performance metrics, the DT model was selected as the best-performing model for predicting the SI in Model 2.

Feature importance and interpretability of types of CDs in Model 2

To provide a clear explanation of the influence of CD type on prediction in Model 2, we present the ranking of 17 CD types. For the top five important variables among both CDs and other predictors of depression, the order of predictors was similar to that in Model 1. The variables with permutation importance values in parentheses are stress (0.219), household income (0.044), age (0.007), osteoarthritis (0.005), and sex (0.004). Among CD types, the top five CDs predicting depression were osteoarthritis (0.005), myocardial infarction (0.002), diabetes (0.001), asthma (0.001), and stroke (0.001) (Table 3).

Rank of important predictors in Model 2

Similarly, for the top five important SI variables, the rankings were comparable to those of Model 1. The variables with permutation importance values in parentheses are stress (0.227), depression (0.139), income (0.072), age (0.063), and alcohol consumption status (0.023). Among the types of CDs, the top five CDs predicting SI most effectively were stroke (0.011), hypertension (0.010), asthma (0.009), myocardial infarction (0.009), and rheumatoid arthritis (0.008) (Table 3).

DISCUSSION

This study compared the applicability of five ML predictive models, namely DT, RF, XGB, LR, and SVM, in predicting depression and SI among older Korean adults from a population-based survey. Prediction models for depression and SI were successfully constructed, and the model revealed that the number of CDs and CD types contributed to the risk of depression and SI in late life. Despite the importance of CDs in old age, ML prediction studies on SI are scarce. To the best of our knowledge, this is the first study to emphasize the impact of the number and types of CDs on the prediction of SI using ML models among older adults.

The five ML prediction models performed well, with AUC values ranging up to 0.772 and 0.793 for depression and SI, respectively, in the model including the number of CDs (Model 1), and with AUC values ranging up to 0.768 and 0.785 for depression and SI, respectively, in the model including types of CDs (Model 2). The model performance in this study is comparable to that in previous studies. A meta-study reported an average AUC of 0.78 in the prediction of depression, encompassing 19 ML studies of adequate quality out of the initial 744 studies published from 2010 to 2022. Of the 19 studies, 13 included CD variables in predicting depression [19]. Bernert et al. [14] performed a meta-analysis that included 87 studies on suicide and reported AUC values ranging from 0.61 to 0.97 among 19 ML studies predicting SI. However, none of the studies included CDs predicting SI. To predict SI, we found only one study considering CD; Na et al. [20] established a prediction model of SI with the experience of CD diagnosis as a predictor and reported an AUC value of 0.87 in South Korean population.

In this study, DT showed equal or even better performance than the other ML methods, although ensemble methods such as RF or XGB were expected to provide superior performance compared to a single DT model. Mohammadian et al.’s results [34] showing that DT had a greater AUC than LR and RF in the training sample are in concordance with this study.

Numerous CDs demonstrated a positive increase in the prediction of both depression and SI, as assessed using SHAP values. Having at least one CD contributed positively to predicting depression, whereas having at least two CDs showed a positive increase in the prediction of SI in this study. Hwang et al. [35 ]used traditional analytic methods to highlight a significant relationship between having two or more CDs and depression among Korean older adults. Compared to the population without CDs, men with CDs exhibited a 2.1-fold higher risk of depression, while women with CDs had a 1.9-fold higher risk [35]. Similar results were observed in the SI model, in which Smith et al. [36] reported significantly higher odds of SI associated with having one, two, three, and four or more CDs compared to having no CD. However, we did not find any positive correlation between one CD and SI by SHAP value analysis. The distinction between the risks posed by a single CD and multiple CDs in predicting SI remains unclear and warrants further research. A path analysis conducted by Zhu et al. [37] indicated that CDs directly or indirectly influence SI in older adults, either through direct pathways or via psychological distress. This suggests that the relationship between CD and SI is complex and dependent on the physical, emotional, and social contexts of older adults [37].

Among the 17 CD types, the top five CD predictors of depression were osteoarthritis, myocardial infarction, diabetes, asthma, and stroke. Consistent with our study, Jiang et al. [38] reported that individuals with a history of stroke had a 2.81 times higher probability of developing depression compared to those without CDs, and the coexistence of arthritis and stroke significantly elevated the risk of depression. Additionally, asthma showed a strong association with depression [38]. For the prediction of SI, the top five CD predictors were stroke, hypertension, asthma, myocardial infarction, and rheumatoid arthritis. This is consistent with Smith et al. [36], which associated arthritis, angina, asthma, and stroke with significantly higher odds for SI. Additionally, Joshi et al. [39] supported our findings, indicating that patients with cardiovascular disease, stroke, ischemic heart disease, and diabetes were more likely to develop SI. Furthermore, arthritis, angina, asthma, and stroke were reported to increase the likelihood of SI significantly [36].

In both depression and SI prediction models, stress was consistently identified as the most important predictor, followed by depression in the SI prediction model. Stress contributes to depression, a major factor in promoting SI in older adults, and the interaction between high levels of stress and depression increases the likelihood of SI significantly [40]. Furthermore, experiences of psychological stress are linked to the onset of depression in the general population, and individuals with depression are more vulnerable to adverse health impacts from stress [41,42]. Additionally, a statistically significant correlation was found between depression and SI [43].

This study has several limitations. First, the cross-sectional design inherently limits the ability of ML analysis to predict future depression and SI. Schafer et al. [44] argued that ML models using longitudinal data performed better in classifying suicide-related outcomes. Second, the use of self-reported single-item questions may introduce recall bias and potential misclassification [45]. Third, our study did not consider the severity of CDs, which can impact the accuracy level, as prior research has shown that both the number and severity of CDs influence depression [46]. Finally, because of the predetermined scope of the existing assessments, we could not perform comprehensive analyses to explore the contribution of realistic combinations of CDs and other health-related factors.

This study evaluated how CDs contribute to the prediction of depression and SI using various ML models in a representative sample of the Korean older adult population. While previous studies have included CDs as secondary predictors, our findings highlight their substantial contribution to predictive accuracy, particularly among older adults with multimorbidity. The number and specific types of CDs contribute to the prediction of depression and SI among older Korean adults. These results may help enhance the prevention, early detection, and treatment strategies for older adults with multimorbidity that tends to involve both CD conditions and mental disorders.

Supplementary Materials

The Supplement is available with this article at https://doi.org/10.30773/pi.2024.0106.

Supplementary Table 1.

General characteristics of study participants according to depression and SI

pi-2024-0106-Supplementary-Table-1.pdf
Supplementary Table 2.

Chronic disease characteristics of study participants according to depression and suicidal ideation (SI)

pi-2024-0106-Supplementary-Table-2.pdf
Supplementary Table 3.

Rank of important predictors in Model 1

pi-2024-0106-Supplementary-Table-3.pdf

Notes

Availability of Data and Material

The datasets generated or analyzed during the current study are available in the Korea National Health and Nutrition Examination Survey (KNHANES) repository (https://knhanes.kdca.go.kr/knhanes-%20/main.do).

Conflicts of Interest

The authors have no potential conflicts of interest to disclose.

Author Contributions

Conceptualization: Youngbin Seo, Hae-Young Kim. Methodology: Youngbin Seo, Hae-Young Kim, KiBong Choi. Formal analysis: KiBong Choi. Funding acquisition: Junesun Kim. Writing—original draft: Youngbin Seo. Writing—review & editing: Hae-Young Kim, Sunmi Song, Junesun Kim.

Funding Statement

This study was supported by a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health & Welfare, Republic of Korea (H21C0572).

Acknowledgments

None

References

1. United Nations. Department of Economic and Social Affairs, Population Division. World Population Prospects 2022 [Internet]; Available at: https://www.un.org/development/desa/pd/sites/www.un.org.development.desa.pd/files/wpp2022_summary_of_results.pdf. Accessed July 11, 2022.
2. De Leo D. Late-life suicide in an aging world. Nat Aging 2022;2:7–12.
3. Korea Foundation for Suicide Prevention. White paper on suicide prevention. Ministry of Health and Welfare & Korea Foundation for Suicide Prevention Seoul: Korea Foundation for Suicide Prevention; 2021.
4. Oh B, Yun JY, Yeo EC, Kim DH, Kim J, Cho BJ. Prediction of suicidal ideation among Korean adults using machine learning: a cross-sectional study. Psychiatry Investig 2020;17:331–340.
5. Neeleman J, de Graaf R, Vollebergh W. The suicidal process; prospective comparison between early and later stages. J Affect Disord 2004;82:43–52.
6. Erlangsen A, Stenager E, Conwell Y. Physical diseases as predictors of suicide in older adults: a nationwide, register-based cohort study. Soc Psychiatry Psychiatr Epidemiol 2015;50:1427–1439.
7. Noël PH, Williams JW Jr, Unützer J, Worchel J, Lee S, Cornell J, et al. Depression and comorbid illness in elderly primary care patients: impact on multiple domains of health status and well-being. Ann Fam Med 2004;2:555–562.
8. Bair MJ, Robinson RL, Katon W, Kroenke K. Depression and pain comorbidity: a literature review. Arch Intern Med 2003;163:2433–2445.
9. Xiong F, Wang L, Shen L, Guo W, Li S, Guan Q. The relationship between multimorbidity and suicidal ideation: a meta-analysis. J Psychosom Res 2020;138:110257.
10. Read JR, Sharpe L, Modini M, Dear BF. Multimorbidity and depression: a systematic review and meta-analysis. J Affect Disord 2017;221:36–46.
11. Han KM, Kim MS, Kim A, Paik JW, Lee J, Ham BJ. Chronic medical conditions and metabolic syndrome as risk factors for incidence of major depressive disorder: a longitudinal study based on 4.7 million adults in South Korea. J Affect Disord 2019;257:486–494.
12. Scott KM, Hwang I, Chiu WT, Kessler RC, Sampson NA, Angermeyer M, et al. Chronic physical conditions and their association with first onset of suicidal behavior in the world mental health surveys. Psychosom Med 2010;72:712–719.
13. Goodwin RD. Is COPD associated with suicide behavior? J Psychiatr Res 2011;45:1269–1271.
14. Bernert RA, Hilberg AM, Melia R, Kim JP, Shah NH, Abnousi F. Artificial intelligence and suicide prevention: a systematic review of machine learning investigations. Int J Environ Res Public Health 2020;17:5929.
15. Dwyer DB, Falkai P, Koutsouleris N. Machine learning approaches for clinical psychology and psychiatry. Annu Rev Clin Psychol 2018;14:91–118.
16. Helgeson VS, Zajdel M. Adjusting to chronic health conditions. Annu Rev Psychol 2017;68:545–571.
17. Jiang F, Jiang Y, Zhi H, Dong Y, Li H, Ma S, et al. Artificial intelligence in healthcare: past, present and future. Stroke Vasc Neurol 2017;2:230–243.
18. Heckler WF, de Carvalho JV, Barbosa JLV. Machine learning for suicidal ideation identification: a systematic literature review. Comput Human Behav 2022;128:107095.
19. Nickson D, Meyer C, Walasek L, Toro C. Prediction and diagnosis of depression using machine learning with electronic health records data: a systematic review. BMC Med Inform Decis Mak 2023;23:271.
20. Na KS, Geem ZW, Cho SE. The development of a suicidal ideation predictive model for community-dwelling elderly aged >55 years. Neuropsychiatr Dis Treat 2022;18:163–172.
21. Knapič S, Malhi A, Saluja R, Främling K. Explainable artificial intelligence for human decision support system in the medical domain. Mach Learn Knowl Extr 2021;3:740–770.
22. Nordin N, Zainol Z, Mohd Noor MH, Chan LF. An explainable predictive model for suicide attempt risk using an ensemble learning and Shapley Additive Explanations (SHAP) approach. Asian J Psychiatr 2023;79:103316.
23. Huh Y, Nam GE, Kim YH, Lee JH. Relationships between multimorbidity and suicidal thoughts and plans among Korean adults. J Clin Med 2019;8:1094.
24. Santos MS, Soares JP, Abreu PH, Araujo H, Santos J. Cross-validation for imbalanced datasets: Avoiding overoptimistic and overfitting approaches [research frontier]. IEEE Comput Intell Mag 2018;13:59–76.
25. Amin A, Anwar S, Adnan A, Nawaz M, Howard N, Qadir J, et al. Comparing oversampling techniques to handle the class imbalance problem: a customer churn prediction case study. IEEE Access 2016;4:7940–7957.
26. Anis M, Ali M. Investigating the performance of smote for class imbalanced learning: a case study of credit scoring datasets. Eur Sci J 2017;13:340–353.
27. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 2002;16:321–357.
28. Batista GEAPA, Prati RC, Monard MC. A study of the behavior of several methods for balancing machine learning training data. SIGKDD Explor 2004;6:20–29.
29. de Hond AA, Steyerberg EW, van Calster B. Interpreting area under the receiver operating characteristic curve. Lancet Digit Health 2022;4:e853–e855.
30. Tharwat A. Classification assessment methods. Applied Computing and Informatics 2020;17:168–192.
31. Brownlee J. Data preparation for machine learning: data cleaning, feature selection, and data transforms in Python Machine Learning Mastery; 2020.
32. Lundberg SM, Lee SI. A unified approach to interpreting model predictions. arXiv:1705.07874 [Preprint] May 22, 2017. Available at: https://doi.org/10.48550/arXiv.1705.07874. Accessed December 18, 2023.
33. Lundberg SM, Erion GG, Lee SI. Consistent individualized feature attribution for tree ensembles. arXiv:1802.03888 [Preprint] February 12, 2018. Available at: https://doi.org/10.48550/arXiv.1802.03888. Accessed September 24, 2023.
34. Mohammadian MK, Omidi T, Faradmal J, Poorolajal J. A comparison of random forest and decision tree for suicide ideation classification. Research Square [Preprint] September 15, 2020. Available at: https://doi.org/10.21203/rs.3.rs-66839/v1. Accessed January 29, 2024.
35. Hwang S, Nam JY, Ahn JH, Park S. Gender differences in the association between multimorbidity and depression in older Korean adults: an analysis of data from the National Survey of Older Koreans (2011-2017). Epidemiol Health 2022;44:e2022049.
36. Smith L, Shin JI, López Sánchez GF, Kostev K, Jacob L, Tully MA, et al. Physical multimorbidity, suicidal ideation, and suicide attempts among adults aged ≥50 years from low- and middle-income countries. Int J Geriatr Psychiatry 2023;38:e5873.
37. Zhu J, Xu L, Sun L, Li J, Qin W, Ding G, et al. Chronic disease, disability, psychological distress and suicide ideation among rural elderly: results from a population survey in Shandong. Int J Environ Res Public Health 2018;15:1604.
38. Jiang Ch, Zhu F, Qin T. Relationships between chronic diseases and depression among middle-aged and elderly people in China: a prospective study from CHARLS. Curr Med Sci 2020;40:858–870.
39. Joshi P, Song HB, Lee SA. Association of chronic disease prevalence and quality of life with suicide-related ideation and suicide attempt among Korean adults. Indian J Psychiatry 2017;59:352–358.
40. Kim HK, Kim JY, Kim JH, Hyoung HK. Decision tree identified risk groups with high suicidal ideation in South Korea: a population-based study. Public Health Nurs 2016;33:99–106.
41. Mazurka R, Wynne-Edwards KE, Harkness KL. Stressful life events prior to depression onset and the cortisol response to stress in youth with first onset versus recurrent depression. J Abnorm Child Psychol 2016;44:1173–1184.
42. Hammen CL. Stress and depression: old questions, new approaches. Current Opin Psychol 2015;4:80–85.
43. Kim BJ, Kihl T. Suicidal ideation associated with depression and social support: a survey-based analysis of older adults in South Korea. BMC Psychiatry 2021;21:409.
44. Schafer KM, Kennedy G, Gallyer A, Resnik P. A direct comparison of theory-driven and machine learning prediction of suicide: a meta-analysis. PLoS One 2021;16:e0249833.
45. Lee K. Relative handgrip strength in relation to depressive mood and suicidal ideation in Koreans using the 2015 KNHANES data. J Musculoskelet Neuronal Interact 2018;18:333–338.
46. Eisner MD, Katz PP, Lactao G, Iribarren C. Impact of depressive symptoms on adult asthma outcomes. Ann Allergy Asthma Immunol 2005;94:566–574.

Article information Continued

Figure 1.

Selection procedure of participants from the data of Korea National Health and Nutrition Examination Survey (KNHANES).

Figure 2.

Shapley Additive explanation (SHAP) summary plot using a decision tree model: (A) prediction for depression and (B) prediction for suicidal ideation.

Table 1.

Performance of five machine learning models with number of chronic diseases (Model 1)

DT RF LR SVM XGB
Prediction for depression
 Accuracy 0.827 0.741 0.840 0.715 0.726
 Balanced accuracy 0.692 0.651 0.658 0.675 0.661
 Sensitivity 0.497 0.521 0.394 0.618 0.576
 Specificity 0.886 0.780 0.921 0.732 0.746
 Precision (PPV) 0.439 0.299 0.471 0.293 0.290
 AUC 0.772 0.729 0.770 0.732 0.726
 MCC 0.341 0.245 0.267 0.270 0.252
 F1-score 0.430 0.680 0.396 0.398 0.385
 PR-AUC 0.391 0.370 0.378 0.358 0.371
Prediction for suicidal ideation
 Accuracy 0.887 0.869 0.781 0.795 0.811
 Balanced accuracy 0.700 0.644 0.690 0.680 0.653
 Sensitivity 0.481 0.380 0.582 0.544 0.468
 Specificity 0.918 0.907 0.797 0.815 0.838
 Precision (PPV) 0.317 0.244 0.184 0.188 0.265
 AUC 0.793 0.760 0.787 0.764 0.754
 MCC 0.274 0.229 0.234 0.225 0.213
 F1-score 0.327 0.291 0.280 0.276 0.272
 PR-AUC 0.294 0.184 0.223 0.179 0.199

DT, decision tree; RF, random forest; LR, logistic regression; SVM, support vector machine; XGB, extreme gradient boosting; PPV, positive predictive value; AUC, area under the curve; MCC, Matthew’s correlation coefficient; PR-AUC, precision-recall AUC.

Table 2.

Performance of five machine learning models with specific types of chronic diseases (Model 2)

DT RF LR SVM XGB
Prediction for depression
 Accuracy 0.834 0.744 0.712 0.715 0.728
 Balanced accuracy 0.668 0.648 0.661 0.656 0.636
 Sensitivity 0.430 0.509 0.588 0.570 0.503
 Specificity 0.906 0.786 0.734 0.741 0.768
 Precision (PPV) 0.452 0.299 0.284 0.283 0.280
 AUC 0.768 0.725 0.717 0.714 0.704
 MCC 0.344 0.252 0.249 0.242 0.241
 F1-score 0.441 0.387 0.383 0.378 0.370
 PR-AUC 0.378 0.345 0.356 0.350 0.329
Prediction for suicidal ideation
 Accuracy 0.880 0.848 0.792 0.781 0.814
 Balanced accuracy 0.690 0.650 0.701 0.696 0.667
 Sensitivity 0.468 0.418 0.595 0.595 0.494
 Specificity 0.912 0.882 0.807 0.796 0.839
 Precision (PPV) 0.296 0.217 0.195 0.187 0.194
 AUC 0.785 0.776 0.760 0.750 0.755
 MCC 0.310 0.224 0.251 0.241 0.222
 F1-score 0.363 0.286 0.294 0.284 0.279
 PR-AUC 0.277 0.212 0.237 0.221 0.215

DT, decision tree; RF, random forest; LR, logistic regression; SVM, support vector machine; XGB, extreme gradient boosting; PPV, positive predictive value; AUC, area under the curve; MCC, Matthew’s correlation coefficient; PR-AUC, precision-recall AUC.

Table 3.

Rank of important predictors in Model 2

DT RF LR SVM XGB
Prediction for depression
 Osteoarthritis 1 4 9 11 13
 Myocardial infarction 2 11 13 9 3
 Diabetes 3 2 7 16 6
 Asthma 4 10 6 14 11
 Stroke 5 3 1 1 1
 Cancer* 6 6 17 17 17
 Thyroid disease 7 12 8 15 10
 Liver cirrhosis 8 15 5 5 9
 Hypertension 9 1 15 6 14
 Dyslipidemia 10 5 14 7 15
 Hepatitis C 11 17 4 8 16
 Hepatitis B 12 14 2 13 4
 Angina pectoris 13 8 12 10 8
 Atopic dermatitis 14 13 16 3 5
 Rheumatoid arthritis 15 9 10 12 7
 Tuberculosis 16 7 3 2 2
 Kidney failure 17 16 11 4 12
Prediction for suicidal ideation
 Stroke 1 11 7 6 4
 Hypertension 2 1 16 16 6
 Asthma 3 9 11 11 5
 Myocardial infarction 4 13 3 7 7
 Rheumatoid arthritis 5 8 2 2 1
 Atopic dermatitis 6 14 10 10 3
 Angina pectoris 7 6 1 1 2
 Dyslipidemia 8 4 12 12 14
 Hepatitis C 9 16 14 14 10
 Cancer* 10 5 15 15 8
 Kidney failure 11 15 8 8 9
 Hepatitis B 12 12 4 4 11
 Osteoarthritis 13 3 9 9 12
 Tuberculosis 14 7 13 13 13
 Thyroid disease 15 10 6 5 15
 Diabetes 16 2 5 3 16
 Liver cirrhosis 17 17 17 17 17
*

cancer includes gastric, colon, liver, lung, thyroid, breast, cervical, and others.

DT, decision tree; RF, random forest; LR, logistic regression; SVM, support vector machine; XGB, extreme gradient boosting.