Interpretable artificial intelligence model for predicting heart failure severity after acute myocardial infarction

Guo, Chenglong; Gao, Binyu; Han, Xuexue; Zhang, Tianxing; Tao, Tianqi; Xia, Jinggang; Liu, Honglei

doi:10.1186/s12872-025-04818-1

Research
Open access
Published: 12 May 2025

Interpretable artificial intelligence model for predicting heart failure severity after acute myocardial infarction

Chenglong Guo¹^na1,
Binyu Gao^2,3^na1,
Xuexue Han⁴,
Tianxing Zhang⁴,
Tianqi Tao⁵,
Jinggang Xia⁴ &
…
Honglei Liu^3,6

BMC Cardiovascular Disorders volume 25, Article number: 362 (2025) Cite this article

274 Accesses
Metrics details

Abstract

Background

Heart failure (HF) after acute myocardial infarction (AMI) is a leading cause of mortality and morbidity worldwide. Accurate prediction and early identification of HF severity are crucial for initiating preventive measures and optimizing treatment strategies. This study aimed to develop an interpretable artificial intelligence (AI) model for HF severity prediction using multidimensional clinical data.

Methods

This study included data from 1574 AMI patients, including medical history, clinical features, physiological parameters, laboratory test, coronary angiography and echocardiography results. Both deep learning (TabNet, Multi-Layer Perceptron) and machine learning (Random Forest, XGboost) models were employed in constructing model. Additionally, the Shapley Additive Explanation (SHAP) method was used to elucidate clinical factors importance and enhance model interpretability. A web platform (https://prediction-killip-gby.streamlit.app/) was also developed to facilitate clinical application.

Results

Among the models, TabNet demonstrated the best performance, achieving an AUROC of 0.827 for KILLIP four-class classification and 0.831 for KILLIP binary classification. Key clinical factors such as GRACE score, NT-pro BNP, and TIMI score were highly correlated with KILLIP classification, aligning with established clinical knowledge.

Conclusions

By leveraging easily accessible multidimensional data, this model enables accurate early prediction and personalized diagnosis of HF risk and severity following AMI. It supports early clinical intervention and improves patient outcomes, offering significant clinical application value.

Clinical trial number

Not applicable.

Peer Review reports

Introduction

Acute myocardial infarction (AMI), commonly known as a heart attack, remains a leading cause of mortality and morbidity worldwide [1]. AMI often leads to the development of heart failure (HF), a debilitating condition with high morbidity and mortality rates [2]. Early identification of patients at risk of developing HF after AMI is crucial for initiating preventive measures and optimizing treatment strategies. The Killip classification, a widely used bedside tool, assesses clinical signs of HF after AMI to stratify patients into different risk categories [3]. However, this classification relies on subjective assessment and may not fully capture the complex interplay of factors contributing to HF development. Leveraging multidimensional data for HF severity prediction could enhance early identification of high-risk patients and guide timely clinical interventions.

In recent years, artificial intelligence (AI) methods, particularly deep learning models, have emerged as powerful tools for analyzing large and complex clinical datasets. These models can identify subtle patterns and generate highly accurate predictions, often surpassing traditional methods in precision [4,5,6]. In the context of AMI, deep learning can leverage diverse patient data (e.g., clinical features, biomarkers, imaging data) to enhance outcome prediction accuracy [6,7,8,9]. By capturing intricate relationships within the data, deep learning approaches, such as artificial neural networks, offer insights that conventional statistical methods might overlook [10]. Training these models on large AMI cohorts allows them to uncover hidden patterns and interactions associated with HF risk [11]. Moreover, deep learning’s ability to handle high-dimensional data allows for the incorporation of a wide range of variables, potentially leading to more comprehensive and accurate risk prediction models.

While recent studies have shown promising results in using machine learning to predict HF after AMI [7, 12], few studies have focused on the specific task of predicting HF combined in conjunction with Killip classifications. Such a focus could provide valuable insights into the early detection of HF after AMI and support personalized treatment strategies. Additionally, existing HF prediction models often lack interpretability, making it challenging to explain predictions and analyze feature importance. The recent introduction of TabNet [13], a deep learning algorithm specifically designed for tabular data, offers new possibilities for improving the performance of clinical tabular data processing.

In this study, we utilized easily accessible multidimensional data obtained during hospitalization-including medical history, clinical features, physiological parameters, and results from laboratory test, coronary angiography combined with echocardiography. We leveraged both deep learning (TabNet, Multi-Layer Perceptron [14,15,16](MLP)) and machine learning models (Random Forest (RF) [17], XGboost [18]) to predict HF severity in patient after AMI, enabling accurate and personalized identification of HF severity. Additionally, we employed Shapley Additive Explanation [19] (SHAP) to elucidate risk factor importance and improve model interpretability. We also developed a web platform to facilitate clinical application.

Method

Dataset

A retrospective study was conducted on 2993 patients diagnosed with type I AMI at Xuanwu Hospital, Capital Medical University, between January 2017 and December 2022. It was authorized by the Ethics Committee of Xuanwu Hospital, Capital Medical University with the approval document number (2022–129) and was processed according to the principles of the Declaration of Helsinki. All enrolled patients signed informed consent forms.

We selected several factors that could potentially influence the development of heart failure in AMI patients. These factors included demographic characteristics such as age, sex, and body mass index (BMI); clinical scores like the GRACE and TIMI risk scores; and medical history, including hypertension, atrial fibrillation (AF), diabetes, hyperlipidemia, cerebrovascular disease (CVD), peptic ulcer (PU), previous myocardial infarction, stent implantation, and coronary artery bypass grafting (CABG). Smoking status was also considered, including whether patients had quit smoking. In addition, we analyzed post-admission blood test results, which included white blood cell count, neutrophils, lymphocytes, monocytes, hemoglobin, platelet count, blood glucose, alanine aminotransferase (ALT), aspartate aminotransferase (AST), creatinine clearance rate (CCR), total cholesterol (TC), low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol (HDL-C), triglycerides (TG), uric acid (UA), hemoglobin A1c (HbA1c), high-sensitivity C-reactive protein (hs-CRP), interleukin-6 (IL-6), N-terminal pro-brain natriuretic peptide (NT-pro BNP), and peak troponin I (TNI). Echocardiographic indicators such as left ventricular ejection fraction (LVEF), left atrial diameter, and left ventricular end-diastolic diameter (LVEDD) were also assessed. Additionally, the length of hospitalization was recorded. Patients missing critical features such as NT-proBNP, LVEF, or GRACE/TIMI scores were excluded from the database, resulting in a final cohort of 1,574 patients.

Based on the Killip classification for HF in AMI, patients were categorized into four groups. This classification system is widely used in clinical cardiology and provides a rapid bedside assessment of HF severity. The Killip class definitions used in this study were summarized in Table 1.

Table 1 The Killip class definitions in AMI patients

Full size table

Model construction and comparison

Both machine learning and deep learning models were applied to the AMI data. Considering the property of medical tabular data, the machine learning models included RF and XGBoost, while the deep learning models comprised a MLP and TabNet. The RF and MLP models were implemented using the Python scikit-learn package, XGBoost using the XGBoost package, and TabNet using PyTorch library. The flow chart of the study design was shown in Fig. 1.

RF and XGBoost are commonly used tree-based machine learning models. MLP is a type of feedforward artificial neural network consisting of multiple layers of nodes, where each node is fully connected to the next layer, allowing for non-linear modeling of complex relationships. The Attentive Interpretable Tabular Learning (TabNet) model is a multi-stage deep learning model introduced by Google Cloud AI and applies a sequential instance-wise attention mechanism allowing it to inherently select the most salient set of radiomic features at different decision steps within its architecture. TabNet specifically designed for tabular data, combining interpretability with state-of-the-art performance through sequential attention mechanisms that decide which features to utilize at each decision step.

A total of 41 features were utilized to develop the prediction models. To build a standardized feature space across all models, continuous variables were normalized using a standard scaler such that they have a mean of 0 and a variance of 1. Given the imbalance in KILLIP classes, we employed a stratified partitioning method to split the dataset into two subsets: a training dataset (80%) and a test dataset (20%). The distribution of KILLIP classes in each subset was consistent with that in the original dataset. During training, to enhance the model’s ability to identify minority class samples, we applied the synthetic minority over-sampling technique (SMOTE) to interpolate the minority KILLIP classes (KILLIP 2, KILLIP 3, KILLIP 4).

For each model, we employed a grid search approach for parameter tuning until the optimal parameter combination was identified. At each tuning step, fivefold cross-validation was performed using stratified shuffles of the training dataset to estimate the parameters of each model and evaluate their predictive performance. To consider class imbalance and classification accuracy, with specific justification, the evaluation metrics included the area under the receiver operating characteristic curve (AUROC), the area under the precision-recall curve (AUPRC), precision, and F1 scores. For each complication, the model with the highest mean AUROC was selected as the best-performing model.

Model interpretation

Model predictions were interpreted using SHAP, a model-agnostic explanation technique. SHAP values for each feature were calculated to represent the contribution of each feature to the predicted risk of a complication. The SHAP method provided both global and local explanations. Global explanations offered consistent and accurate attribution values for each feature, demonstrating the relationships between input features and KILLIP classification. Local explanations provided insights into specific predictions for individual cases by inputting the corresponding data.

Results

Patient characteristics

A total of 1574 patients were identified during the study period. Among them, 1005 patients (63.8%) were classified as KILLIP 1, 468 patients (29.7%) as KILLIP 2, 72 patients (4.6%) as KILLIP 3 and 29 patients (1.8%) as KILLIP 4. The demographic and clinical characteristics across KILLIP classes are summarized in Table 2. The results indicated that compared to patients in KILLIP class 1, those in KILLIP classes 2–4 were more likely to be female, older, and have a history of hypertension, diabetes, atrial fibrillation (AF), myocardial infarction, and stent placement. Additionally, these patients exhibited higher GRACE and TIMI risk scores, elevated white blood cell, neutrophil, and monocyte counts, as well as increased levels of HbA1c, hs-CRP, IL-6, ALT, AST, UA, NT-pro BNP, left atrial diameter, and LVEDD. Conversely, patients in KILLIP classes 2–4 had lower CCR, LVEF, TC, TG, and hemoglobin levels. These findings demonstrate that female gender, advanced age, elevated inflammation markers, pre-existing cardiac conditions, diabetes, and renal dysfunction are significantly associated with an increased risk of HF following AMI. Moreover, relatively low levels of TC, TG, and hemoglobin were also found to contribute to the increased risk of HF after AMI.

Table 2 Comparison of demographic, clinical characteristics, and outcomes across KILLIP classes in the dataset

Full size table

Model performance

To gain a more comprehensive understanding and improve the prediction of HF severity following AMI, we employed machine learning models (RF and XGBoost) and deep learning models (TabNet and MLP) for two tasks: four-class classification (KILLIP 1, 2, 3, 4) and binary classification (KILLIP 1 vs. KILLIP 2, 3, 4). The discriminative performance results are presented in Tables 3 and 4, respectively, with ROC curves illustrated in Fig. 1. Performance metrics of different models for each KILLIP class in four-class classification and binary classification were presented in Appendix Tables 2, 3, 4 and 5 and Appendix Tables 6, 7, 8 and 9, respectively. Appendix Table 10 shown the final hyperparameters of all the models.

As shown in Table 3; Fig. 2, among the models considered, the TabNet model achieved the highest predictive performance for KILLIP classification (four-class classification) with an AUROC of 0.827, followed by the MLP model and the RF model. The XGBoost model demonstrated the lowest performance. Overall, the deep learning models (MLP, TabNet) outperformed the machine learning models (RF, XGBoost).

Similarly, in the binary classification task (Table 4), the same trend was observed, with the TabNet model again delivering the best performance with an AUROC of 0.831.

Table 3 Performance of machine learning and deep learning models in predicting KILLIP class (four-class classification). Fivefold cross-validation was performed in all the 1574 patients

Full size table

Table 4 Performance of machine learning and deep learning models in predicting KILLIP class (binary classification). Fivefold cross-validation was performed in all the 1574 patients

Full size table

Model interpretation

As demonstrated in the SHAP summary plots of the TabNet model for four-class KILLIP classification (Fig. 3A and B), feature contributions were evaluated based on average SHAP values, presented in descending order. SHAP summary plots of other models were presented in Appendix Fig. 3 – Fig. 3. The GRACE and TIMI risk scores, NT-pro BNP, creatinine, and length of hospitalization had a negative impact on predicting “KILLIP 1,” indicating that higher values of these features decreased the likelihood of a patient being classified as KILLIP 1. Conversely, LVEF and CCR exhibited a positive effect, increasing the probability of KILLIP 1 classification.

In addition, the SHAP dependence plot provides insight into how individual features influence model predictions. The relationship between actual values and SHAP values for these features is illustrated in Fig. 4, where SHAP values greater than zero correspond to a positive class prediction, indicating a higher KILLIP grade. For example, patients with a GRACE score ≤ 150 or LVEF ≥ 48% had SHAP values above zero, pushing the model’s decision toward the “KILLIP 1” class. Similarly, low actual values of NT-pro BNP (≤ 2500) and TIMI scores (≤ 3) also contributed to the prediction of KILLIP 1. For binary KILLIP classification, the corresponding SHAP summary plots for the TabNet model are provided in Appendix Figs. 4 and 5.

Furthermore, local explanations analyzed how specific predictions were made for individual patients using personalized input data. The raw data for one patient is presented in Appendix Table 1. Figure 5A, B, C, and D depict predictions for a patient with KILLIP classifications 1–4, respectively. For instance, Fig. 5A1–A4 shows this patient’s probabilities for KILLIP classes 1, 2, 3, and 4, as predicted by the TabNet model: 92.4% (Fig. 5A1), 0.07% (Fig. 5A2), 0.005% (Fig. 5A3), and 0.001% (Fig. 5A4). For this patient, factors such as GRACE risk score, LVEF, and CCR strongly influenced the prediction toward the “KILLIP 1” class. Figure 6. Shown the force SHAP value plot for the test set. The corresponding SHAP local explanations for binary KILLIP classification are shown in Appendix Table 1 and Appendix Figs. 6, 7 and 8.

Convenient application for clinical utility

As illustrated in Fig. 7, we have integrated the KILLIP prediction model into a web platform to improve its clinical utility. By entering the actual values for all required features, the application can automatically predict the KILLIP classification for patients after AMI. The web platform is available online at https://prediction-killip-gby.streamlit.app/.

Discussion

In this study, we employed four machine learning and deep learning algorithms to predict the risk of HF after AMI using multidimensional clinical data. These computational methods are well-suited to managing complex and extensive datasets, making them highly effective for developing clinical prediction models. Their ability to handle diverse data types and identify intricate relationships between variables allows for improved accuracy in clinical risk predictions. By integrating easily accessible multidimensional clinical data with advanced machine learning and deep learning algorithms, we have enhanced the potential of clinical prediction tools in identifying patients at risk for post-AMI HF.

Indeed, several established scoring systems such as the MAGGIC risk score, Framingham criteria, and ADHERE risk tree have been developed to predict the onset or outcomes of heart failure. However, these tools are typically designed for use in broader heart failure populations (including both de novo HF and chronic HF), rather than specifically for acute heart failure prediction in the context of acute myocardial infarction. In contrast, our study aimed to predict the severity of heart failure during hospitalization following AMI, using the Killip classification as the outcome metric—a standard prognostic tool in AMI settings. While we did incorporate well-validated cardiovascular scores such as the GRACE and TIMI scores as input features, both of which have shown predictive value in post-AMI prognosis, including heart failure risk, we acknowledge the value of referencing other HF-specific risk tools for broader contextualization.

Among the models tested, the TabNet model achieved the highest AUROC value for KILLIP classification. TabNet effectively combines the strengths of deep learning and tree-based models, employing a sequential attention mechanism to select crucial features at each decision step. Prior research has demonstrated the TabNet method’s excellent predictive value in medical contexts.

In medical data, the issue of class imbalance is common. To address this, we utilized the SMOTE over-sampling method to generate synthetic samples similar to the original ones, thereby increasing data diversity and enhancing the model’s performance and generalizability. This approach better reflects real-world medical scenarios and provides more reliable support for research and clinical decision-making.

While machine learning and deep learning models are often regarded as “black boxes”, their lack of interpretability can be a challenge in clinical settings. To improve transparency, we applied the SHAP method, which offers both global and local explanations of model predictions. SHAP helps elucidates the model’s overall functionality and details how specific predictions are made for individual patients. By highlighting key clinical variables contributing to the risk of HF, SHAP visualizations can assist practitioners in identifying key factors early.

Our SHAP analysis revealed that higher GRACE risk score, TIMI risk score, age, and elevated levels of NT-pro BNP, creatinine, hs-CRP, and IL-6 were associated with an increased risk of HF after AMI. Conversely, higher CCR and LVEF were linked to a decreased risk. Although GRACE and TIMI scores were originally designed to predict mortality and recurrent myocardial infarction in acute coronary syndrome patients, they can also serve as indirect indicators of HF risk, as shown in previous studies [20, 21]. HF is a common complication following AMI, especially in cases with significant myocardial damage or additional cardiovascular risk factors. Age, a well-established risk factor of HF, is likely related to age-related changes in cardiac and vascular function [22, 23]. NT-pro BNP, a biomarker reflecting cardiac workload and function, is essential for diagnosing and prognosticating HF [24]. Additionally, elevated levels of inflammatory markers such as IL-6 and hs-CRP, have been linked to increased HF risk after AMI [25, 26]. Impaired kidney function, as indicated by elevated creatinine levels and decreased CCR, is closely related to HF development after AMI [27]. Finally, a decreased LVEF, which indicates impaired heart pumping function, is strongly associated with the development of HF and poor prognosis post-AMI. In summary, monitoring these factors is essential for early identification of high-risk patients and timely intervention to improve outcomes. However, it is critical to acknowledge that SHAP is a post hoc interpretability method. Its values reflect feature importance within the model’s decision-making framework rather than genuine causal relationships or clinical pathophysiological mechanisms [28, 29].

KILLIP grading is traditionally based on changes in blood pressure and lung auscultation during AMI hospitalization. However, our study incorporated multidimensional clinical indicators into a predictive model, providing a more accurate assessment of HF risk than the KILLIP grading alone. The development of predictive models like ours enhances the comprehensive clinical thinking of cardiovascular physicians. To further facilitate clinical use, we are making the predictive models accessible through a web platform, aiming to promote widespread clinical application and adoption.

There are several limitations to this study. First, the data were derived from a single-center dataset, and external validation with multi-center data is lacking. Second, the reliance on SHAP for interpretability introduces methodological constraints. As a post hoc explanation tool, SHAP values do not establish causality and may prioritize variables that are statistically predictive within the model rather than clinically actionable targets. This could lead to over-reliance on model-derived associations without rigorous biological validation. Third, the data modalities included in the study were somewhat limited. Specifically, our analysis focused primarily on structured clinical variables (e.g., laboratory biomarkers, risk scores, and demographic features), while omitting unstructured data modalities such as imaging data, longitudinal follow-up records, and genomic or proteomic biomarkers. This may restrict the model’s ability to capture subtle pathophysiological interactions that could further refine HF risk stratification. Future research will focus on expanding the dataset by collecting more comprehensive clinical data from AMI patients across multiple institutions to refine and improve the accuracy of the prediction model.

Conclusion

By harnessing the power of artificial intelligence, we have developed a KILLIP classification prediction model to assess the risk of HF after AMI. This model enhances risk stratification, optimizes treatment strategies, guides early clinical interventions, reduces the incidence of post-AMI heart failure, and improves patient outcomes, demonstrating significant clinical utility. Its clinical utility is further demonstrated through its integration into a user-friendly web platform, accessible to both remote and local healthcare settings. The platform’s visual design framework ensures that the predictive tool is both practical and actionable across a range of clinical environments.

Data availability

The data and materials can be obtained from the authors upon reasonable request.

Abbreviations

HF:: Heart failure
AMI:: Acute myocardial infarction
AI:: Artificial intelligence
SHAP:: Shapley Additive Explanation
RF:: Random Forest
MLP:: Multi-Layer Perceptron

References

Samsky MD, Morrow DA, Proudfoot AG, Hochman JS, Thiele H, Rao SV. Cardiogenic shock after acute myocardial infarction. JAMA. 2021;1840. https://doiorg.publicaciones.saludcastillayleon.es/10.1056/NEJM199110173251601.
Hernandez AF, Udell JA, Jones WS, Anker SD, Petrie MC, Harrington J, Mattheus M, Seide S, Zwiener I, Amir O, Bahit MC, Bauersachs J, Bayes-Genis A, Chen Y, Chopra VK, Figtree A, Ge G, Goodman JG, Gotcheva S, Goto N, Gasior S, Jamal T, Januzzi W, Jeong JL, Lopatin MH, Lopes Y, Merkely RD, Parikh B, Parkhomenko PB, Ponikowski A, Rossello P, Schou X, Simic M, Steg D, Szachniewicz PG, van der Meer J, Vinereanu P, Zieroth D, Brueckmann S, Sumin M, Bhatt M, Butler DL. Effect of empagliflozin on heart failure outcomes after acute myocardial infarction: insights from the EMPACT-MI trial. Circulation. 2024;149(21):1627–38. https://doiorg.publicaciones.saludcastillayleon.es/10.1161/CIRCULATIONAHA.124.069217.
Article CAS PubMed PubMed Central Google Scholar
Hori Y, Sakakura K, Jinnouchi H, Taniguchi Y, Tsukui T, Hatori M, Kasahara T, Watanabe Y, Yamamoto K, Seguchi M, Fujita H. Determinants of serious in-hospital complications in patients with Killip class 1/2 ST-segment elevation myocardial infarction who underwent primary percutaneous coronary intervention. Heart Vessels. 2024 Mar;18. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00380-024-02382-w.
Deo RC. Machine learning in medicine. Circulation. 2015;1920–30. https://doiorg.publicaciones.saludcastillayleon.es/10.1161/CIRCULATIONAHA.115.001593.
Choi RY, Coyner AS, Kalpathy-Cramer J, Chiang MF, Campbell JP. Introduction to machine learning, neural networks, and deep learning. Transl Vis Sci Technol. 2020;9(2):14. https://doiorg.publicaciones.saludcastillayleon.es/10.1167/tvst.9.2.14.
Article PubMed PubMed Central Google Scholar
Gao Z, Liu X, Kang Y, et al. Improving the prognostic evaluation precision of hospital outcomes for heart failure using admission notes and clinical tabular data: multimodal deep learning model. J Med Internet Res. 2024;26:e54363. https://doiorg.publicaciones.saludcastillayleon.es/10.2196/54363.
Article PubMed PubMed Central Google Scholar
Bat-Erdene BI, Zheng H, Son SH, Lee JY. Deep learning-based prediction of heart failure rehospitalization during 6, 12, 24-month follow-ups in patients with acute myocardial infarction. Health Inf J. 2022;28(2):14604582221101529. https://doiorg.publicaciones.saludcastillayleon.es/10.1177/14604582221101529.
Article Google Scholar
Li Y, Hu Y, Jiang F, Chen H, Xue Y, Yu Y. Combining WGCNA and machine learning to identify mechanisms and biomarkers of ischemic heart failure development after acute myocardial infarction. Heliyon. 2024;10(5):e27165. https://doiorg.publicaciones.saludcastillayleon.es/S2405-8440(24)03196-7.
Article CAS PubMed PubMed Central Google Scholar
Li X, Shang C, Xu C, Wang Y, Xu J, Zhou Q. Development and comparison of machine learning-based models for predicting heart failure after acute myocardial infarction. BMC Med Inf Decis Mak. 2023;23(1):165. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12911-023-02240-1.
Article Google Scholar
Nagendran M, Chen Y, Lovejoy CA, Gordon AC, Komorowski M, Harvey H, Topol EJ, Ioannidis JPA, Collins GS, Maruthappu M. Artificial intelligence versus clinicians: systematic review of design, reporting standards, and claims of deep learning studies. BMJ. 2020;368:m689. https://doiorg.publicaciones.saludcastillayleon.es/10.1136/bmj.m689.
Article PubMed PubMed Central Google Scholar
Popat A, Yadav S, Patel SK, Baddevolu S, Adusumilli S, Rao Dasari N, Sundarasetty M, Anand S, Sankar J, Jagtap YG. Artificial intelligence in the early prediction of cardiogenic shock in acute heart failure or myocardial infarction patients: A systematic review and Meta-Analysis. Cureus. 2023;15(12):e50395. https://doiorg.publicaciones.saludcastillayleon.es/10.7759/cureus.50395.
Article PubMed PubMed Central Google Scholar
Mohammad M, Olesen K, Koul S, et al. Development and validation of an artificial neural network algorithm to predict mortality and admission to hospital for heart failure after myocardial infarction: a nationwide population-based study. Lancet Digit Health. 2022;4:e37–45. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/S2589-7500(21)00228-4.
Article CAS PubMed Google Scholar
Arik SÖ. and Tomas Pfister. Tabnet: Attentive interpretable tabular learning. Proceedings of the AAAI conference on artificial intelligence. Vol. 35. No. 8. 2021. https://doiorg.publicaciones.saludcastillayleon.es/10.1609/aaai.v35i8.16826
Warren SMC, Walter Pitts. A logical calculus of the ideas immanent in nervous activity. Bull Math Biophys. 1943;5:115–33. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/BF02478259.
Article Google Scholar
Frank Rosenblatt. The perceptron: a probabilistic model for information storage and organization in the brain. Psychol Rev. 1958;65(6):386. https://doiorg.publicaciones.saludcastillayleon.es/10.1037/h0042519.
Article Google Scholar
David E, Rumelhart GE, Hinton, Ronald J, Williams. Learning representations by back propagating errors. Nature. 1986;323(6088):533–6. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/323533a0.
Article Google Scholar
Breiman L. Random forests. Machine learning 45 (2001): 5–32. https://doiorg.publicaciones.saludcastillayleon.es/10.1023/A:1010933404324
Chen T. and Carlos Guestrin. Xgboost: A scalable tree boosting system. Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining. 2016. https://doiorg.publicaciones.saludcastillayleon.es/10.1145/2939672.2939785
Lundberg SM, Su-In, Lee. A unified approach to interpreting model predictions. Adv Neural Inf Process Syst 30 (2017).
Guo C, Han X, Zhang T, Zhang H, Li X, Zhou X, Feng S, Tao T, Yin C, Xia J. Lipidomic analyses reveal potential biomarkers for predicting death and heart failure after acute myocardial infarction. Clin Chim Acta. 2024;562:119892. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.cca.2024.119892.
Article CAS PubMed Google Scholar
Zhang T, Han X, Zhang H, Li X, Zhou X, Feng S, Guo C, Song F, Tao T, Yin C, Xia J. Identification of molecular markers for predicting the severity of heart failure after AMI: an Olink precision proteomic study. Clin Chim Acta. 2024;555:117825. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.cca.2024.117825.
Article CAS PubMed Google Scholar
Peikert A, Martinez FA, Vaduganathan M, Claggett BL, Kulac IJ, Desai AS, Jhund PS, de Boer RA, DeMets D, Hernandez AF, Inzucchi SE, Kosiborod MN, Lam CSP, Shah SJ, Katova T, Merkely B, Vardeny O, Wilderäng U, Lindholm D, Petersson M, Langkilde AM, McMurray JJV, Solomon SD. Efficacy and safety of Dapagliflozin in heart failure with mildly reduced or preserved ejection fraction according to age: the DELIVER trial. Circ Heart Fail. 2022;15(10):e010080. https://doiorg.publicaciones.saludcastillayleon.es/10.1161/CIRCHEARTFAILURE.122.010080.
Article CAS PubMed Google Scholar
Redfield MM, Borlaug BA. Heart failure with preserved ejection fraction: A review. JAMA. 2023;329(10):827–38. https://doiorg.publicaciones.saludcastillayleon.es/10.1001/jama.2023.2020.
Article PubMed Google Scholar
Luo H, Xiang C, Zeng L, Li S, Mei X, Xiong L, Liu Y, Wen C, Cui Y, Du L, Zhou Y, Wang K, Li L, Liu Z, Wu Q, Pu J, Yue R. SHAP based predictive modeling for 1 year all-cause readmission risk in elderly heart failure patients: feature selection and model interpretation. Sci Rep. 2024;14(1):17728. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41598-024-67844-7.
Article CAS PubMed PubMed Central Google Scholar
Alogna A, Koepp KE, Sabbah M, Espindola Netto JM, Jensen MD, Kirkland JL, Lam CSP, Obokata M, Petrie MC, Ridker PM, Sorimachi H, Tchkonia T, Voors A, Redfield MM, Borlaug BA. Interleukin-6 in patients with heart failure and preserved ejection fraction. JACC Heart Fail. 2023;11(11):1549–61. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.jchf.2023.06.031.
Article CAS PubMed PubMed Central Google Scholar
Gui XY, Rabkin SW, C-Reactive Protein. Interleukin-6, Trimethylamine-N-Oxide, Syndecan-1, nitric oxide, and tumor necrosis factor Receptor-1 in heart failure with preserved versus reduced ejection fraction: a Meta-Analysis. Curr Heart Fail Rep. 2023;20(1):1–11. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s11897-022-00584-9.
Article CAS PubMed Google Scholar
Bart BA, Goldsmith SR, Lee KL, Givertz MM, O’Connor CM, Bull DA, Redfield MM, Deswal A, Rouleau JL, LeWinter MM, Ofili EO, Stevenson LW, Semigran MJ, Felker GM, Chen HH, Hernandez AF, Anstrom KJ, McNulty SE, Velazquez EJ, Ibarra JC, Mascette AM, Braunwald E. Heart failure clinical research network. Ultrafiltration in decompensated heart failure with cardiorenal syndrome. N Engl J Med. 2012;367(24):2296–304. https://doiorg.publicaciones.saludcastillayleon.es/10.1056/NEJMoa1210357.
Article CAS PubMed PubMed Central Google Scholar
Slack D et al. Fooling lime and shap: Adversarial attacks on post hoc explanation methods. Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society. 2020. https://doiorg.publicaciones.saludcastillayleon.es/10.1145/3375627.3375830
Rudin C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat Mach Intell. 2019;1(5):206–15. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s42256-019-0048-x.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

This work was supported by the Beijing Natural Science Foundation (No. L246059, Z240021, 7242264), the National Natural Science Foundation of China (No.82100265) and the R&D Program of Beijing Municipal Education Commission (No. KM202310025020).

Author information

Chenglong Guo and Binyu Gao contributed equally to this work.

Authors and Affiliations

Pulmonary Vascular Disease Center, Beijing Anzhen Hospital, Capital Medical University, Beijing, China
Chenglong Guo
Biological Science & Medical Engineering, Southeast University, Nanjing, 518000, China
Binyu Gao
School of Biomedical Engineering, Capital Medical University, Beijing, 100069, China
Binyu Gao & Honglei Liu
Department of Cardiology, Xuanwu Hospital, Capital Medical University, Beijing, 100053, China
Xuexue Han, Tianxing Zhang & Jinggang Xia
Department of Geriatrics, The Second Medical Center, National Clinical Research Center for Geriatric Diseases, Chinese PLA General Hospital, Beijing, 100853, China
Tianqi Tao
Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, 100069, China
Honglei Liu

Authors

Chenglong Guo
View author publications
You can also search for this author inPubMed Google Scholar
Binyu Gao
View author publications
You can also search for this author inPubMed Google Scholar
Xuexue Han
View author publications
You can also search for this author inPubMed Google Scholar
Tianxing Zhang
View author publications
You can also search for this author inPubMed Google Scholar
Tianqi Tao
View author publications
You can also search for this author inPubMed Google Scholar
Jinggang Xia
View author publications
You can also search for this author inPubMed Google Scholar
Honglei Liu
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

C.G. and B.G. drafted the manuscript, performed the machine learning models, and interpreted the data; X.H. and T.Z. conducted data curation, and preliminary analysis; T.T. contributed to analytical methodology development; H.L. and J.X. conceived and designed the study, supervised the research, and critically revised the manuscript. All authors have read and approved the final submitted manuscript.

Corresponding authors

Correspondence to Tianqi Tao, Jinggang Xia or Honglei Liu.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the Ethics Committee of Xuanwu Hospital, Capital Medical University with the approval document number (2022–129) and was processed according to the principles of the Declaration of Helsinki. All enrolled patients signed informed consent forms.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Guo, C., Gao, B., Han, X. et al. Interpretable artificial intelligence model for predicting heart failure severity after acute myocardial infarction. BMC Cardiovasc Disord 25, 362 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12872-025-04818-1

Download citation

Received: 11 February 2025
Accepted: 02 May 2025
Published: 12 May 2025
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12872-025-04818-1

Interpretable artificial intelligence model for predicting heart failure severity after acute myocardial infarction

Abstract

Background

Methods

Results

Conclusions

Clinical trial number

Introduction

Method

Dataset

Model construction and comparison

Model interpretation

Results

Patient characteristics

Model performance

Model interpretation

Convenient application for clinical utility

Discussion

Conclusion

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s note

Electronic supplementary material

Supplementary Material 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Cardiovascular Disorders

Contact us