Background The thrombolytic effect for ischemic stroke (IS) is affected by complex factors, such as acute onset of stroke, short therapeutic time window, various individual patient factors, treatment model, types and doses of medicines as well as mode of administration. To identify the influencing factors of thrombolytic effect, most existing studies adopt statistical methods, while rare studies use artificial intelligence (AI) -based algorithms.Objective To establish models using AI-based algorithms for IS patients based on the real-world data including general patient characteristics, medication model and recovery effects, to achieve precise individualized thrombolytic treatment and provide data support for clinical prescription decisions.Methods A retrospective design was used. The clinical information of IS patients (
n=55 621) was extracted from the Yidu Cloud scientific research big data server system of the Second Affiliated Hospital of Dalian Medical University from January 1, 2001 to December 31, 2021, among whom 1 855 with complete information were enrolled according to the inclusion criteria. Thrombolysis effect was evaluated by comparing the National Institutes of Health Stroke Scale (NIHSS) score measured at admission and discharge, and those with an improvement in the NIHSS score by ≥4 points and <4 points were assigned to neurological improvement group (
n=1 236) , and control group (
n=619) , respectively. Factors possibly associated with post-IS thrombolytic effect (including general patient characteristics, medication indicators, examination indicators, test indicators, and treatment methods) were obtained by summarizing the factors suggested separately by three neurology experts with a senior title, and reviewing relevant guidelines and literature, then were screened using univariate analysis, and the identified ones were treated by dimensionality reduction using principal component analysis (PCA) . Models of Logistic, support vector machine (SVM) , C5.0 decision tree arithmetic, classification and regression tree (CART) , deep neural network (DNN) , and Wide&Deep, were built and compared to find the one with the best performance in predicting thrombolytic effect, then to determine its parameters. Then by use of two randomly generated two numbers, 7 and 11, the 1 855 patients were randomly assigned to three datasets, training (
n=1 113, for building and practicing models to discover rules) , validation (
n=371, for adjusting model parameters) , and test (
n=371, for evaluating the generalization ability of the final model) . Feature engineering was used to construct a simplified model and evaluate its accuracy. The clinical information of IS patients (
n=3 925) was extracted from the Yidu Cloud scientific research big data server system of Dalian Central Hospital for external verification of the model.
Results Twenty-six patients characteristics associated with thrombolytic effect were included for establishing models. The dimensionalities were reduced to two principal components by PCA, explaining 93.1% of the total variance. Comparison analysis revealed that the Wide&Deep model had the best predictive performance with an accuracy of 0.815, and an F-index of 0.871. Furthermore, the values of the area under the receiver operating characteristic (AUC) curve of the Wide&Deep model in predicting the thrombolytic effect in patients in the training set and test set were 0.753 and 0.793, respectively. The number of hidden layers and neurons in each layer of the model was 7 and 15, respectively. Using sigmoid as the activation function showed that the model parameters were optimal. The feature-engineering analysis of factors influencing the improvement of neurological function showed that the importance of medication type, administration mode and dosage ranked high, and the importance ranking in a descending order was: cerebrovascular disease history, type of medication, mode of administration, single dose, atherosclerosis, therapeutic time window of thrombolytic therapy, prevalence of use of anticoagulant drugs and drugs for promoting blood circulation and removing blood stasis. After simplifying the independent variables of the model, the accuracy of the Wide&Deep model was 0.819, and its accuracy was 0.801 suggested by the external verification after model simplification, indicating good predictive performance and generalizability.Conclusion The Wide&Deep model has proven to have excellent evaluation indicators. The importance of influencing factors of thrombolytic effect in a descending order is: cerebrovascular disease history, type of medication, administration mode, single dose, atherosclerosis, therapeutic time window of thrombolytic therapy, prevalence of use of anticoagulants and blood-activating and stasis-removing drugs. It provides clinicians with timely and effective thrombolysis treatment support involving thrombolysis related factors and individualized administration using AI-based algorithms.
Table and Figures |
Reference |
Related Articles |
Metrics