初级保健领域基于机器学习预测模型研究的设计特征与方法学质量：范围综述

doi:10.12114/j.issn.1007-9572.2023.0561

摘要/Abstract

摘要： 背景近年来初级保健领域基于机器学习预测模型研究发展迅速，但关于其设计特征与方法学质量的研究报道较少。目的系统总结、分析初级保健领域基于机器学习预测模型研究的设计特征与方法学质量。方法采用计算机检索PubMed、Embase、中国知网、万方数据知识服务平台建库至2023-02-21发布的初级保健领域基于机器学习预测模型研究，采用叙述性总结和描述方法分析纳入文献的基本特征、预测模型类型、样本量、缺失值处理方法、机器学习算法类型、模型性能评价指标及预测效能、模型验证方法等。结果最终纳入30篇文献，涉及106个预测模型，其中发表时间为2021~2023年17篇；研究主题涉及呼吸系统疾病6篇，肿瘤4篇，门诊预约3篇；26篇文献样本量>1 000（占86.67%，95%CI=68.36%~95.64%）；使用机器学习方法处理缺失值者7篇；65个预测模型使用基于树的机器学习算法，其中随机森林使用频率最高（占32.08%，95%CI=23.53%~41.95%）；61个预测模型使用受试者工作特征（ROC）曲线下面积（AUC）或一致性（C统计量）作为区分度评价指标（占57.55%，95%CI=47.57%~66.97%），但仅14个预测模型报告了校准度指标（占13.21%，95%CI=7.67%~21.50%）；106个预测模型多数区分度良好，但92个预测模型偏倚风险评估结果为高风险（占86.79%，95%CI=78.50%~92.33%）；仅7篇文献所涉预测模型进行了外部验证。结论近3年来初级保健领域基于机器学习预测模型研究逐渐增多，研究主题主要涉及呼吸系统疾病、肿瘤、门诊预约等；预测模型在样本量、缺失值处理方法等方面存在较大差异，多数预测模型区分度良好，但大部分预测模型未进行外部验证，总体偏倚风险较高。

关键词: 初级保健, 机器学习, 研究设计, 预测模型, 方法学评价, 范围综述

Abstract:

Background

Researches about prediction models based on machine learning in primary care developed rapidly in recent years, but there are few researches about the design features and methodological quality.

Objective

To systematacially summarize and analyze the design features and methodological quality of researches about prediction models based on machine learning in primary care.

Methods

Researches about prediction models based on machine learning in primary care was searched in PubMed, Embase, CNKI, Wanfang Data published from base-building to 2023-02-21, descriptive summary and description methods were used to analyze the basic characteristics of the included literature, types of prediction models, sample size, handling method of missing value, types of machine learning algorithms, model performance evaluation index and prediction efficiency, and model verification method.

Results

Totally 30 literature were enrolled, involving 106 prediction models, thereinto 17 literature were published between 2021 and 2023; research topics: respiratory disease in 6 literature, tumour in 4 literature, outpatient appointment in 3 literature; sample size over 1 000 in 26 literature (accounting for 86.67%, 95%CI=68.36%-95.64%) ; using machine learning methods to hand missing value in 7 literature; 65 prediction models used tree-based machine learning algorithm, in which random forest was the most frequently used (accounting for 32.08%, 95%CI=23.53%-41.95%) ; 61 prediction models used AUC of ROC or consistency (C statistic) as the differentiation evaluation index (accounting for 57.55%, 95%CI=47.57%-66.97%), but only 14 prediction models reported prediction models (accounting for 13.21%, 95%CI=7.67%-21.50%) ; the differentiation of most of the 106 prediction models was good, but bias risk assessment results of 92 prediction models were high-risk (accounting for 86.79%, 95%CI=78.50%-92.33%) ; only 7 literature involved prediction models conducted the external validation.

Conclusion

Researches about prediction models based on machine learning in primary care increase gradually in the past three years, in which the topics mainly involve respiratory disease, tumour, outpatient appointment and so on; there are significant difference in sample size and handling method of missing value in the 106 prediction models, most of the 106 prediction models are with good differentiation, but most of them did not conducted the external validation, and the overall risk of bias is relatively high.

Key words: Primary care, Machine learning, Research design, Prediction model, Methodological evaluation, Scoping review

钟锦佳,李文涛,黄亚芳,等. 初级保健领域基于机器学习预测模型研究的设计特征与方法学质量：范围综述[J]. 中国全科医学, 2024, 27(10): 1271-1276. DOI: 10.12114/j.issn.1007-9572.2023.0561.
ZHONG Jinjia,LI Wentao,HUANG Yafang, et al. Design Features and Methodological Quality of Researches about Prediction Models Based on Machine Learning in Primary Care: a Scoping Review[J]. Chinese General Practice, 2024, 27(10): 1271-1276. DOI: 10.12114/j.issn.1007-9572.2023.0561.

图/表 4

参考文献 26

[1]	LIAW W，KAKADIARIS I A. Artificial intelligence and family medicine：better together[J]. Fam Med，2020，52（1）：8-10.
[2]	KONG X，AI B，KONG Y，et al. Artificial intelligence：a key to relieve China's insufficient and unequally-distributed medical resources[J]. Am J Transl Res，2019，11（5）：2632-2640.
[3]	AGRAWAL R，PRABAKARAN S. Big data in digital healthcare：lessons learnt and recommendations for general practice[J]. Heredity（Edinb），2020，124（4）：525-534.
[4]	SØRENSEN N，BEMMAN B，JENSEN M B，et al. Machine learning in general practice：scoping review of administrative task support and automation[J]. BMC Prim Care，2023，24（1）：14.
[5]	ABBASGHOLIZADEH RAHIMI S，LÉGARÉ F，SHARMA G，et al. Application of artificial intelligence in community-based primary health care：systematic scoping review and critical appraisal[J]. J Med Internet Res，2021，23（9）：e29839. DOI：10.2196/29839.
[6]	KUEPER J K，TERRY A L，ZWARENSTEIN M，et al. Artificial intelligence and primary care research：a scoping review[J]. Ann Fam Med，2020，18（3）：250-258. DOI：10.1370/afm.2518.
[7]	GREENER J G，KANDATHIL S M，MOFFAT L，et al. A guide to machine learning for biologists[J]. Nat Rev Mol Cell Biol，2022，23（1）：40-55. DOI：10.1038/s41580-021-00407-0.
[8]	SHICKEL B，TIGHE P J，BIHORAC A，et al. Deep EHR：a survey of recent advances in deep learning techniques for electronic health record（EHR）analysis[J]. IEEE J Biomed Health Inform，2018，22（5）：1589-1604. DOI：10.1109/JBHI.2017.2767063.
[9]	HOBENSACK M，SONG J，SCHARP D，et al. Machine learning applied to electronic health record data in home healthcare：a scoping review[J]. Int J Med Inform，2023，170：104978.
[10]	BOUWMEESTER W，ZUITHOFF N P A，MALLETT S，et al. Reporting and methods in clinical prediction research：a systematic review[J]. PLoS Med，2012，9（5）：1-12.
[11]	CHOWDHURY M Z I，TURIN T C. Variable selection strategies and its importance in clinical prediction modelling[J]. Fam Med Community Health，2020，8（1）：e000262.
[12]	TRICCO A，LILLIE E，ZARIN W，et al. PRISMA extension for scoping reviews（PRISMA-ScR）：checklist and explanation[J]. Ann Intern Med，2018，169（7）：467-473.
[13]	STARFIELD B. A framework for primary care research[J]. J Fam Pract，1996，42（2）：181-185.
[14]	MOONS K G M，WOLFF R F，RILEY R D，et al. PROBAST：a tool to assess risk of bias and applicability of prediction model studies：explanation and elaboration[J]. Ann Intern Med，2019，170（1）：W1-33. DOI：10.7326/M18-1377.
[15]	CHRISTODOULOU E，MA J，COLLINS G S，et al. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models[J]. J Clin Epidemiol，2019，110：12-22. DOI：10.1016/j.jclinepi.2019.02.004.
[16]	KABORÉ R，HALLER M C，HARAMBAT J，et al. Risk prediction models for graft failure in kidney transplantation：a systematic review[J]. Nephrol Dial Transplant，2017，32（suppl_2）：ii68-76. DOI：10.1093/ndt/gfw405.
[17]	MASCONI K L，MATSHA T E，ECHOUFFO-TCHEUGUI J B，et al. Reporting and handling of missing data in predictive research for prevalent undiagnosed type 2 diabetes mellitus：a systematic review[J]. EPMA J，2015，6（1）：7.
[18]	MASCONI K L，MATSHA T E，ERASMUS R T，et al. Effects of different missing data imputation techniques on the performance of undiagnosed diabetes risk prediction models in a mixed-ancestry population of South Africa[J]. PLoS One，2015，10（9）：e0139210. DOI：10.1371/journal.pone.0139210.
[19]	TIERNEY N J，HARDEN F A，HARDEN M J，et al. Using decision trees to understand structure in missing data[J]. BMJ Open，2015，5（6）：e007450.
[20]	HOWEY R，CLARK A D，NAAMANE N，et al. A Bayesian network approach incorporating imputation of missing data enables exploratory analysis of complex causal biological relationships[J]. PLoS Genet，2021，17（9）：e1009811.
[21]	HAUN M W，SIMON L，SKLENAROVA H，et al. Predicting anxiety in cancer survivors presenting to primary care-a machine learning approach accounting for physical comorbidity[J]. Cancer Med，2021，10（14）：5001-5016. DOI：10.1002/cam4.4048.
[22]	SAVAGE R，MESSENGER M，NEAL R D，et al. Development and validation of multivariable machine learning algorithms to predict risk of cancer in symptomatic patients referred urgently from primary care：a diagnostic accuracy study[J]. BMJ Open，2022，12（4）：e053590. DOI：10.1136/bmjopen-2021-053590.
[23]	GOLDSTEIN B A，NAVAR A M，PENCINA M J，et al. Opportunities and challenges in developing risk prediction models with electronic health records data：a systematic review[J]. J Am Med Inform Assoc，2017，24（1）：198-208.
[24]	KHARRAZI H，CHI W，CHANG H Y，et al. Comparing population-based risk-stratification model performance using demographic，diagnosis and medication data extracted from outpatient electronic health records versus administrative claims[J]. Med Care，2017，55（8）：789-796.
[25]	AZMI J，ARIF M，NAFIS M T，et al. A systematic review on machine learning approaches for cardiovascular disease prediction using medical big data[J]. Med Eng Phys，2022，105：103825.
[26]	COLLINS G S，DHIMAN P，ANDAUR NAVARRO C L，et al. Protocol for development of a reporting guideline（TRIPOD-AI）and risk of bias tool（PROBAST-AI）for diagnostic and prognostic prediction model studies based on artificial intelligence[J]. BMJ Open，2021，11（7）：e048008.

文献基本特征	篇数	百分比（%）	95%CI（%）
发表时间
2010—2020年	13	43.33	（25.97~62.34）
2021—2023年	17	56.67	（37.66~74.03）
地区分布
美国	8	26.67	（12.98~46.18）
英国	7	23.33	（10.63~42.70）
瑞典	3	10.00	（2.62~27.68）
德国	3	10.00	（2.62~27.68）
其他	9	30.00	（15.41~49.56）
研究主题
呼吸系统疾病	6	20.00	（8.40~39.13）
肿瘤	4	13.33	（4.36~31.64）
门诊预约	3	10.00	（2.62~27.68）
其他	17	56.67	（37.66~74.03）
预测模型类型
开发和内部验证	20	66.67	（47.14~82.06）
开发和内、外部验证	5	16.67	（6.31~35.45）
仅开发	3	10.00	（2.62~27.68）
仅外部验证	2	6.67	（1.16~23.51）

文献基本特征	篇数	百分比（%）	95%CI（%）
发表时间
2010—2020年	13	43.33	（25.97~62.34）
2021—2023年	17	56.67	（37.66~74.03）
地区分布
美国	8	26.67	（12.98~46.18）
英国	7	23.33	（10.63~42.70）
瑞典	3	10.00	（2.62~27.68）
德国	3	10.00	（2.62~27.68）
其他	9	30.00	（15.41~49.56）
研究主题
呼吸系统疾病	6	20.00	（8.40~39.13）
肿瘤	4	13.33	（4.36~31.64）
门诊预约	3	10.00	（2.62~27.68）
其他	17	56.67	（37.66~74.03）
预测模型类型
开发和内部验证	20	66.67	（47.14~82.06）
开发和内、外部验证	5	16.67	（6.31~35.45）
仅开发	3	10.00	（2.62~27.68）
仅外部验证	2	6.67	（1.16~23.51）

机器学习算法类型	预测模型数	百分比（%）	95%CI（%）
基于树的机器学习算法	65	61.32	（51.33~70.48）
随机森林	34	32.08	（23.53~41.95）
梯度提升机	17	16.02	（9.89~24.72）
决策树	10	9.43	（4.86~17.06）
极端梯度提升树	4	3.77	（1.21~9.94）
回归模型	20	18.87	（12.17~27.88）
最大似然逻辑回归	14	13.21	（7.67~21.50）
Lasso回归	2	1.89	（0.33~7.32）
最小二乘法回归	1	0.94	（0.05~5.90）
Cox回归	1	0.94	（0.05~5.90）
弹性网络回归	1	0.94	（0.05~5.90）
岭回归	1	0.94	（0.05~5.90）
神经网络	9	8.49	（4.20~15.93）
支持向量机	6	5.66	（2.32~12.41）
贝叶斯网络	2	1.89	（0.33~7.32）
k-邻近算法	2	1.89	（0.33~7.32）
朴素贝叶斯	1	0.94	（0.05~5.90）
超级学习者集成	1	0.94	（0.05~5.90）

机器学习算法类型	预测模型数	百分比（%）	95%CI（%）
基于树的机器学习算法	65	61.32	（51.33~70.48）
随机森林	34	32.08	（23.53~41.95）
梯度提升机	17	16.02	（9.89~24.72）
决策树	10	9.43	（4.86~17.06）
极端梯度提升树	4	3.77	（1.21~9.94）
回归模型	20	18.87	（12.17~27.88）
最大似然逻辑回归	14	13.21	（7.67~21.50）
Lasso回归	2	1.89	（0.33~7.32）
最小二乘法回归	1	0.94	（0.05~5.90）
Cox回归	1	0.94	（0.05~5.90）
弹性网络回归	1	0.94	（0.05~5.90）
岭回归	1	0.94	（0.05~5.90）
神经网络	9	8.49	（4.20~15.93）
支持向量机	6	5.66	（2.32~12.41）
贝叶斯网络	2	1.89	（0.33~7.32）
k-邻近算法	2	1.89	（0.33~7.32）
朴素贝叶斯	1	0.94	（0.05~5.90）
超级学习者集成	1	0.94	（0.05~5.90）