论文标题
COVID-19通过常规血液测试使用机器学习诊断
COVID-19 diagnosis by routine blood tests using machine learning
论文作者
论文摘要
照顾冠状病毒疾病患者(COVID-19)的医生描述了常规血液参数的不同变化。但是,这些变化阻碍了他们进行共同诊断。我们为COVID-19诊断构建了机器学习预测模型。该模型基于5,333例患有各种细菌和病毒感染的患者的常规血液检查和交叉验证,以及160 COVID-19-19-19-阳性患者。我们以81.9%的敏感性选择了操作ROC点,特异性为97.9%。曲线下(AUC)下的交叉验证区域为0.97。根据XGBoost算法的特征评分,用于COVID19诊断的五个最有用的常规血液参数是MCHC,嗜酸性粒细胞计数,白蛋白,INR和凝血酶原活性百分比。 TSNE可视化表明,严重的COVID-19病程患者的血液参数更像是细菌的参数,而不是病毒感染。报道的诊断准确性至少具有可比性,并且可能与RT-PCR和胸部CT研究互补。发烧,咳嗽,肌痛和其他症状的患者现在可以通过我们的诊断工具评估最初的常规血液检查。所有Covid-19预测阳性的患者将进行标准的RT-PCR研究以确认诊断。我们认为,我们的结果对COVID-19诊断的改善做出了重大贡献。
Physicians taking care of patients with coronavirus disease (COVID-19) have described different changes in routine blood parameters. However, these changes, hinder them from performing COVID-19 diagnosis. We constructed a machine learning predictive model for COVID-19 diagnosis. The model was based and cross-validated on the routine blood tests of 5,333 patients with various bacterial and viral infections, and 160 COVID-19-positive patients. We selected operational ROC point at a sensitivity of 81.9% and specificity of 97.9%. The cross-validated area under the curve (AUC) was 0.97. The five most useful routine blood parameters for COVID19 diagnosis according to the feature importance scoring of the XGBoost algorithm were MCHC, eosinophil count, albumin, INR, and prothrombin activity percentage. tSNE visualization showed that the blood parameters of the patients with severe COVID-19 course are more like the parameters of bacterial than viral infection. The reported diagnostic accuracy is at least comparable and probably complementary to RT-PCR and chest CT studies. Patients with fever, cough, myalgia, and other symptoms can now have initial routine blood tests assessed by our diagnostic tool. All patients with a positive COVID-19 prediction would then undergo standard RT-PCR studies to confirm the diagnosis. We believe that our results present a significant contribution to improvements in COVID-19 diagnosis.