扩散加权脑梗塞病变的分割和检测敏感性具有合成的深度学习

论文标题

扩散加权脑梗塞病变的分割和检测敏感性具有合成的深度学习

Improved Segmentation and Detection Sensitivity of Diffusion-Weighted Brain Infarct Lesions with Synthetically Enhanced Deep Learning

论文作者

Federau, Christian, Christensen, Soren, Scherrer, Nino, Ospel, Johanna, Schulze-Zachau, Victor, Schmidt, Noemi, Breit, Hanns-Christian, Maclaren, Julian, Lansberg, Maarten, Kozerke, Sebastian

论文摘要

目的：将在人类标记的临床扩散加权（DW）中风病变数据库中训练的深度学习模型的分割和检测性能与在同一数据库中训练的模型增强了合成DW中风病变。方法：在这项机构审查委员会批准的研究中，中风数据库为962例（平均年龄65岁+/- 17岁，255例男性，449例患有DW阳性中风病变的扫描）和2,027例患者的正常数据库（平均年龄38 +/- 24岁，24岁，1088名女性）。通过将真实中风的相对信号增加到正常脑体积的相对信号增加而产生了带有合成DW中风病变的脑量。在四个不同的数据库上训练了一个通用的3D U-NET，以生成四个不同的模型：（a）375个神经放射原生物学家标记的临床DW阳性中风病例（CDB）;（b）2,000个合成病例（S2DB）;（c）CDB+2,000 CDB+2,000个合成病例（CS2DB）;或（d）CDB+40,000个合成病例（CS40DB）。对模型的中风数据库病例的20％（n = 192）进行了测试，该模型被排除在训练集之外。使用骰子评分和中风分割的病变体积来表征分割精度，并使用配对的两尾学生的t检验对统计显着性进行了测试。将检测敏感性和特异性与三位神经放射学家进行了比较。结果：在CS40DB上训练的3D U-NET模型的性能优于在CS2DB（0.70，p <0.001）或CDB（0.65，p <0.001）上训练的模型。深度学习模型也比三个人类读者中的每个（84％[81％-87％]，78％[75％-81％]和79％[76％-82％]）更敏感（91％[89％-93％]），但特异性（75％[72％-78％] vs）（96％[94％-97％]，92％[90％-94％]和89％[86％-91％]。

Purpose: To compare the segmentation and detection performance of a deep learning model trained on a database of human-labelled clinical diffusion-weighted (DW) stroke lesions to a model trained on the same database enhanced with synthetic DW stroke lesions. Methods: In this institutional review board approved study, a stroke database of 962 cases (mean age 65+/-17 years, 255 males, 449 scans with DW positive stroke lesions) and a normal database of 2,027 patients (mean age 38+/-24 years,1088 females) were obtained. Brain volumes with synthetic DW stroke lesions were produced by warping the relative signal increase of real strokes to normal brain volumes. A generic 3D U-Net was trained on four different databases to generate four different models: (a) 375 neuroradiologist-labeled clinical DW positive stroke cases(CDB);(b) 2,000 synthetic cases(S2DB);(c) CDB+2,000 synthetic cases(CS2DB); or (d) CDB+40,000 synthetic cases(CS40DB). The models were tested on 20%(n=192) of the cases of the stroke database, which were excluded from the training set. Segmentation accuracy was characterized using Dice score and lesion volume of the stroke segmentation, and statistical significance was tested using a paired, two-tailed, Student's t-test. Detection sensitivity and specificity was compared to three neuroradiologists. Results: The performance of the 3D U-Net model trained on the CS40DB(mean Dice 0.72) was better than models trained on the CS2DB (0.70,P <0.001) or the CDB(0.65,P<0.001). The deep learning model was also more sensitive (91%[89%-93%]) than each of the three human readers(84%[81%-87%],78%[75%-81%],and 79%[76%-82%]), but less specific(75%[72%-78%] vs for the three human readers (96%[94%-97%],92%[90%-94%] and 89%[86%-91%]). Conclusion: Deep learning training for segmentation and detection of DW stroke lesions was significantly improved by enhancing the training set with synthetic lesions.

下载PDF全文

下载文献需遵守相关版权规定

论文标题