在光学相干断层扫描上检测和量化drusen和网状伪曲霉的深度学习框架

论文标题

在光学相干断层扫描上检测和量化drusen和网状伪曲霉的深度学习框架

A deep learning framework for the detection and quantification of drusen and reticular pseudodrusen on optical coherence tomography

论文作者

Schwartz, Roy, Khalid, Hagar, Liakopoulos, Sandra, Ouyang, Yanling, de Vente, Coen, González-Gonzalo, Cristina, Lee, Aaron Y., Guymer, Robyn, Chew, Emily Y., Egan, Catherine, Wu, Zhichao, Kumar, Himeesh, Farrington, Joseph, Sánchez, Clara I., Tufail, Adnan

论文摘要

目的 - 开发和验证一个深度学习（DL）框架，用于在光学相干断层扫描中检测和定量drusen和网状假牙（RPD）。设计 - 开发和验证分类和特征分割的深度学习模型。方法 - 开发了一个DL框架，该框架由分类模型和用于识别无法解决的扫描的分布（OOD）检测模型组成；使用DRUSEN或RPD识别扫描的分类模型；图像分割模型以独立将病变分为RPD或DRUSEN。数据是从英国生物库（UKBB）的1284名参与者中获得的，并自我报告了与年龄相关的黄斑变性（AMD）和250个UKBB对照组的诊断。 Drusen和RPD由五位视网膜专家手动划定。主要结果度量是ROC曲线（AUC），KAPPA，准确性和类内相关系数（ICC）下的灵敏度，特异性，面积。结果 - 分类模型在其各自的任务上执行强烈（分别为0.95、0.93和0.99 AUC，对于不可行的扫描分类器，OOD模型以及DRUSEN和RPD分类模型）。 DRUSEN和RPD面积的平均ICC与毕业生的平均ICC分别为0.74和0.61，而Intergrader一致性为0.69和0.68。 Froc曲线表明该模型的敏感性接近人类的表现。结论 - 模型实现了与人类绩效相似的高分类和细分性能。在研究和临床环境中，这种强大的框架的应用将进一步了解我们对RPD作为与Drusen的独立实体的理解。

Purpose - To develop and validate a deep learning (DL) framework for the detection and quantification of drusen and reticular pseudodrusen (RPD) on optical coherence tomography scans. Design - Development and validation of deep learning models for classification and feature segmentation. Methods - A DL framework was developed consisting of a classification model and an out-of-distribution (OOD) detection model for the identification of ungradable scans; a classification model to identify scans with drusen or RPD; and an image segmentation model to independently segment lesions as RPD or drusen. Data were obtained from 1284 participants in the UK Biobank (UKBB) with a self-reported diagnosis of age-related macular degeneration (AMD) and 250 UKBB controls. Drusen and RPD were manually delineated by five retina specialists. The main outcome measures were sensitivity, specificity, area under the ROC curve (AUC), kappa, accuracy and intraclass correlation coefficient (ICC). Results - The classification models performed strongly at their respective tasks (0.95, 0.93, and 0.99 AUC, respectively, for the ungradable scans classifier, the OOD model, and the drusen and RPD classification model). The mean ICC for drusen and RPD area vs. graders was 0.74 and 0.61, respectively, compared with 0.69 and 0.68 for intergrader agreement. FROC curves showed that the model's sensitivity was close to human performance. Conclusions - The models achieved high classification and segmentation performance, similar to human performance. Application of this robust framework will further our understanding of RPD as a separate entity from drusen in both research and clinical settings.

下载PDF全文

下载文献需遵守相关版权规定

论文标题