论文标题
使用两步转移学习在内窥镜图像中增强肾结石识别
Boosting Kidney Stone Identification in Endoscopic Images Using Two-Step Transfer Learning
论文作者
论文摘要
了解肾结石形成的原因对于建立防止复发的治疗至关重要。当前有不同的方法来确定肾结石类型。但是,参考前体内识别程序可能需要长达几周的时间,而视觉识别需要训练有素的专家。已经开发了机器学习模型,以便在输尿管镜检查过程中为泌尿科医生提供肾结石的自动分类。但是,在培训数据和方法的质量方面普遍缺乏。在这项工作中,使用两步转移学习方法来训练肾脏石材分类器。所提出的方法将在用CCD摄像头(EX-VIVO数据集)获取的一组肾结石图像上学习的知识转移到了最终模型,该模型从内窥镜图像(Ex-Vivo DataSet)分类。结果表明,来自不同域的学习特征有相似信息有助于提高在实际条件下执行分类的模型的性能(例如,不受控制的照明条件和模糊)。最后,与从头开始训练的模型或通过初始化Imagenet权重的模型相比,获得的结果表明,两步方法提取物的特征是改善内窥镜图像中肾结石的识别。
Knowing the cause of kidney stone formation is crucial to establish treatments that prevent recurrence. There are currently different approaches for determining the kidney stone type. However, the reference ex-vivo identification procedure can take up to several weeks, while an in-vivo visual recognition requires highly trained specialists. Machine learning models have been developed to provide urologists with an automated classification of kidney stones during an ureteroscopy; however, there is a general lack in terms of quality of the training data and methods. In this work, a two-step transfer learning approach is used to train the kidney stone classifier. The proposed approach transfers knowledge learned on a set of images of kidney stones acquired with a CCD camera (ex-vivo dataset) to a final model that classifies images from endoscopic images (ex-vivo dataset). The results show that learning features from different domains with similar information helps to improve the performance of a model that performs classification in real conditions (for instance, uncontrolled lighting conditions and blur). Finally, in comparison to models that are trained from scratch or by initializing ImageNet weights, the obtained results suggest that the two-step approach extracts features improving the identification of kidney stones in endoscopic images.