论文标题
通过机器学习,来自全天空相机数据的云标识
Cloud Identification from All-sky Camera Data with Machine Learning
论文作者
论文摘要
大多数地面观测值配备了广角全天空摄像头,以监视夜空条件。这样的相机系统可用于提供传入的云层预警,这些云可能会通过降水以及天空质量监测对望远镜设备构成危险。我们研究了使用不同机器学习方法来自动化全天摄像机数据中主要是不透明云作为云警告系统的使用。在深入学习的方法中,我们在预先标记的相机图像上训练残留的神经网络(RESNET)。我们的第二种方法从相机图像中提取相关和局部图像特征,并使用这些数据来训练基于梯度的基于树的模型(LightGBM)。我们在位于洛厄尔天文台的Discovery通道望远镜上拍摄的一组大约2,000张图像上训练了这两种模型方法,其中云的存在已被手动标记。在图像的给定区域中检测云的精度为85%,但需要大量的计算资源。我们的LightGBM方法通过约1,000张图像和相当适度的计算资源的培训样本实现了95%的精度。基于不同的性能指标,我们建议后一种基于功能的自动化云检测方法。为这项工作构建的代码可在线提供。
Most ground-based observatories are equipped with wide-angle all-sky cameras to monitor the night sky conditions. Such camera systems can be used to provide early warning of incoming clouds that can pose a danger to the telescope equipment through precipitation, as well as for sky quality monitoring. We investigate the use of different machine learning approaches for automating the identification of mostly opaque clouds in all-sky camera data as a cloud warning system. In a deep-learning approach, we train a Residual Neural Network (ResNet) on pre-labeled camera images. Our second approach extracts relevant and localized image features from camera images and uses these data to train a gradient-boosted tree-based model (lightGBM). We train both model approaches on a set of roughly 2,000 images taken by the all-sky camera located at Lowell Observatory's Discovery Channel Telescope, in which the presence of clouds has been labeled manually. The ResNet approach reaches an accuracy of 85% in detecting clouds in a given region of an image, but requires a significant amount of computing resources. Our lightGBM approach achieves an accuracy of 95% with a training sample of ~1,000 images and rather modest computing resources. Based on different performance metrics, we recommend the latter feature-based approach for automated cloud detection. Code that was built for this work is available online.