论文标题

使用线性支持向量机的类星体检测,从错误方法中学习

Quasar Detection using Linear Support Vector Machine with Learning From Mistakes Methodology

论文作者

Herle, Aniruddh, Channegowda, Janamejaya, Prabhu, Dinakar

论文摘要

天文学领域需要收集和同化大量数据。随着科学仪器每晚产生的大量数据呈指数增长,数据处理和处理问题变得严重。对于处理数据的常规方法,该问题变得广泛,这主要是手动的,但是使用机器学习方法的理想设置。虽然为天文学构建分类器,但鉴于这些物体的稀有性和科学价值,丢失诸如超新星或类星体以检测损失的稀有物体的成本要严重得多。在本文中,探索了线性支撑向量机(LSVM)以检测类星体,这是极亮的物体,其中超质量黑洞被发光的积聚盘包围。在天文学中,正确识别类星体至关重要,因为它们本质上非常罕见。他们的稀有性创造了一个平衡问题,需要考虑到这一点。在设计分类器时,考虑了班级不平衡问题和高分类成本。为了实现此检测,探索了新型分类器,并评估了其性能。据观察,LSVM与整体袋(EBT)一起使用错误方法学的学习,从而降低了假负率的10倍。

The field of Astronomy requires the collection and assimilation of vast volumes of data. The data handling and processing problem has become severe as the sheer volume of data produced by scientific instruments each night grows exponentially. This problem becomes extensive for conventional methods of processing the data, which was mostly manual, but is the perfect setting for the use of Machine Learning approaches. While building classifiers for Astronomy, the cost of losing a rare object like supernovae or quasars to detection losses is far more severe than having many false positives, given the rarity and scientific value of these objects. In this paper, a Linear Support Vector Machine (LSVM) is explored to detect Quasars, which are extremely bright objects in which a supermassive black hole is surrounded by a luminous accretion disk. In Astronomy, it is vital to correctly identify quasars, as they are very rare in nature. Their rarity creates a class-imbalance problem that needs to be taken into consideration. The class-imbalance problem and high cost of misclassification are taken into account while designing the classifier. To achieve this detection, a novel classifier is explored, and its performance is evaluated. It was observed that LSVM along with Ensemble Bagged Trees (EBT) achieved a 10x reduction in the False Negative Rate, using the Learning from Mistakes methodology.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源