论文标题
基于先前概率的轨迹语义位置注释的无监督方法
An unsupervised approach for semantic place annotation of trajectories based on the prior probability
论文作者
论文摘要
语义位置注释可以提供单个语义,这在轨迹数据挖掘领域可能有很大帮助。大多数现有方法依赖于注释或外部数据,并且需要在更改区域后进行重新培训,从而阻止其大规模应用。本文中,我们提出了一种无监督的方法,该方法用时空信息表示为轨迹的语义位置注释的UPAPP。贝叶斯标准专门用于将候选位置的时空概率分解为空间概率,持续时间概率和访问时间概率。随后采用了ROI和POI数据中的空间信息来计算空间概率。就时间概率而言,术语频率逆文档频率加权算法用于计算轨迹中对不同位置类型的潜在访问,并生成访问时间和持续时间的先前概率。然后将候选地点的时空概率与该地点类别注释访问的地方的重要性相结合。用北京709名志愿者收集的轨迹数据集的验证表明,我们的方法的总体和平均准确性分别为0.712和0.720,表明可以准确地注释访问的地方而没有任何外部数据。
Semantic place annotation can provide individual semantics, which can be of great help in the field of trajectory data mining. Most existing methods rely on annotated or external data and require retraining following a change of region, thus preventing their large-scale applications. Herein, we propose an unsupervised method denoted as UPAPP for the semantic place annotation of trajectories using spatiotemporal information. The Bayesian Criterion is specifically employed to decompose the spatiotemporal probability of the candidate place into spatial probability, duration probability, and visiting time probability. Spatial information in ROI and POI data is subsequently adopted to calculate the spatial probability. In terms of the temporal probabilities, the Term Frequency Inverse Document Frequency weighting algorithm is used to count the potential visits to different place types in the trajectories, and generates the prior probabilities of the visiting time and duration. The spatiotemporal probability of the candidate place is then combined with the importance of the place category to annotate the visited places. Validation with a trajectory dataset collected by 709 volunteers in Beijing showed that our method achieved an overall and average accuracy of 0.712 and 0.720, respectively, indicating that the visited places can be annotated accurately without any external data.