论文标题
lrcn-retailnet:一种重复的神经网络架构,用于准确的人计数
LRCN-RetailNet: A recurrent neural network architecture for accurate people counting
论文作者
论文摘要
测量和分析零售商店客户流的流量对于零售商更好地理解客户的行为和支持决策至关重要。然而,对于自动人士计数的新技术的发展并没有太多关注。我们介绍了LRCN-Retailnet:一种能够学习非线性回归模型的经常性神经网络体系结构,并准确地预测了人们从低成本监视摄像机捕获的视频中的人数。输入视频格式遵循最近提出的RGBP图像格式,该格式由颜色和人(前景)信息组成。我们的体系结构能够考虑两个相关方面:通过RGBP图像中卷积层提取的空间特征;以及问题的时间连贯性,该问题是由经常性层所利用的。我们表明,通过有监督的学习方法,训练有素的模型能够以高准确性来预测人们的数量。此外,我们介绍并证明该方法的直接修改是有效地将销售人员排除在人民数量之外的有效修改。进行了全面的实验,以验证,评估和比较所提出的体系结构。结果证实,LRCN-Retailnet明显优于以前的零售结构,这仅限于评估每个迭代的单个图像。以及用于对象检测的最新神经网络。最后,计算绩效实验证实了整个方法可以有效地估计人们实时计数。
Measuring and analyzing the flow of customers in retail stores is essential for a retailer to better comprehend customers' behavior and support decision-making. Nevertheless, not much attention has been given to the development of novel technologies for automatic people counting. We introduce LRCN-RetailNet: a recurrent neural network architecture capable of learning a non-linear regression model and accurately predicting the people count from videos captured by low-cost surveillance cameras. The input video format follows the recently proposed RGBP image format, which is comprised of color and people (foreground) information. Our architecture is capable of considering two relevant aspects: spatial features extracted through convolutional layers from the RGBP images; and the temporal coherence of the problem, which is exploited by recurrent layers. We show that, through a supervised learning approach, the trained models are capable of predicting the people count with high accuracy. Additionally, we present and demonstrate that a straightforward modification of the methodology is effective to exclude salespeople from the people count. Comprehensive experiments were conducted to validate, evaluate and compare the proposed architecture. Results corroborated that LRCN-RetailNet remarkably outperforms both the previous RetailNet architecture, which was limited to evaluating a single image per iteration; and a state-of-the-art neural network for object detection. Finally, computational performance experiments confirmed that the entire methodology is effective to estimate people count in real-time.