论文标题
通过文本相似性在搜索上进行搜索,朝着可推广的语义产品搜索搜索单击日志
Towards Generalizable Semantic Product Search by Text Similarity Pre-training on Search Click Logs
论文作者
论文摘要
最近,语义搜索已成功地应用于电子商务产品搜索,并且有望用于查询的学习语义空间和产品编码将概括为看不见的查询或产品。但是,到目前为止,尚未在该域中对概括进行方便地出现。在本文中,我们研究了几种通用域和特定于域的预训练的罗伯塔变体,发现通用域微调无助于概括,这与发现先前的艺术相吻合。基于对公共可用的手动注释Query-Query-rododododuct da da DA
Recently, semantic search has been successfully applied to e-commerce product search and the learned semantic space(s) for query and product encoding are expected to generalize to unseen queries or products. Yet, whether generalization can conveniently emerge has not been thoroughly studied in the domain thus far. In this paper, we examine several general-domain and domain-specific pre-trained Roberta variants and discover that general-domain fine-tuning does not help generalization, which aligns with the discovery of prior art. Proper domain-specific fine-tuning with clickstream data can lead to better model generalization, based on a bucketed analysis of a publicly available manual annotated query-product pair da