论文标题
cookiegraph:理解和检测第一方跟踪饼干
COOKIEGRAPH: Understanding and Detecting First-Party Tracking Cookies
论文作者
论文摘要
随着第三方Cookie Blocking已成为浏览器的常态,广告商和跟踪器已经开始使用第一方饼干进行跟踪。我们对允许并阻止的第三方cookie的10K网站进行了差异测量研究。这项研究表明,即使第三方饼干被阻止,第一方饼干也用于将已知跟踪器的标识符存储给已知跟踪器。 与第三方Cookie阻塞相反,彻底的第一方曲奇阻止是不切实际的,因为这会导致重大功能损坏。我们提出了CookieGraph,这是一种基于机器学习的方法,可以准确,稳健地检测第一方跟踪cookie。 CookieGraph以90.20%的精度检测到第一方跟踪Cookie,表现优于最先进的cookieblock方法17.75%。我们表明,cookiegraph对cookie名称操纵完全强大,而cookieblock的cookieblock却下降了15.68%。尽管阻止所有第一方饼干会导致32%的SSO登录站点的重大破裂,而CookieBlock将其降低到10%,但我们表明CookieGraph不会在这些站点上引起任何重大破裂。 我们对CookieGraph的部署表明,在93.43%的10K网站上使用了第一方跟踪cookie。我们还发现,第一方跟踪饼干是通过指纹脚本设置的。最普遍的第一方跟踪饼干是由Google,Facebook和Tiktok等主要广告实体设定的。
As third-party cookie blocking is becoming the norm in browsers, advertisers and trackers have started to use first-party cookies for tracking. We conduct a differential measurement study on 10K websites with third-party cookies allowed and blocked. This study reveals that first-party cookies are used to store and exfiltrate identifiers to known trackers even when third-party cookies are blocked. As opposed to third-party cookie blocking, outright first-party cookie blocking is not practical because it would result in major functionality breakage. We propose CookieGraph, a machine learning-based approach that can accurately and robustly detect first-party tracking cookies. CookieGraph detects first-party tracking cookies with 90.20% accuracy, outperforming the state-of-the-art CookieBlock approach by 17.75%. We show that CookieGraph is fully robust against cookie name manipulation while CookieBlock's acuracy drops by 15.68%. While blocking all first-party cookies results in major breakage on 32% of the sites with SSO logins, and CookieBlock reduces it to 10%, we show that CookieGraph does not cause any major breakage on these sites. Our deployment of CookieGraph shows that first-party tracking cookies are used on 93.43% of the 10K websites. We also find that first-party tracking cookies are set by fingerprinting scripts. The most prevalent first-party tracking cookies are set by major advertising entities such as Google, Facebook, and TikTok.