关键词:用户追踪器;网络追踪;数据驱动
摘 要:In this work, we built a tool that processes users' network traces and outputs tracker strings such as usernames, cookies, IMEI numbers and the like, that uniquely identify a machine/device/browser. The key challenge in automatically capturing trackers from raw traces is dealing with enterprise-sized data. We tackle this problem by applying data-driven multi-stage filtering, thereby pruning the size of network traces to be analyzed. Each filtering step has a trade-off between between false positive rate and potentially interesting information lost (false negatives).