zikele

zikele

人生如此自可乐

以强大的隐私保证发布维基百科使用数据

2308.16298v3

中文标题#

以强大的隐私保证发布维基百科使用数据

英文标题#

Publishing Wikipedia usage data with strong privacy guarantees

中文摘要#

近 20 年来,维基媒体基金会一直在发布关于每天每个维基百科页面有多少人访问的统计数据。 这些数据帮助 维基百科编辑确定他们应该将精力集中在哪些地方以改进在线百科全书,并促进了学术研究。 2023 年 6 月,维基媒体 基金会,在 Tumult Labs 的帮助下,回应了维基百科编辑和学术研究人员长期以来的请求:它开始以更细粒度的方式发布这些统计数据,包括在每日页面浏览次数中加入来源国家。 这项新的数据发布使用差分隐私来为浏览或编辑维基百科的人提供强大的保障。 本文描述了这一数据发布:其目标,从开始到部署的过程,用于生成数据的算法,以及数据发布的成果。

英文摘要#

For almost 20 years, the Wikimedia Foundation has been publishing statistics about how many people visited each Wikipedia page on each day. This data helps Wikipedia editors determine where to focus their efforts to improve the online encyclopedia, and enables academic research. In June 2023, the Wikimedia Foundation, helped by Tumult Labs, addressed a long-standing request from Wikipedia editors and academic researchers: it started publishing these statistics with finer granularity, including the country of origin in the daily counts of page views. This new data publication uses differential privacy to provide robust guarantees to people browsing or editing Wikipedia. This paper describes this data publication: its goals, the process followed from its inception to its deployment, the algorithms used to produce the data, and the outcomes of the data release.

PDF 获取#

查看中文 PDF - 2308.16298v3

智能达人抖店二维码

抖音扫码查看更多精彩内容

Loading...
Ownership of this post data is guaranteed by blockchain and smart contracts to the creator alone.