论文标题
合成数据 - 什么,为什么和如何?
Synthetic Data -- what, why and how?
论文作者
论文摘要
该解释器文档旨在概述合成数据技术快速扩展的工作的当前状态,特别关注隐私。该文章旨在为非技术受众提供,尽管已经给出了一些正式的定义,以便为专家提供清晰度。本文旨在使读者能够快速熟悉合成数据的概念,并了解随之而来的一些微妙的复杂性。我们确实认为,合成数据是一个非常有用的工具,我们的希望是,该报告强调了这一点,同时引起了人们对部署中很容易忽视的细微差别的关注。
This explainer document aims to provide an overview of the current state of the rapidly expanding work on synthetic data technologies, with a particular focus on privacy. The article is intended for a non-technical audience, though some formal definitions have been given to provide clarity to specialists. This article is intended to enable the reader to quickly become familiar with the notion of synthetic data, as well as understand some of the subtle intricacies that come with it. We do believe that synthetic data is a very useful tool, and our hope is that this report highlights that, while drawing attention to nuances that can easily be overlooked in its deployment.