Differential Privacy
Problem Setting
When distributing some data calculated from a dataset publicly, we need to ensure that any malicious party cannot recover from the released data any information about personal data in the original dataset. Mathematically speaking, we need to ensure that our algorithm has Differential Privacy.
First, we need to define a special pair of sets called neighbouring set:
Definition 1 (Neighbouring Sets) Two sets
With this definition, we could define DP:
Definition 1 (Differential Privacy) Let A be a randomized algorithm,