Abstract
Differential privacy (DP) has become the mainstream privacy standard due to its strong protection of individual users’ information. It guarantees that for any neighboring datasets, the output of the mechanism is practically identical. There are two common definitions of neighborhoods in differential privacy. The first is record-level DP, where neighboring datasets differ by exactly one record; the second is user-level DP, where two datasets are neighbors if they differ by a single user’s data. Some commonly used aggregation functions include SUM, MAX, AVERAGE, etc.In our first work, we note that many frequently encountered aggregation functions are monotonic or can be derived from monotonic functions. Based on this insight, we design a general user-level DP mechanism that estimates monotonic functions with strong optimality guarantees. While our general mechanism may run in super-polynomial time, we show how to instantiate an approximate version that runs in polynomial time, making it efficient for certain common monotonic functions.
From our first work, we observe that existing user-level DP mechanisms still have high computational costs due to the complex relationships between users and records. To address this challenge, random sampling has proven to be an effective approach for reducing computational costs. However, all existing sampling-based mechanisms are designed for record-level DP. Therefore, in our second work, we take the first step towards investigating the application of random sampling under user-level DP and propose an accurate and efficient sampling-based mechanism for the sum estimation function.
A limitation of the first two works is that they focus on the central DP model, where all users trust the data curator. However, this assumption is often impractical. Thus, in our third work, we shift to the distributed setting and focus on the shuffle DP model. We explore the subgraph counting problem under edge DP, which is a specific case of aggregation function estimation in the context of record-level DP. We propose a general sampling-based mechanism, apply it to several common subgraphs, and analyze the performance guarantees.
| Date of Award | 2025 |
|---|---|
| Original language | English |
| Awarding Institution |
|
| Supervisor | Ke YI (Supervisor) |
Cite this
- Standard