Influencers and the giant component: The fundamental hardness in privacy protection for socially contagious attributes

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

The presence of correlation is known to make privacy protection more difficult. We investigate the privacy of socially contagious attributes on a network of individuals, where each individual possessing that attribute may influence a number of others into adopting it. We show that for contagions following the Independent Cascade model there exists a giant connected component of infected nodes, containing a constant fraction of all the nodes who all receive the contagion from the same set of sources. We further show that it is extremely hard to hide the existence of this giant connected component if we want to obtain an estimate of the activated users at an acceptable level. Moreover, an adversary possessing this knowledge can predict the real status (“active” or “inactive”) with decent probability for many of the individuals regardless of the privacy (perturbation) mechanism used. As a case study, we show that the Wasserstein mechanism, a state-of-the-art privacy mechanism designed specifically for correlated data, introduces a noise with magnitude of order Ω(n) in the count estimation in our setting. We provide theoretical guarantees for two classes of random networks: Erdös-Rényi graphs and Chung-Lu power-law graphs under the Independent Cascade model. Experiments demonstrate that a giant connected component of infected nodes can indeed appear in real-world networks and a simple inference attack can reveal the status of a good fraction of nodes.

Original languageEnglish (US)
Title of host publicationSIAM International Conference on Data Mining, SDM 2021
PublisherSiam Society
Pages217-225
Number of pages9
ISBN (Electronic)9781611976700
StatePublished - 2021
Event2021 SIAM International Conference on Data Mining, SDM 2021 - Virtual, Online
Duration: Apr 29 2021May 1 2021

Publication series

NameSIAM International Conference on Data Mining, SDM 2021

Conference

Conference2021 SIAM International Conference on Data Mining, SDM 2021
CityVirtual, Online
Period4/29/215/1/21

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Software

Fingerprint

Dive into the research topics of 'Influencers and the giant component: The fundamental hardness in privacy protection for socially contagious attributes'. Together they form a unique fingerprint.

Cite this