TY - GEN
T1 - Tight bounds on the round complexity of the distributed maximum coverage problem
AU - Assadi, Sepehr
AU - Khanna, Sanjeev
N1 - Publisher Copyright:
© Copyright 2018 by SIAM.
PY - 2018
Y1 - 2018
N2 - We study the maximum k-set coverage problem in the follow-ing distributed setting. A collection of input sets S1; : : : ; Sm over a universe [n] is partitioned across p machines and the goal is to find k sets whose union covers the most number of elements. The computation proceeds in rounds where in each round machines communicate information to each other. Specifically, in each round, all machines simultane-ously send a message to a central coordinator who then com-municates back to all machines a summary to guide the com-putation for the next round. At the end of the last round, the coordinator outputs the answer. The main measures of efficiency in this setting are the approximation ratio of the returned solution, the communication cost of each machine, and the number of rounds of computation. Our main result is an asymptotically tight bound on the tradeoff between these three measures for the distributed maximum coverage problem. We first show that any r-round protocol for this problem either incurs a communication cost of k/m(1=r) or only achieves an approximation factor of k(1=r). This in particular implies that any protocol that simultaneously achieves good approximation ratio (O(1) ap-proximation) and good communication cost (eO(n) commu-nication per machine), essentially requires logarithmic (in k) number of rounds. We complement our lower bound result by showing that there exist an r-round protocol that achieves an e e-1 -approximation (essentially best possible) with a com-munication cost of k/mO(1=r) as well as an r-round protocol that achieves a kO(1=r)-approximation with only eO(n) com-munication per each machine (essentially best possible). We further use our results in this distributed setting to obtain new bounds for maximum coverage in two other main models of computation for massive datasets, namely, the dynamic streaming model and the MapReduce model.
AB - We study the maximum k-set coverage problem in the follow-ing distributed setting. A collection of input sets S1; : : : ; Sm over a universe [n] is partitioned across p machines and the goal is to find k sets whose union covers the most number of elements. The computation proceeds in rounds where in each round machines communicate information to each other. Specifically, in each round, all machines simultane-ously send a message to a central coordinator who then com-municates back to all machines a summary to guide the com-putation for the next round. At the end of the last round, the coordinator outputs the answer. The main measures of efficiency in this setting are the approximation ratio of the returned solution, the communication cost of each machine, and the number of rounds of computation. Our main result is an asymptotically tight bound on the tradeoff between these three measures for the distributed maximum coverage problem. We first show that any r-round protocol for this problem either incurs a communication cost of k/m(1=r) or only achieves an approximation factor of k(1=r). This in particular implies that any protocol that simultaneously achieves good approximation ratio (O(1) ap-proximation) and good communication cost (eO(n) commu-nication per machine), essentially requires logarithmic (in k) number of rounds. We complement our lower bound result by showing that there exist an r-round protocol that achieves an e e-1 -approximation (essentially best possible) with a com-munication cost of k/mO(1=r) as well as an r-round protocol that achieves a kO(1=r)-approximation with only eO(n) com-munication per each machine (essentially best possible). We further use our results in this distributed setting to obtain new bounds for maximum coverage in two other main models of computation for massive datasets, namely, the dynamic streaming model and the MapReduce model.
UR - https://www.scopus.com/pages/publications/85045558137
UR - https://www.scopus.com/pages/publications/85045558137#tab=citedBy
U2 - 10.1137/1.9781611975031.155
DO - 10.1137/1.9781611975031.155
M3 - Conference contribution
AN - SCOPUS:85045558137
T3 - Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms
SP - 2412
EP - 2431
BT - 29th Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2018
A2 - Czumaj, Artur
PB - Association for Computing Machinery
T2 - 29th Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2018
Y2 - 7 January 2018 through 10 January 2018
ER -