TY - GEN
T1 - Load Balancing Performance in Distributed Storage with Regular Balanced Redundancy
AU - Aktas, Mehmet Fatih
AU - Behrouzi-Far, Amir
AU - Soljanin, Emina
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/10
Y1 - 2019/10
N2 - Contention at the storage nodes is the main cause of long and variable data access times in distributed storage systems. Offered load on the system must be balanced across the storage nodes in order to minimize contention, and load balancing should be robust against the skews and fluctuations in content popularities. Data objects are replicated across multiple nodes in practice to allow for load balancing. However redundancy increases the storage requirement and should be used efficiently. We evaluate load balancing performance of natural storage schemes in which each data object is stored at d different nodes and each node stores the same number of objects. We find that load balance in a system of n nodes improves multiplicatively with d as long as d = o (log(n)), and improves exponentially as soon as d = Θ(log(n)). We show that load balance improves the same way with d when the service choices are created with XORs of r objects rather than object replicas, which also reduces the storage overhead multiplicatively by r. However, unlike accessing an object replica, access through a recovery set composed by an XOR'ed copy requires downloading content from r nodes, which increases load imbalance additively by r.
AB - Contention at the storage nodes is the main cause of long and variable data access times in distributed storage systems. Offered load on the system must be balanced across the storage nodes in order to minimize contention, and load balancing should be robust against the skews and fluctuations in content popularities. Data objects are replicated across multiple nodes in practice to allow for load balancing. However redundancy increases the storage requirement and should be used efficiently. We evaluate load balancing performance of natural storage schemes in which each data object is stored at d different nodes and each node stores the same number of objects. We find that load balance in a system of n nodes improves multiplicatively with d as long as d = o (log(n)), and improves exponentially as soon as d = Θ(log(n)). We show that load balance improves the same way with d when the service choices are created with XORs of r objects rather than object replicas, which also reduces the storage overhead multiplicatively by r. However, unlike accessing an object replica, access through a recovery set composed by an XOR'ed copy requires downloading content from r nodes, which increases load imbalance additively by r.
UR - http://www.scopus.com/inward/record.url?scp=85081578181&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85081578181&partnerID=8YFLogxK
U2 - 10.1109/REDUNDANCY48165.2019.9003321
DO - 10.1109/REDUNDANCY48165.2019.9003321
M3 - Conference contribution
AN - SCOPUS:85081578181
T3 - 2019 16th International Symposium "Problems of Redundancy in Information and Control Systems", REDUNDANCY 2019
SP - 75
EP - 80
BT - 2019 16th International Symposium "Problems of Redundancy in Information and Control Systems", REDUNDANCY 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 16th International Symposium "Problems of Redundancy in Information and Control Systems", REDUNDANCY 2019
Y2 - 21 October 2019 through 25 October 2019
ER -