TY - GEN
T1 - Why Reading Patterns Matter in Storage Coding & Scheduling Design
AU - Ferner, Ulric J.
AU - Soljanin, Emina
AU - Medard, Muriel
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2015/8/19
Y1 - 2015/8/19
N2 - Coding techniques for storage systems are gaining traction in data center (DC) applications, owing to their data survivability performance, and more recently, to their ability to mitigate traffic congestion. This paper considers stochastic allocation schedules in networks that admit bulk file requests, across three drive blocking models. We consider a block-based code and a stochastic scheduling algorithm which is beneficial in the case of continuous chunk read patterns. In particular, we demonstrate that in systems with continuous chunk reading patterns, when drive blocking is either independent or from traffic congestion, block coded storage can reduce average download time by 10 - 66%, given modern system parameters. However, a distinction should be made between systems with continuous and those with interrupted chunk read patterns. For interrupted chunk read systems, given our allocation algorithm that performs well for continuous reads, block coded storage performance can be worse than replication, numerical illustrations show relative losses over 66%. These illustrations demonstrate that to harness the full benefits of coded storage and to avoid pitfalls, careful attention must be paid to continuous vs. Interrupted chunk reading patterns, codes other than block codes should be considered, as could joint code-scheduling design.
AB - Coding techniques for storage systems are gaining traction in data center (DC) applications, owing to their data survivability performance, and more recently, to their ability to mitigate traffic congestion. This paper considers stochastic allocation schedules in networks that admit bulk file requests, across three drive blocking models. We consider a block-based code and a stochastic scheduling algorithm which is beneficial in the case of continuous chunk read patterns. In particular, we demonstrate that in systems with continuous chunk reading patterns, when drive blocking is either independent or from traffic congestion, block coded storage can reduce average download time by 10 - 66%, given modern system parameters. However, a distinction should be made between systems with continuous and those with interrupted chunk read patterns. For interrupted chunk read systems, given our allocation algorithm that performs well for continuous reads, block coded storage performance can be worse than replication, numerical illustrations show relative losses over 66%. These illustrations demonstrate that to harness the full benefits of coded storage and to avoid pitfalls, careful attention must be paid to continuous vs. Interrupted chunk reading patterns, codes other than block codes should be considered, as could joint code-scheduling design.
KW - Blocking probability
KW - Bulk requests
KW - Cloud computing architectures
KW - Coded storage
KW - Design patterns
KW - Download time
KW - File chunks
KW - Reading patterns
KW - Storage
UR - http://www.scopus.com/inward/record.url?scp=84960106830&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84960106830&partnerID=8YFLogxK
U2 - 10.1109/CLOUD.2015.55
DO - 10.1109/CLOUD.2015.55
M3 - Conference contribution
AN - SCOPUS:84960106830
T3 - Proceedings - 2015 IEEE 8th International Conference on Cloud Computing, CLOUD 2015
SP - 357
EP - 364
BT - Proceedings - 2015 IEEE 8th International Conference on Cloud Computing, CLOUD 2015
A2 - Pu, Calton
A2 - Mohindra, Ajay
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 8th IEEE International Conference on Cloud Computing, CLOUD 2015
Y2 - 27 June 2015 through 2 July 2015
ER -