Efficient replication of queued tasks for latency reduction in cloud systems

Gauri Joshi, Emina Soljanin, Gregory Wornell

Research output: Chapter in Book/Report/Conference proceedingConference contribution

30 Scopus citations

Abstract

In cloud computing systems, assigning a job to multiple servers and waiting for the earliest copy to finish is an effective method to combat the variability in response time of individual servers. Although adding redundant replicas always reduces service time, the total computing time spent per job may be higher, thus increasing waiting time in queue. The total time spent per job is also proportional to the cost of computing resources. We analyze how different redundancy strategies, for eg. number of replicas, and the time when they are issued and canceled, affect the latency and computing cost. We get the insight that the log-concavity of the service time distribution is a key factor in determining whether adding redundancy reduces latency and cost. If the service distribution is log-convex, then adding maximum redundancy reduces both latency and cost. And if it is log-concave, then having fewer replicas and canceling the redundant requests early is more effective.

Original languageEnglish (US)
Title of host publication2015 53rd Annual Allerton Conference on Communication, Control, and Computing, Allerton 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages107-114
Number of pages8
ISBN (Electronic)9781509018239
DOIs
StatePublished - Apr 4 2016
Externally publishedYes
Event53rd Annual Allerton Conference on Communication, Control, and Computing, Allerton 2015 - Monticello, United States
Duration: Sep 29 2015Oct 2 2015

Publication series

Name2015 53rd Annual Allerton Conference on Communication, Control, and Computing, Allerton 2015

Other

Other53rd Annual Allerton Conference on Communication, Control, and Computing, Allerton 2015
Country/TerritoryUnited States
CityMonticello
Period9/29/1510/2/15

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Computer Science Applications
  • Control and Systems Engineering

Fingerprint

Dive into the research topics of 'Efficient replication of queued tasks for latency reduction in cloud systems'. Together they form a unique fingerprint.

Cite this