Ítem
Solo Metadatos

Evaluating the effectiveness of replication for tail-tolerance

dc.creatorQiu, Zhanspa
dc.creatorPérez, Juan F.
dc.date.accessioned2020-08-28T15:49:15Z
dc.date.available2020-08-28T15:49:15Z
dc.date.created2015-05spa
dc.description.abstractComputing clusters (CC) are a cost-effective high-performance platform for computation-intensive scientific and engineering applications. A key challenge in managing CCs is to consistently achieve low response times. In particular, tail-tolerant methods aim to keep the tail of the response-time distribution short. In this paper we explore concurrent replication with canceling, a tail-tolerant approach that involves processing requests and their replicas concurrently, retrieving the result from the first replica that completes, and canceling all other replicas. We propose a stochastic model that considers any number of replicas, general processing and inter-arrival times, and computes the response time distribution. We show that replication can be very effective in keeping the response-time tail short, but these benefits highly depend on the processing-time distribution, as well as on the CC utilization and the statistical characteristics of the arrival process. We also exploit the model to support the selection of the optimal number of replicas, and a resource provisioning strategy that meets service-level objectives on the response-time percentiles.eng
dc.format.mimetypeapplication/pdf
dc.identifier.doihttps://doi.org/10.1109/CCGrid.2015.22
dc.identifier.issnISBN: 978-1-4799-8006-2
dc.identifier.urihttps://repository.urosario.edu.co/handle/10336/28509
dc.language.isoengspa
dc.publisherIEEEspa
dc.relation.citationEndPage452
dc.relation.citationStartPage443
dc.relation.citationTitleCCGrid `15: 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing Shenzhen China (May, 2015)
dc.relation.ispartofProceedings of the 15th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, ISBN: 978-1-4799-8006-2 (May 2015); pp. 443-452spa
dc.relation.urihttps://dl.acm.org/doi/abs/10.1109/CCGrid.2015.22spa
dc.rights.accesRightsinfo:eu-repo/semantics/restrictedAccess
dc.rights.accesoRestringido (Acceso a grupos específicos)spa
dc.sourceCCGrid '15: 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing Shenzhen China (May, 2015)spa
dc.source.instnameinstname:Universidad del Rosario
dc.source.reponamereponame:Repositorio Institucional EdocUR
dc.subject.keywordComputing clustersspa
dc.subject.keywordConcurrent replicationspa
dc.subject.keywordComputer systemspa
dc.titleEvaluating the effectiveness of replication for tail-tolerancespa
dc.title.TranslatedTitleEvaluar la efectividad de la replicación para la tolerancia a la colaspa
dc.typebookParteng
dc.type.hasVersioninfo:eu-repo/semantics/publishedVersion
dc.type.spaParte de librospa
Archivos
Colecciones