Holistic workload scaling : A new approach to compute acceleration in the cloud

Pérez, Juan F.; Chen, Lydia Y.; Villari, Massimo; Ranjan, Rajiv

doi:10.1109/MCC.2018.011791711

Ítem

Acceso Abierto

Holistic workload scaling : A new approach to compute acceleration in the cloud

Mostrar el registro sencillo de la publicación

dc.creator	Pérez, Juan F.
dc.creator	Chen, Lydia Y.
dc.creator	Villari, Massimo
dc.creator	Ranjan, Rajiv
dc.creator.google	Pérez, Juan F.	spa
dc.creator.google	Chen, Lydia Y.
dc.creator.google	Villari, Massimo
dc.creator.google	Ranjan, Rajiv
dc.date.accessioned	2019-02-15T19:41:21Z
dc.date.available	2019-02-15T19:41:21Z
dc.date.created	2018
dc.date.issued	2018
dc.description.abstract	Workload scaling is an approach to accelerating computation and thus improving response times by replicating the exact same request multiple times and processing it in parallel on multiple nodes and accepting the result from the first node to finish. This is not unlike a TV game show, where the same question is given to multiple contestants and the (correct) answer is accepted from the first to respond. This is different than traditional strategies for parallelization as used in, say, MapReduce workloads, where each node runs a subset of the overall workload. There are a variety of strategies that trade off metrics such as cost, utilization, performance, and interprocessor communication requirements. Performance modeling can help determine optimal approaches for different environments and goals. This is important, because poor performance can lead to application and domain-specific losses, such as e-commerce conversions and sales. Performance modeling and analysis plays an important role in designing and driving the selection of resource scaling mechanisms. Such modeling and analysis is complex due to time-varying workload arrival rates and request sizes, and even more complex in cloud environments due to the additional stochastic variation caused by performance interference due to resource sharing across co-located tenants. Moreover, little is known on how to multi-scale, i.e., dynamically and simultaneously scale resources vertically, horizontally, and through workload scaling. In this article, we first demonstrate the effectiveness of multi-scaling in reducing latency, and then discuss the performance modeling challenges, particularly for workload scaling. © 2014 IEEE.	eng
dc.format.mimetype	application/pdf
dc.identifier.doi	10.1109/MCC.2018.011791711
dc.identifier.issn	2325-6095
dc.identifier.uri	http://repository.urosario.edu.co/handle/10336/19089
dc.language.iso	eng	spa
dc.relation.citationEndPage	30
dc.relation.citationStartPage	20
dc.relation.citationTitle	IEEE Cloud Computing
dc.relation.citationVolume	Vol. 5
dc.relation.ispartof	IEEE Cloud Computing, ISSN:2325-6095, Vol. 5 (2018) pp. 20-30	spa
dc.relation.uri	https://www.computer.org/csdl/mags/cd/2018/01/mcd2018010020.pdf	spa
dc.rights.accesRights	info:eu-repo/semantics/openAccess
dc.rights.acceso	Abierto (Texto Completo)	spa
dc.source.bibliographicCitation	Metrics, K., Blog, , https://blog.kissmetrics.com/loading-time	spa
dc.source.instname	instname:Universidad del Rosario
dc.source.reponame	reponame:Repositorio Institucional EdocUR
dc.subject	Cloud Computing	spa
dc.subject	Stochastic Systems	spa
dc.subject	Cloud Environments	spa
dc.subject	Inter Processor Communication	spa
dc.subject	Mapreudce	spa
dc.subject	Model And Analysis	spa
dc.subject	Optimal Approaches	spa
dc.subject	Parallelilzation	spa
dc.subject	Performance Modeling And Analysis	spa
dc.subject	Stochastic Variation	spa
dc.subject	Economic And Social Effects	spa
dc.subject.ddc	Probabilidades & matemáticas aplicadas	spa
dc.subject.lemb	Sistemas estocásticos	spa
dc.subject.lemb	Comercio electrónico	spa
dc.title	Holistic workload scaling : A new approach to compute acceleration in the cloud	spa
dc.type	article	eng
dc.type.hasVersion	info:eu-repo/semantics/publishedVersion
dc.type.spa	Artículo	spa