Identidad Institucional CRAI
Logo EdocUR
    • English
    • español
    • português
  •  Work Submission
  •  FAQs
  • English 
    • English
    • español
    • português
  • Login

Contacto

Twitter

Facebook

Youtube

View Item 
  •   Repositorio Institucional EdocUR - Universidad del Rosario
  • Investigación
  • Artículos
  • View Item
  •   Repositorio Institucional EdocUR - Universidad del Rosario
  • Investigación
  • Artículos
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Holistic workload scaling : A new approach to compute acceleration in the cloud

  • Exportar citas ▼
    • Exportar a Mendeley
    • Exportar a BibTex
Thumbnail
Date
2018
Author
Pérez, Juan F.Autoridad Universidad de Rosario
Chen, Lydia Y.
Villari, Massimo
Ranjan, Rajiv
Métricas

Share
Citation
URI
10.1109/MCC.2018.011791711
http://repository.urosario.edu.co/handle/10336/19089

Summary

Workload scaling is an approach to accelerating computation and thus improving response times by replicating the exact same request multiple times and processing it in parallel on multiple nodes and accepting the result from the first node to finish. This is not unlike a TV game show, where the same question is given to multiple contestants and the (correct) answer is accepted from the first to respond. This is different than traditional strategies for parallelization as used in, say, MapReduce workloads, where each node runs a subset of the overall workload. There are a variety of strategies that trade off metrics such as cost, utilization, performance, and interprocessor communication requirements. Performance modeling can help determine optimal approaches for different environments and goals. This is important, because poor performance can lead to application and domain-specific losses, such as e-commerce conversions and sales. Performance modeling and analysis plays an important role in designing and driving the selection of resource scaling mechanisms. Such modeling and analysis is complex due to time-varying workload arrival rates and request sizes, and even more complex in cloud environments due to the additional stochastic variation caused by performance interference due to resource sharing across co-located tenants. Moreover, little is known on how to multi-scale, i.e., dynamically and simultaneously scale resources vertically, horizontally, and through workload scaling. In this article, we first demonstrate the effectiveness of multi-scaling in reducing latency, and then discuss the performance modeling challenges, particularly for workload scaling. © 2014 IEEE.
Subject
Cloud Computing ; Stochastic Systems ; Cloud Environments ; Inter Processor Communication ; Mapreudce ; Model And Analysis ; Optimal Approaches ; Parallelilzation ; Performance Modeling And Analysis ; Stochastic Variation ; Economic And Social Effects ;

Subject

Probabilidades & matemáticas aplicadas ;

Source link

https://www.computer.org/csdl/mags/cd/2018/01/mcd2018010020.pdf...

Show full item record

Collections
  • Artículos [6079]
Política de Acceso Abierto URPortal de Revistas URRepositorio de Datos de Investigación URCiencia Abierta UR
 

 

Browse

All of DSpaceCommunities & CollectionsTitlesAuthorsTypeSubjectsAdvisorBy Issue DateThis CollectionTitlesAuthorsTypeSubjectsAdvisorBy Issue Date

My Account

LoginRegister

Statistics

View Usage Statistics
Política de Acceso Abierto URPortal de Revistas URRepositorio de Datos de Investigación URCiencia Abierta UR