Ítem
Solo Metadatos

On the latency-accuracy tradeoff in approximate MapReduce jobs

dc.creatorPérez, Juan F.spa
dc.creatorBirke R.spa
dc.creatorChen L.Y.spa
dc.date.accessioned2020-05-25T23:58:26Z
dc.date.available2020-05-25T23:58:26Z
dc.date.created2017spa
dc.description.abstractTo ensure the scalability of big data analytics, approximate MapReduce platforms emerge to explicitly trade off accuracy for latency. A key step to determine optimal approximation levels is to capture the latency of big data jobs, which is long deemed challenging due to the complex dependency among data inputs and map/reduce tasks. In this paper, we use matrix analytic methods to derive stochastic models that can predict a wide spectrum of latency metrics, e.g., average, tails, and distributions, for approximate MapReduce jobs that are subject to strategies of input sampling and task dropping. In addition to capturing the dependency among waves of map/reduce tasks, our models incorporate two job scheduling policies, namely, exclusive and overlapping, and two task dropping strategies, namely, early and straggler, enabling us to realistically evaluate the potential performance gains of approximate computing. Our numerical analysis shows that the proposed models can guide big data platforms to determine the optimal approximation strategies and degrees of approximation. © 2017 IEEE.eng
dc.format.mimetypeapplication/pdf
dc.identifier.doihttps://doi.org/10.1109/INFOCOM.2017.8057038
dc.identifier.issn00001983
dc.identifier.issn00002011
dc.identifier.urihttps://repository.urosario.edu.co/handle/10336/22862
dc.language.isoengspa
dc.publisherInstitute of Electrical and Electronics Engineers Inc.spa
dc.relation.citationTitleProceedings - IEEE INFOCOM
dc.relation.ispartofProceedings - IEEE INFOCOM, ISSN:00001983, 00002011,(2017)spa
dc.relation.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85034099178&doi=10.1109%2fINFOCOM.2017.8057038&partnerID=40&md5=9160e95ccdd9351c287acea4e1a1f88espa
dc.rights.accesRightsinfo:eu-repo/semantics/openAccess
dc.rights.accesoAbierto (Texto Completo)spa
dc.source.instnameinstname:Universidad del Rosariospa
dc.source.reponamereponame:Repositorio Institucional EdocURspa
dc.subject.keywordApproximation theoryspa
dc.subject.keywordEconomic and social effectsspa
dc.subject.keywordStochastic modelsspa
dc.subject.keywordStochastic systemsspa
dc.subject.keywordData analyticsspa
dc.subject.keywordData platformspa
dc.subject.keywordJob scheduling policiesspa
dc.subject.keywordMap-reducespa
dc.subject.keywordMatrix analytic methodsspa
dc.subject.keywordOptimal approximationspa
dc.subject.keywordPerformance Gainspa
dc.subject.keywordWide spectrumspa
dc.subject.keywordBig dataspa
dc.titleOn the latency-accuracy tradeoff in approximate MapReduce jobsspa
dc.typeconferenceObjecteng
dc.type.hasVersioninfo:eu-repo/semantics/publishedVersion
dc.type.spaDocumento de conferenciaspa
Archivos
Colecciones