Loading [MathJax]/extensions/MathMenu.js
Modeling randomized data streams in caching, data processing, and crawling applications | IEEE Conference Publication | IEEE Xplore

Modeling randomized data streams in caching, data processing, and crawling applications


Abstract:

Many BigData applications (e.g., MapReduce, web caching, search in large graphs) process streams of random key-value records that follow highly skewed frequency distribut...Show More

Abstract:

Many BigData applications (e.g., MapReduce, web caching, search in large graphs) process streams of random key-value records that follow highly skewed frequency distributions. In this work, we first develop stochastic models for the probability to encounter unique keys during exploration of such streams and their growth rate over time. We then apply these models to the analysis of LRU caching, MapReduce overhead, and various crawl properties (e.g., node-degree bias, frontier size) in random graphs.
Date of Conference: 26 April 2015 - 01 May 2015
Date Added to IEEE Xplore: 24 August 2015
Electronic ISBN:978-1-4799-8381-0
Print ISSN: 0743-166X
Conference Location: Hong Kong, China

Contact IEEE to Subscribe

References

References is not available for this document.