Visible to the public Web Caching Evaluation from Wikipedia Request Statistics

TitleWeb Caching Evaluation from Wikipedia Request Statistics
Publication TypeConference Paper
Year of Publication2017
AuthorsHasslinger, G., Kunbaz, M., Hasslinger, F., Bauschert, T.
Conference Name2017 15th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt)
Keywordsaccess time reduction, cache storage, Conferences, Electronic publishing, Encyclopedias, hit rate, Internet, Metrics, pubcrawl, resilience, Resiliency, Scalability, Servers, simulation, statistical analysis, Web Caching, Web caching evaluation, web caching strategies, Web sites, Wikipedia caches, Wikipedia daily top-1000 statistics, Wikipedia pages, Wikipedia request statistics, wireless networks, Zipf distributed requests
Abstract

Wikipedia is one of the most popular information platforms on the Internet. The user access pattern to Wikipedia pages depends on their relevance in the current worldwide social discourse. We use publically available statistics about the top-1000 most popular pages on each day to estimate the efficiency of caches for support of the platform. While the data volumes are moderate, the main goal of Wikipedia caches is to reduce access times for page views and edits. We study the impact of most popular pages on the achievable cache hit rate in comparison to Zipf request distributions and we include daily dynamics in popularity.

URLhttp://ieeexplore.ieee.org/document/7959873/
DOI10.23919/WIOPT.2017.7959873
Citation Keyhasslinger_web_2017