A Request Distribution Algorithm for Web Server Cluster
Abstract
With the explosively increasing of web-based applications’ workloads, Web server cluster encounters challenge in response time for requests. Request distribution among servers in web server cluster is the key to address such challenge, especially under heavy workloads. In this paper, we propose a new request distribution algorithm named llac (least load active cache) for load balancing switch in web server cluster. The goal of llac is to improve the cache hit rate and reduce response time. Packets are parsed in IP level, and back-end servers are notified to cache hot files using link change technology, neither changing URL information nor modifying the service program. This avoids switching overhead between user mode and kernel mode. The load balancing switch directly creates connection with the selected server, avoiding migrating connection overhead. This policy estimates the current composited load of each server and selects the server with the least load to serve the request. It also improves the resource utilization of web servers. Experimental results show that llac achieves better performance for web applications than wrr (weight round robin) which is a popular request distribution.
Keywords
References
A. Chandra, P. Pradhan, R. Tewari, S. Sahu, P. Shenoy. "An observation-based approach towards self-managing web servers", Computer Communications, 2006, pp1174-1188.
V. Cardellini, E. Casalicchio, M. Colajanni, S. Tucci, "Mechanisms for quality of service in web clusters", Computer Networks, vol.37, No.6, 2001, pp761-771.
http://dx.doi.org/10.1016/S1389-1286(01)00252-3
M.E. Crovella, A. Bestavros. "Self-Similarity in World Wide Web Traffic: Evidence and Possible Causes", IEEE/ACM Transactions on Networking, vol.5, No.6, 1997, pp835-846.
http://dx.doi.org/10.1109/90.650143
V. Cardellini, E. Casalicchio, M. Colajanni, and P.S. Yu. "The State of the Art in Locally Distributed Web-Server Systems", ACM Computing Surveys, vol.34, No.2, 2002, pp 263-311.
http://dx.doi.org/10.1145/508352.508355
M. Andreolini, E. Casalicchio. "A cluster-based web system providing differentiated and guaranteed services", Cluster Computing, vol.7, No.1, 2004, pp7-19.
http://dx.doi.org/10.1023/B:CLUS.0000003940.34740.be
E. Choi. "Performance test and analysis for an adaptive load balancing mechanism on distributed server cluster systems", Future Generation Computer Systems, No.20, 2004, pp 237-247.
http://dx.doi.org/10.1016/S0167-739X(03)00138-9
V.S. Pail, M. Aront, G. Bangat. "Locality-Aware Request Distribution in Cluster-based Network Servers", ACM SIGOPS Operating Systems Review, USA:ACM, 1998, pp205-216.
L. Cherkasova, M. Karlsson. "Scalable Web Server Cluster Design with Workload-Aware Request Distribution Strategy WARD", Advanced Issues of E-Commerce and Web-Based Information Systems, Washington:IEEE Computer Society, 2001, pp212-221.
E. Casalicchio, M. Colajanni. "A client-aware dispatching algorithm for Web clusters providing multiple services", The International World Wide Web Conference Committee (IW3C2), 2001, pp535-544.
M. Andreolini, S. Casolari, Michele Colajanni. "Models and framework for supporting runtime decisions in Web-based systems", ACM Transactions on the Web (TWEB), vol.2, No.3, 2008, pp1-43.
http://dx.doi.org/10.1145/1377488.1377491
CHRISTIAN BENVENUTI. Understanding LINUX NETWORK INTERNALS. 2006. http://linux.softpedia.com/get/System/Benchmarks/Web-bench-1378.shtml
M.L. Chiang, Y.C. Lin, L.F. Guo. "Design and implementation of an efficient web cluster with content-based request distribution and file caching", Journal of Systems and Software, vol.81, No.11, 2008, pp 2044-2058
http://dx.doi.org/10.1016/j.jss.2008.02.069
S. Sharifian, S.A. Motamedi, M.K. Akbari. "A content-based load balancing algorithm with admission control for cluster web servers", Future Generation Computer Systems, vol.24, No.8, 2008, pp775-787.
http://dx.doi.org/10.1016/j.future.2008.03.005
M.L. Chiang, C.H. Wu, Y.J. Liao, Y.F. Chen. "New Content-aware Request Distribution Policies in Web Clusters Providing Multiple Services", Proceedings of the 2009 ACM symposium on Applied Computing, USA:ACM, 2009, pp79-83.
Z.Y. Xu, J.Z. Han, L. Bhuyan. "Scalable and Decentralized Content-Aware Dispatching in Web Clusters", IEEE International Performance, Computing, and Communications, Washington:IEEE Computer Society, 2007, pp202-209.
http://dx.doi.org/10.1109/PCCC.2007.358896
Y.K. Chang. "Fully Pre-Splicing TCP for Web Switches", Proceedings of the First International Conference on Innovative Computing, Information and Control, Washington:IEEE Computer Society, 2006, pp737-740.
S. Chase, D.C. Anderson. "Managing energy and server resources in hosting centers", In Proc. of the eighteenth ACM symposium on Operating systems principles, 2001, pp103-116.
Tarek F. Abdelzaher, Kang G. Shin, and Nina Bhatti. "Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach", IEEE Transactions on Parallel and Distributed Systems, June 2001.
Yasushi Saito, Brian N. Bershad, and Henry M. Levy. "An approximation-based load-balancing algorithm with admission control for cluster web servers with dynamic workloads", Journal of Supercomputing, vol.53, No.3, 2010, pp 440-463.
http://dx.doi.org/10.1007/s11227-009-0303-8
Full Text: PDF


