Download Collective Communications

Enhanced Replica Selection Technique for Binding Replica Sites in Data Grids Authors: Mahdi S. Al-Mhanna, Rafah M. Almuttairi University of Babylon, Babylon, Iraq University of Babylon Outlines What is the DataGrid? Replication Overview. Replica Optimizer. What is Association Rules? New Replica Selection Technique for Binding Cheapest Replica Sites in Data Grids. Comparison with traditional methods. Related Work. Conclusion. University of Babylon What is Data Grid? University of Babylon What is Data Grid ?  An infrastructure that enables replicating the required data sets in different distributed sites and then selecting the best replica matching user’s requirements. Data Grid job: It contains a set of files that must be available at the time before or during execution time. Job 1: File 1; File10; File 11; File12; ……… University of Babylon Why do we need it? Worldwide LHC Computing Grid The Large Hadron Collider will produce roughly 15 petabytes (15 million gigabytes) of data annually – enough to fill more than 1.7 million dual-layer DVDs a year! 1 Megabyte (1MB) A digital photo 1 Gigabyte (1GB) = 1000MB A DVD movie 1 Terabyte (1TB) = 1000GB World annual book production 1 Petabyte (1PB) = 1000TB Annual production of one LHC experiment 1 Exabyte (1EB) = 1000 PB World annual information production CMS LHCb ATLAS University of Babylon ALICE LHC data Balloon (30 Km) CD stack with 1 year LHC data! (~ 20 Km) LHC data correspond to about 20 million CDs each year! Where will the experiments store all of these data? Concorde (15 Km) Data for LHC: a problem? The Grid: a possible solution! University of Babylon Mt. Blanc (4.8 Km) Replication Overview University of Babylon Data problem solved by, Storing data in different hosts. SVA Compressed Data University of Babylon Replication Overview • Single server will be flooded when the requests increase. • Latency increases with long queue of waited jobs. Resource Broker and Resource Optimizer University of Babylon Replication Strategies!! Replication Strategies!! …Which file I should be used when it is needed?!! Optimizatio n Strategies!! Resource Optimizer University of Babylon The replica optimiser provides: A run time selection service to get the best file replica. User Interface Job1: f1,f2,...,fz Job2: f7,f8,...,fz . Resource Broker Job1: f7,f8,...,fz Job exe. site Replica Manager Replica Optimizer Jobt: f11,f12,...,fz Job2: f1,f2,...,fz Job exe. site Replica Manager Replica Optimizer Get best sites having: f1,f2,...,fz Computing Element Storage Element Computing Element University of Babylon Storage Element Association rules technique University of Babylon Association rules: Pincer-Search algorithm This algorithm is used to combine different associated sites, having uncongested links at the time of the file(s) transfer. Preliminary Concepts  Association rules: association rules are mostly used in mining transaction data. Crucial terms in association rules terminology are:  Item (in DM terminology corresponds to attribute-value pair).  Transaction (a set of items; corresponds to example).  A Set (data set) of transactions containing more different items. University of Babylon Efficient Set algorithm University of Babylon Get the first Job from job queue Get all related LFN from job description file Get the physical adds of replicas’ sites from RLS Using Ipref, Probing info. are stored in NHD Calculate the score and update the TABLE Apply EST to get Ln which is list of associated sites Use (GridFTP) transfer files from efficient set of sites. Get the next Job from job queue Yes Do we have history for current job in NHD? Figure 4. Execution flows using ESM No Experiments University of Babylon Study case: Let us assume there are eight sites having a requested file.  Ipref, network service is used to get RTT report  RLS, to collect physical names of replicas PRAGMA Grid, 35 institutions in 25 regions University of Babylon University of Babylon 1- Network History File NHF/Using Ipref RTT/STT 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 S1 60 65 303 305 300 313 310 307 310 310 310 307 310 50 316 74 S2 176 211 175 213 210 176 175 216 217 211 175 176 211 175 214 209 S3 202 209 298 203 223 207 298 260 260 224 204 205 227 202 260 206 S4 336 343 338 273 334 335 273 271 342 339 272 271 344 270 336 341 S5 78.7 76.5 64.8 92.6 95 66.7 94.2 94.4 95.2 66.3 92.8 69 92.5 66.3 63.4 94.3 University of Babylon S6 256 256 258 255 292 298 255 256 289 257 262 256 299 299 296 287 S7 76.4 86.8 57.2 85.1 55 55.3 85.8 60 90.1 90.3 91.4 88.7 64.3 90.5 57.8 56.4 S8 213 202 205 202 212 212 202 212 212 212 202 210 212 216 224 222 1. RTT: Round Trip Time 2. STT: Single Trip Time 3. Unit: ms 2- Logical History File (LHF) University of Babylon Affected Factors  Factors affect directly on data transfer time.  Bandwidth  Latency of links Factors affect the performance data transfer.  CPU  I/O slightly University of Babylon Cont… According to the first three factors, in our algorithm, a score function is defined as the following: Scorei = piBW× wiBW+piCPU× wiCPU + piI/O × wiI/O Sum = ∑ Scorej , 1 < J < M AVG = sum / M , M= Number of sites If scorei ≥ AVG then Scorei=1 otherwise Scorei= 0 University of Babylon Cont… Where, - Scorei. the replica selection cost model for server i. - piBW. percentage of Bandwidth available from server i to the computing site. - wiBW. network and Bandwidth ratio of replica server i defined by the administrator of the Data Grid organization piCPU. percentage CPU idle states of replica server i - wiCPU. CPU load ratio of server i defined by the administrator. - piI/O. percentage of memory free space of server i - wiI/O. memory free space ratio defined by the administrator. University of Babylon Rules • "The rule": Rule #1: if Site(s) S4, S7 are selected then this implies that site(s) S3 can also be selected at the same time. This rule has 100% confidence. I(Rule1)= S4, S7  S3 Support(S4,S7,S3)/Support(S4,S7)*Support(S3)= 1.109 The confidence of this rule is 100%. The requested files will be sent by S3 , S4, S7 To compute the correlation of this rule and see how far it is better than choosing the site randomly, we use an Improvement equation. University of Babylon EST vs Traditional Model University of Babylon EST vs Traditional Model Computing Site (CS) is (S0) RS={S14 ,S1, S3 } . Red stars referring to congested routers. Traditional selection S14 less number of Hops EST, (S3 ) uncongested links. Data Grid and theirUniversity associated of Babylonnetwork geometry Comparison with Traditional selection strategy Traditional selection strategy and EST University of Babylon Conclusion University of Babylon Conclusion  Efficient Selection Technique (EST) is a dynamic replica selection strategy.  Aims to adapt at run-time its criteria to flexible QoS binding contracts specified by the service provider and/or the client.  The concept of association rules of data mining approach is used.  To reduce the searching space, response time and network resources consumed. University of Babylon Related work  During 2001-2003, Sudharshan et al. In their work they used the history of previous file transfer information to predict the best site holding a copy of the requested file.  R. Kavitha, and I. Foster, in 2001 they used Network Bandwidth and link’s latency. Corina and M. Mesaac in 2003 and Ceryen and M. Kevin 2005 used different algorithms such as greedy, random, partitioned and weight algorithms in the selection engine. Their selection methods were with respect to some static metric such as the geographical distance in miles and the topological distance in number of hops.  Rashedur et al. in 2005 exploited a replica selection technique with the K-Nearest Neighbor (KNN) rule used to select the best replica from the information gathered locally.  Rashedur et al., In 2008 proposed a Neural Network predictive technique (NN based) to estimate the transfer time between sites.  A. Jaradat et al., In 2009, proposed a new approach that utilizes availability, security and time as selection criteria between different replicas, by adopting k-means clustering algorithm concepts to create a balanced (best) solution.  Omer Aljadaan in 2010, used a Genetic algorithm for clustering replicas and get best replica matching QoS of user’s requirements. Most of these methods have a limitations and some of them have a drawback and others work under a specific conditions. University of Babylon References • • • • • • • • • • • • • M. Rashedur Rahman, Ken Barker, Reda Alhajj, "Replica selection in grid environment:a data-mining approach", Distributed systems and grid computing (DSGC),pp: 695 – 700 , 2005 J. Gwertzman and M. Seltzer, "The case for geographical push cashing", In Proceeding of the 5th Workshop on Hot ZTopic in Operating Systems, 1995. R. Kavitha, I. Foster, "Design and evaluation of replication strategies for a high performance data grid", in, Proceedings of Computing and High Energy and, Nuclear Physics, 2001. S. Vazhkudai, J. Schopf, I. Foster, "Predicting the performance of wide-area data transfers", in: 16th International PDPS, 2002. S. Vazhkudai, J. Schopf, "Using regression techniques to predict large data transfers", in: Computing: Infrastructure and Applications, The International Journal of High Performance Computing Applications, IJHPCA , August, 2003. A. Abbas, Grid Computing:" A Practical Guide to Technology and APPLICATIONS", 2006. http://goc.pragma-grid.net/wiki/index.php/UoHyd. S. Vazhkudai, S Tuecke, I. Foster, "Replica selection in the globus data grid", in: First IEEE/ACM International Conference on Cluster Computing and the Grid, CCGrid 2001. J. Guyton and M. Schwartz."Locating nearby copies of replicated internet servers", In Proceeding of ACM SIGCOMM’ 95, 1995. A. Tirumala, J. Ferguson, Iperf 1.2 - The TCP/UDP Bandwidth Measurement Tool, 2002. R. Wolski, Dynamically forecasting network performance using the Network Weather Service, Cluster Computing (1998). Yunhong Gu, Robert L. Grossman, "UDT: UDP-based data transfer for high-speed wide area networks", Computer Networks, Volume 51, Issue 7, 16 May 2007, Pages 1777 1799. Elsevier. R.M. Rahman, K. Barker, R. Alhajj, "Predicting the performance of GridFTP transfers", in: Proceedings of IEEE Symposium of Parallel and Distributed Systems, 2004, New Mexico, USA, p. 238a. University of Babylon • • • • • • • • • • • • • • • • J. F. Kurose, K.W. Ross, "Computer Networking A Top-Down Approach Featuring the Internet", 3rd edition. S. Venugopal, . R. Buyya,"The Gridbus Toolkit for Service Oriented Grid and Utility Computing: An Overview and Status Report"2004. R. Agrawal, T. Imielinski, A.Swami, "Mining association rules between sets of items in large databases". In: Proc. ACM SIGMOD Intl. Conf. Management Data, 1993 R.M Rahman, K Barker and R Alhajj, "Replica selection strategies in data grid", Journal of Parallel and Distributed Computing, Volume 68, Issue 12, Pages 1561-1574, December 2008. A. Jaradat, R. Salleh and A. Abid, "Imitating K-Means to Enhance Data Selection", Journal of Applied Sciences 9 (19): 3569-3574, 2009, ISSN 1812-5654, Asian Network for Scientific Information-2009. S. Venugopal, . R. Buyya, K. Ramamohanarao, "A taxonomy of Data Grids for distributed data sharing, management, and processing". ACM Comput. Surv. 38, 1 (Jun. 2006), ACM New York, NY, USA http://www.resample.com/xlminer/help/Index.htm A. K Pujari, "Data mining techniques", Hyderabad : Universities Press, 2002. G. Williams, M. Hegland and S. Roberts, "A Data Mining Tutorial", IASTED International Conference on Parallel and Distributed Computing and Networks (PDCN’98) 14 December 1998. T. Ceryen, and M. Kevin, 2005. "Performance characterization of decentralized algorithms for replica selection in dstributed object systems". Proceedngs of 5th International Workshop on Software and Performance, July 11 -14, Palma, de Mallorca, Spain, pp: 257-262. F. Corina, and M. Mesaac, 2003, "A scalable replica selection strategy based on flexible contracts". Proceedings of the 3rd IEEE Workshop on Internet Applications, June 23-24, IEEE Computer Society Washington, DC, USA, pp: 95-99. R. M. almuttari, R. Wankar, A. Negi, C.R. Rao. "Intelligent Replica Selection Strategy for Data Grid", In proceeding of the 10th International conference on Parallel and Distributed Proceeding Techniques and Applications, IEEE Computer Society Washington, DC, WorldComp2010, GCA2010, LasVegas, USA, Volume3, pp: 95-100, July 1215-2010. Cisco Distributed Director, http://www.cisco.com/warp/public/cc/pd/cxsr/dd/index.shtml M. Sayal, Y. Breitbart, P. Scheuermann, R. Vingralek, "Selection algorithms for replicated web servers". In Proceeding of the Workshop on Internet Server Performance,1998. E. Zegura, M. Ammar, Z. Fei, and S. Bhattacharjee, "Application-layer anycasting: a server selection architecture and use in a replicated web service", IEEE/ACM Transactions on Networking, vol. 8, no. 4, pp. 455–466, Aug. 2000. http://www-wanmon.slac.stanford.edu/cgi-wrap/pingtable.pl?from=DZ.ARN.N3&to=WORLD University of Babylon ? ???? [email protected] [email protected]

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Download Collective Communications