Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Accelerating Time to Results KC ZHANG Panasas Technical and Business Development Manager [email protected] Leader in Parallel Storage Systems Agenda Panasas introduction Customer successes Panasas solutions Panasas Availability Slide 2 Panasas, Inc. Panasas Founded by Garth Gibson in 1999. First Customer Ship in 2003 The fastest supercomputer in the world runs Panasas Primary Investors: HQ – Silicon Valley Market Focus: o o o o o o Energy Academia Government Life Sciences Manufacturing Finance Technologies: parallel file system and parallel storage appliance World wide support with over 25 global resellers Slide 3 Panasas, Inc. Partnering to meet customer needs Application ISVs Resellers Standards Development Slide 4 Panasas, Inc. Recognized Product Innovation and Excellence NAS Magic Quadrant “Visionary” Best HPC Storage Product Top 5 Vendors to Watch in 2009 Top Collaboration Between Government and Industry Roadrunner, Top Supercomputing Top Supercomputing Achievement Roadrunner, Los Alamos National Laboratory Achievement Roadrunner, Los Alamos National Laboratory 8 Panasas Customers Win HPCWire Awards in 2008! 6 Panasas Customers Win HPCWire Awards in 2007! 10 Disruptive New Storage Technologies Promise Big Changes Slide 5 Panasas, Inc. Panasas Powers RoadRunner Slide 6 Panasas, Inc. RoadRunner at a Glance Slide 7 Panasas, Inc. Petascale Red Infrastructure Diagram with Roadrunner Accelerated FY08 NFS and other network services, WAN Secure Core switches Nx10GE NxGE Nx10GE Archive I B 4 X Compute Unit FTA’s 10GE Roadruner Phase 3 1.026 PF F a t T r e e IONODES Site wide Shared Global Parallel File System (Panasas) Compute Unit 4 GE per 5-8 TB 10GE IONODES IO Unit M y r i n e t CU Roadrunner Phase 1 70TF CU Lightning/Bolt 35 TF Scalable to 600 GB/sec before adding Lanes Slide 8 1GE IONODES IO Unit M y r i n e t CU Panasas, Inc. Leaders in HPC choose Panasas ENERGY SWIFTCOMPANY Slide 9 Panasas, Inc. The Common Themes A. Very complex problems and simulations B. Very large number of files being used concurrently C. Very large number of concurrent users/servers D. Consolidating Users and Clusters on one storage system E. Any or all of the above Panasas solves the most difficult storage problems while delivering very high reliability in an easy to use appliance-like package. Slide 10 Panasas, Inc. Breaking Through the Bottleneck Clusters = Parallel Compute Parallel Compute needs Parallel IO Linux Compute Cluster Linux Compute Cluster Issues Complex Scaling Limited BW & I/O Islands of storage Inflexible Expensive Single data path to storage Monolithic Storage (NFS servers) Slide 11 Benefits Linear Scaling Extreme BW & I/O Single storage pool Ease of Mgmt Lower Cost Parallel data paths Panasas Parallel Storage Clusters Panasas, Inc. What is Parallel Storage? The architecture for scale-out file storage Clustered NFS NFS File Server File Server File Server NAS: Clustered Storage: Network Attached Storage Multiple NAS file servers managed as one. Good aggregate performance. Slide 12 Parallel NFS Parallel Clustered Storage: File server not in data path. Performance bottleneck eliminated. Panasas, Inc. Panasas Storage Cluster: Built on Industry-Standard Components Integrated 10GE Switch Battery Module (2 Power units) Shelf Front 1 DB, 10 SB Shelf Rear StorageBlade DirectorBlade Midplane routes GE, power Slide 13 Panasas, Inc. Performance and Scaling DirectFLOW client o Standard installable file system o Supports all common Linux flavors o Support up to 12K clients Panasas DirectFLOW® data path DirectorBlade cluster o Divides namespace into virtual volumes o Allows metadata to scale (no bottleneck) Demonstrated scalable performance o Slide 14 30+ GB/sec of sustained throughput from a single filesystem Panasas, Inc. Scalable NAS - NFS/CIFS Scalable NFS/CIFS server o o o o Load automatically distributed across scalable DirectorBlade modules Scale to satisfy growing number of clients Any DirectorBlade module can access any file Slide in a new DB, instantly get more NFS ops/sec into the same data Access same data from any protocol o o Slide 15 Integrates non-Linux devices into system 2+9 configuration typically best for NFS. Balances CPU ops/sec with disk ops/sec Panasas, Inc. Total Time in Hours to complete the job 400 Data Set 350 • 23 Million Traces Hours 300 • 139GB input dataset 250 • 234GB output depth migrated image gathers 200 150 • 247MB per depth slice, 970 depth slices 100 50 0 Panasas Other Vendor A Other Vendor B Throughput of Reads & Writes (MB/sec) 60 Data Set 50 MB / SEC • 23 Million Traces • 139GB input dataset 40 • 234GB output depth migrated image gathers 30 20 • 247MB per depth slice, 970 depth slices 10 0 Panasas Chart Legend Other Vendor A Read Rate Write Rate Other Vendor B Aggregate Throughput for 24 Nodes 1400 Data Set • 23 Million Traces 1200 • 139GB input dataset MB / SEC 1000 • 234GB output depth migrated image gathers 800 600 • 247MB per depth slice, 970 depth slices 400 200 Chart Legend Aggregate Read Throughput Aggregate Write Throughput 0 Panasas Other Vendor A Other Vendor B Job Time Activity Panasas Other Vendor B Other Vendor A Data Set • 23 Million Traces Chart Legend • 139GB input dataset Processor Waiting on Data • 234GB output depth migrated image gathers Computation • 247MB per depth slice, 970 depth slices ActiveScale Operating System DirectFLOW® Protocol o Provides parallel data paths for maximum performance PanFS™ Parallel File System o o o Distributed and parallel file system Block management hidden behind object storage interface File management distributed across metadata managers Designed to be managed by non-storage professionals ActiveScan Predictive Media Management o o Continuous sweeps of all data and disk media in the StorageBlade If discrepancies are detected the system proactively corrects the media defects Predictive Disk Management o Anticipates disk problems with automated, predictive failure analysis; data is moved prior to failure, to avoid reconstruction Real-time monitoring of client load generation o Slide 20 Identify performance bottlenecks among storage users Panasas, Inc. Horizontal Parity: Panasas ObjectRAID Parity calculated and written to disk(s) o Any failed disk can be reconstructed from the remaining disks Panasas ObjectRAID is faster o Uses multiple RAID controllers to run in parallel (“Parallel Reconstruction”) Panasas ObjectRAID is more efficient o Reconstructs only user data versus every sector on disk 800GB Blade reconstructed in 31 minutes at Los Alamos National Laboratory! Horizontal Parity Slide 21 Panasas, Inc. Unique: Vertical Parity Solves media error problem regardless of drive density “RAID” within an individual drive Improves on internal ECC capabilities Independent of horizontal arraybased parity schemes Vertical Parity Seamless recovery from media errors by applying RAID schemes across disk sectors Vertical Parity Horizontal Parity Slide 22 Panasas, Inc. Unique: Network Parity Extends parity capability across the data path to the client or server node Enables end-to-end data integrity validation o Protects from errors introduced by disks, firmware, server hardware, server software, network components and transmission o Client either receives valid data or an error notification Network Parity Vertical Parity Horizontal Parity Slide 23 Panasas, Inc. Manageability: Single Global Namespace Panasas removes artificial, physical and logical boundaries o Eliminates need to maintain mount scripts or move data Cluster 1 Cluster 3 Cluster 1 Cluster 3 Cluster 2 Cluster 2 Single Global Namespace Archived Files Cluster 1 Results Cluster 2 Results Cluster 3 Results Traditional Storage Networks Slide 24 Panasas Storage Cluster Panasas, Inc. Automatic provisioning for easy growth Online Provisioning o o Configure One DirectorBlade and all others obtain their configuration via DHCP on private port New Storage is seamlessly integrated into the system DHCP on Private Port Reading Config Setting IP Addrs Matching Versions Growth without limitations o Terabytes to Petabytes o Single seamless namespace Single Seamless Namespace! Slide 25 Panasas, Inc. Manageability: Automatic RAID configuration Per File RAID o Small File RAID Layout is an Attribute Stored within the Object System assigns RAID level based on file size o < 64 KB RAID 1 for efficient space allocation o > 64 KB RAID 5 for optimum system performance RAID 1 Mirroring Large File Automatic transition from RAID 1 to 5 o No re-striping RAID 5 Striping Two level RAID MAP, Stripe width and depth o Automatically optimizes stripe size Enables optimum system growth and reconstruction Slide 26 Panasas, Inc. Manageability: Dynamic Load Balancing 1 StorageBlade Capacity 2 StorageBlade Performance 3 DirectorBlade Performance Biases new data objects to new blades Dynamically moves data objects from filled blades as needed Data objects striped broadly for performance Dynamically moves objects from “hot” blades Slide 27 Cluster design assigns new clients to least utilized DirectorBlades Panasas, Inc. Proven Panasas Scalability Storage Cluster Sizes Today (e.g.) Slide 28 o Boeing, 50 DirectorBlades, 500 StorageBlades in one system. (plus 25 DirectorBlades and 250 StorageBlades each in two other smaller systems.) o LANL RoadRunner.100 DirectorBlades, 1000 StorageBlades in one system today, planning to increase to 144 shelves next year. o Intel has 5,000 active DF clients against 10-shelf systems, with even more clients mounting DirectorBlades via NFS. Release 3.2 will allow them to deploy up to 12,000 clients against a single system. o BP uses 200 StorageBlade storage pools as their building block o Most customers run systems in the 100 to 200 blade size range Panasas, Inc. Fast Deployment Panasas Appliance Model o Deploy solutions in hours and days vs. weeks and months o Ireland's most powerful computer (#117 in the world) was installed in three hours and powered up in just one day, thanks to a rapidly deployable computing platform from Silicon Graphics and Panasas. http://biz.yahoo.com/prnews/090205/sf67219.html?.v=1 Slide 29 Panasas, Inc. ActiveScale 3.2 Released Sept 2008 Performance 10 GE switch => 50% improvement in shelf performance Multi-core client performance tuning Infiniband connectivity RAID-10 volumes to optimize N-1 workloads Reliability Complete HA feature set with addition of NFS/CIFS Fail over Industry leading data integrity with Vertical Parity and Network Parity Manageability Snapshots NDMP support for easy backups Slide 30 Panasas, Inc. Summary Parallel storage provides high performance for faster survey turnaround and more complex algorithms o 10s of GB/s in production seismic processing data centers o 50% performance increase per shelf with 10Gb Ethernet Scalability to support more complex data acquisition and larger clusters o Deployed on a single shelf on survey vessels o 12,000 core clusters in production today o 4PB+ systems in production today Proven across the E&P industry o All major ISVs: Landmark, Paradigm, Schlumberger o Operating on 6 continents for Service Cos., NOCs, Majors and Independents Panasas is proven to cost effectively increase processing throughput! Slide 31 Panasas, Inc. For more information, call Panasas at: Thank You 1-888-PANASAS (US & Canada) 00 (800) PANASAS2 (UK & France) 00 (800) 787-702 张克诚 (Italy) +001 (510) 608-7790 (All Other Countries) Slide 32 13701026265 Panasas, Inc.