Survey
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project
High-Availability Linux Project http://linux-ha.org/ (Last updated 24 Septemver, 2003) 30th December, 2003 Seo Dong Mahn Contents High Availability level SPOF general system topology takeover High-availability Linux Project Goals Products of Linux-HA Components What Linux-HA can do now Linux-HA task list Related project Friends of Linux-HA Conclusion Kangwon National Univ. -2- Software & System Lab High Availability Availability service time usage for purpose High Availability keep alive application from fault of CPU, network, storage system recovery Kangwon National Univ. -3- Software & System Lab High Availability - Level 100% Fault Tolerance using professional hardware 99.999% 99.99% High Availability 99.5% using professional HA solution to improve system availability 99.0% Enhanced Availability 98.0% improve availability via RAID, redundancy power supply 96.6% Basic Availability 95.0% Kangwon National Univ. simple data backup from server -4- Software & System Lab High Availability - SPOF 1 hardware – 40% software – 30% human – 20% Kangwon National Univ. A p p lic a tio n F a ilu re 40 % H ard w are / O S P o w er N etw o rk D isaster -5- U n tested C han g e M an ag em en t O verlo ad ed W eak P ro b lem D etectio n O th e r 20 % O p e ra to r E rro r 40 % F o rg et so m ethin g Lack o f P ro ced ures B ackup E rro rs S ecurity Software & System Lab High Availability - SPOF 2 Power Power Source Source Human Human Operator Operator Client Client Application Application Client Machine Internal Disk Network Network Interface Interface Card Card Disk Disk Controller Controller CPU CPU Power Power Supply Supply Operating Operating System System Network Human Human Operator Operator Server Server Application Application Database Database Engine Engine Server Machine Network Network Interface Interface Card Card Power Power Supply Supply CPU CPU External Disk External Disk Disk Disk Controller Controller External Disk Power Power Supply Supply Operating Operating System System Kangwon National Univ. Disk Enclosure Power Power Source Source -6- Software & System Lab High Availability - general system topology 1 Kangwon National Univ. -7- Software & System Lab High Availability - general system topology 2 Kangwon National Univ. -8- Software & System Lab High Availability - takeover basic Agent CPU usage, NIC, File system status, Disk usage, Daemons behavior, Network behavior, Process behavior, Swap Space usage custom Agent Process, Environment Variable, Program error option Agent Oracle, Infomix, Sybase … Kangwon National Univ. -9- Software & System Lab High-availability Linux Project - Goals basic goal “Provide a high-availability (clustering) solution for linux which promotes reliability, availability, and serviceability (RAS) through a community development effort.” high availability system Heartbeat, Fake, Heart file system for high availability high availability RAID, GFS, LFS, CODA monitoring for high availability Mon, PIKT, NOCOL/SPINS Kangwon National Univ. -10- Software & System Lab High-availability Linux Project - Products of Linux-HA 1 main Fake check_root ARP spoofing service, standby YES source_rc .fakerc 파일에 설정된 환경 파일의 내용들을 변수이름에 설정한다. parse_arguments clear_remote_arp_c ache 명령행에 입력된 take-over ip 파 일을 확인 하고 remove가 설정되 어 있는지 확인한다. 라우터의 캐시 내용을 지운다. fake remove? YES 본래 호스트의 MAC주소와 ip매핑 NO 자신의 MAC주소와 take-over한 ip값을 매 핑 while(1) fake_on Call Send_arp() fake_off while FOREIGN_ARP >= 0 sleep fo ARP_DELAY shell if FOREIGN_ARP <= 0 Kangwon National Univ. -11- Software & System Lab High-availability Linux Project - Products of Linux-HA 2 Heartbeat function system check, IP takeover eth0 transfer eth0 TCP, UDP, Serial, PPP/UDP LAN(Ethernet, etc) check timeout, cross checking, voting Resource Manager (Nullmodem Cable) IP address, NIC, file system, disk, application … Daemon Control process, Status process, hb channel read process, hb channel write process ttys0 ttys0 eth0 eth0 LAN(Ethernet, etc) Kangwon National Univ. -12- Software & System Lab High-availability Linux Project - Products of Linux-HA 3 fo rk child p aren t P R O C _C O N TR O L fo rk child p aren t P R O C _M S T_S TA TU S m ed ia 1 fo rk child T_S TA TU S 2 3 p aren t P R O C _H B W R ITE m ed ia child p aren t [] p aren t pe pi child fo rk w m ed ia fo rk P R O C _H B R E A D exec() m ach_ dow n R e so u rc e M an ag er sta tu s heartb eat- fifo status_p ip e d in g w rite Kangwon National Univ. -13- Software & System Lab High-availability Linux Project - Products of Linux-HA 4 Mon 각 객체에 대한 정의 작업 monitoring, scheduler function 구성 파일 읽어들임 Monitors 모니터링에 필요한 서비스 세팅 service status check & report independent execution 데몬 프로세스 생성 Alerts 필요에 따라 클라이언트 연결 if service is stopped, send warning message independent execution 모니터링 작업 수행 Client/Server Kangwon National Univ. 무한루프 서비스 중단 발견시 경고작업 수행 -14- Software & System Lab High-availability Linux Project - Components Membership services Communication Services Cluster management Resource (I/O) fencing Resource Monitoring Storage Sharing/Replication Kangwon National Univ. -15- Software & System Lab High-availability Linux Project - What Linux-HA can do now Web servers LVS director servers Mail servers Database servers Firewalls File servers DNS servers DHCP servers Proxy Caching servers etc. Kangwon National Univ. -16- Software & System Lab High-availability Linux Project - Linux-HA task list http://www.linux-ha.org/HATodo.html Kangwon National Univ. -17- Software & System Lab High-availability Linux Project - Related project LAN Mirroring technologies DRBD NBD server software ENBD Volume Mangers LVM, EVMS File system technologies Open GFS, Reiserfs4, Ext3, JFS, XFS, FENRIS, M2CS, Intermezzo, Lustre, LinLogFS Other Linux-HA Management Software Linux FailSafe, Kimberlite, Ultramonkey, VRRP, Failoverd, Heart, Fake Monitoring and Administration Software lm-sensors, SAF-TE, RAS, RAM ECC monitoring, Mon, OpenNMS, Spumoni, Monit, VACM, PIKT, NOCOL/SNIPS, Nagios, MAT, WebRAT, Keepalived Etc. LVS, DLM, SSI Cluster for Linux, Life, HA RAID, OpenMosix, CODA, LNAT, LSP, OSCAR, Amata, Beowulf, Kangwon National Univ. -18- Software & System Lab High-availability Linux Project - Friends of Linux-HA IBM SuSE SGI MSC Linux Conectiva Tummy.com Emageon Intel Kangwon National Univ. -19- Software & System Lab Conclusion High Availability High-availability Linux Project “Provide a high-availability (clustering) solution for linux which promotes reliability, availability, and serviceability (RAS) through a community development effort.” Kangwon National Univ. -20- Software & System Lab