Download High-Availability Linux Project

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
High-Availability Linux Project
http://linux-ha.org/
(Last updated 24 Septemver, 2003)
30th December, 2003
Seo Dong Mahn
Contents
High Availability
level
SPOF
general system topology
takeover
High-availability Linux Project
Goals
Products of Linux-HA
Components
What Linux-HA can do now
Linux-HA task list
Related project
Friends of Linux-HA
Conclusion
Kangwon National Univ.
-2-
Software & System Lab
High Availability
Availability
service time usage for purpose
High Availability
keep alive application from fault of CPU, network,
storage
system recovery
Kangwon National Univ.
-3-
Software & System Lab
High Availability
- Level
100%
Fault Tolerance
using professional hardware
99.999%
99.99%
High Availability
99.5%
using professional HA solution to improve system availability
99.0%
Enhanced Availability
98.0%
improve availability via RAID, redundancy power supply
96.6%
Basic Availability
95.0%
Kangwon National Univ.
simple data backup from server
-4-
Software & System Lab
High Availability
- SPOF 1
hardware
–
40%
software
–
30%
human
–
20%
Kangwon National Univ.
A p p lic a tio n
F a ilu re
40 %
H ard w are / O S
P o w er
N etw o rk
D isaster
-5-
U n tested
C han g e M an ag em en t
O verlo ad ed
W eak P ro b lem
D etectio n
O th e r
20 %
O p e ra to r
E rro r
40 %
F o rg et so m ethin g
Lack o f P ro ced ures
B ackup E rro rs
S ecurity
Software & System Lab
High Availability
- SPOF 2
Power
Power Source
Source
Human
Human
Operator
Operator
Client
Client
Application
Application
Client Machine
Internal
Disk
Network
Network
Interface
Interface
Card
Card
Disk
Disk
Controller
Controller
CPU
CPU
Power
Power
Supply
Supply
Operating
Operating
System
System
Network
Human
Human
Operator
Operator
Server
Server
Application
Application
Database
Database
Engine
Engine
Server Machine
Network
Network
Interface
Interface
Card
Card
Power
Power
Supply
Supply
CPU
CPU
External
Disk
External
Disk
Disk
Disk
Controller
Controller
External
Disk
Power
Power
Supply
Supply
Operating
Operating
System
System
Kangwon National Univ.
Disk Enclosure
Power
Power Source
Source
-6-
Software & System Lab
High Availability
- general system topology
1
Kangwon National Univ.
-7-
Software & System Lab
High Availability
- general system topology
2
Kangwon National Univ.
-8-
Software & System Lab
High Availability
- takeover
basic Agent
CPU usage, NIC, File system status, Disk usage,
Daemons behavior, Network behavior, Process
behavior, Swap Space usage
custom Agent
Process, Environment Variable, Program error
option Agent
Oracle, Infomix, Sybase …
Kangwon National Univ.
-9-
Software & System Lab
High-availability Linux
Project
- Goals
basic goal
“Provide a high-availability (clustering) solution for linux
which promotes reliability, availability, and serviceability
(RAS) through a community development effort.”
high availability system
Heartbeat, Fake, Heart
file system for high availability
high availability RAID, GFS, LFS, CODA
monitoring for high availability
Mon, PIKT, NOCOL/SPINS
Kangwon National Univ.
-10-
Software & System Lab
High-availability Linux
Project
- Products of Linux-HA 1
main
Fake
check_root
ARP spoofing
service, standby
YES
source_rc
.fakerc 파일에 설정된 환경 파일의
내용들을 변수이름에 설정한다.
parse_arguments
clear_remote_arp_c
ache
명령행에 입력된 take-over ip 파
일을 확인 하고 remove가 설정되
어 있는지 확인한다.
라우터의 캐시 내용을 지운다.
fake remove?
YES
본래 호스트의
MAC주소와 ip매핑
NO
자신의 MAC주소와
take-over한 ip값을 매
핑
while(1)
fake_on
Call
Send_arp()
fake_off
while
FOREIGN_ARP >= 0
sleep fo
ARP_DELAY
shell
if FOREIGN_ARP <= 0
Kangwon National Univ.
-11-
Software & System Lab
High-availability Linux
Project
- Products of Linux-HA 2
Heartbeat
function
system check, IP takeover
eth0
transfer
eth0
TCP, UDP, Serial, PPP/UDP
LAN(Ethernet, etc)
check
timeout, cross checking, voting
Resource Manager
(Nullmodem Cable)
IP address, NIC, file system, disk,
application …
Daemon
Control process, Status process,
hb channel read process,
hb channel write process
ttys0
ttys0
eth0
eth0
LAN(Ethernet, etc)
Kangwon National Univ.
-12-
Software & System Lab
High-availability Linux
Project
- Products of Linux-HA 3
fo rk
child
p aren t
P R O C _C O N TR O L
fo rk
child
p aren t
P R O C _M S T_S TA TU S
m ed ia 1
fo rk
child
T_S TA TU S
2
3
p aren t
P R O C _H B W R ITE
m ed ia
child
p aren t
[]
p aren t
pe
pi
child
fo rk
w
m ed ia
fo rk
P R O C _H B R E A D
exec()
m ach_
dow n
R e so u rc e
M an ag er
sta tu s
heartb eat- fifo
status_p ip e
d in g
w rite
Kangwon National Univ.
-13-
Software & System Lab
High-availability Linux
Project
- Products of Linux-HA 4
Mon
각 객체에 대한 정의 작업
monitoring, scheduler
function
구성 파일 읽어들임
Monitors
모니터링에 필요한 서비스 세팅
service status check & report
independent execution
데몬 프로세스 생성
Alerts
필요에 따라 클라이언트 연결
if service is stopped,
send warning message
independent execution
모니터링 작업 수행
Client/Server
Kangwon National Univ.
무한루프
서비스 중단 발견시 경고작업 수행
-14-
Software & System Lab
High-availability Linux
Project
- Components
Membership services
Communication Services
Cluster management
Resource (I/O) fencing
Resource Monitoring
Storage Sharing/Replication
Kangwon National Univ.
-15-
Software & System Lab
High-availability Linux Project
- What Linux-HA can do now
Web servers
LVS director servers
Mail servers
Database servers
Firewalls
File servers
DNS servers
DHCP servers
Proxy Caching servers
etc.
Kangwon National Univ.
-16-
Software & System Lab
High-availability Linux
Project
- Linux-HA task list
http://www.linux-ha.org/HATodo.html
Kangwon National Univ.
-17-
Software & System Lab
High-availability Linux
Project
- Related project
LAN Mirroring technologies
DRBD
NBD server software
ENBD
Volume Mangers
LVM, EVMS
File system technologies
Open GFS, Reiserfs4, Ext3, JFS, XFS, FENRIS, M2CS, Intermezzo, Lustre, LinLogFS
Other Linux-HA Management Software
Linux FailSafe, Kimberlite, Ultramonkey, VRRP, Failoverd, Heart, Fake
Monitoring and Administration Software
lm-sensors, SAF-TE, RAS, RAM ECC monitoring, Mon, OpenNMS, Spumoni, Monit,
VACM, PIKT, NOCOL/SNIPS, Nagios, MAT, WebRAT, Keepalived
Etc.
LVS, DLM, SSI Cluster for Linux, Life, HA RAID, OpenMosix, CODA, LNAT, LSP,
OSCAR, Amata, Beowulf,
Kangwon National Univ.
-18-
Software & System Lab
High-availability Linux
Project
- Friends of Linux-HA
IBM
SuSE
SGI
MSC Linux
Conectiva
Tummy.com
Emageon
Intel
Kangwon National Univ.
-19-
Software & System Lab
Conclusion
High Availability
High-availability Linux Project
“Provide a high-availability (clustering) solution for linux
which promotes reliability, availability, and serviceability
(RAS) through a community development effort.”
Kangwon National Univ.
-20-
Software & System Lab
Related documents