Application Transparent Fault Management in Fault Tolerant Match
FTCS(1993)
摘要
Fault detection and fault tolerance has become an increasingly important aspect of all computer system designs, from PC’s
to high- end workstations and embedded critical systems. Since operating systems are common to all computers and it is at
the operating system level where there is maximum system visibility and control, it is appropriate for the operating system
to provide policies which detect, contain and tolerate faults. These policies form an operating system’s “fault management.”
A mechanism to provide support for operating system fault management has been designed and implemented for a UNIX 43 BSD server
running on the Mach 3.0 microkernel. The mechanism, called the sentry mechanism, consists of fault management control placed
at all operating system entry and exit points. The suitability of the mechanism is determined through demonstration of its
ability to support diverse, commonly accepted policies efficiently, where efficiency is measured in terms of implementation
complexity and performance. Several sentry policies have been implemented including monitoring, assertions, checkpoint/checkpoint
recovery and journaling journal replay. This paper presents the sentry mechanism, its implementation and the design and implementation
of the mentioned policies.
更多查看译文
关键词
operating system kernels,Mach 3.0 microkernel,UNIX 4.3 BSD server,application transparent fault management,assertion type policy,checkpoint/restart,checkpoint/restart/journaling,fault tolerant Mach,operating system fault management mechanism,performance cost,sentry
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要