ECSE 422. Fault Tolerant Computing.
Credits: 3
Offered by: Electrical & Computer Engr (Faculty of Engineering)
This course is not offered this catalogue year.
Description
Introduction to fault-tolerant systems. Fault-tolerance techniques through hardware, software, information and time redundancy. Failure classification, failure semantics, failure masking. Exception handling: detection, recovery, masking and propagation, termination vs. resumption. Reliable storage, reliable communication. Process groups, synchronous and asynchronous group membership and broadcast services. Automatic redundancy management. Case studies.
- (3-1-5)
- Prerequisites: ECSE 324 and (ECSE 250 or COMP 250)