Stud.IP Uni Oldenburg
University of Oldenburg
25.10.2020 01:36:28
inf105 - Fault Tolerance in Distributed Systems (Complete module description)
Original version English Download as PDF
Module label Fault Tolerance in Distributed Systems
Module code inf105
Credit points 6.0 KP
Workload 180 h
Faculty/Institute Department of Computing Science
Used in course of study
  • Master's Programme Computing Science (Master) > Praktische Informatik
  • Master's Programme Embedded Systems and Microrobotics (Master) > Akzentsetzungsmodule
Contact person
Module responsibility
Authorized examiners
Module counceling
Entry requirements
Skills to be acquired in this module
This module provides knowledge of fault-tolerant distributed systems. The terminology, structure, conception, core challenges and related implementation concepts will be covered in detail.

Professional competence
The students:
  • Assess what a fault-tolerant distributed system is and develop awareness of its capabilities
  • Name and discuss common implementations of fault-tolerant distributed systems
Methodological competence
The students:
  • Reflect the implementation challenges of a distributed system
  • Are able to adapt and evolve implementation concepts of fault-tolerant distributed systems in new contexts
Social competence
The students:
  • Solve problems in small teams
  • Present their solutions to the members of the tutorial
  • Discuss their different solutions with members of the tutorial

The students:
  • Accept criticism
  • Question their initially applied methods for problem solving
  • Question their initial solutions in the light of newly learned methods
Module contents
1) Fault, Error, Failure
2) Failure semantics, Fault tolerance
3) Byzantine agreement protocols
4) Stable storage
5) Fail-stop processors
6) Atomic commit protocols
7) Classification of replication control schemes
  • pessimistic vs. optimistic
  • semantic vs. syntactic
  • static vs. dynamic
8) Consistency notions
9) Quality criteria
10) Survey of replication control schemes
11) Design of replication control schemes
12) Unifying frameworks
13) Replication in practice
Reader's advisory
P. Jalote (1994): Fault Tolerance in Distributed Systems. Prentice-Hall.
A. Helal et. Al (1996): Replication Techniques in Distributed Systems. Kluwer Academics
A. Schiper et. Al (2010): Replication: Theory and Practice
Language of instruction German
Duration (semesters) 1 Semester
Module frequency jährlich
Module capacity unlimited
Reference text
connectet with:
Betriebssysteme 1 und 2
Verteilte Betriebssysteme
Modullevel AS (Akzentsetzung / Accentuation)
Modulart Wahlpflicht / Elective
Lern-/Lehrform / Type of program V+S bzw V+Ü
Vorkenntnisse / Previous knowledge Verteilte Betriebssysteme
Course type Comment SWS Frequency Workload attendance
Lecture 2.00 WiSe 28 h
Seminar or exercise 2.00 WiSe 28 h
Total time of attendance for the module 56 h
Examination Time of examination Type of examination
Final exam of module
End of lecture period
written exam or oral exam or practical work