Stud.IP Uni Oldenburg
University of Oldenburg
25.10.2021 08:20:27
inf105 - Fault Tolerance in Distributed Systems (Complete module description)
Original version English Download as PDF
Module label Fault Tolerance in Distributed Systems
Module code inf105
Credit points 6.0 KP
Workload 180 h
Institute directory Department of Computing Science
Applicability of the module
  • Master's Programme Computing Science (Master) > Praktische Informatik
  • Master's Programme Embedded Systems and Microrobotics (Master) > Akzentsetzungsmodule
Responsible persons
Lehrenden, Die im Modul (Module counselling)
Theel, Oliver (Module responsibility)
Modulverantwortlichen, Die (Authorized examiners)
Prerequisites
Skills to be acquired in this module
This module provides knowledge of fault-tolerant distributed systems. The terminology, structure, conception, core challenges and related implementation concepts will be covered in detail.

Professional competence
The students:
  • Assess what a fault-tolerant distributed system is and develop awareness of its capabilities
  • Name and discuss common implementations of fault-tolerant distributed systems
Methodological competence
The students:
  • Reflect the implementation challenges of a distributed system
  • Are able to adapt and evolve implementation concepts of fault-tolerant distributed systems in new contexts
Social competence
The students:
  • Solve problems in small teams
  • Present their solutions to the members of the tutorial
  • Discuss their different solutions with members of the tutorial

Self-competence
The students:
  • Accept criticism
  • Question their initially applied methods for problem solving
  • Question their initial solutions in the light of newly learned methods
Module contents
1) Fault, Error, Failure
2) Failure semantics, Fault tolerance
3) Byzantine agreement protocols
4) Stable storage
5) Fail-stop processors
6) Atomic commit protocols
7) Classification of replication control schemes
  • pessimistic vs. optimistic
  • semantic vs. syntactic
  • static vs. dynamic
8) Consistency notions
9) Quality criteria
10) Survey of replication control schemes
11) Design of replication control schemes
12) Unifying frameworks
13) Replication in practice
Reader's advisory
P. Jalote (1994): Fault Tolerance in Distributed Systems. Prentice-Hall.
A. Helal et. Al (1996): Replication Techniques in Distributed Systems. Kluwer Academics
A. Schiper et. Al (2010): Replication: Theory and Practice
Links
Language of instruction German
Duration (semesters) 1 Semester
Module frequency jährlich
Module capacity unlimited
Reference text
connectet with:
Betriebssysteme 1 und 2
Betriebssysteme-Praktikum
Verteilte Betriebssysteme
Modullevel / module level AS (Akzentsetzung / Accentuation)
Modulart / typ of module Wahlpflicht / Elective
Lehr-/Lernform / Teaching/Learning method V+S bzw V+Ü
Vorkenntnisse / Previous knowledge Verteilte Betriebssysteme
Course type Comment SWS Frequency Workload of compulsory attendance
Lecture
2 WiSe 28
Seminar or exercise
2 WiSe 28
Total time of attendance for the module 56 h
Examination Time of examination Type of examination
Final exam of module
End of lecture period
written exam or oral exam or practical work