Sök i kursutbudet

Använda sökfunktionen för att hitta i Chalmers utbildningsutbud, både vad gäller kurser och program. När det finns en kurshemsida visas en hus-symbol som leder till denna sida. Tänk på att välja det läsår du vill se information om.
Sök program och utbildningsplaner


Institutionernas kurser för doktorander

​​​​​​​​​​​​​​​​​​​​

Kursplan för

Läsår
EDA122 - Fault-tolerant computer systems
 
Kursplanen fastställd 2009-02-24 av programansvarig (eller motsvarande)
Ägare: MPDCS
7,5 Poäng
Betygskala: TH - Fem, Fyra, Tre, Underkänt
Utbildningsnivå: Avancerad nivå
Huvudområde: Datateknik, Informationsteknik
Institution: 37 - DATA- OCH INFORMATIONSTEKNIK


Undervisningsspråk: Engelska

Modul   Poängfördelning   Tentamensdatum
Lp1 Lp2 Lp3 Lp4 Sommarkurs
0107 Tentamen 6,0 hp Betygskala: TH   6,0 hp   20 Okt 2010 em H,  10 Jan 2011 em M,  16 Aug 2011 em V
0207 Laboration 1,5 hp Betygskala: UG   1,5 hp    

I program

MPNET NETWORKS AND DISTRIBUTED SYSTEMS, MSC PROGR - Security, Årskurs 2 
MPSYS SYSTEMS, CONTROL AND MECHATRONICS, MSC PROGR, Årskurs 2 
MPSEN SOFTWARE ENGINEERING AND TECHNOLOGY, MSC PROGR, Årskurs 2 (valbar)
MPSEN SOFTWARE ENGINEERING AND TECHNOLOGY, MSC PROGR, Årskurs 1 (valbar)
MPDCS SECURE AND DEPENDABLE COMPUTER SYSTEMS, MSC PROGR, Årskurs 2 (obligatorisk)
MPIES INTEGRATED ELECTRONIC SYSTEM DESIGN, MSC PROGR, Årskurs 2 (valbar)
TKITE INFORMATIONSTEKNIK, CIVILINGENJÖR, Årskurs 3 (valbar)

Examinator:

Professor  Johan Karlsson


Ersätter

EDA120   Tillförlitliga och feltoleranta datorsystem EDA121   Fault-tolerant computer systems


Behörighet:

För kurser inom Chalmers utbildningsprogram gäller samma behörighetskrav som till de(t) program kursen ingår i.

Kursspecifika förkunskaper

No formal requirements, but the participants are expected to have basic knowledge in computer engineering, programming and probability theory.

Syfte

Fault-tolerant systems are used in applications that require high dependability, such as safety-critical control systems in vehicles and airplanes, or business-critical systems for e-commerce, automatic teller machines and financial transactions. This is an introductory course that covers basic techniques for design and analysis of fault-tolerant systems, as well as project management and development processes for safety-critical systems.

Lärandemål (efter fullgjord kurs ska studenten kunna)

* Formulate requirements for fault-tolerant computer systems used in business, safety and mission critical applications.
* Design system architectures for fault-tolerant computer systems from a given requirements specification.
* Perform probabilistic dependability analysis of fault-tolerant computer system using fault-trees, reliability block diagrams and time-continous Markov chains.
* Describe principles and properties of techniques for error detection, error recovery and errror masking in computer systems.
* Master the terminology of dependable computing and describe major elements of relevant standards.

Innehåll

The course focuses on design and analysis of fault-tolerant computer systems. It also gives an overview of development processes and standards for safety-critical systems.
The content can be divided into five areas:
1. Terminology and definitions: Includes terms such as dependability, reliability, maintainability, availability and safety, taxonomies for dependable systems, fault models, etc.
2. Design techniques for error detection and fault-tolerance: Fault-tolerance is achieved by introducing redundancy. Various redundancy principles are described and analyzed. Hardware redundancy: triple modular redundancy (TMR), dual modular redundancy (DMR), hot and cold standby systems, hybrid redundancy, etc. Software redundancy: N-version programming, recovery blocks and run-time assertions. Information redundancy: error correcting codes and self-checking circuits. Time redundancy: Methods for detecting and tolerating transient and permanent faults. Fault-tolerance in distributed systems: time-triggered systems, forward recovery, backward recovery, redundancy management, consensus, checkpointing, byzantine failures, etc.
3. Analysis of fault-tolerant system: Reliability block diagrams, fault-trees, markov chain models, failure mode and effects analysis (FMEA), fault tree analysis (FTA), fault injection, etc. Includes a laboratory class in which markov chain models are used to analyse a fault-tolerant system. The analysis is done using a special computer program.
4. Project management and development processes: Process models, risk analysis, safety case, the IEC 61508 standard, etc.
5. System examples: Fault-tolerant systems from areas such as space, aviation, automotive, telecommunication and transaction processing are described, some by guest lectures from industry.

Organisation

Lectures, exercises and a series of laboratory classes.

Litteratur

Neil Storey, Safety-Critical Computer Systems, Prentice Hall, ISBN 0-201-42787-7. Compendium, reprints of articles, compendium of exercises.

Examination

Written exam. Compulsory laboratory classes.


Sidansvarig Publicerad: må 13 jul 2020.