Sök i programutbudet

Använd sökfunktionen för att leta efter kurser och program i Chalmers utbildningsutbud. Den programplan och utbildningsplan som avser dina studier är i allmänhet från det läsår du började dina studier.

​​​​​​​​​​​​​

Kursplan för

Läsår
EDA122 - Fault-tolerant computer systems
 
Kursplanen fastställd 2012-02-21 av programansvarig (eller motsvarande)
Ägare: MPCSN
7,5 Poäng
Betygskala: TH - Fem, Fyra, Tre, Underkänt
Utbildningsnivå: Avancerad nivå
Huvudområde: Datateknik, Informationsteknik
Institution: 37 - DATA- OCH INFORMATIONSTEKNIK


Undervisningsspråk: Engelska
Sökbar för utbytesstudenter
Blockschema: C

Modul   Poängfördelning   Tentamensdatum
Lp1 Lp2 Lp3 Lp4 Sommarkurs Ej Lp
0107 Tentamen 6,0 hp Betygskala: TH   6,0 hp   23 Okt 2012 em H,  15 Jan 2013 em M,  22 Aug 2013 em V
0207 Laboration 1,5 hp Betygskala: UG   1,5 hp    

I program

MPCSN COMPUTER SYSTEMS AND NETWORKS, MSC PROGR, Årskurs 2 (valbar)
MPCSN COMPUTER SYSTEMS AND NETWORKS, MSC PROGR, Årskurs 1 (obligatorisk)
TKITE INFORMATIONSTEKNIK, CIVILINGENJÖR, Årskurs 3 (valbar)
MPSOF SOFTWARE ENGINEERING, MSC PROGR, Årskurs 2 (valbar)
MPSYS SYSTEMS, CONTROL AND MECHATRONICS, MSC PROGR, Årskurs 2 (valbar)
MPEES EMBEDDED ELECTRONIC SYSTEM DESIGN, MSC PROGR, Årskurs 1 (obligatoriskt valbar)
MPEES EMBEDDED ELECTRONIC SYSTEM DESIGN, MSC PROGR, Årskurs 2 (valbar)

Examinator:

Professor  Johan Karlsson


Ersätter

DAT270   Dependable computer systems EDA120   Tillförlitliga och feltoleranta datorsystem EDA121   Fault-tolerant computer systems

Kursutvärdering:

http://document.chalmers.se/doc/2503bf02-458c-40c6-93e2-17bf3f7aa083


Behörighet:

För kurser inom Chalmers utbildningsprogram gäller samma behörighetskrav som till de(t) program kursen ingår i.

Kursspecifika förkunskaper

Students are expected to have basic knowledge in computer organization, programming and probability theory.

Syfte

The course gives an introduction to dependable computing with an emphasis on system-level design of fault-tolerant systems. Dependability and fault tolerance are becoming increasingly important in a wide range of computer applications. Examples include safety-critical control systems for road vehicles, airplanes and medical devices, and business-critical systems for e-commerce, financial transactions and factory automation.

Lärandemål (efter fullgjord kurs ska studenten kunna)

* Formulate dependability requirements for computer systems used in business-, safety- and mission-critical applications. (Learning goal ensured by written exam, lab classes and laboratory report)
* Describe the structure and principles of commonly used system architectures for fault-tolerant computers. (Written exam, lab classes and laboratory report)
* Perform probabilistic dependability analysis of computer systems using fault-trees, reliability block diagrams, time-continuous Markov chains and stochastic Petri nets. (Written exam, lab classes and laboratory report)
* Describe principles and properties of techniques for error detection, error masking and system recovery. (Written exam)
* Master the terminology of dependable computing. (Written exam and laboratory report)
* Describe basic concepts in life-cycle models and standards employed in the development of safety-critical systems. (Written exam)
* Write a technical report of good quality on the topic of system-level dependability assessment of computer systems. (Laboratory report)

Innehåll

The course deals with design and analysis of fault-tolerant computer systems.
The content can be divided into five areas:
1. Terminology and definitions: Includes terms such as dependability, reliability, maintainability, availability and safety, taxonomies for dependable systems, fault and failure models, etc.
2. Design techniques for error detection and fault tolerance. Hardware redundancy: triple modular redundancy (TMR), dual modular redundancy (DMR), hot and cold standby systems, hybrid redundancy, forward and backward recovery, etc. Software redundancy: N-version programming, recovery blocks and run-time assertions. Information redundancy: basic principles for error correcting codes. Time redundancy: Methods for detecting and tolerating transient faults. Fault-tolerance in distributed systems: time-triggered systems, consensus and byzantine failures.
3. Dependability analysis of computer systems: Reliability block diagrams, fault-trees, Markov chain models, failure mode and effects analysis (FMEA), fault tree analysis (FTA), fault injection, etc. Includes two laboratory classes in which Markov chain models and stochastic Petri nets are used to analyze fault-tolerant systems. The analysis is done using a special computer program. Students are required to document the results of one laboratory class in a technical report.
4. Development processes: lifecycle models, hazard analysis, risk analysis, safety case, the IEC 61508 and the ISO26262 standards, etc.
5. System examples: Fault-tolerant systems from areas such as space, aviation, automotive, telecommunication and transaction processing are described, some by guest lecturers from industry.

Organisation

Lectures, exercises and two laboratory classes.

Litteratur

Neil Storey, Safety-Critical Computer Systems, Prentice Hall, ISBN 0-201-42787-7. Compendium, reprints of articles, compendium of exercises.

Examination

Written exam. Compulsary participation in two laboratory classes. One laboratory report.


Sidansvarig Publicerad: on 24 jan 2018.