Safety Critical & High Availability Systems



Similar documents
Design of High Availability Systems & Software

Embedded Real-Time Systems (TI-IRTS) Safety and Reliability Patterns B.D. Chapter

Embedded Systems Lecture 9: Reliability & Fault Tolerance. Björn Franke University of Edinburgh

Managing and Maintaining Windows Server 2008 Servers

Westek Technology Snapshot and HA iscsi Replication Suite

ISO Introduction

Red Hat Enterprise linux 5 Continuous Availability

Designing, Optimizing and Maintaining a Database Administrative Solution for Microsoft SQL Server 2008

Safety Issues in Automotive Software

TÓPICOS AVANÇADOS EM REDES ADVANCED TOPICS IN NETWORKS

MS Design, Optimize and Maintain Database for Microsoft SQL Server 2008

Course 20465: Designing a Data Solution with Microsoft SQL Server

Availability Digest. SAP on VMware High Availability Analysis. A Mathematical Approach. December 2012

Designing a Data Solution with Microsoft SQL Server

Design Patterns for Safety-Critical Embedded Systems

STORAGE CENTER WITH NAS STORAGE CENTER DATASHEET

Implementing High-Availability (HA) Solutions for Siebel ebusiness Applications

10215: Implementing and Managing Microsoft Server Virtualization

Controlling Risks Risk Assessment

ASSESSMENT OF THE ISO STANDARD, ROAD VEHICLES FUNCTIONAL SAFETY

A System-safety process for by-wire automotive systems

How To Write A Server On A Flash Memory On A Perforce Server

Value Paper Author: Edgar C. Ramirez. Diverse redundancy used in SIS technology to achieve higher safety integrity

NEC Express Partner Program. Deliver true innovation. Enjoy the rewards.

INDIA September 2011 virtual techdays

Apache S4: A Distributed Stream Computing Platform

TÓPICOS AVANÇADOS EM REDES ADVANCED TOPICS IN NETWORKS

CONTROL LEVEL NETWORK RESILIENCY USING RING TOPOLOGIES. Joseph C. Lee, Product Manager Jessica Forguites, Product Specialist

Key Challenges in Cloud Computing to Enable Future Internet of Things

M6430a Planning and Administering Windows Server 2008 Servers

Server Virtualization with Windows Server Hyper-V and System Center

Vess A2000 Series HA Surveillance with Milestone XProtect VMS Version 1.0

Pervasive PSQL Meets Critical Business Requirements

5054A: Designing a High Availability Messaging Solution Using Microsoft Exchange Server 2007

ELECTROTECHNIQUE IEC INTERNATIONALE INTERNATIONAL ELECTROTECHNICAL

Planning and Administering Windows Server 2008 Servers

FAULT-TOLERANT COMPUTING

Selecting Sensors for Safety Instrumented Systems per IEC (ISA )

HIPEAC Segregation of Subsystems with Different Criticalities on Networked Multi-Core Chips in the DREAMS Architecture

Avaya Aura Virtualized Environment

20465C: Designing a Data Solution with Microsoft SQL Server

Administering a Microsoft SQL Server 2000 Database

Downtime, whether planned or unplanned,

RPO represents the data differential between the source cluster and the replicas.

Server Virtualization with Windows Server Hyper-V and System Center

Planning and Administering Windows Server 2008 Servers

CASE STUDY: Oracle TimesTen In-Memory Database and Shared Disk HA Implementation at Instance level. -ORACLE TIMESTEN 11gR1

The Aspect Unified IP Five 9s Environment

Administering a Microsoft SQL Server 2000 Database

University of Paderborn Software Engineering Group II-25. Dr. Holger Giese. University of Paderborn Software Engineering Group. External facilities

TABLE OF CONTENTS THE SHAREPOINT MVP GUIDE TO ACHIEVING HIGH AVAILABILITY FOR SHAREPOINT DATA. Introduction. Examining Third-Party Replication Models

Designing a Data Solution with Microsoft SQL Server 2014

Course 2788A: Designing High Availability Database Solutions Using Microsoft SQL Server 2005

LinuxWorld Conference & Expo Server Farms and XML Web Services

Designing a Data Solution with Microsoft SQL Server

HIGH AVAILABILITY LINUX ARCHITECTURE FOR MISSION CRITICAL WORKLOADS

Course 20412A: Configuring Advanced Windows Server 2012 Services

MS-10215: Implementing and Managing Microsoft Server Virtualization. Course Objectives. Required Exam(s) Price. Duration. Methods of Delivery

Software Engineering for Real- Time Systems.

Course 20465C: Designing a Data Solution with Microsoft SQL Server

Skelta BPM and High Availability

Designing a Data Solution with Microsoft SQL Server 2014

Designing a Data Solution with Microsoft SQL Server

Administering a Microsoft SQL Server 2000 Database

Hadoop s Entry into the Traditional Analytical DBMS Market. Daniel Abadi Yale University August 3 rd, 2010

Bosch Packaging Academy Essential Training

STORAGE CENTER. The Industry s Only SAN with Automated Tiered Storage STORAGE CENTER

Middleware and Distributed Systems. System Models. Dr. Martin v. Löwis. Freitag, 14. Oktober 11

Cisco Disaster Recovery: Best Practices White Paper

Web DNS Peer-to-peer systems (file sharing, CDNs, cycle sharing)

Mixed-Criticality: Integration of Different Models of Computation. University of Siegen, Roman Obermaisser

System Availability and Data Protection of Infortrend s ESVA Storage Solution

Atlas Emergency Detection System (EDS)

MultiPARTES. Virtualization on Heterogeneous Multicore Platforms. 2012/7/18 Slides by TU Wien, UPV, fentiss, UPM

Testing of safety-critical software some principles

NEC Corporation of America Intro to High Availability / Fault Tolerant Solutions

Oracle EPM Disaster Recovery High Level Overview

Cisco Nexus 1000V and Cisco Nexus 1110 Virtual Services Appliance (VSA) across data centers

Powering Converged Infrastructures

Introduction of ISO/DIS (ISO 26262) Parts of ISO ASIL Levels Part 6 : Product Development Software Level

Government Degree on the Safety of Nuclear Power Plants 717/2013

Linear Motion and Assembly Technologies Pneumatics Service. Industrial Ethernet: The key advantages of SERCOS III

TIBCO StreamBase High Availability Deploy Mission-Critical TIBCO StreamBase Applications in a Fault Tolerant Configuration

High Availability with Windows Server 2012 Release Candidate

Microsoft SQL Server for Oracle DBAs Course 40045; 4 Days, Instructor-led

High Availability Design Patterns

RETIS Lab Real-Time Systems Laboratory

Transcription:

SCHA - Version: 1 21 June 2016 Safety Critical & High Availability Systems

Safety Critical & High Availability Systems SCHA - Version: 1 3 days Course Description: This Masterclass examines the design of embedded systems and software that are to provide services in applications that could, when they fail, threaten the well-being or safety of people. Many, though not all, of these systems must not be stopped under any circumstances, and thus must be designed for high availability. Practical guidance is offered on how to address these concerns when designing systems in fields such as medical, automotive, avionics, nuclear and chemical process control. The Masterclass surveys concepts and alternatives for system and software architectures appropriate for safetycritical and high availability systems. Following an examination of hazard and risk analysis techniques, the seminar goes on to list a number of approaches to software safety that span fault avoidance, fault detection, and fault containment tactics including redundancy, recovery, masking and barriers. A variety of candidate architectural design patterns are examined, including dual/triple modular redundancy, shutdown monitors, dissimilar independent designs, backup parallel patterns and active/monitor parallel patterns. Many realworld examples are presented. Systems which are required to provide high availability must be designed to tolerate faults. Their design is usually based on off-the-shelf hardware and software combined in ways that will achieve

five-nines (99.999%) or greater availability. Basic hardware N-plexing and voting issues are discussed, followed by an in-depth study of a number of backward error recovery fault tolerance techniques including Checkpoint-Rollback, Process Pairs, and Recovery Blocks. The class continues with several forward error recovery techniques. Technical issues such as failover management, data replication, and software design defects, are addressed in depth. This Masterclass is far from a general course about system or software design theory, but rather it is tightly focused on the design of embedded systems and software that are required to provide their intended functions without endangering the safety or life of users or their environment, while at the same time maintaining high availability if required. Intended audience: This Masterclass is intended for practicing real-time and embedded systems engineers, software system architects, project managers and technical consultants who have responsibility for designing, structuring and implementing the hardware and software for real-time and embedded computer systems in applications that could, when they fail, threaten the well-being or life of people. Many of these systems have high availability as an additional design requirement Prerequisites: Course participants are expected to be familiar with general embedded and real-time software design. Objectives: The primary goal of this Masterclass is to give the participant the skills necessary to design systems and software for real-time and embedded computers in which faults and failures could pose a danger to human life. As part of this, participants gain skills in designing systems for high availability. This is very practical, results-oriented

training that provides knowledge and skills that can be applied immediately. Topics: Definitions and Background Hazards and Risks Safety vs. Fault Tolerance Design Issues for Safety Redundancy Approaches to Dependability Examples: Automotive Brake-by-Wire, Steer-by-Wire Preparatory Analyses Hazard Analysis: FMEA Fault & Event Tree Analysis Exercise: Fault Tree Analysis Probabilistic Event Tree Analysis Risk Analysis Approaches to Safety: Fault Avoidance, Fault Detection, Fault Tolerance Fundamental Safety Design Patterns Detection of Sensor Errors Failstop Fault Masking Shutdown Design Patterns Single Channel Patterns Multi-Channel Safety Design Patterns Actuation Monitoring Options Dual Channel Patterns Dual Closed-Loop Patterns

Heterogeneous Peer-Channel Pattern Example: Flight Control Computer Development Dual-Dual Pattern Design Patterns for Resiliency and Safety Monitor-Actuator Pattern Extended Example: Medical Respiratory Ventilator The Safety Executive Extended Example: Automotive Drive-by-Wire Extended Example: Airbus A330/340 Fly-by-Wire A Cookbook for Safety-Critical Design Functionality Learning from System Failures and Accidents Sources of System Accidents Hazard-Based Risk Analysis Calculations Exercise: Spacecraft Risk Analysis Software Factors in Some Famous Accidents High Availability: Underlying Principles Fault Avoidance vs. Tolerance Failure Curves Replication vs. Functional Redundancy vs. Analytic Redundancy Dynamic vs. Static Redundancy Extended Example: Space Shuttle Software Fundamental System-Level Availability Design Patterns Static Hardware Fault Tolerance N-Plex Design Exercise: MTBF, MTTF Calculations in Triple Modular Redundancy Dynamic System Fault Tolerance Redundant Pairs

Clusters Cluster Failover Strategy Choices Concepts for Backward Error Recovery Design Diversity Dynamic System Redundancy Backward Error Recovery Transactions & Checkpointing System and Software Design Patterns for High Availability Checkpoint-Rollback Process Pairs Recovery Blocks Limitations of Backward Error Recovery Patterns Forward Error Recovery Design Patterns Technical Issues in High Availability Design Failover Management Data Replication Dealing with Software Design Faults C Language in Critical Systems Software Robustness: MISRA-C, LINT, Static Code Analyzers Exercise: C-Language Shenanigans

º Final Examination. www.sela.co.il 03-6176066