Operating System Support for Multiprocessor Systems-on-Chip



Similar documents
Multiprocessor System-on-Chip

Networking Virtualization Using FPGAs

Outline. Introduction. Multiprocessor Systems on Chip. A MPSoC Example: Nexperia DVP. A New Paradigm: Network on Chip

Xeon+FPGA Platform for the Data Center

Digitale Signalverarbeitung mit FPGA (DSF) Soft Core Prozessor NIOS II Stand Mai Jens Onno Krah

Breaking the Interleaving Bottleneck in Communication Applications for Efficient SoC Implementations

A Generic Network Interface Architecture for a Networked Processor Array (NePA)

Architectures and Platforms

Lesson 7: SYSTEM-ON. SoC) AND USE OF VLSI CIRCUIT DESIGN TECHNOLOGY. Chapter-1L07: "Embedded Systems - ", Raj Kamal, Publs.: McGraw-Hill Education

Cloud Computing and Robotics for Disaster Management

Making Multicore Work and Measuring its Benefits. Markus Levy, president EEMBC and Multicore Association

Intel Xeon +FPGA Platform for the Data Center

BDTI Solution Certification TM : Benchmarking H.264 Video Decoder Hardware/Software Solutions

Real-Time Operating Systems for MPSoCs

2.1 What are distributed systems? What are systems? Different kind of systems How to distribute systems? 2.2 Communication concepts

CMS Level 1 Track Trigger

Architectural Level Power Consumption of Network on Chip. Presenter: YUAN Zheng

ZigBee Technology Overview

Virtualised MikroTik

Types Of Operating Systems

Reconfigurable Computing. Reconfigurable Architectures. Chapter 3.2

Enabling Technologies for Distributed Computing

7a. System-on-chip design and prototyping platforms

Networking Remote-Controlled Moving Image Monitoring System

Lesson 10:DESIGN PROCESS EXAMPLES Automatic Chocolate vending machine, smart card and digital camera

Applying the Benefits of Network on a Chip Architecture to FPGA System Design

All Programmable Logic. Hans-Joachim Gelke Institute of Embedded Systems. Zürcher Fachhochschule

PCI Express Overview. And, by the way, they need to do it in less time.

Introduction to Routing and Packet Forwarding. Routing Protocols and Concepts Chapter 1

DESIGN AND VERIFICATION OF LSR OF THE MPLS NETWORK USING VHDL

Enhance Service Delivery and Accelerate Financial Applications with Consolidated Market Data

Multistage Interconnection Network for MPSoC: Performances study and prototyping on FPGA

Introduction to Cloud Computing

Achieving Nanosecond Latency Between Applications with IPC Shared Memory Messaging

FPGA Accelerator Virtualization in an OpenPOWER cloud. Fei Chen, Yonghua Lin IBM China Research Lab

Stream Processing on GPUs Using Distributed Multimedia Middleware

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage

Introduction to System-on-Chip

FlexPath Network Processor

FPGA-based Multithreading for In-Memory Hash Joins

Enabling Technologies for Distributed and Cloud Computing

Computer Systems Structure Input/Output

Virtuoso and Database Scalability

DEPLOYING AND MONITORING HADOOP MAP-REDUCE ANALYTICS ON SINGLE-CHIP CLOUD COMPUTER

M85 OpenCPU Solution Presentation

How To Build A Cloud Computer

A Taxonomy and Survey of Energy-Efficient Data Centers and Cloud Computing Systems

Design Issues in a Bare PC Web Server

Mobility research group

Von der Hardware zur Software in FPGAs mit Embedded Prozessoren. Alexander Hahn Senior Field Application Engineer Lattice Semiconductor

Best Practises for LabVIEW FPGA Design Flow. uk.ni.com ireland.ni.com

Contents. Chapter 1. Introduction

Internet of things (IOT) applications covering industrial domain. Dev Bhattacharya

A Transport Protocol for Multimedia Wireless Sensor Networks

Cisco Integrated Services Routers Performance Overview

Introduction to Exploration and Optimization of Multiprocessor Embedded Architectures based on Networks On-Chip

TCP Servers: Offloading TCP Processing in Internet Servers. Design, Implementation, and Performance

System Design Issues in Embedded Processing

High Performance or Cycle Accuracy?

Fastboot Techniques for x86 Architectures. Marcus Bortel Field Application Engineer QNX Software Systems

- Nishad Nerurkar. - Aniket Mhatre

A case study of mobile SoC architecture design based on transaction-level modeling

Architecture of distributed network processors: specifics of application in information security systems

Open Flow Controller and Switch Datasheet

Introducing PgOpenCL A New PostgreSQL Procedural Language Unlocking the Power of the GPU! By Tim Child

Building an energy dashboard. Energy measurement and visualization in current HPC systems

Seeking Opportunities for Hardware Acceleration in Big Data Analytics

Boundless Security Systems, Inc.

OPART: Towards an Open Platform for Abstraction of Real-Time Communication in Cross-Domain Applications

How to Perform Real-Time Processing on the Raspberry Pi. Steven Doran SCALE 13X

Cut Network Security Cost in Half Using the Intel EP80579 Integrated Processor for entry-to mid-level VPN

Easy H.264 video streaming with Freescale's i.mx27 and Linux

Computer and Set of Robots

Symmetric Multiprocessing

Scheduling. Scheduling. Scheduling levels. Decision to switch the running process can take place under the following circumstances:

Chapter 1 Computer System Overview

Overview of Computer Networks

The Lagopus SDN Software Switch. 3.1 SDN and OpenFlow. 3. Cloud Computing Technology

Wireless Microcontrollers for Environment Management, Asset Tracking and Consumer. October 2009

Design and Implementation of the Heterogeneous Multikernel Operating System

System-on-a-Chip with Security Modules for Network Home Electric Appliances

MBP_MSTR: Modbus Plus Master 12

CS 3530 Operating Systems. L02 OS Intro Part 1 Dr. Ken Hoganson

System on Chip Platform Based on OpenCores for Telecommunication Applications

IMPLEMENTATION OF FPGA CARD IN CONTENT FILTERING SOLUTIONS FOR SECURING COMPUTER NETWORKS. Received May 2010; accepted July 2010

Reconfigurable Architecture Requirements for Co-Designed Virtual Machines

Hardware accelerated Virtualization in the ARM Cortex Processors

Network Monitoring Comparison

Wireless Technologies for Automation

Getting the most TCP/IP from your Embedded Processor

4 Internet QoS Management

A Scalable VISC Processor Platform for Modern Client and Cloud Workloads

5 Performance Management for Web Services. Rolf Stadler School of Electrical Engineering KTH Royal Institute of Technology.

Transcription:

Operating System Support for Multiprocessor Systems-on-Chip Dr. Gabriel marchesan almeida

Agenda. Introduction. Adaptive System + Shop Architecture. Preliminary Results. Perspectives & Conclusions Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk /

INTRODUCTION HOMOGENEOUS / HETEROGENEOUS PLATFORMS Network Processing Unit RISC USB Router Video Accelerator Power Managem. RAM Audio Accelerator Memory UART Y L D H AR BL E LA A C S Power CPU Bridge Bluetooth Webcam Management GPIO HETEROGENEOUS PLATFORMS SHARED MEMORY ILY S A E LE B A L SCA Homogeneous Heterogeneous Performance ADAPTATION Power Flexibility HOMOGENEOUS PLATFORM DISTRIBUTED MEMORY Programmability Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk /

ARCHITECTURES IN THE MARKET SCC (Single Chip Cloud Computer) Intel Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk /

Context Array of tiny scalar ISP w. private RAM Interconnected through a NoC Adaptive DVFS Dynamic Task Mapping Domain-specific Tele Load balancing General purpose Multi OS com média Institute for Information Processing Technology (ITIV) Telecom CPU DSP DSP Prof. Dr.-Ing. K. D. Müller-Glaser Prof. Dr.-Ing. J. Becker Prof. Dr. rer. nat. W. Stork RAM I/O Multimedia Centralized control (GPP) Specialized ISPs, ASIPs Static task mapping Bus-like interconnect Centralized control (OS) Symmetric multicore Dynamic load balancing High-freq. memory & Bus Heterogeneous Homogeneous Homogeneous Centralized Distributed Centralized Static Adaptive Dynamic Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk /

SHOP Self-adaptive Homogeneous Platform Distributed Memory Message Passing (State of the art) Network Processing Unit NPU Router RISC RAM Power Management Task Migration Task N... Task Task RTOS Frequency Scaling ADAPTATION Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk 6 /

SHOP OS Operating System Tiny, preemptive OS with dynamic loader Scheduler based on thread priorities Features: Communication between local and remote threads Use of RAW/UDP/TCP IP connection to ensure reliable communication Possible use of a self-adaptive RAW/UDP/TCP IP protocol Frequency scaling DMA support Routing Table (Master node) NPU Task Ports 9.68.. 000, 00 9.68.. 000 9.68.. 000 00 Step : Registering Tasks Step : Requesting Task Location Step : Receiving Task Location Step : Establishing the Link Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk 7 /

SHOP OS Operating System Routing Table (Master node) Task Migration NPU Task Ports 9.68.. 9.68.. 000, 00 9.68.. 000 9.68.. 000 00 Step : T is selected to be migrated : Stork Sending Prof. Dr.-Ing. K. D. Müller-Glaser Prof. Dr.-Ing. J. Becker Prof. Dr. rer.step nat. W. information to the sender tasks Step : Stopping sending packets to T Step : Sending information to the master node Step : Removing T entry from routing table Step 6: Migrating T to NPU 9.68.. Step 7: Sending new position to the master node Step 8: Registering T in the routing table Step 9: Updating T position in the sender tasks Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk 8 /

SHOP OS Operating System RAW/UDP/TCP protocol Protocol Speed QoS Raw High No UDP/IP Medium No TCP/IP Low Yes Packet FaultLost Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk 9 /

SHOP OS Processor software hardware Applications Router software hardware FPGA SystemC Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk 0/

Platforms for validation HS-Scale (Nicolas Saint Jean, LIRMM, France) Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk /

Platforms for validation System C (Nicolas Hebert, LIRMM, France) Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk /

METRICS AND VALIDATION FLOW OFF-LINE (DESIGN TIME) APPLICATION ANSI C CODE PROFILING SPLIT APPLICATION ANSI C CODE + API INITIAL STATIC MAPPING SPLIT APPLICATION ANSI C CODE + API INITIAL STATIC MAPPING TASK GRAPH WITH PROCESSING COST Institute for Information Processing Technology (ITIV) PROCESSING REQUIREMENTS Prof. Dr.-Ing.SNAPSHOT K. D. Müller-Glaser Prof. Dr.-Ing. J. Becker Prof. Dr. rer. nat. W. Stork PERFORMANCE MEASUREMENT PERFORMANCE MEASUREMENT NEW STATIC MAPPING NEW DYNAMIC MAPPING SPLITTING ANSI C CODE + API SPLIT APPLICATION PERFORMANCE SMi PERFORMANCE DMI ON-LINE (RUN-TIME) Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk /

ADAPTIVE SYSTEM Action Monitoring O = f(l) SYSTEM The user feeds the O=f(l) law, the system then handles decision (action) making accordingly Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk /

HOW TO MANAGE ADAPTABILITY? Events and Maps Events represent perturbations - decisions taken based on history - stored in DRET: Distributed Raw Event Table The set of all tiles states represents the map of the MPSoC - temperature map - power consumption map - working/not working map Maps are stored in AIM: Architecture Instant Map MONITORING INSTRUMENTED SYSTEM SENSORS DRET Network Processing Unit DRET LEVEL DIAGNOSIS AIM LEVEL SENSORS DIAGNOSIS ONLINE APPLICATION REMAPPING Three levels of reactiveness LEVEL AIM Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk /

HOW TO MANAGE ADAPTABILITY? CLUSTER CLUSTER AIM AIM Technology (ITIV) Institute for Information Processing AIM CLUSTER AIM CLUSTER Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk 6/

DRET DISTRIBUTED RAW EVENT TABLE Create Table Create Table Rows Columns Clock cycles 0 7 08 6 70 Rows Insert Dump Table Columns 000 Clock cycles 97 96 88 7 899 Get First Row Columns 6 07 7 9 6 0 0 Rows Delete Table Columns Sort Table Rows Columns Clock cycles 7 9 7 8 9 6 70 79 08 00 Rows Columns Clock Rows Clock Prof. Dr.-Ing. K. D. Müller-Glaser Prof. Dr.-Ing. J. Becker Prof. Dr. rer. nat. W. Stork cycles cycles 70 70 0 7 Insert 00 Clock cycles 0 9 6 78 80 6 0 7 9 7 07 0 Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk 7/

EXPERIMENTS MONITORING Action O = f(l) Monitoring ACTUAL CPU WORKLOAD FIFO USAGE FREQUENCY FUTURE TEMPERATURE NOC USAGE Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk 8/

EXPERIMENTS DIAGNOSIS Action 0 N O = f(l) Monitoring N E W 8 9 E W 6 7 Institute for Information Processing S Technology (ITIV) S 0 N N E W 9 8 E W 7 S 0 6 S Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk 9/

EXPERIMENTS CASE STUDY (MJPEG DECODER) Sender T APPLICATION ANSI C CODE T PROFILING IVLC T TASK GRAPH WITH Prof. Dr.-Ing. K. PROCESSING D. Müller-Glaser Prof. Dr.-Ing. J. Becker Prof. Dr. rer. nat. W. Stork COST PROCESSING REQUIREMENTS SNAPSHOT T IQUANT T T SPLITTING ANSI C CODE + API T 0 SPLIT APPLICATION IDCT T Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk 0/

EXPERIMENTS ACTION MJPEG MULTI TASK PROCESSOR (600MHz) MJPEG MULTI TASK PROCESSOR (600MHz). 6 80 x 0 Action MJPEG MULTI TASK PROCESSOR (600MHz) 80.. 70 70. 60.. 60 0 0... 0 0 0 Technology. 0 0. 0 Institute for Information Processing (ITIV) Prof. Dr.-Ing. K. D. Müller-Glaser. Prof. Dr.-Ing. J. Becker Prof. Dr. rer. nat. W. Stork FIFO Filling (Number of Positions) Throughput (MB/s) NUMBER OF(CLOCK PACKETS MIGRATION TIME CYCLES) THROUGHPUT (MB/s) O = f(l) Monitoring 0. 0 0. 0 0 0.0 0 0 8 0 0 6 0 TASK 0 0 0 0 SIZE (KB) 0 0 TIME (ms) TIME (ms) 0 DYNAMIC MAPPING BUFFER FILLING DYNAMIC MAPPING MIGRATION TRIGGER MIGRATION MIGRATIONCOST COST POINT 8MIGRATION 6 6 0 0 0 0 0 0 TIME (ms) 0 0 Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk /

PERSPECTIVES & CONCLUSIONS Investigate the use of a self-adaptive RAW/UDP/TCP IP protocol Develop different monitoring systems TEMPERATURE CPU FIFO USAGE WORKLOAD Explore different task migration techniques Critical applications to perform the experiments VOICE ENCODING MSA VIDEO (MULTIPLE SEQUENCE ALIGNMENT) ENCODING DIALIGN ALGORITHM PROTEIN ENERGY INTERACTION Dr. Gabriel Marchesan Almeida - Laboratory of Informatics, Robotics and Microelectronics of Montpellier, France November st 0 - Invited Talk /

Thank you for your attention! Dr. Gabriel Marchesan Almeida Institute of Information Processing Technology (ITIV) gabriel.almeida@kit.edu