Processes, Protection and the Kernel: Mode, Space, and Context



Similar documents
Chapter 6, The Operating System Machine Level

An Implementation Of Multiprocessor Linux

Processes and Non-Preemptive Scheduling. Otto J. Anshus

SYSTEM ecos Embedded Configurable Operating System

CSC 2405: Computer Systems II

Introduction. What is an Operating System?

Linux Scheduler. Linux Scheduler

Lecture 5. User-Mode Linux. Jeff Dike. November 7, Operating Systems Practical. OSP Lecture 5, UML 1/33

Operating Systems 4 th Class

Real Time Programming: Concepts

ELEC 377. Operating Systems. Week 1 Class 3

How To Write A Windows Operating System (Windows) (For Linux) (Windows 2) (Programming) (Operating System) (Permanent) (Powerbook) (Unix) (Amd64) (Win2) (X

Process Description and Control william stallings, maurizio pizzonia - sistemi operativi

Virtual vs Physical Addresses

RTOS Debugger for ecos

COS 318: Operating Systems

How To Understand How A Process Works In Unix (Shell) (Shell Shell) (Program) (Unix) (For A Non-Program) And (Shell).Orgode) (Powerpoint) (Permanent) (Processes

Operating System Overview. Otto J. Anshus

Operating Systems. 05. Threads. Paul Krzyzanowski. Rutgers University. Spring 2015

Department of Electrical Engineering and Computer Science MASSACHUSETTS INSTITUTE OF TECHNOLOGY Operating System Engineering: Fall 2005

Red Hat Linux Internals

Multiprocessor Scheduling and Scheduling in Linux Kernel 2.6

Overview of Operating Systems Instructor: Dr. Tongping Liu

Exceptions in MIPS. know the exception mechanism in MIPS be able to write a simple exception handler for a MIPS machine

Last Class: OS and Computer Architecture. Last Class: OS and Computer Architecture

CS414 SP 2007 Assignment 1

Virtual Private Systems for FreeBSD

Technical Properties. Mobile Operating Systems. Overview Concepts of Mobile. Functions Processes. Lecture 11. Memory Management.

A Survey of Parallel Processing in Linux

Operating Systems. Lecture 03. February 11, 2013

CS 377: Operating Systems. Outline. A review of what you ve learned, and how it applies to a real operating system. Lecture 25 - Linux Case Study

Deciding which process to run. (Deciding which thread to run) Deciding how long the chosen process can run

CS161: Operating Systems

First-class User Level Threads

Chapter 2: OS Overview

Glossary of Object Oriented Terms

CHAPTER 6 TASK MANAGEMENT

System Calls and Standard I/O

Debugging with TotalView

6.828 Operating System Engineering: Fall Quiz II Solutions THIS IS AN OPEN BOOK, OPEN NOTES QUIZ.

The Linux Kernel: Process Management. CS591 (Spring 2001)

Thomas Fahrig Senior Developer Hypervisor Team. Hypervisor Architecture Terminology Goals Basics Details

Priority Based Implementation in Pintos

Shared Address Space Computing: Programming

Replication on Virtual Machines

Why Threads Are A Bad Idea (for most purposes)

Automatic Logging of Operating System Effects to Guide Application-Level Architecture Simulation

umps software development

Chapter 3 Operating-System Structures

CS5460: Operating Systems

Scheduler Activations: Effective Kernel Support for the User-Level Management of Parallelism

The Microsoft Windows Hypervisor High Level Architecture

Lecture 25 Symbian OS

W4118 Operating Systems. Junfeng Yang

Introduction to Operating Systems. Perspective of the Computer. System Software. Indiana University Chen Yu

Lesson Objectives. To provide a grand tour of the major operating systems components To provide coverage of basic computer system organization

Linux scheduler history. We will be talking about the O(1) scheduler

RTEMS Porting Guide. On-Line Applications Research Corporation. Edition , for RTEMS July 2015

Computer-System Architecture

Process definition Concurrency Process status Process attributes PROCESES 1.3

POSIX. RTOSes Part I. POSIX Versions. POSIX Versions (2)

W4118: segmentation and paging. Instructor: Junfeng Yang

Overview. CISC Developments. RISC Designs. CISC Designs. VAX: Addressing Modes. Digital VAX

CHAPTER 15: Operating Systems: An Overview

Embedded Systems. 6. Real-Time Operating Systems

Computer Systems II. Unix system calls. fork( ) wait( ) exit( ) How To Create New Processes? Creating and Executing Processes

Mutual Exclusion using Monitors

SSC - Concurrency and Multi-threading Java multithreading programming - Synchronisation (I)

Security Overview of the Integrity Virtual Machines Architecture

Windows8 Internals, Sixth Edition, Part 1

A Look through the Android Stack

Linux Kernel Networking. Raoul Rivas

Topics. Producing Production Quality Software. Concurrent Environments. Why Use Concurrency? Models of concurrency Concurrency in Java

Linux Driver Devices. Why, When, Which, How?

Advanced Operating Systems ( L)

Operating System Tutorial

Cloud Computing. Up until now

Operating Systems for Parallel Processing Assistent Lecturer Alecu Felician Economic Informatics Department Academy of Economic Studies Bucharest

This tutorial will take you through step by step approach while learning Operating System concepts.

Keil C51 Cross Compiler

COS 318: Operating Systems. Virtual Memory and Address Translation

FRONT FLYLEAF PAGE. This page has been intentionally left blank

OPERATING SYSTEM SERVICES

CS0206 OPERATING SYSTEMS Prerequisite CS0201, CS0203

Web Server Architectures

Operating System Components and Services

Chapter 6 Concurrent Programming

CSI 402 Lecture 13 (Unix Process Related System Calls) 13 1 / 17

Operating System Structures

Real Time Control Under UNIX for RCCL

I/O Device and Drivers

Example of Standard API

University of Dayton Department of Computer Science Undergraduate Programs Assessment Plan DRAFT September 14, 2011

Tools Page 1 of 13 ON PROGRAM TRANSLATION. A priori, we have two translation mechanisms available:

Review from last time. CS 537 Lecture 3 OS Structure. OS structure. What you should learn from this lecture

CS5460: Operating Systems. Lecture: Virtualization 2. Anton Burtsev March, 2013

Transcription:

Processes, Protection and the Kernel: Mode, Space, and Context

Processes and the Kernel processes in private virtual address spaces data data The kernel sets up process execution contexts to virtualize the machine. system call traps...and upcalls (e.g., signals) shared kernel code and data in shared address space CPU and devices force entry to the kernel to handle exceptional events. Threads or processes enter the kernel for services.

Objectives The nature of the classical kernel, its protection mechanisms, and architectural support for protected kernels. Mode, space, and context. Control transfer from user code into the kernel. System calls (traps) and user program events (faults). Access control: handles, IDs, and Access Control Lists. Control transfer from the kernel to user code. Signals, APCs, syscall return. Kernel synchronization. Process structure and process birth/death, process states. Fork/exec/exit/join/wait and process trees.

The Kernel Today, all real operating systems have protected kernels. The kernel resides in a well-known file: the machine automatically loads it into memory (boots) on power-on/reset. Our kernel is called the executive in some systems (e.g., MS). The kernel is (mostly) a library of service procedures shared by all user programs, but the kernel is protected: User code cannot access internal kernel data structures directly, and it can invoke the the kernel only at well-defined entry points (system calls). Kernel code is like user code, but the kernel is privileged: The kernel has direct access to all hardware functions, and defines the entry points of handlers for interrupts and exceptions (traps and faults).

Kernel Mode CPU mode (a field in some status register) indicates whether the CPU is running in a user program or in the protected kernel. Some instructions or data accesses are only legal when the CPU is executing in kernel mode. CPU mode R0 Rn PC x registers 0 x OS code OS data Program A data Data Program B Data code library 2 n main memory physical address space

Thread/Process States and Transitions running (user) interrupt, exception, return interrupt, exception Sleep running (kernel) Run Yield blocked Wakeup ready

CPU Events: Interrupts and Exceptions An interrupt is caused by an external event. device requests attention, timer expires, etc. An exception is caused by an executing instruction. CPU requires software intervention to handle a fault or trap. unplanned deliberate sync fault syscall trap async interrupt AST AST: Asynchronous System Trap Also called a software interrupt or an Asynchronous or Deferred Procedure Call (APC or DPC) control flow Note: different cultures may use some of these terms (e.g., trap, fault, exception, event, interrupt) slightly differently. exception.cc event handler (e.g., ISR: Interrupt Service Routine)

Protecting Entry to the Kernel Protected events and kernel mode are the architectural foundations of kernel-based OS (Unix, NT+, etc). The machine defines a small set of exceptional event types. The machine defines what conditions raise each event. The kernel installs handlers for each event at boot time. e.g., a table in kernel memory read by the machine The machine transitions to kernel mode only on an exceptional event. The kernel defines the event handlers. Therefore the kernel chooses what code will execute in kernel mode, and when. event/return user kernel event/return

Handling Events, Part I: The Big Picture 1. To deliver the event, the machine saves relevant state in temporary storage, then transfers control to the kernel. Set kernel mode and set PC := handler. 2. Kernel handler examines registers and saved machine state. What happened? What was the machine doing when it happened? How should the kernel respond? 3. Kernel responds to the condition. Execute kernel service, device control code, fault handlers, etc., modify machine state as needed. 4. Kernel restores saved context (registers) and resumes activity. 5. Specific events and mechanisms for saving, examining, or restoring context are machine-dependent.

The Role of Events Once the system is booted, every entry to the kernel occurs as a result of an event. In some sense, the whole kernel is a big event handler. Event handlers are kernel-defined and execute in kernel mode. Events do not change the identity of the executing thread/process. Context: thread/process context, or interrupt context. Loosely, whose stack are you running on. For purposes of this discussion, suppose one thread per process. Events do not change the current space!

The Virtual Address Space 0 text data BSS user stack args/env 0x0 sbrk() jsr A typical process VAS space includes: user regions in the lower half V->P mappings specific to each process accessible to user or kernel code 2 n-1 2 n -1 kernel text and kernel data 0xffffffff kernel regions in upper half shared by all processes accessible only to kernel code Nachos: process virtual address space includes only user portions. A VAS for a private address space system (e.g., Unix) executing on a typical 32-bit architecture.

Example: Process and Kernel Address Spaces 0 0x0 n-bit virtual address space data data 32-bit virtual address space 2 n-1-1 0x7FFFFFFF 2 n-1 0x80000000 2 n -1 0xFFFFFFFF

Introduction to Virtual Addressing User processes address memory through virtual addresses. virtual memory (big) text data physical memory (small) The kernel controls the virtual-physical translations in effect for each space. The kernel and the machine collude to translate virtual addresses to physical addresses. BSS user stack args/env kernel The machine does not allow a user process to access memory unless the kernel says it s OK. virtual-to-physical translations The specific mechanisms for implementing virtual address translation are machine-dependent: we will cover them later.

System Call Traps User code invokes kernel services by initiating system call traps. Programs in C, C++, etc. invoke system calls by linking to a standard library of procedures written in assembly language. The library defines a stub or wrapper routine for each syscall. Stub executes a special trap instruction (e.g., chmk or callsys). Syscall arguments/results passed in registers or user stack. read() in Unix libc.a library (executes in user mode): Alpha CPU architecture #define SYSCALL_READ 27 # number for a read system call move arg0 argn, a0 an # syscall args in registers A0..AN move SYSCALL_READ, v0 # syscall dispatch code in V0 callsys # kernel trap move r1, _errno # errno = return status return

Bullet-Proofing the Kernel System calls must be safe to protect the kernel from buggy or malicious user programs. 1. System calls enter the kernel at a well-known safe point. Enter at the kernel trap handler; control transfers to the middle of the kernel are not permitted. 2. The kernel validates all system call arguments before use. Kernel may reject a request if it is meaningless or if the user process has inadequate privilege for the requested operation. 3. All memory used by the system call handler is in kernel space, so it is protected from interference by user code. What stack does the system call execute on?

Kernel Stacks and Trap/Fault Handling Processes execute user code on a user stack in the user portion of the process virtual address space. data stack stack System calls and faults run in kernel mode on the process kernel stack. Each process has a second kernel stack in kernel space (the kernel portion of the address space). stack syscall dispatch table stack System calls run in the process space, so copyin and copyout can access user memory. The syscall trap handler makes an indirect call through the system call dispatch table to the handler for the specific system call.

Safe Handling of Syscall Args/Results 1. Decode and validate by-value arguments. Process (stub) leaves arguments in registers or on the stack. 2. Validate by-reference (pointer) IN arguments. Validate user pointers and copy data into kernel memory with a special safe copy routine, e.g., copyin(). 3. Validate by-reference (pointer) OUT arguments. Copy OUT results into user memory with special safe copy routine, e.g., copyout(). 4. Set up registers with return value(s); return to user space. Stub may check to see if syscall failed, possibly raising a user program exception or storing the result in a variable.

Kernel Object Handles Instances of kernel abstractions may be viewed as objects named by protected handles held by processes. Handles are obtained by create/open calls, subject to security policies that grant specific rights for each handle. Any process with a handle for an object may operate on the object using operations (system calls). Specific operations are defined by the object s type. The handle is an integer index to a kernel table. Microsoft NT object handles Unix file descriptors Nachos FileID and SpaceID user space object handles kernel file port etc.

Example: Mechanics of an Alpha Syscall Trap 1. Machine saves return address and switches to kernel stack. save user SP, global pointer(gp), PC on kernel stack set kernel mode and transfer to a syscall trap handler (entsys) 2. Trap handler saves software state, and dispatches. save some/all registers/arguments on process kernel stack vector to syscall routine through sysent[v0: dispatchcode] 3. Trap handler returns to user mode. when syscall routine returns, restore user register state execute privileged return-from-syscall instruction (retsys) machine restores SP, GP, PC and sets user mode emerges at user instruction following the callsys

Questions About System Call Handling 1. Why do we need special copyin and copyout routines? validate user addresses before using them 2. What would happen if the kernel did not save all registers? 3. Where should per-process kernel global variables reside? syscall arguments (consider size) and error code 4. What if the kernel executes a callsys instruction? What if user code executes a retsys instruction? 5. How to pass references to kernel objects as arguments or results to/from system calls? pointers? No: use integer object handles or descriptors (also sometimes called capabilities).

Flashback: Virtual Addressing User processes address memory through virtual addresses. virtual memory (big) text data physical memory (small) The kernel controls the virtual-physical translations in effect for each space. The kernel and the machine collude to translate virtual addresses to physical addresses. BSS user stack args/env kernel The machine does not allow a user process to access memory unless the kernel says it s OK. virtual-to-physical translations The specific mechanisms for implementing virtual address translation are machine-dependent: we will cover them later.

A Simple Page Table process page table PFN 0 PFN 1 Each process/vas has its own page table. Virtual addresses are translated relative to the current page table. PFN i page #i PFN i + offset offset user virtual address physical memory page frames In this example, each VPN j maps to PFN j, but in practice any physical frame may be used for any virtual page. The page tables are themselves stored in memory; a protected register holds a pointer to the current page table.

Virtual Address Translation Example: typical 32-bit architecture with 8KB pages. 00 virtual address 29 0 13 VPN offset Virtual address translation maps a virtual page number (VPN) to a physical page frame number (PFN): the rest is easy. Deliver exception to OS if translation is not valid and accessible in requested mode. physical address{ address translation PFN + offset

Faults Faults are similar to system calls in some respects: Faults occur as a result of a process executing an instruction. Fault handlers execute on the process kernel stack; the fault handler may block (sleep) in the kernel. The completed fault handler may return to the faulted context. But faults are different from syscall traps in other respects: Syscalls are deliberate, but faults are accidents. divide-by-zero, dereference invalid pointer, memory page fault Not every execution of the faulting instruction results in a fault. may depend on memory state or register contents

Options for Handling a Fault (1) 1. Some faults are handled by patching things up and returning to the faulted context. Example: the kernel may resolve an address fault (virtual memory fault) by installing a new virtual-physical translation. The fault handler may adjust the saved PC to re-execute the faulting instruction after returning from the fault. 2. Some faults are handled by notifying the process that the fault occurred, so it may recover in its own way. Fault handler munges the saved user context (PC, SP) to transfer control to a registered user-mode handler on return from the fault. Example: Unix signals or Microsoft NT user-mode Asynchronous Procedure Calls (APCs).

Options for Handling a Fault (2) 3. The kernel may handle unrecoverable faults by killing the user process. Program fault with no registered user-mode handler? Destroy the process, release its resources, maybe write the memory image to a file, and find another ready process/thread to run. In Unix this is the default action for many signals (e.g., SEGV). 4. How to handle faults generated by the kernel itself? Kernel follows a bogus pointer? Divides by zero? Executes an instruction that is undefined or reserved to user mode? These are generally fatal operating system errors resulting in a system crash, e.g., panic()!

Thought Questions About Faults 1. How do you suppose ASSERT and panic are implemented? 2. Unix systems allow you to run a program under a debugger. How do you suppose that works? If the program crashes, the debugger regains control and allows you to examine/modify its memory and register values! 3. Some operating systems allow remote debugging. A remote machine may examine/modify a crashed system over the network. How? 4. How can a user-mode fault handler recover from a fault? How does it return to the faulted context? 5. How can a debugger restart a program that has stopped, e.g., due to a fault? How are breakpoints implemented? 6. What stack do signal handlers run on?

Architectural Foundations of OS Kernels One or more privileged execution modes (e.g., kernel mode) protected device control registers privileged instructions to control basic machine functions System call trap instruction and protected fault handling User processes safely enter the kernel to access shared OS services. Virtual memory mapping OS controls virtual-physical translations for each address space. Device interrupts to notify the kernel of I/O completion etc. Includes timer hardware and clock interrupts to periodically return control to the kernel as user code executes. Atomic instructions for coordination on multiprocessors

A Few More Points about Events The machine may actually be implemented by a combination of hardware and special pre-installed software (firmware). PAL (Privileged Architecture Library) on Alpha hides hardware details from even the OS kernel some instructions are really short PAL routines some special machine registers are really in PAL scratch memory, not CPU registers Events illustrate hardware/software tradeoffs: how much of the context should be saved on an event or switch, and by whom (hardware, PAL, or OS) goal: simple hardware and good performance in common cases

Mode, Space, and Context At any time, the state of each processor is defined by: 1. mode: given by the mode bit Is the CPU executing in the protected kernel or a user program? 2. space: defined by V->P translations currently in effect What address space is the CPU running in? Once the system is booted, it always runs in some virtual address space. 3. context: given by register state and execution stream Is the CPU executing a thread/process, or an interrupt handler? Where is the stack? These are important because the mode/space/context determines the meaning and validity of key operations.

Common Mode/Space/Context Combinations 1. User code executes in a process/thread context in a process address space, in user mode. Can address only user code/data defined for the process, with no access to privileged instructions. 2. System services execute in a process/thread context in a process address space, in kernel mode. Can address kernel memory or user process code/data, with access to protected operations: may sleep in the kernel. 3. Interrupts execute in a system interrupt context in the address space of the interrupted process, in kernel mode. Can access kernel memory and use protected operations. no sleeping!

Kernel Concurrency Control 101 Processes/threads running in kernel mode share access to system data structures in the kernel address space. Sleep/wakeup (or equivalent) are the basis for: coordination, e.g., join (exit/wait), timed waits (pause), bounded buffer (pipe read/write), message send/receive synchronization, e.g., long-term mutual exclusion for atomic read*/write* syscalls user kernel interrupt or exception Sleep/wakeup is sufficient for concurrency control among kernel-mode threads on uniprocessors: problems arise from interrupts and multiprocessors.

Dangerous Transitions Interrupt handlers may share data with syscall code, or with other handlers. Kernel-mode threads must restore data to a consistent state before blocking. interrupt kernel interrupt sleep run user trap/fault run kernel suspend/run (suspend) run preempt (ready) Involuntary context switches of threads in user mode have no effect on kernel data. blocked wakeup ready The shared data states observed by an awakening thread may have changed while sleeping. Threads in kernel mode are non-preemptible as a policy in most kernels.

The Problem of Interrupts Interrupts can cause races if the handler (ISR) shares data with the interrupted code. e.g., wakeup call from an ISR may corrupt the sleep queue. Interrupts may be nested. ISRs may race with each other. kernel code (e.g., syscall) high-priority ISR low-priority handler (ISR)

Interrupt Priority Traditional Unix kernels illustrate the basic approach to avoiding interrupt races. Rank interrupt types in N priority classes. When an ISR at priority p runs, CPU blocks interrupts of priority p or lower. How big must the interrupt stack be? Kernel software can query/raise/lower the CPU interrupt priority level (IPL). Avoid races with an ISR of higher priority by raising CPU IPL to that priority. Unix spl*/splx primitives (may need software support on some architectures). spl0 splnet splbio splimp clock splx(s) low high int s; s = splhigh(); /* touch sleep queues */ splx(s);

Multiprocessor Kernels On a shared memory multiprocessor, non-preemptible kernel code and spl*() are no longer sufficient to prevent races. Option 1, asymmetric multiprocessing: limit all handling of traps and interrupts to a single processor. slow and boring Option 2, symmetric multiprocessing ( SMP ): supplement existing synchronization primitives. any CPU may execute kernel code synchronize with spin-waiting requires atomic instructions use spinlocks but still must disable interrupts M P P P P

Example: Unix Sleep sleep (void* event, int sleep_priority) { struct proc *p = curproc; int s; Optional } s = splhigh(); /* disable all interrupts */ p->p_wchan = event; /* what are we waiting for */ p->p_priority -> priority; /* wakeup scheduler priority */ p->p_stat = SSLEEP; /* transition curproc to sleep state */ INSERTQ(&slpque[HASH(event)], p); /* fiddle sleep queue */ splx(s); /* enable interrupts */ mi_switch(); /* context switch */ /* we re back... */

Implementing Sleep on a Multiprocessor sleep (void* event, int sleep_priority) { struct proc *p = curproc; int s; Optional What if another CPU takes an interrupt and calls wakeup? } s = splhigh(); /* disable all interrupts */ p->p_wchan = event; /* what are we waiting for */ p->p_priority -> priority; /* wakeup scheduler priority */ p->p_stat = SSLEEP; /* transition curproc to sleep state */ INSERTQ(&slpque[HASH(event)], p); /* fiddle sleep queue */ splx(s); /* enable interrupts */ mi_switch(); /* context switch */ /* we re back... */ What if another CPU is handling a syscall and calls sleep or wakeup? What if another CPU tries to wakeup curproc before it has completed mi_switch?

Using Spinlocks in Sleep: : First Try sleep (void* event, int sleep_priority) { struct proc *p = curproc; int s; Grab spinlock to prevent another CPU from racing with us. Optional } lock spinlock; p->p_wchan = event; /* what are we waiting for */ p->p_priority -> priority; /* wakeup scheduler priority */ p->p_stat = SSLEEP; /* transition curproc to sleep state */ INSERTQ(&slpque[HASH(event)], p); /* fiddle sleep queue */ unlock spinlock; mi_switch(); /* context switch */ /* we re back */ Wakeup (or any other related critical section code) will use the same spinlock, guaranteeing mutual exclusion.

Sleep with Spinlocks: : What Went Wrong sleep (void* event, int sleep_priority) { struct proc *p = curproc; int s; } Optional Potential deadlock: what if we take an interrupt on this processor, and call wakeup while the lock is held? lock spinlock; p->p_wchan = event; /* what are we waiting for */ p->p_priority -> priority; /* wakeup scheduler priority */ p->p_stat = SSLEEP; /* transition curproc to sleep state */ INSERTQ(&slpque[HASH(event)], p); /* fiddle sleep queue */ unlock spinlock; mi_switch(); /* context switch */ /* we re back */ Potential doubly scheduled thread: what if another CPU calls wakeup to wake us up before we re finished with mi_switch on this CPU?

Using Spinlocks in Sleep: : Second Try sleep (void* event, int sleep_priority) { struct proc *p = curproc; int s; s = splhigh(); lock spinlock; Grab spinlock and disable interrupts. Optional p->p_wchan = event; /* what are we waiting for */ p->p_priority -> priority; /* wakeup scheduler priority */ p->p_stat = SSLEEP; /* transition curproc to sleep state */ INSERTQ(&slpque[HASH(event)], p); /* fiddle sleep queue */ unlock spinlock; splx(s); } mi_switch(); /* context switch */ /* we re back */

Review: Threads vs. Processes 1. The process is a kernel abstraction for an independent executing program. includes at least one thread of control also includes a private address space (VAS) - VAS requires OS kernel support often the unit of resource ownership in kernel - e.g., memory, open files, CPU usage 2. Threads may share an address space. Threads have context just like vanilla processes. - thread context switch vs. process context switch Every thread must exist within some process VAS. Processes may be multithreaded with thread primitives supported by a library or the kernel. data data

Implementing Processes: Questions A process is an execution of a program within a private virtual address space (VAS). 1. What are the system calls to operate on processes? 2. How does the kernel maintain the state of a process? Processes are the basic unit of resource grouping. 3. How is the process virtual address space laid out? What is the relationship between the program and the process? 4. How does the kernel create a new process? How to allocate physical memory for processes? How to create/initialize the virtual address space?

Nachos Exec/Exit/Join Example Exec parent Join Exec child Exit SpaceID pid = Exec( myprogram, 0); Create a new process running the program myprogram. Note: in Unix this is two separate system calls: fork to create the process and exec to execute the program. int status = Join(pid); Called by the parent to wait for a child to exit, and reap its exit status. Note: child may have exited before parent calls Join! Exit(status); Exit with status, destroying process. Note: this is not the only way for a proess to exit!.

Mode Changes for Exec/Exit Syscall traps and returns are not always paired. Exec returns (to child) from a trap that never happened Exit system call trap never returns system may switch processes between trap and return In contrast, interrupts and returns are strictly paired. parent Exec call Exec return Join call Join return Exec enters the child by doctoring up a saved user context to return through. child Exec entry to user space Exit call transition from user to kernel mode (callsys) transition from kernel to user mode (retsys)

Process Internals virtual address space thread process descriptor The address space is represented by page table, a set of translations to physical memory allocated from a kernel memory manager. The kernel must initialize the process memory with the program image to run. + + stack Each process has a thread bound to the VAS. The thread has a saved user context as well as a system context. The kernel can manipulate the user context to start the thread in user mode wherever it wants. user ID process ID parent PID sibling links children resources Process state includes a file descriptor table, links to maintain the process tree, and a place to store the exit status.

The Birth of a Program myprogram.c int j; char* s = hello\n ; assembler myprogram.o data object file int p() { j = write(1, s, 6); return(j); } compiler.. p: store this store that push jsr _write ret etc. myprogram.s linker data program data data data libraries and other objects myprogram (executable file)

What s in an Object File or Executable? Header magic number indicates type of image. Section table an array of (offset, len, startva) program sections header text idata wdata program instructions p immutable data (constants) hello\n writable global/static data j, s Used by linker; may be removed after final link step and strip. symbol table j, s,p,sbuf int j = 327; char* s = hello\n ; char sbuf[512]; relocation records int p() { int k = 0; j = write(1, s, 6); return(j); }

The Program and the Process VAS Process text segment is initialized directly from program text section. sections Process data segment(s) are initialized from idata and wdata sections. Text and idata segments may be write-protected. header text idata wdata symbol table relocation records program text data BSS user stack args/env kernel process VAS Process stack and BSS (e.g., heap) segment(s) are zero-filled. BSS Block Started by Symbol (uninitialized global data) e.g., heap and sbuf go here. segments Process BSS segment may be expanded at runtime with a system call (e.g., Unix sbrk) called by the heap manager routines. Args/env strings copied in by kernel when the process is created.

Nachos: A Peek Under the Hood shell data cp data user space MIPS instructions executed by SPIM SPIM MIPS emulator fetch/execute examine/deposit ExceptionHandler() Machine::Run() Nachos kernel SaveState/RestoreState examine/deposit Machine object Rn SP PC registers memory page table process page tables