430 likes | 451 Vues
Dive into multithreading models, from user threads to kernel threads, and explore thread libraries like Pthreads, Win32, and Java. Learn about the benefits, challenges, and implementation of multithreading. Discover the impact of multicore systems on programming and the nuances of concurrency.
 
                
                E N D
Chapter 4: Threads Topics: • Overview • Multithreading Models • Thread Libraries • Threading Issues • Operating System Examples • Windows XP Threads • Linux Threads Objectives: • To introduce the notion of a thread — a fundamental unit of CPU utilization that forms the basis of multithreaded computer systems • To discuss the APIs for the Pthreads, Win32, and Java thread libraries • To examine issues related to multithreaded programming
Threads Motivation • Threads run within application • Multiple tasks with the application can be implemented by separate threads • Update display • Fetch data • Spell checking • Answer a network request • Process creation is heavy-weight while thread creation is light-weight • Can simplify code, increase efficiency • Kernels are generally multithreaded
Benefits • Responsiveness • Resource Sharing • Economy • Scalability
Question • Which pieces of information from a process can be shared by all threads of a process and which ones need to be unique? Process info: • State • Program counter • Registers • Open files • Stack • Text section (code) • Data
Multicore Programming • Multicore systems putting pressure on programmers, challenges include: • Dividing activities • Balance • Data splitting • Data dependency • Testing and debugging
Concurrency Single-core System Multicore System
User Threads • Thread management done by user-level threads library • Three primary thread libraries: • POSIX Pthreads • Win32 threads • Java threads
Kernel Threads • Supported by the Kernel • Examples • Windows XP/2000 • Solaris • Linux • Tru64 UNIX • Mac OS X
Multithreading Models • Many-to-One • One-to-One • Many-to-Many • Two-level • Main considerations: • Limited number of threads? • Concurrency: What if a thread makes a blocking system call? • Allows for parallel execution on multiprocessor machines?
Many-to-One • Many user-level threads mapped to single kernel thread • Examples: • Solaris Green Threads • GNU Portable Threads
One-to-One • Each user-level thread maps to kernel thread • Examples: Windows NT/XP/2000, Linux, Solaris 9 and later
Many-to-Many Model • Allows many user level threads to be mapped to many kernel threads • Allows the operating system to create a sufficient number of kernel threads • Examples: • Solaris prior to version 9 • Windows NT/2000 with the ThreadFiber package
Two-level Model • Similar to M:M, except that it allows a user thread to be bound to kernel thread • Examples • IRIX • HP-UX • Tru64 UNIX • Solaris 8 and earlier
Thread Libraries • Thread libraryprovides programmer with API for creating and managing threads • Two primary ways of implementing • Library entirely in user space • Kernel-level library supported by the OS
Pthreads • May be provided either as user-level or kernel-level • A POSIX standard (IEEE 1003.1c) API for thread creation and synchronization • API specifies behavior of the thread library, implementation is up to development of the library • Policy vs. mechanism • Common in UNIX operating systems (Solaris, Linux, Mac OS X & in Windows via shareware implementations)
Examples • Pthreads • http://www.cs.mtsu.edu/~hcarroll/3250/private/code/chapter04/pthreadsExample.c • Win32 API • http://www.cs.mtsu.edu/~hcarroll/3250/private/code/chapter04/win32ThreadsExample.c • Java • http://www.cs.mtsu.edu/~hcarroll/3250/private/code/chapter04/Driver.java
Java Threads • Java threads are managed by the JVM • Typically implemented using the threads model provided by underlying OS • Java threads may be created by: • Extending Thread class • Implementing the Runnable interface
Tuesday, January 31, 2012 • lab1 – graded copies pushed out today @ 4:00 PM • lab2 – due tomorrow • hwk04 – due Thursday
Threading Issues • Scheduler activations • Semantics of fork() and exec() system calls • Thread cancellationof target thread • Asynchronous or deferred • Signal handling • Synchronous and asynchronous • Thread pools • Thread-specific data • Create Facility needed for data private to thread
Scheduler Activations • Both many-to-one and many-to-many models require communication to maintain the appropriate number of kernel threads allocated to the application • Scheduler activations provide upcalls- a communication mechanism from the kernel to the thread library • This communication allows an application to maintain the correct number kernel threads
Lightweight Processes • Virtual processor (LWP) between thread library (user thread(s)) and kernel • Not present in all OSes • In some contexts, LWP refers to the kernel thread
Tracking Threads on ranger • ps –eLf command: $ ps -eLf | head -n 1; ps -eLf | grep $USER UID PIDPPIDLWP C NLWP STIME TTY TIME CMD hcarroll 13291 14715 13291 0 2 13:23 pts/22 00:00:00 ./pthreadsExample 55555555555 hcarroll 13291 14715 13292 98 2 13:23 pts/22 00:00:31 ./pthreadsExample 55555555555 hcarroll 13458 14715 13458 0 1 13:24 pts/22 00:00:00 ps -eLf hcarroll 13459 14715 13459 0 1 13:24 pts/22 00:00:00 grep hcarroll root 14712 1 14712 0 1 12:21 ? 00:00:00 sshd: hcarroll [priv] hcarroll 14714 14712 14714 0 1 12:21 ? 00:00:00 sshd: hcarroll@pts/22 hcarroll 14715 14714 14715 0 1 12:21 pts/22 00:00:00 -bash hcarroll 30081 14715 30081 0 1 12:45 pts/22 00:00:00 emacs
Semantics of fork() and exec() • What does fork() do? • What does exec() do? • Does fork() duplicate only the calling thread or all threads for that process? • Discuss with your neighbor(s) how can you test this?
Multi-processes with Multi-threads • http://www.cs.mtsu.edu/~hcarroll/3250/private/code/chapter04/webServerSkeleton.c • http://www.cs.mtsu.edu/~hcarroll/3250/private/code/chapter04/webServerSkeleton-withFork.c • ps -eLf | head -n 1; ps -eLf | grep $USER
Thread Cancellation • Terminating a thread before it has finished • Two general approaches: • Asynchronous cancellation terminates the target thread immediately. • Deferred cancellation allows the target thread to periodically check if it should be cancelled.
Signal Handling • Signals are used in UNIX systems to notify a process that a particular event has occurred. • A signal handleris used to process signals • Signal is generated by particular event • Signal is delivered to a process • Signal is handled • Options for multi-threaded processes: • Deliver the signal to the thread to which the signal applies • Deliver the signal to every thread in the process • Deliver the signal to certain threads in the process • Assign a specific thread to receive all signals for the process
Thread Pools • Create a number of threads in a pool where they await work • Advantages: • Usually slightly faster to service a request with an existing thread than create a new thread • Allows the number of threads in the application(s) to be bound to the size of the pool
Thread Specific Data • Allows each thread to have its own copy of data • Useful when you do not have control over the thread creation process (i.e., when using a thread pool)
Windows XP Threads • Implements the one-to-one mapping, kernel-level • Each thread contains • A thread id • Register set • Separate user and kernel stacks • Private data storage area • The register set, stacks, and private storage area are known as the contextof the threads • The primary data structures of a thread include: • ETHREAD (executive thread block) • KTHREAD (kernel thread block) • TEB (thread environment block)
Linux Threads • fork() and clone() system calls • Doesn’t distinguish between process and thread • Uses term task rather than thread • clone() takes options to determine sharing on process create • struct task_struct points to process data structures (shared or unique)
Recap • Provide 2 programming example in which multithreading provides better performance than a single-threaded solution. • What are at least 2 differences between a kernel thread and a user thread? • What is the difference between creating and managing processes and threads (e.g., what do they share)? • Which of the following components of program state are shared across threads in a multithreaded process? (Exercise 4.10) Register values, Heap memory, Global variables, Stack memory • Consider a multiprocessor system and a multithreaded program written using the many-to-many threading model. Let the number of user-level threads in the program be more than the number of processors in the system. Discuss the performance implications of the following scenarios: (Exercise 4.14) • Number of kernel threads allocated to the program is less than the number of cores. • Number of kernel threads allocated to the program is equal to the number of cores. • Number of kernel threads allocated to the program is greater than the number of cores but less than the number of user-level threads.
Recap • Provide 2 programming example in which multithreading provides better performance than a single-threaded solution. • A Web server that services each request in a separate thread. • A parallelized application such as matrix multiplication where different parts of the matrix may be worked on in parallel. • An interactive GUI program such as a debugger where a thread is used to monitor user input, another thread represents the running application, and a third thread monitors performance. • What are at least 2 differences between a kernel thread and a user thread? • User-level threads are unknown by the kernel, whereas the kernel is aware of kernel threads. • On systems using either M:1 or M:N mapping, user threads are scheduled by the thread library and the kernel schedules kernel threads. • Kernel threads need not be associated with a process whereas every user thread belongs to a process. Kernel threads are generally more expensive to maintain than user threads as they must be represented with a kernel data structure.
Recap • What is the difference between creating and managing processes and threads (e.g., what do they share)? • Process (using fork()): • Makes a COPY of the process (global & local variables) • Shares open files (e.g., pipes) • Execution starts directly after fork() • Threads (using pthread_create() / CreateThread()): • Shares global variables, memory space (e.g., things on the heap), open files, etc. • Execution starts in functions • <Timing example: timing-*.c> • Which of the following components of program state are shared across threads in a multithreaded process? (Exercise 4.10) Register values, Heap memory, Global variables, Stack memory
Recap • Consider a multiprocessor system and a multithreaded program written using the many-to-many threading model. Let the number of user-level threads in the program be more than the number of processors in the system. Discuss the performance implications of the following scenarios: (Exercise 4.10) • The number of kernel threads allocated to the program is less than the number of processors. • Not all of the processors will be utilized • The number of kernel threads allocated to the program is equal to the number of processors. • Whenever a kernel thread is blocked, a processor will go unutilized • The number of kernel threads allocated to the program is greater than the number of processors but less than the number of user-level threads. • Allows for the greatest utilization of processors and the kernel threads will be scheduled on the processors