Revealing the performance aspects in your code. Intel VTune Amplifier XE Generics. Rev.: Sep 1, 2013
|
|
|
- Rebecca May
- 10 years ago
- Views:
Transcription
1 Revealing the performance aspects in your code Intel VTune Amplifier XE Generics Rev.: Sep 1,
2 Agenda Introduction to Intel VTune Amplifier XE profiler High-level Features Types of Analysis Hotspot analysis Statistical Call Graph Lab 1: Find the Performance Hotspot Concurrency Analysis Lab 2: Analyzing Parallelism Locks and Waits Analysis Lab 3: Identifying Parallelism issues User and Synchronization API, Frame/Task Analysis Lab 4: Instrumenting user source code Command Line Interface, Installation, Remote Collection Conclusion 2
3 Intel VTune Amplifier XE Performance Profiler Where is my application Spending Time? Wasting Time? Waiting Too Long? Focus tuning on functions taking time See call stacks See time on source See cache misses on your source See functions sorted by # of cache misses See locks by wait time Red/Green for CPU utilization during wait Windows & Linux Low overhead No special recompiles Advanced Profiling For Scalable Multicore Performance 3
4 Intel VTune Amplifier XE Tune Applications for Scalable Multicore Performance Fast, Accurate Performance Profiles Hotspot (Statistical call tree) Call counts (Statistical) Hardware-Event Sampling Thread Profiling Visualize thread interactions on timeline Balance workloads Easy set-up Pre-defined performance profiles Use a normal production build Find Answers Fast Filter extraneous data View results on the source / assembly Compatible Microsoft, GCC, Intel compilers C/C++, Fortran, Assembly,.NET, Java Latest Intel processors and compatible processors 1 Windows or Linux Visual Studio Integration (Windows) Standalone user i/f and command line 32 and 64-bit 1 IA32 and Intel 64 architectures. Many features work with compatible processors. Event based sampling requires a genuine Intel Processor. 4
5 A set of instruments to identify performance problems Quick Overview 5
6 Intel VTune Amplifier XE Get a quick snapshot 4 cores CPU Usage Thread Concurrency 6 Frame Rate 6
7 Intel VTune Amplifier XE Identify hotspots Hottest Functions Hottest Call Stack Quickly identify what is important 7
8 Intel VTune Amplifier XE Identify threading inefficiency Coarse Grain Locks High Lock Contention Low Concurrency Load Imbalance 8
9 Intel VTune Amplifier XE Find Answers Fast Adjust Data Grouping (Partial list shown) Click [+] for Call Stack Double Click Function to View Source Filter by Timeline Selection (or by Grid Selection) Filter by Module & Other Controls 9
10 Intel VTune Amplifier XE Timeline Visualizes Thread Behavior Locks & Waits Transitions Hotspot s CPU Time Lightweight Hotspots Hovers: Optional: Use API to mark frames and user tasks Optional: Add a mark during collection 10
11 Intel VTune Amplifier XE See Profile Data On Source / Asm Time on Source / Asm Quick Asm navigation: Select source to highlight Asm Quickly scroll to hot spots. Right click for instruction reference manual Intel VTune Amplifier XE Click jump to scroll Asm 11
12 High-level Features 12
13 Intel VTune Amplifier XE Feature Highlights Hot Spot Analysis (Statistical Call Graph) Locates the time consuming regions of your application Provides associated call-stacks that let you know how you got to these time consuming regions Call-tree built using these call stacks Advanced Hotspot and architecture analysis Based on Hardware Event-based Sampling (EBS) Pre-defined tuning experiments Thread Profiling Visualize thread activity and lock transitions in the timeline Provides lock profiling capability Shows CPU/Core utilization and concurrency information GPU Compute Performance Analysis Collect GPU data for tuning OpenCL applications. Correlate GPU and CPU activities CPU Power Efficiency Analysis Wake-up rate and frequency measurement per Core 13
14 Intel VTune Amplifier XE Feature Highlights Attach to running processes Hotspot and Concurrency analysis modes can attach to running processes System wide data collection EBS modes allows system wide data collection and the tool provides the ability to filter this data GUI Standalone GUI available on Windows* and Linux Microsoft* Visual Studio integration Command Line Comprehensive support for regression analysis and remote collection Platform & application support Windows * and Linux Microsoft *.NET/C# applications Java * and mixed applications Fortran applications 14
15 Intel VTune Amplifier XE Feature Highlights Event multiplexing Gather more information with each profiling run Timeline correlation of thread and event data Populates thread active time with event data collected for that thread Ability to filter regions on the timeline Advanced Source / Assembler View See event data graphed on the source / assembler View and analyze assembly as basic blocks Review the quality of vectorization in the assembly code display of your hot spot Provides pre-defined tuning experiments Predefined profiles for quick analysis configuration A user profile can be created on a basis of a predefined profile User API Rich set of user API for collection control, events highlighting, code instrumentation, and visualization enhancing. 15
16 Data Collectors and Analysis Types 16
17 Intel VTune Amplifier XE Analysis Types (based on technology) Software Collector Any IA86 processor, any virtual, no driver Basic Hotspots Which functions use the most time? Concurrency Tune parallelism. Colors show number of cores used. Locks and Waits Tune the #1 cause of slow threaded performance waiting with idle cores. Hardware Collector Higher res., lower overhead, system wide Advanced Hotspots Which functions use the most time? Where to inline? Statistical call counts General Exploration Where is the biggest opportunity? Cache misses? Branch mispredictions? Advanced Analysis Dig deep to tune bandwidth, cache misses, access contention, etc. 17
18 Intel VTune Amplifier XE Pre-defined Analysis Types Advanced Hotspot analysis based on the underlying architecture User mode sampling, Threading, IO, Signaling API instrumentation 3 rd Generation Core Architecture (a.k.a SandyBridge) analysis types 4 th Generation Core Architecture (a.k.a Haswell) analysis types 18
19 GUI Layout 19
20 Creating a Project GUI Layout
21 Selecting type of data collection GUI Layout All available analysis types Different ways to start the analysis Helps creating new analysis types Copy the commandline to clipboard 21
22 Profile a Running Application No need to stop and relaunch the app when profiling Two Techniques: Attach to Process: Hotspot - Hotspot Concurrency - Concurrency Locks - Locks & & Waits Profile System: - Advanced Hotspots & Custom EBS - Optional: Filter by process after collection 22
23 Summary View GUI Layout Clicking on the Summary tab shows a high level summary of the run Timing for the whole application run List of 5 Hotspot functions CPU Usage 23
24 Bottom-Up View GUI Layout Menu and Tool bars Analysis Type Current grouping Viewpoint currently being used Tabs within each result Grid area Stack Pane Filter area Timeline area 24
25 Top-Down View GUI Layout Clicking on the Top- Down Tree tab changes stack representation in the Grid Top-level function and it s tree Total Time (self + children s) Self Time 25
26 Caller/Callee View GUI Layout Select a function in the Bottom-Up and find the caller/callee List of functions sorted by CPU Time List of callers and their stacks List of callees and their stacks 26
27 Adding User Marks to the Timeline GUI Controls Resume data collection when needed Start application without data collection Observe paused region on the Time Line Click Mark Timeline during collection Observe the mark on the Time Line 27
28 Key Result Analysis and GUI Concepts 28
29 VTune Amplifier XE Key Concepts Viewpoints It is a pre-defined view that determines what needs to be displayed in the grid and timeline for a given analysis type An analysis type may support more than one view points To change viewpoints, select a viewpoint by clicking on Click 29
30 Intel VTune Amplifier XE Key Concepts Groupings Each analysis type has many viewpoints Each viewpoint has pre-defined groupings Allows you to analyze the data in different hierarchies and granularities Click 30
31 Viewpoints and Groupings For example, pre-defined groupings can be used to determine load imbalance Change to Function/Thread 31
32 Intel VTune Amplifier XE Key Concepts VTune Amplifier XE allows comparison of two similar runs Extremely useful for Benchmarking Regression analysis Testing During performance optimization work source code may change Binary recompiled: compare based on source function Inside a function: compare based on functions level Functions changed: group by source files and compare Source files changed: compare by modules 32
33 Results Comparison 33
34 Lab Activities Analysis Types Revisited 34
35 Reminding methodology of performance profiling and tuning The Goal: minimize the time it takes your program / module / function to execute Identify Hotspots and focus on them It s just a few functions (20% of code does 80% of job) Optimize them (with compiler or hand optimizations) Check for hotspots again, and find new ones How to optimize the Hotspots? Maximize CPU utilization and minimize elapsed time Ensure CPU is busy all the time All Cores busy parallelism Busy with useful tasks Optimize tasks execution 35
36 Intel VTune Amplifier XE Terminology Elapsed Time The total time your target application ran. Wall clock time at end of application Wall clock time at start of application CPU Time The amount of time a thread spends executing on a logical processor. For multiple threads, the CPU time of the threads is summed. Wait Time The amount of time that a given thread waited for some event to occur, such as: synchronization waits and I/O waits 36
37 Intel VTune Amplifier XE CPU Usage Thread1 Waiting Thread1 Thread2 Waiting Thread2 Thread3 Waiting Thread3 Thread running 1sec 1sec 1sec 1sec 1sec 1sec Thread waiting Elapsed Time: 6 seconds CPU Usage CPU Time: T1 (4s) + T2 (2s) + T3 (2s) = 8 seconds Wait Time: T1(2s) + T2(3s) + T3 (2s) = 7 seconds
38 Hotspot analysis Displays hot functions in your application Shows most time consuming call sequences Statistical Call Graph Include timeline view of threads in your application Basic Hotspot Analysis Start the Analysis 38
39 Hotspot analysis Summary Note Elapsed Time and CPU Time 39
40 Hotspot analysis Summary (Continued) Note overall CPU Usage Note # of CPUs Available on the platform 40
41 Hotspots analysis Hotspot functions Adjust Data Grouping Hotspot Functions Change Viewpoint Function CPU time (Partial list shown) Click [+] for Call Stack Thread timeline Call stack Filter by Timeline Selection (or by Grid Selection) Filter by Module & Other Controls 41
42 Hotspots analysis Hotspot functions by CPU usage Double Click Function to View Source Coloring CPU Time by CPU Utilization Overhead and Spin Time Overhead and Spin on Timeline 42
43 Hotspots analysis Source View Source View Assembly View Self and Total Time on Source / Asm Right click for instruction reference manual Quick Asm navigation: Select source to highlight Asm Click jump to scroll Asm Quickly scroll to hot spots. Scroll Bar Heat Map is an overview of hot spots 43
44 Lab 1 Find the Performance Hotspot 44
45 Reminding methodology of performance profiling and tuning How to optimize the Hotspots? Maximize CPU utilization and minimize elapsed time Ensure CPU is busy all the time All Cores busy parallelism (high concurrency) Elapsed (Serial) Elapsed (N-threads) Elapsed (Serial) / N Serial T1 T2 T3 T4 4T optimal Potential Gain Gain Time Time Time 45
46 Intel VTune Amplifier XE Terminology Concurrency - Is a measurement of the number of active threads Thread1 Waiting Thread1 Thread2 Waiting Thread2 Concurrency Summary Thread3 Waiting Thread Thread running 1sec 1sec 1sec 1sec 1sec 1sec Thread waiting 46
47 Intel VTune Amplifier XE Parallelism/Concurrency Analysis For Parallelism / Concurrency analysis, Stack sampling is done just like in Hotspots analysis Wait functions are instrumented (e.g. WaitForSingleObject, EnterCriticalSection) Signal functions are instrumented (e.g. SetEvent, LeaveCriticalSection) I/O functions are instrumented (e.g. ReadFile, socket) Concurrency Analysis Start the Analysis 47
48 Concurrency Analysis Summary Concurrency Levels Adjustable Metrics 48
49 Concurrency Analysis Summary: Concurrency vs. CPU Usage Histogram Threads might be in active state, but not using CPU 9/26/
50 Concurrency View Concurrency Level Overhead Wait Thread is running Thread is waiting Thread Transitions Overhead 50
51 Concurrency Timeline Investigate reasons for transitions Select and Zoom Hover over a transition line 51
52 Source Code View by Concurrency Concurrency coloring for CPU Time against source lines 52
53 Lab 2 Analyzing Parallelism 53
54 Waiting on locks Sync Object Sync Object Sync Object Signal Signal Signal Thread1 Waiting Thread1 Idle Thread2 Waiting Thread2 Idle Thread3 Waiting Thread3 Thread running 1sec 1sec 1sec 1sec 1sec 1sec Thread waiting Start main thread Calculating Wait and Idle time Stop main thread 54
55 Intel VTune Amplifier XE Locks and Waits Analysis Identifies those threading items that are causing the most thread block time Synchronization locks Threading APIs I/O Start the Analysis Locks & Waits Analysis 55
56 Locks and Waits View Grouping by Sync Object Wait Objects CPU Utilization Waits # Spinning Stack for the wait object 56
57 Locks-and-Waits Source View Wait count Critical Section object Waiting time on the Critical Section 57
58 Lab 3 Finding Parallelism Issues 58
59 Intel VTune Amplifier XE User APIs User APIs Collection Control API Thread Naming API User-Defined Synchronization API Task API User Event API Frame API JIT Profiling API 59
60 Intel VTune Amplifier XE User APIs Enable you to control collection set marks during the execution of the specific code specify custom synchronization primitives implemented without standard system APIs To use the user APIs, do the following: Include ittnotify.h, located at <install_dir>/include Insert itt_* notifications in your code Link to the libittnotify.lib file located at <install_dir>/lib 60
61 Intel VTune Amplifier XE User APIs Collection Control APIs void itt_pause (void) void itt_resume (void) Thread naming APIs Run the application without collecting data. VTune Amplifier XE reduces the overhead of collection, by collecting only critical information, such as thread and process creation. Resume data collection. VTune Amplifier XE resumes collecting all data. void itt_thread_set_name (const itt_char *name) void itt_thread_ignore (void) Set thread name using char or Unicode string, where name is the thread name. Indicate that this thread should be ignored from analysis. It will not affect the concurrency of the application. It will not be visible in the Timeline pane. 61
62 Intel VTune Amplifier XE Collection Control APIs int main(int argc, char* argv[]) { dosomeinitializationwork(); itt_resume(); while(grunning) { dosomedataparallelwork(); } itt_pause(); } dosomefinalizationwork(); return 0; 62
63 Intel VTune Amplifier XE User defined synchronization API long spin = 1; itt_sync_prepare((void *) &spin ); while(resourcebusy); // spin wait; itt_sync_acquired((void *) &spin ); // Use shared resource itt_sync_releasing((void *) &spin ); // Code here should free the resource 63
64 Intel VTune Amplifier XE User Event APIs Useful to observe when certain events occur in your application or identify how long certain regions of code take to execute Event APIs enables you to annotate an application when certain events occur itt_event itt_event_create(char *, int); itt_event_start( itt_event); itt_event_end( itt_event); 64
65 Intel VTune Amplifier XE User Event APIs itt_event itt_event_create(const itt_char *name, int namelen ); Create a user event type with the specified name. This API returns a handle to the user event type that should be passed into the following APIs as a parameter. The namelen parameter refers to the number of characters, not the number of bytes. int itt_event_start( itt_event event ); Call this API with an already created user event handle to register an instance of that event. This event appears in the Timeline pane display as a tick mark. int itt_event_end( itt_event event ); Call this API following a call to itt_event_start() to show the user event as a tick mark with a a duration line from start to end. If this API is not called, the user event appears in the Timeline pane as a single tick mark. 65
66 Intel VTune Amplifier XE User Events - using APIs DWORD WINAPI aiwork(lpvoid lparg) { int tid = *((int*)lparg); itt_event aievent; aievent = itt_event_create("ai Thread Work",14); } while(grunning) { WaitForSingleObject(bSignal[tid], INFINITE); itt_event_start(aievent); dosomedataparallelwork(); itt_event_end(aievent); SetEvent(eSignal[tid]); } return 0; 66
67 Intel VTune Amplifier XE Visualizing Events in the Timeline View User defined task 67
68 Intel VTune Amplifier XE Frame Analysis Frame Analysis Analyze Long Latency Activity Frame: a region executed repeatedly (nonoverlapping). API marks start and finish Auto detect DirectX frames Examples: Game Compute next graphics frame Simulator Time step loop Computation Convergence loop Application voidalgorithm_1(); voidalgorithm_2(int myid); doublegetseconds(); DWORD WINAPI do_xform (void * lpmyid); bool checkresults(); itt_domain* pd = itt_domain_create ( mydomain ); Region (Frame) while( grunning ) { itt_frame_begin_v3(pd, NULL);... //Do Work... itt_frame_end_v3(pd, NULL); } for (int k = 0; k < N; ++k) { int ik = i*n + k; int kj = k*n + j; c2[ij] += a[ik]*b[kj]; } 68
69 Intel VTune Amplifier XE Frame APIs itt_domain* itt_domain_create( const itt_char *name ); void itt_frame_begin_v3(const itt_domain *domain, itt_id *id); void itt_frame_end_v3(const itt_domain *domain, itt_id *id); Create a domain with a domain name. Since the domain is expected to be static over the application's execution time, there is no mechanism to destroy a domain. Any domain can be accessed by any thread in the process, regardless of which thread created the domain. This call is thread-safe. Define the beginning of the frame instance. A itt_frame_begin_v3 call must be paired with a itt_frame_end_v3 call. Successive calls to itt_frame_begin_v3 with the same ID are ignored until a call to itt_frame_end_v3 with the same ID. domain is the domain for this frame instance. id is the instance ID for this frame instance, or NULL. Define the end of the frame instance. A itt_frame_end_v3 call must be paired with a itt_frame_begin_v3 call. The first call to itt_frame_end_v3 with a given ID ends the frame. Successive calls with the same ID are ignored, as are calls that do not have a matching itt_frame_begin_v3 call. domain - The domain for this frame instance id - The instance ID for this frame instance, or NULL for the current instance. 69
70 Frame Analysis Using APIs itt_domain* pd = itt_domain_create ("SimDomain"); while(grunning) { itt_frame_begin_v3(pd, NULL); start = clock(); //Wait all threads before moving into the next frame WaitForMultipleObjects(FUNCTIONAL_DOMAINS, esignal, TRUE, INFINITE); stop = clock(); //Give all threads the "go" signal for (int i = 0; i < FUNCTIONAL_DOMAINS; i++) SetEvent(bSignal[i]); if (frame % NETWORKCONNETION_FREQ == 0) { //Start network thread SetEvent(bNetSignal); } itt_frame_end_v3(pd, NULL); } 70
71 Summary View / Frame Rate Chart Adjust the frame rate then Apply changes
72 Frame Analysis Find Slow Frames With One Click (1) Regroup Data Before: List of Functions Taking Time (Partial list shown) After: List of Slow Frames 72
73 Just 2 more clicks shows where to focus tuning Slow functions in slow frames (1) Only show slow frames Result: Functions taking a lot of time in slow frames (2) Regroup: Show functions 73
74 Intel VTune Amplifier XE Task APIs A task is a logical unit of work performed by a particular thread Tasks can be nested You can use task APIs to assign tasks to threads One thread executes one task at a given time Tasks may correspond to functions, scopes, or a case block in a switch statement 74
75 Task API primitives Use This Primitive void ITTAPI itt_task_begin ( const itt_domain *domain, itt_id taskid, itt_id parentid, itt_string_handle *name) void ITTAPI itt_task_begin_fn ( const itt_domain *domain, itt_id taskid, itt_id parentid, void *fn) void ITTAPI itt_task_end ( const itt_domain *domain) To Do This Create a task instance on a thread. This becomes the current task instance for that thread. A call to itt_task_end() on the same thread ends the current task instance. Begin a task instance on a thread. End a task instance on a thread. Parameter Description itt_domain The domain of the task. itt_id taskid This is a reserved parameter. itt_id parentid This is a reserved parameter. itt_string_handle The task string handle. *fn This is a reserved parameter. 75
76 Task API usage itt_domain* domain = itt_domain_create(l"task Domain"); itt_string_handle* UserTask = itt_string_handle_create(l"usertask"); itt_string_handle* UserSubTask = itt_string_handle_create(l UserSubTask"); int main(int argc, char* argv[]) {... itt_task_begin (domain, itt_null, itt_null, UserTask); //create many threads to call work() itt_task_end (domain);... } work() { itt_task_begin (domain, itt_null, itt_null, UserSubTask); do_foo(); itt_task_end (domain); return 0; } 76
77 Using Task API Hotspots analysis Bottom-up pane 77
78 Using Task API Hotspots analysis Tasks pane 78
79 Lab 4 Instrumenting user source code 79
80 Windows & Linux Versions Available Stand-alone GUI, Command line, Visual Studio Integration Microsoft Windows* OS Windows XP* 1, Windows 7*, Windows 8 Desktop* Windows Server* 2003, 2008 Microsoft Visual Studio* 2008, 2010 and 2012 Standalone GUI and command line IA32 and Intel 64 Linux* OS RHEL*, Fedora*, SUSE*, CentOS*, Ubuntu* Additional distributions may also work Standalone GUI and command line IA32 and Intel 64 Single user and floating licenses available 80
81 Installation Windows Integrated into Microsoft Visual Studio, or Standalone Administrative privileges required for full package GUI and command line versions are both installed Linux Standalone GUI and command line versions Root access not required but won t install Event-based sampling collectors Hotspot collector with calling sequences will work Data collection-only installation option Enables collection with no license Collection results then copied to system with license for viewing Driver for event-based sampling is built at install time and can be insmod d at install time and boot time 81
82 Intel VTune Amplifier XE Command Line Interface Command line (CLI) versions exist on Linux* and Windows* CLI use cases: Test code changes for performance regressions Automate execution of performance analyses CLI features: Fine-grained control of all analysis types and options Text-based analysis reports Analysis results can be opened in the graphical user interface 82
83 Intel VTune Amplifier XE Command Line Interface - Examples Display a list of available analysis types and preset configuration levels amplxe-cl collect-list Run Hot Spot analysis on target myapp and store result in default-named directory, such as r000hs amplxe-cl c hotspots -- myapp Run the Parallelism analysis, store the result in directory r001par amplxe-cl -c parallelism -result-dir r001par -- myapp 83
84 Intel VTune Amplifier XE Command Line Interface - Reporting $> amplxe-cl report summary r /home/user1/examples/lab2/r003cc Summary Average Concurrency: Elapsed Time: CPU Time: Wait Time: CPU Usage: Executing actions 100 % done 84
85 Intel VTune Amplifier XE Command Line Interface Gropof-like output 85
86 Intel VTune Amplifier XE Command Line Interface CSV output Example: $> amplxe-cl -report hotspots -csv-delimiter=comma - format=csv -report-out=testing111 -r r003cc Function,Module,CPU Time,Idle:CPU Time,Poor:CPU Time,Ok:CPU Time,Ideal:CPU Time,Over:CPU Time CLHEP::RanecuEngine::flat,test40,50.751,0,0.050,0.081,0.080, G4UniversalFluctuation::SampleFluctuations,test40,32.730,0,0.030,0.070,0.010, sqrt,test40,19.060,0,0.010,0.070,0.030, G4Track::GetVelocity,test40,15.330,0,0.030,0.030,0.040, G4VoxelNavigation::LevelLocate,test40,14.460,0,0.020,0.010,0.040, G4Step::UpdateTrack,test40,14.090,0,0,0.030,0.020, G4NavigationLevelRep::G4NavigationLevelRep,test40,13.721,0,0.030,0.020,0.040, exp,test40,13.438,0,0.038,0.010,0.060, log,test40,13.340,0,0.180,0.020,0.110, G4PhysicsVector::GetValue,test40,11.970,0,0.020,0.020,0.050,
87 Remote Data Collection Local System VTune Amplifier XE Full user interface Copy command line Copy results file Remote System Lightweight command line collector 1. Setup the experiment using GUI locally 2. Copy command line instructions to paste buffer 3. Open remote shell on the target system 4. Paste command line, run collection 5. Copy result to your system 6. Open file using local GUI One typical model Collect on Linux, analyze and display on Windows The Linux machine is target Collect data on Linux system using command line tool Doesn t require a license Copy the resulting performance data files to a Windows* system Analyze and display results on the Windows* system Requires a license 87
88 Summary The Intel VTune Amplifier XE can be used to find: Source code for performance bottlenecks Characterize the amount of parallelism in an application Determine which synchronization locks or APIs are limiting the parallelism in an application Understand problems limiting CPU instruction level parallelism Instrument user code for better understanding of execution flow defined by threading runtimes
89 Questions? 89
90 Legal Disclaimer & Optimization Notice INFORMATION IN THIS DOCUMENT IS PROVIDED AS IS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. INTEL ASSUMES NO LIABILITY WHATSOEVER AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO THIS INFORMATION INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. Copyright, Intel Corporation. All rights reserved. Intel, the Intel logo, Xeon, Core, VTune, and Cilk are trademarks of Intel Corporation in the U.S. and other countries. Optimization Notice Intel s compilers may or may not optimize to the same degree for non-intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice. Notice revision #
INTEL PARALLEL STUDIO XE EVALUATION GUIDE
Introduction This guide will illustrate how you use Intel Parallel Studio XE to find the hotspots (areas that are taking a lot of time) in your application and then recompiling those parts to improve overall
Basics of VTune Performance Analyzer. Intel Software College. Objectives. VTune Performance Analyzer. Agenda
Objectives At the completion of this module, you will be able to: Understand the intended purpose and usage models supported by the VTune Performance Analyzer. Identify hotspots by drilling down through
Finding Performance and Power Issues on Android Systems. By Eric W Moore
Finding Performance and Power Issues on Android Systems By Eric W Moore Agenda Performance & Power Tuning on Android & Features Needed/Wanted in a tool Some Performance Tools Getting a Device that Supports
The ROI from Optimizing Software Performance with Intel Parallel Studio XE
The ROI from Optimizing Software Performance with Intel Parallel Studio XE Intel Parallel Studio XE delivers ROI solutions to development organizations. This comprehensive tool offering for the entire
Intel Media Server Studio - Metrics Monitor (v1.1.0) Reference Manual
Intel Media Server Studio - Metrics Monitor (v1.1.0) Reference Manual Overview Metrics Monitor is part of Intel Media Server Studio 2015 for Linux Server. Metrics Monitor is a user space shared library
Get an Easy Performance Boost Even with Unthreaded Apps. with Intel Parallel Studio XE for Windows*
Get an Easy Performance Boost Even with Unthreaded Apps for Windows* Can recompiling just one file make a difference? Yes, in many cases it can! Often, you can achieve a major performance boost by recompiling
Improve Fortran Code Quality with Static Analysis
Improve Fortran Code Quality with Static Analysis This document is an introductory tutorial describing how to use static analysis on Fortran code to improve software quality, either by eliminating bugs
Keys to node-level performance analysis and threading in HPC applications
Keys to node-level performance analysis and threading in HPC applications Thomas GUILLET (Intel; Exascale Computing Research) IFERC seminar, 18 March 2015 Legal Disclaimer & Optimization Notice INFORMATION
Tutorial: Analyzing Energy Usage on an Android* Platform
Tutorial: Analyzing Energy Usage on an Android* Platform Intel VTune Amplifier 2016 for Linux* OS Legal Information Contents Contents Legal Information... 5 Overview... 7 Chapter 1: Analyzing Energy Usage
Intel Media SDK Library Distribution and Dispatching Process
Intel Media SDK Library Distribution and Dispatching Process Overview Dispatching Procedure Software Libraries Platform-Specific Libraries Legal Information Overview This document describes the Intel Media
-------- Overview --------
------------------------------------------------------------------- Intel(R) Trace Analyzer and Collector 9.1 Update 1 for Windows* OS Release Notes -------------------------------------------------------------------
Scaling up to Production
1 Scaling up to Production Overview Productionize then Scale Building Production Systems Scaling Production Systems Use Case: Scaling a Production Galaxy Instance Infrastructure Advice 2 PRODUCTIONIZE
Towards OpenMP Support in LLVM
Towards OpenMP Support in LLVM Alexey Bataev, Andrey Bokhanko, James Cownie Intel 1 Agenda What is the OpenMP * language? Who Can Benefit from the OpenMP language? OpenMP Language Support Early / Late
Improve Fortran Code Quality with Static Security Analysis (SSA)
Improve Fortran Code Quality with Static Security Analysis (SSA) with Intel Parallel Studio XE This document is an introductory tutorial describing how to use static security analysis (SSA) on C++ code
Performance Analysis and Optimization Tool
Performance Analysis and Optimization Tool Andres S. CHARIF-RUBIAL [email protected] Performance Analysis Team, University of Versailles http://www.maqao.org Introduction Performance Analysis Develop
AMD CodeXL 1.7 GA Release Notes
AMD CodeXL 1.7 GA Release Notes Thank you for using CodeXL. We appreciate any feedback you have! Please use the CodeXL Forum to provide your feedback. You can also check out the Getting Started guide on
Eliminate Memory Errors and Improve Program Stability
Eliminate Memory Errors and Improve Program Stability with Intel Parallel Studio XE Can running one simple tool make a difference? Yes, in many cases. You can find errors that cause complex, intermittent
COLO: COarse-grain LOck-stepping Virtual Machine for Non-stop Service
COLO: COarse-grain LOck-stepping Virtual Machine for Non-stop Service Eddie Dong, Yunhong Jiang 1 Legal Disclaimer INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE,
Full and Para Virtualization
Full and Para Virtualization Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF x86 Hardware Virtualization The x86 architecture offers four levels
End-user Tools for Application Performance Analysis Using Hardware Counters
1 End-user Tools for Application Performance Analysis Using Hardware Counters K. London, J. Dongarra, S. Moore, P. Mucci, K. Seymour, T. Spencer Abstract One purpose of the end-user tools described in
XDB Intel System Debugger 2015 Overview Training. Robert Mueller-Albrecht, TCE, SSG DPD ECDL
XDB Intel System Debugger 2015 Overview Training Robert Mueller-Albrecht, TCE, SSG DPD ECDL Agenda 1) What is XDB? 2) Debugger startup and device/platform support 3) Debugger usage (Android* an Linux*)
Getting Started with CodeXL
AMD Developer Tools Team Advanced Micro Devices, Inc. Table of Contents Introduction... 2 Install CodeXL... 2 Validate CodeXL installation... 3 CodeXL help... 5 Run the Teapot Sample project... 5 Basic
Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms
Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Family-Based Platforms Executive Summary Complex simulations of structural and systems performance, such as car crash simulations,
The Transition to PCI Express* for Client SSDs
The Transition to PCI Express* for Client SSDs Amber Huffman Senior Principal Engineer Intel Santa Clara, CA 1 *Other names and brands may be claimed as the property of others. Legal Notices and Disclaimers
An Oracle White Paper September 2013. Advanced Java Diagnostics and Monitoring Without Performance Overhead
An Oracle White Paper September 2013 Advanced Java Diagnostics and Monitoring Without Performance Overhead Introduction... 1 Non-Intrusive Profiling and Diagnostics... 2 JMX Console... 2 Java Flight Recorder...
Power Benefits Using Intel Quick Sync Video H.264 Codec With Sorenson Squeeze
Power Benefits Using Intel Quick Sync Video H.264 Codec With Sorenson Squeeze Whitepaper December 2012 Anita Banerjee Contents Introduction... 3 Sorenson Squeeze... 4 Intel QSV H.264... 5 Power Performance...
Intel Application Software Development Tool Suite 2.2 for Intel Atom processor. In-Depth
Application Software Development Tool Suite 2.2 for Atom processor In-Depth Contents Application Software Development Tool Suite 2.2 for Atom processor............................... 3 Features and Benefits...................................
Intel 810 and 815 Chipset Family Dynamic Video Memory Technology
Intel 810 and 815 Chipset Family Dynamic Video Technology Revision 3.0 March 2002 March 2002 1 Information in this document is provided in connection with Intel products. No license, express or implied,
MAQAO Performance Analysis and Optimization Tool
MAQAO Performance Analysis and Optimization Tool Andres S. CHARIF-RUBIAL [email protected] Performance Evaluation Team, University of Versailles S-Q-Y http://www.maqao.org VI-HPS 18 th Grenoble 18/22
Intel System Event Log (SEL) Viewer Utility
Intel System Event Log (SEL) Viewer Utility User Guide Document No. E12461-005 Legal Statements INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS FOR THE GENERAL PURPOSE OF SUPPORTING
Intel Service Assurance Administrator. Product Overview
Intel Service Assurance Administrator Product Overview Running Enterprise Workloads in the Cloud Enterprise IT wants to Start a private cloud initiative to service internal enterprise customers Find an
ALTIRIS Software Delivery Solution for Windows 6.1 SP3 Product Guide
ALTIRIS Software Delivery Solution for Windows 6.1 SP3 Product Guide Notice Altiris Software Delivery Solution for Windows 6.1 SP3 Product Guide 2007 Altiris, Inc. All rights reserved. Document Date: February
NVIDIA GeForce Experience
NVIDIA GeForce Experience DU-05620-001_v02 October 9, 2012 User Guide TABLE OF CONTENTS 1 NVIDIA GeForce Experience User Guide... 1 About GeForce Experience... 1 Installing and Setting Up GeForce Experience...
Intel Many Integrated Core Architecture: An Overview and Programming Models
Intel Many Integrated Core Architecture: An Overview and Programming Models Jim Jeffers SW Product Application Engineer Technical Computing Group Agenda An Overview of Intel Many Integrated Core Architecture
Intel System Event Log (SEL) Viewer Utility
Intel System Event Log (SEL) Viewer Utility User Guide Document No. E12461-007 Legal Statements INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS FOR THE GENERAL PURPOSE OF SUPPORTING
DELL. Virtual Desktop Infrastructure Study END-TO-END COMPUTING. Dell Enterprise Solutions Engineering
DELL Virtual Desktop Infrastructure Study END-TO-END COMPUTING Dell Enterprise Solutions Engineering 1 THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS AND TECHNICAL
Accomplish Optimal I/O Performance on SAS 9.3 with
Accomplish Optimal I/O Performance on SAS 9.3 with Intel Cache Acceleration Software and Intel DC S3700 Solid State Drive ABSTRACT Ying-ping (Marie) Zhang, Jeff Curry, Frank Roxas, Benjamin Donie Intel
How to Configure Intel Ethernet Converged Network Adapter-Enabled Virtual Functions on VMware* ESXi* 5.1
How to Configure Intel Ethernet Converged Network Adapter-Enabled Virtual Functions on VMware* ESXi* 5.1 Technical Brief v1.0 February 2013 Legal Lines and Disclaimers INFORMATION IN THIS DOCUMENT IS PROVIDED
GPU Usage. Requirements
GPU Usage Use the GPU Usage tool in the Performance and Diagnostics Hub to better understand the high-level hardware utilization of your Direct3D app. You can use it to determine whether the performance
Vendor Update Intel 49 th IDC HPC User Forum. Mike Lafferty HPC Marketing Intel Americas Corp.
Vendor Update Intel 49 th IDC HPC User Forum Mike Lafferty HPC Marketing Intel Americas Corp. Legal Information Today s presentations contain forward-looking statements. All statements made that are not
Intel System Event Log (SEL) Viewer Utility. User Guide SELViewer Version 10.0 /11.0 December 2012 Document number: G88216-001
Intel System Event Log (SEL) Viewer Utility User Guide SELViewer Version 10.0 /11.0 December 2012 Document number: G88216-001 Legal Statements INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH
Zing Vision. Answering your toughest production Java performance questions
Zing Vision Answering your toughest production Java performance questions Outline What is Zing Vision? Where does Zing Vision fit in your Java environment? Key features How it works Using ZVRobot Q & A
NetBeans Profiler is an
NetBeans Profiler Exploring the NetBeans Profiler From Installation to a Practical Profiling Example* Gregg Sporar* NetBeans Profiler is an optional feature of the NetBeans IDE. It is a powerful tool that
Debugging with TotalView
Tim Cramer 17.03.2015 IT Center der RWTH Aachen University Why to use a Debugger? If your program goes haywire, you may... ( wand (... buy a magic... read the source code again and again and...... enrich
PTC System Monitor Solution Training
PTC System Monitor Solution Training Patrick Kulenkamp June 2012 Agenda What is PTC System Monitor (PSM)? How does it work? Terminology PSM Configuration The PTC Integrity Implementation Drilling Down
SQL Server 2008 Performance and Scale
SQL Server 2008 Performance and Scale White Paper Published: February 2008 Updated: July 2008 Summary: Microsoft SQL Server 2008 incorporates the tools and technologies that are necessary to implement
Intel Data Direct I/O Technology (Intel DDIO): A Primer >
Intel Data Direct I/O Technology (Intel DDIO): A Primer > Technical Brief February 2012 Revision 1.0 Legal Statements INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE,
The Foundation for Better Business Intelligence
Product Brief Intel Xeon Processor E7-8800/4800/2800 v2 Product Families Data Center The Foundation for Big data is changing the way organizations make business decisions. To transform petabytes of data
An Oracle White Paper March 2013. Load Testing Best Practices for Oracle E- Business Suite using Oracle Application Testing Suite
An Oracle White Paper March 2013 Load Testing Best Practices for Oracle E- Business Suite using Oracle Application Testing Suite Executive Overview... 1 Introduction... 1 Oracle Load Testing Setup... 2
VTune Performance Analyzer Essentials
VTune Performance Analyzer Essentials The complete book is available from shopintel.com at a special discount for VTune analyzer users. Click the book cover and enter the promotional code vta2006 at the
Intel 845G/GL Chipset Dynamic Video Memory Technology
R Intel 845G/GL Chipset Dynamic Video Memory Technology Revision 1.2 June 2002 May 2002 1 Information in this document is provided in connection with Intel products. No license, express or implied, by
Intel Media Server Studio Professional Edition for Windows* Server
Intel Media Server Studio 2015 R3 Professional Edition for Windows* Server Release Notes Overview What's New System Requirements Installation Installation Folders Known Limitations Legal Information Overview
Performance Tuning Guide for ECM 2.0
Performance Tuning Guide for ECM 2.0 Rev: 20 December 2012 Sitecore ECM 2.0 Performance Tuning Guide for ECM 2.0 A developer's guide to optimizing the performance of Sitecore ECM The information contained
Accelerating High-Speed Networking with Intel I/O Acceleration Technology
White Paper Intel I/O Acceleration Technology Accelerating High-Speed Networking with Intel I/O Acceleration Technology The emergence of multi-gigabit Ethernet allows data centers to adapt to the increasing
Remote Control 5.1.2. Tivoli Endpoint Manager - TRC User's Guide
Tivoli Remote Control 5.1.2 Tivoli Endpoint Manager - TRC User's Guide Tivoli Remote Control 5.1.2 Tivoli Endpoint Manager - TRC User's Guide Note Before using this information and the product it supports,
Intel Integrated Native Developer Experience (INDE): IDE Integration for Android*
Intel Integrated Native Developer Experience (INDE): IDE Integration for Android* 1.5.8 Overview IDE Integration for Android provides productivity-oriented design, coding, and debugging tools for applications
Quick Start Guide for VMware and Windows 7
PROPALMS VDI Version 2.1 Quick Start Guide for VMware and Windows 7 Rev. 1.1 Published: JULY-2011 1999-2011 Propalms Ltd. All rights reserved. The information contained in this document represents the
Analyzing Network Servers. Disk Space Utilization Analysis. DiskBoss - Data Management Solution
DiskBoss - Data Management Solution DiskBoss provides a large number of advanced data management and analysis operations including disk space usage analysis, file search, file classification and policy-based
VMware vrealize Operations for Horizon Administration
VMware vrealize Operations for Horizon Administration vrealize Operations for Horizon 6.1 This document supports the version of each product listed and supports all subsequent versions until the document
Evaluating Intel Virtualization Technology FlexMigration with Multi-generation Intel Multi-core and Intel Dual-core Xeon Processors.
Evaluating Intel Virtualization Technology FlexMigration with Multi-generation Intel Multi-core and Intel Dual-core Xeon Processors. Executive Summary: In today s data centers, live migration is a required
Measuring Cache and Memory Latency and CPU to Memory Bandwidth
White Paper Joshua Ruggiero Computer Systems Engineer Intel Corporation Measuring Cache and Memory Latency and CPU to Memory Bandwidth For use with Intel Architecture December 2008 1 321074 Executive Summary
How to Configure Intel X520 Ethernet Server Adapter Based Virtual Functions on Citrix* XenServer 6.0*
How to Configure Intel X520 Ethernet Server Adapter Based Virtual Functions on Citrix* XenServer 6.0* Technical Brief v1.0 December 2011 Legal Lines and Disclaimers INFORMATION IN THIS DOCUMENT IS PROVIDED
Running a Workflow on a PowerCenter Grid
Running a Workflow on a PowerCenter Grid 2010-2014 Informatica Corporation. No part of this document may be reproduced or transmitted in any form, by any means (electronic, photocopying, recording or otherwise)
VERITAS Backup Exec TM 10.0 for Windows Servers
VERITAS Backup Exec TM 10.0 for Windows Servers Quick Installation Guide N134418 July 2004 Disclaimer The information contained in this publication is subject to change without notice. VERITAS Software
Intel Rapid Storage Technology
Intel Rapid Storage Technology User Guide August 2011 Revision 1.0 1 Document Number: XXXXXX INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED,
VirtualCenter Database Performance for Microsoft SQL Server 2005 VirtualCenter 2.5
Performance Study VirtualCenter Database Performance for Microsoft SQL Server 2005 VirtualCenter 2.5 VMware VirtualCenter uses a database to store metadata on the state of a VMware Infrastructure environment.
OpenMP* 4.0 for HPC in a Nutshell
OpenMP* 4.0 for HPC in a Nutshell Dr.-Ing. Michael Klemm Senior Application Engineer Software and Services Group ([email protected]) *Other brands and names are the property of their respective owners.
Intel Platform and Big Data: Making big data work for you.
Intel Platform and Big Data: Making big data work for you. 1 From data comes insight New technologies are enabling enterprises to transform opportunity into reality by turning big data into actionable
Contents -------- Overview and Product Contents -----------------------------
------------------------------------------------------------------------ Intel(R) Threading Building Blocks - Release Notes Version 2.0 ------------------------------------------------------------------------
Intel vpro Technology Module for Microsoft* Windows PowerShell*
Intel vpro Technology Module for Microsoft* Windows PowerShell* 1 Legal Disclaimer INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL
WebSphere Business Monitor
WebSphere Business Monitor Dashboards 2010 IBM Corporation This presentation should provide an overview of the dashboard widgets for use with WebSphere Business Monitor. WBPM_Monitor_Dashboards.ppt Page
MPI Application Tune-Up Four Steps to Performance
MPI Application Tune-Up Four Steps to Performance Abstract Cluster systems continue to grow in complexity and capability. Getting optimal performance can be challenging. Making sense of the MPI communications,
Remote Application Server Version 14. Last updated: 06-02-15
Remote Application Server Version 14 Last updated: 06-02-15 Information in this document is subject to change without notice. Companies, names, and data used in examples herein are fictitious unless otherwise
Netwrix Auditor for Active Directory
Netwrix Auditor for Active Directory Quick-Start Guide Version: 7.1 10/26/2015 Legal Notice The information in this publication is furnished for information use only, and does not constitute a commitment
Binary search tree with SIMD bandwidth optimization using SSE
Binary search tree with SIMD bandwidth optimization using SSE Bowen Zhang, Xinwei Li 1.ABSTRACT In-memory tree structured index search is a fundamental database operation. Modern processors provide tremendous
INTEL PARALLEL STUDIO EVALUATION GUIDE. Intel Cilk Plus: A Simple Path to Parallelism
Intel Cilk Plus: A Simple Path to Parallelism Compiler extensions to simplify task and data parallelism Intel Cilk Plus adds simple language extensions to express data and task parallelism to the C and
Netwrix Auditor for Windows Server
Netwrix Auditor for Windows Server Quick-Start Guide Version: 7.0 7/7/2015 Legal Notice The information in this publication is furnished for information use only, and does not constitute a commitment from
RSM Web Gateway RSM Web Client INSTALLATION AND ADMINISTRATION GUIDE
RSM Web Gateway RSM Web Client INSTALLATION AND ADMINISTRATION GUIDE Installation and Administration Guide RSM Web Client and RSM Web Gateway 17 August, 2004 Page 1 Copyright Notice 2004 Sony Corporation.
PART IV Performance oriented design, Performance testing, Performance tuning & Performance solutions. Outline. Performance oriented design
PART IV Performance oriented design, Performance testing, Performance tuning & Performance solutions Slide 1 Outline Principles for performance oriented design Performance testing Performance tuning General
Mobile App Monitoring. Release Notes. Release 8.0
Mobile App Monitoring Release 8.0 Mobile App Monitoring 8.0 Mobile App Monitoring 8.0 July 2014 Copyright Notice Copyright 1995-2014 Keynote Systems, Inc. All rights reserved. THE INFORMATION CONTAINED
PARALLELS SERVER BARE METAL 5.0 README
PARALLELS SERVER BARE METAL 5.0 README 1999-2011 Parallels Holdings, Ltd. and its affiliates. All rights reserved. This document provides the first-priority information on the Parallels Server Bare Metal
Netwrix Auditor for SQL Server
Netwrix Auditor for SQL Server Quick-Start Guide Version: 7.1 10/26/2015 Legal Notice The information in this publication is furnished for information use only, and does not constitute a commitment from
Achieving Nanosecond Latency Between Applications with IPC Shared Memory Messaging
Achieving Nanosecond Latency Between Applications with IPC Shared Memory Messaging In some markets and scenarios where competitive advantage is all about speed, speed is measured in micro- and even nano-seconds.
Kaspersky Security Center Web-Console
Kaspersky Security Center Web-Console User Guide CONTENTS ABOUT THIS GUIDE... 5 In this document... 5 Document conventions... 7 KASPERSKY SECURITY CENTER WEB-CONSOLE... 8 SOFTWARE REQUIREMENTS... 10 APPLICATION
StarWind iscsi SAN: Configuring Global Deduplication May 2012
StarWind iscsi SAN: Configuring Global Deduplication May 2012 TRADEMARKS StarWind, StarWind Software, and the StarWind and StarWind Software logos are trademarks of StarWind Software that may be registered
Command Line Interface User Guide for Intel Server Management Software
Command Line Interface User Guide for Intel Server Management Software Legal Information Information in this document is provided in connection with Intel products. No license, express or implied, by estoppel
Three Paths to Faster Simulations Using ANSYS Mechanical 16.0 and Intel Architecture
White Paper Intel Xeon processor E5 v3 family Intel Xeon Phi coprocessor family Digital Design and Engineering Three Paths to Faster Simulations Using ANSYS Mechanical 16.0 and Intel Architecture Executive
Elemental functions: Writing data-parallel code in C/C++ using Intel Cilk Plus
Elemental functions: Writing data-parallel code in C/C++ using Intel Cilk Plus A simple C/C++ language extension construct for data parallel operations Robert Geva [email protected] Introduction Intel
Software Evaluation Guide for Autodesk 3ds Max 2009* and Enemy Territory: Quake Wars* Render a 3D character while playing a game
Software Evaluation Guide for Autodesk 3ds Max 2009* and Enemy Territory: Quake Wars* Render a 3D character while playing a game http://www.intel.com/performance/resources Version 2008-09 Rev. 1.0 Information
Intro to GPU computing. Spring 2015 Mark Silberstein, 048661, Technion 1
Intro to GPU computing Spring 2015 Mark Silberstein, 048661, Technion 1 Serial vs. parallel program One instruction at a time Multiple instructions in parallel Spring 2015 Mark Silberstein, 048661, Technion
Resource Utilization of Middleware Components in Embedded Systems
Resource Utilization of Middleware Components in Embedded Systems 3 Introduction System memory, CPU, and network resources are critical to the operation and performance of any software system. These system
Quick Start Guide for Parallels Virtuozzo
PROPALMS VDI Version 2.1 Quick Start Guide for Parallels Virtuozzo Rev. 1.1 Published: JULY-2011 1999-2011 Propalms Ltd. All rights reserved. The information contained in this document represents the current
Enterprise Manager. Version 6.2. Installation Guide
Enterprise Manager Version 6.2 Installation Guide Enterprise Manager 6.2 Installation Guide Document Number 680-028-014 Revision Date Description A August 2012 Initial release to support version 6.2.1
Also on the Performance tab, you will find a button labeled Resource Monitor. You can invoke Resource Monitor for additional analysis of the system.
1348 CHAPTER 33 Logging and Debugging Monitoring Performance The Performance tab enables you to view the CPU and physical memory usage in graphical form. This information is especially useful when you
User's Guide FairCom Performance Monitor
User's Guide FairCom Performance Monitor User's Guide FairCom Performance Monitor Contents 1. c-treeace Performance Monitor... 4 2. Startup... 5 3. Using Main Window... 6 4. Menus... 8 5. Icon Row... 11
