Computer Vision on GPU with OpenCV Anton Obukhov, NVIDIA
|
|
|
- Aubrie Dennis
- 9 years ago
- Views:
Transcription
1 Computer Vision on GPU with OpenCV Anton Obukhov, NVIDIA
2 Outline Introduction into OpenCV OpenCV GPU module Face Detection on GPU Pedestrian detection on GPU 2
3 OpenCV History OpenCV started Alpha Release CVPR 00 Beta 1 Beta 2 Beta 3 Beta 4 Beta 5 Release 1.0 Release 2.0 Release 1.1 Release 2.2 (GPU Beta) Release 2.3 R 2.1 (GPU) Intel Willow Garage NVIDIA Original goal: Accelerate the field by lowering the bar to computer vision Find compelling uses for the increasing MIPS out in the market Staffing: Climbed in 1999 to average 7 first couple of years Little development from Willow entered in 2008 to accelerate development, NVIDIA joined in full time professional developers, 3 of them dedicated to GPU 3
4 OpenCV Functionality Overview Image processing General Image Processing Segmentation Machine Learning, Detection Image Pyramids Transforms Fitting Video, Stereo, and 3D Camera Calibration Features Depth Maps Optical Flow Inpainting Tracking 4
5 OpenCV Architecture and Development Languages: C C++ Python CUDA JAVA (plans) Technologies: CUDA SSE TBB 3 rd party libs: Eigen IPP Jasper JPEG, PNG OpenNI QT TBB VideoInput Development: Maintainers Contributors QA: Buildbot Google Tests Modules: Core ImgProc HighGUI GPU ML ObjDetect Video Calib3D Features2D FLANN Target archs: X86 X64 ARM CUDA Target OS: Windows Linux Mac OS Android 5
6 OpenCV License Based on BSD license Free for commercial and research use Does not force your code to be open You need not contribute back 6
7 Projects Using OpenCV Academic and Research Large community Industry Google street view Structure from motion in movies Robotics Over downloads 7
8 Recent OpenCV release (2.3rc, June 2011) Added patent-free feature descriptors Improved camera calibration Added panorama stitching module GPU module release Android port Fine documentation (online too: bugfixes Full list on Change Logs 8
9 Outline Introduction into OpenCV OpenCV GPU module Face Detection on GPU Pedestrian detection on GPU 9
10 OpenCV GPU Module Goals: Provide developers with a convenient computer vision framework on the GPU Maintain conceptual consistency with the current CPU functionality Achieve the best performance with GPUs Efficient kernels tuned for modern architectures Optimized dataflows (asynchronous execution, copy overlaps, zero-copy) 10
11 OpenCV GPU Module Contents Image processing building blocks: Color conversions Geometrical transforms Per-element operations Integrals, reductions Template matching Filtering engine Feature detectors High-level algorithms: Stereo matching Face detection SURF 11
12 OpenCV GPU Module Usage Prerequisites: Get sources from SourceForge or SVN CMake NVIDIA Display Driver NVIDIA GPU Computing Toolkit (for CUDA) Build OpenCV with CUDA support #include <opencv2/gpu/gpu.hpp> 12
13 OpenCV GPU Data Structures Class GpuMat For storing 2D image in GPU memory, just like class cv::mat Reference counting Can point to data allocated by user Class CudaMem For pinned memory support Can be transformed into cv::mat or cv::gpu::gpumat Class Stream Overloads with extra Stream parameter // class GpuMat GpuMat(Size size, int type); GpuMat(const GpuMat& m); explicit GpuMat (const Mat& m); GpuMat& operator = (const GpuMat& m); GpuMat& operator = (const Mat& m); void upload(const Mat& m); void upload(const CudaMem& m, Stream& stream); void download(mat& m) const; void download(cudamem& m, Stream& stream) const; // class Stream bool queryifcomplete(); void waitforcompletion(); void enqueuedownload(const GpuMat& src, Mat& dst); void enqueueupload(const Mat& src, GpuMat& dst); void enqueuecopy(const GpuMat& src, GpuMat& dst); 13
14 OpenCV GPU Module Example Mat frame; VideoCapture capture(camera); cv::hogdescriptor hog; hog.setsvmdetector(cv::hogdescriptor:: getdefaultpeopledetectorector()); capture >> frame; Mat frame; VideoCapture capture(camera); cv::gpu::hogdescriptor hog; hog.setsvmdetector(cv::hogdescriptor:: getdefaultpeopledetectorector()); capture >> frame; GpuMat gpu_frame; gpu_frame.upload(frame); vector<rect> found; hog.detectmultiscale(frame, found, 1.4, Size(8, 8), Size(0, 0), 1.05, 8); vector<rect> found; hog.detectmultiscale(gpu_frame, found, 1.4, Size(8, 8), Size(0, 0), 1.05, 8); Designed very similar! 14
15 OpenCV and NPP NPP is NVIDIA Performance Primitives library of signal and image processing functions (similar to Intel IPP) NVIDIA will continue adding new primitives and optimizing for future architectures GPU module uses NPP whenever possible Highly optimized implementations for all supported NVIDIA architectures and OS Part of CUDA Toolkit no additional dependencies OpenCV extends NPP and uses it to build higher level CV 15
16 OpenCV GPU Module Performance Tesla C2050 (Fermi) vs. Core i GHz (4 cores, TBB, SSE) Average speedup for primitives: 33 For good data (large images are better) Without copying to GPU What can you get from your computer? opencv\samples\gpu\perfomance 16
17 OpenCV GPU: Histogram of Oriented Gradients Used for pedestrian detection Speed-up ~ 8 17
18 OpenCV GPU: Speeded Up Robust Features SURF (12 ) Bruteforce matcher K-Nearest search (20-30 ) In radius search (3-5 ) 18
19 FULL HD FULL HD OpenCV GPU: Stereo Vision Stereo Block Matching (7 ) Can run Full HD real-time on Dual-GPU GPU BM Hierarchical Dense Stereo Belief Propagation (20 ) Constant space BP ( ) GPU CSBP 19
20 OpenCV GPU: Viola-Jones Cascade Classifier Used for face detection Speed-up ~ 6 Based on NCV classes (NVIDIA implementation) 20
21 OpenCV with Multiple GPUs Algorithms designed with single GPU in mind You can split workload manually in slices: Stereo Block Matching (dual-gpu speedup ~ 1.8 ) Multi-scale pedestrian detection: linear speed-up (scale-parallel) 21
22 OpenCV Needs Your Feedback! Help us set development priorities Which OpenCV functions do you use? Which are the most painful and time-consuming today? The more information you can provide about your end application, the better Feature request/feedback form on OpenCV Wiki: 22
23 Outline Introduction into OpenCV OpenCV GPU module Face Detection on GPU Pedestrian detection on GPU 23
24 GPU Face Detection: Motivation One of the first Computer Vision problems Soul of Human-Computer interaction Smart applications in real life 24
25 GPU Face Detection: Problem Locate all upright frontal faces: Where face detection does not work: 25
26 GPU Face Detection: Approaches Viola-Jones Haar classifiers framework: Y 1 2 N Y Y Y Face N N N Not face N Basic idea: reject most non-face regions on early stages 26
27 Classifiers Cascade Explained 20 pix White points represent face windows passed through the 1,2,3,6, and 20 classifier stages Time for CUDA to step in! (Parallel windows processing) 27
28 GPU Face Detection: Haar Classifier Each stage comprises a strong classifier: 1 h 1 h 2 h S T φ > f (X) Yes No α 1 α 2 28
29 Haar Features Explained Most representative Haar features for Face Detection 29
30 Integral Image Explained Each Integral Image pixel contains the sum of all pixels of the original image to the left and top Calculation of sum of pixels in a rectangle can be done in 4 accesses to the integral image 30
31 Integral Images with CUDA Algorithm: Integrate image rows Integrate image columns Known as Parallel Scan (one CUDA thread per element): Input: Output:
32 GPU Face Detection Performance 32
33 OpenCV NCV Framework Features: Native and Stack GPU memory allocators Protected allocations (fail-safety) Containers: NCVMatrix, NCVVector Runtime C++ template dispatcher NPP_staging a place for missing NPP functions Integral images Mean and StdDev calculation Vector compaction 33
34 OpenCV NCV Haar Cascade Classifiers Haar Object Detection from OpenCV GPU module: Implemented on top of NCV Uses NPP with extensions (NPP_staging) Not only faces! Suitable for production applications Reliable (fail-safe) Largest Object mode (up to 200 fps) All Objects mode 34
35 Outline Introduction into OpenCV OpenCV GPU module Face Detection on GPU Pedestrian detection on GPU 35
36 Pedestrian Detection HOG descriptor Introduced by Navneet Dalal and Bill Triggs Feature vectors are compatible with the INRIA Object Detection and Localization Toolkit 36
37 Pedestrian Detection: HOG Descriptor Object shape is characterized by distributions of: Gradient magnitude Edge/Gradient orientation Grid of orientation histograms Magnitude Edge Orientation 37
38 Pedestrian Detection: Working on Image Gamma correction Gradients calculation Sliding window algorithm Multi-scale 38
39 Pedestrian Detection: Inside Window Compute histograms inside cells Normalize blocks of cells One cell may belong to >1 block Apply linear SVM classifier Window descriptor Blocks of cells Cells 39
40 Pedestrian Detection: Step 1 Gradients computation Block histograms calculation Histograms normalization Linear SVM Gamma correction improves quality Sobel filter 3x3 by columns and rows Output: magnitude and angle 40
41 Pedestrian Detection: Step 2 Gradients computation Block histograms calculation Histograms normalization Linear SVM Big intersection in close positions Require window stride to be multiple of cell size Histograms of blocks are computed independently Image 41
42 Pedestrian Detection: Step 2 Gradients computation Block histograms calculation Histograms normalization Linear SVM Pixels vote in proportion to gradient magnitude Tri-linear interpolation 2 orientation bins 4 cells Gaussian Decreases weight of pixels near block boundary 42
43 Pedestrian Detection: Step 3 Gradients computation Block histograms calculation Histograms normalization Linear SVM Normalization L2-Hys norm L2 norm, clipping, normalization 2 parallel reductions in shared memory 43
44 Pedestrian Detection: Step 4 Gradients computation Block histograms calculation Histograms normalization Linear SVM Linear SVM GPU time, % Classification is just a dot product 6% 5% 20% Gamma + Gradients 1 thread block per window position 30% Histograms SVM Normalize 39% Other 44
45 Pedestrian Detection Performance 8 times faster! Detection rate Same as CPU FPS Core i5 2.8 GHz TBB, 4 cores Tesla C x x
46 GPU Technology Conference Spring 2012 San Francisco Bay Area The one event you can t afford to miss Learn about leading-edge advances in GPU computing Explore the research as well as the commercial applications Discover advances in computational visualization Take a deep dive into parallel programming Ways to participate Speak share your work and gain exposure as a thought leader Register learn from the experts and network with your peers Exhibit/Sponsor promote your company as a key player in the GPU ecosystem
47 Thank you 47
Introduction to OpenCV for Tegra. Shalini Gupta, Nvidia
Introduction to OpenCV for Tegra Shalini Gupta, Nvidia Computer Vision = Mobile differentiator Applications Smart photography Augmented reality, gesture recognition, visual search Vehicle safety Lucky
Local features and matching. Image classification & object localization
Overview Instance level search Local features and matching Efficient visual recognition Image classification & object localization Category recognition Image classification: assigning a class label to
A Comparison of Keypoint Descriptors in the Context of Pedestrian Detection: FREAK vs. SURF vs. BRISK
A Comparison of Keypoint Descriptors in the Context of Pedestrian Detection: FREAK vs. SURF vs. BRISK Cameron Schaeffer Stanford University CS Department [email protected] Abstract The subject of keypoint
How To Train A Face Recognition In Python And Opencv
TRAINING DETECTORS AND RECOGNIZERS IN PYTHON AND OPENCV Sept. 9, 2014 ISMAR 2014 Joseph Howse GOALS Build apps that learn from p h o to s & f r o m real-time camera input. D e te c t & recognize the faces
CS231M Project Report - Automated Real-Time Face Tracking and Blending
CS231M Project Report - Automated Real-Time Face Tracking and Blending Steven Lee, [email protected] June 6, 2015 1 Introduction Summary statement: The goal of this project is to create an Android
PARALLEL JAVASCRIPT. Norm Rubin (NVIDIA) Jin Wang (Georgia School of Technology)
PARALLEL JAVASCRIPT Norm Rubin (NVIDIA) Jin Wang (Georgia School of Technology) JAVASCRIPT Not connected with Java Scheme and self (dressed in c clothing) Lots of design errors (like automatic semicolon
Computer Vision Technology. Dave Bolme and Steve O Hara
Computer Vision Technology Dave Bolme and Steve O Hara Today we ll discuss... The OpenCV Computer Vision Library Python scripting for Computer Vision Python OpenCV bindings SciPy / Matlab-like Python capabilities
HPC with Multicore and GPUs
HPC with Multicore and GPUs Stan Tomov Electrical Engineering and Computer Science Department University of Tennessee, Knoxville CS 594 Lecture Notes March 4, 2015 1/18 Outline! Introduction - Hardware
A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow
, pp.233-237 http://dx.doi.org/10.14257/astl.2014.51.53 A Study on SURF Algorithm and Real-Time Tracking Objects Using Optical Flow Giwoo Kim 1, Hye-Youn Lim 1 and Dae-Seong Kang 1, 1 Department of electronices
Android and OpenCV Tutorial
Android and OpenCV Tutorial Computer Vision Lab Tutorial 26 September 2013 Lorenz Meier, Amaël Delaunoy, Kalin Kolev Institute of Visual Computing Tutorial Content Strengths / Weaknesses of Android Java
Optimizing a 3D-FWT code in a cluster of CPUs+GPUs
Optimizing a 3D-FWT code in a cluster of CPUs+GPUs Gregorio Bernabé Javier Cuenca Domingo Giménez Universidad de Murcia Scientific Computing and Parallel Programming Group XXIX Simposium Nacional de la
Texture Cache Approximation on GPUs
Texture Cache Approximation on GPUs Mark Sutherland Joshua San Miguel Natalie Enright Jerger {suther68,enright}@ece.utoronto.ca, [email protected] 1 Our Contribution GPU Core Cache Cache
How To Program With Adaptive Vision Studio
Studio 4 intuitive powerful adaptable software for machine vision engineers Introduction Adaptive Vision Studio Adaptive Vision Studio software is the most powerful graphical environment for machine vision
A Computer Vision System on a Chip: a case study from the automotive domain
A Computer Vision System on a Chip: a case study from the automotive domain Gideon P. Stein Elchanan Rushinek Gaby Hayun Amnon Shashua Mobileye Vision Technologies Ltd. Hebrew University Jerusalem, Israel
Advanced analytics at your hands
2.3 Advanced analytics at your hands Neural Designer is the most powerful predictive analytics software. It uses innovative neural networks techniques to provide data scientists with results in a way previously
Applying Deep Learning to Car Data Logging (CDL) and Driver Assessor (DA) October 22-Oct-15
Applying Deep Learning to Car Data Logging (CDL) and Driver Assessor (DA) October 22-Oct-15 GENIVI is a registered trademark of the GENIVI Alliance in the USA and other countries Copyright GENIVI Alliance
GPU Point List Generation through Histogram Pyramids
VMV 26, GPU Programming GPU Point List Generation through Histogram Pyramids Gernot Ziegler, Art Tevs, Christian Theobalt, Hans-Peter Seidel Agenda Overall task Problems Solution principle Algorithm: Discriminator
Performance monitoring at CERN openlab. July 20 th 2012 Andrzej Nowak, CERN openlab
Performance monitoring at CERN openlab July 20 th 2012 Andrzej Nowak, CERN openlab Data flow Reconstruction Selection and reconstruction Online triggering and filtering in detectors Raw Data (100%) Event
VALAR: A BENCHMARK SUITE TO STUDY THE DYNAMIC BEHAVIOR OF HETEROGENEOUS SYSTEMS
VALAR: A BENCHMARK SUITE TO STUDY THE DYNAMIC BEHAVIOR OF HETEROGENEOUS SYSTEMS Perhaad Mistry, Yash Ukidave, Dana Schaa, David Kaeli Department of Electrical and Computer Engineering Northeastern University,
GPU Usage. Requirements
GPU Usage Use the GPU Usage tool in the Performance and Diagnostics Hub to better understand the high-level hardware utilization of your Direct3D app. You can use it to determine whether the performance
A Prototype For Eye-Gaze Corrected
A Prototype For Eye-Gaze Corrected Video Chat on Graphics Hardware Maarten Dumont, Steven Maesen, Sammy Rogmans and Philippe Bekaert Introduction Traditional webcam video chat: No eye contact. No extensive
Parallel Image Processing with CUDA A case study with the Canny Edge Detection Filter
Parallel Image Processing with CUDA A case study with the Canny Edge Detection Filter Daniel Weingaertner Informatics Department Federal University of Paraná - Brazil Hochschule Regensburg 02.05.2011 Daniel
ANDROID DEVELOPER TOOLS TRAINING GTC 2014. Sébastien Dominé, NVIDIA
ANDROID DEVELOPER TOOLS TRAINING GTC 2014 Sébastien Dominé, NVIDIA AGENDA NVIDIA Developer Tools Introduction Multi-core CPU tools Graphics Developer Tools Compute Developer Tools NVIDIA Developer Tools
OpenCL Optimization. San Jose 10/2/2009 Peng Wang, NVIDIA
OpenCL Optimization San Jose 10/2/2009 Peng Wang, NVIDIA Outline Overview The CUDA architecture Memory optimization Execution configuration optimization Instruction optimization Summary Overall Optimization
Computer Graphics Hardware An Overview
Computer Graphics Hardware An Overview Graphics System Monitor Input devices CPU/Memory GPU Raster Graphics System Raster: An array of picture elements Based on raster-scan TV technology The screen (and
Next Generation GPU Architecture Code-named Fermi
Next Generation GPU Architecture Code-named Fermi The Soul of a Supercomputer in the Body of a GPU Why is NVIDIA at Super Computing? Graphics is a throughput problem paint every pixel within frame time
VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS
VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS Norbert Buch 1, Mark Cracknell 2, James Orwell 1 and Sergio A. Velastin 1 1. Kingston University, Penrhyn Road, Kingston upon Thames, KT1 2EE,
Interactive Level-Set Deformation On the GPU
Interactive Level-Set Deformation On the GPU Institute for Data Analysis and Visualization University of California, Davis Problem Statement Goal Interactive system for deformable surface manipulation
Real Time Industrial Application of Single Board Computer Based Color Detection System
Real Time Industrial Application of Single Board Computer Based Color Detection System Serdar SOLAK 1 and Emine DO RU BOLAT 2 1 Department of Informatics, Kocaeli University, 41380, Umuttepe Campus, Kocaeli,Turkey
GETTING STARTED WITH ANDROID DEVELOPMENT FOR EMBEDDED SYSTEMS
Embedded Systems White Paper GETTING STARTED WITH ANDROID DEVELOPMENT FOR EMBEDDED SYSTEMS September 2009 ABSTRACT Android is an open source platform built by Google that includes an operating system,
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches
Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches PhD Thesis by Payam Birjandi Director: Prof. Mihai Datcu Problematic
Equalizer. Parallel OpenGL Application Framework. Stefan Eilemann, Eyescale Software GmbH
Equalizer Parallel OpenGL Application Framework Stefan Eilemann, Eyescale Software GmbH Outline Overview High-Performance Visualization Equalizer Competitive Environment Equalizer Features Scalability
Real-time Visual Tracker by Stream Processing
Real-time Visual Tracker by Stream Processing Simultaneous and Fast 3D Tracking of Multiple Faces in Video Sequences by Using a Particle Filter Oscar Mateo Lozano & Kuzahiro Otsuka presented by Piotr Rudol
Distributed Vision Processing in Smart Camera Networks
Distributed Vision Processing in Smart Camera Networks CVPR-07 Hamid Aghajan, Stanford University, USA François Berry, Univ. Blaise Pascal, France Horst Bischof, TU Graz, Austria Richard Kleihorst, NXP
Programming models for heterogeneous computing. Manuel Ujaldón Nvidia CUDA Fellow and A/Prof. Computer Architecture Department University of Malaga
Programming models for heterogeneous computing Manuel Ujaldón Nvidia CUDA Fellow and A/Prof. Computer Architecture Department University of Malaga Talk outline [30 slides] 1. Introduction [5 slides] 2.
Graphics Cards and Graphics Processing Units. Ben Johnstone Russ Martin November 15, 2011
Graphics Cards and Graphics Processing Units Ben Johnstone Russ Martin November 15, 2011 Contents Graphics Processing Units (GPUs) Graphics Pipeline Architectures 8800-GTX200 Fermi Cayman Performance Analysis
Get an Easy Performance Boost Even with Unthreaded Apps. with Intel Parallel Studio XE for Windows*
Get an Easy Performance Boost Even with Unthreaded Apps for Windows* Can recompiling just one file make a difference? Yes, in many cases it can! Often, you can achieve a major performance boost by recompiling
Applications to Computational Financial and GPU Computing. May 16th. Dr. Daniel Egloff +41 44 520 01 17 +41 79 430 03 61
F# Applications to Computational Financial and GPU Computing May 16th Dr. Daniel Egloff +41 44 520 01 17 +41 79 430 03 61 Today! Why care about F#? Just another fashion?! Three success stories! How Alea.cuBase
Getting Started with RemoteFX in Windows Embedded Compact 7
Getting Started with RemoteFX in Windows Embedded Compact 7 Writers: Randy Ocheltree, Ryan Wike Technical Reviewer: Windows Embedded Compact RDP Team Applies To: Windows Embedded Compact 7 Published: January
The use of computer vision technologies to augment human monitoring of secure computing facilities
The use of computer vision technologies to augment human monitoring of secure computing facilities Marius Potgieter School of Information and Communication Technology Nelson Mandela Metropolitan University
IP Video Rendering Basics
CohuHD offers a broad line of High Definition network based cameras, positioning systems and VMS solutions designed for the performance requirements associated with critical infrastructure applications.
Go Faster - Preprocessing Using FPGA, CPU, GPU. Dipl.-Ing. (FH) Bjoern Rudde Image Acquisition Development STEMMER IMAGING
Go Faster - Preprocessing Using FPGA, CPU, GPU Dipl.-Ing. (FH) Bjoern Rudde Image Acquisition Development STEMMER IMAGING WHO ARE STEMMER IMAGING? STEMMER IMAGING is: Europe's leading independent provider
Lazy OpenCV installation and use with Visual Studio
Lazy OpenCV installation and use with Visual Studio Overview This tutorial will walk you through: How to install OpenCV on Windows, both: The pre-built version (useful if you won t be modifying the OpenCV
Basler. Line Scan Cameras
Basler Line Scan Cameras High-quality line scan technology meets a cost-effective GigE interface Real color support in a compact housing size Shading correction compensates for difficult lighting conditions
Real-time Person Detection and Tracking in Panoramic Video
2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops Real-time Person Detection and Tracking in Panoramic Video Marcus Thaler, Werner Bailer JOANNEUM RESEARCH, DIGITAL Institute for
RIPL. An Image Processing DSL. Rob Stewart & Deepayan Bhowmik. 1st May, 2014. Heriot Watt University
RIPL An Image Processing DSL Rob Stewart & Deepayan Bhowmik Heriot Watt University 1st May, 2014 Image processing language Rathlin = + FPGA Motivation Application scenario FPGAs good fit for remote image
Implementation of Canny Edge Detector of color images on CELL/B.E. Architecture.
Implementation of Canny Edge Detector of color images on CELL/B.E. Architecture. Chirag Gupta,Sumod Mohan K [email protected], [email protected] Abstract In this project we propose a method to improve
ultra fast SOM using CUDA
ultra fast SOM using CUDA SOM (Self-Organizing Map) is one of the most popular artificial neural network algorithms in the unsupervised learning category. Sijo Mathew Preetha Joy Sibi Rajendra Manoj A
Lecture 2: The SVM classifier
Lecture 2: The SVM classifier C19 Machine Learning Hilary 2015 A. Zisserman Review of linear classifiers Linear separability Perceptron Support Vector Machine (SVM) classifier Wide margin Cost function
Learn CUDA in an Afternoon: Hands-on Practical Exercises
Learn CUDA in an Afternoon: Hands-on Practical Exercises Alan Gray and James Perry, EPCC, The University of Edinburgh Introduction This document forms the hands-on practical component of the Learn CUDA
Using Mobile Processors for Cost Effective Live Video Streaming to the Internet
Using Mobile Processors for Cost Effective Live Video Streaming to the Internet Hans-Joachim Gelke Tobias Kammacher Institute of Embedded Systems Source: Apple Inc. Agenda 1. Typical Application 2. Available
In: Proceedings of RECPAD 2002-12th Portuguese Conference on Pattern Recognition June 27th- 28th, 2002 Aveiro, Portugal
Paper Title: Generic Framework for Video Analysis Authors: Luís Filipe Tavares INESC Porto [email protected] Luís Teixeira INESC Porto, Universidade Católica Portuguesa [email protected] Luís Corte-Real
The Evolution of Computer Graphics. SVP, Content & Technology, NVIDIA
The Evolution of Computer Graphics Tony Tamasi SVP, Content & Technology, NVIDIA Graphics Make great images intricate shapes complex optical effects seamless motion Make them fast invent clever techniques
Revision history: New comments added to: m)! Universal Driver
Revision history: 1.0! 17/07/2006! Initial draft 1.1! 18/07/2006! Added screen shot for picture slides. 1.2! 07/09/2006! New features added: s)! Support for QuickTime VR movies. t)! Support for PDF files
If you are working with the H4D-60 or multi-shot cameras we recommend 8GB of RAM on a 64 bit Windows and 1GB of video RAM.
Phocus 2.7.6 Windows read-me December 5 2013 Installation To install Phocus, run the installation bundle called Phocus 2.7.6 Setup.exe. This bundle contains Phocus, Hasselblad Device Drivers, Microsoft.NET
CUDAMat: a CUDA-based matrix class for Python
Department of Computer Science 6 King s College Rd, Toronto University of Toronto M5S 3G4, Canada http://learning.cs.toronto.edu fax: +1 416 978 1455 November 25, 2009 UTML TR 2009 004 CUDAMat: a CUDA-based
Silverlight for Windows Embedded Graphics and Rendering Pipeline 1
Silverlight for Windows Embedded Graphics and Rendering Pipeline 1 Silverlight for Windows Embedded Graphics and Rendering Pipeline Windows Embedded Compact 7 Technical Article Writers: David Franklin,
The Uintah Framework: A Unified Heterogeneous Task Scheduling and Runtime System
The Uintah Framework: A Unified Heterogeneous Task Scheduling and Runtime System Qingyu Meng, Alan Humphrey, Martin Berzins Thanks to: John Schmidt and J. Davison de St. Germain, SCI Institute Justin Luitjens
The Visual Internet of Things System Based on Depth Camera
The Visual Internet of Things System Based on Depth Camera Xucong Zhang 1, Xiaoyun Wang and Yingmin Jia Abstract The Visual Internet of Things is an important part of information technology. It is proposed
Network Traffic Monitoring and Analysis with GPUs
Network Traffic Monitoring and Analysis with GPUs Wenji Wu, Phil DeMar [email protected], [email protected] GPU Technology Conference 2013 March 18-21, 2013 SAN JOSE, CALIFORNIA Background Main uses for network
Medical Image Processing on the GPU. Past, Present and Future. Anders Eklund, PhD Virginia Tech Carilion Research Institute [email protected].
Medical Image Processing on the GPU Past, Present and Future Anders Eklund, PhD Virginia Tech Carilion Research Institute [email protected] Outline Motivation why do we need GPUs? Past - how was GPU programming
GPU-Based Network Traffic Monitoring & Analysis Tools
GPU-Based Network Traffic Monitoring & Analysis Tools Wenji Wu; Phil DeMar [email protected], [email protected] CHEP 2013 October 17, 2013 Coarse Detailed Background Main uses for network traffic monitoring
Analecta Vol. 8, No. 2 ISSN 2064-7964
EXPERIMENTAL APPLICATIONS OF ARTIFICIAL NEURAL NETWORKS IN ENGINEERING PROCESSING SYSTEM S. Dadvandipour Institute of Information Engineering, University of Miskolc, Egyetemváros, 3515, Miskolc, Hungary,
NVIDIA CUDA GETTING STARTED GUIDE FOR MAC OS X
NVIDIA CUDA GETTING STARTED GUIDE FOR MAC OS X DU-05348-001_v6.5 August 2014 Installation and Verification on Mac OS X TABLE OF CONTENTS Chapter 1. Introduction...1 1.1. System Requirements... 1 1.2. About
Anime Studio Debut vs. Pro
vs. Animation Length 2 minutes (3000 frames) Unlimited Motion Tracking 3 Points Unlimited Audio Tracks 2 Tracks Unlimited Video Tracks 1 Track Unlimited Physics No Yes Poser scene import No Yes 3D layer
Managing Adaptability in Heterogeneous Architectures through Performance Monitoring and Prediction
Managing Adaptability in Heterogeneous Architectures through Performance Monitoring and Prediction Cristina Silvano [email protected] Politecnico di Milano HiPEAC CSW Athens 2014 Motivations System
Intro to GPU computing. Spring 2015 Mark Silberstein, 048661, Technion 1
Intro to GPU computing Spring 2015 Mark Silberstein, 048661, Technion 1 Serial vs. parallel program One instruction at a time Multiple instructions in parallel Spring 2015 Mark Silberstein, 048661, Technion
NVIDIA Performance Primitives & Video Codecs on GPU. Gold Room Thursday 1 st October 2009 Anton Obukhov & Frank Jargstorff
NVIDIA Performance Primitives & Video Codecs on GPU Gold Room Thursday 1 st October 2009 Anton Obukhov & Frank Jargstorff Overview Two presentations: NPP (Frank Jargstorff) Video Codes on NVIDIA GPUs (Anton
EXPLORING IMAGE-BASED CLASSIFICATION TO DETECT VEHICLE MAKE AND MODEL FINAL REPORT
EXPLORING IMAGE-BASED CLASSIFICATION TO DETECT VEHICLE MAKE AND MODEL FINAL REPORT Jeffrey B. Flora, Mahbubul Alam, Amr H. Yousef, and Khan M. Iftekharuddin December 2013 DISCLAIMER The contents of this
Example of Standard API
16 Example of Standard API System Call Implementation Typically, a number associated with each system call System call interface maintains a table indexed according to these numbers The system call interface
ROBOTRACKER A SYSTEM FOR TRACKING MULTIPLE ROBOTS IN REAL TIME. by Alex Sirota, [email protected]
ROBOTRACKER A SYSTEM FOR TRACKING MULTIPLE ROBOTS IN REAL TIME by Alex Sirota, [email protected] Project in intelligent systems Computer Science Department Technion Israel Institute of Technology Under the
Robust Algorithms for Current Deposition and Dynamic Load-balancing in a GPU Particle-in-Cell Code
Robust Algorithms for Current Deposition and Dynamic Load-balancing in a GPU Particle-in-Cell Code F. Rossi, S. Sinigardi, P. Londrillo & G. Turchetti University of Bologna & INFN GPU2014, Rome, Sept 17th
Multi-GPU Load Balancing for Simulation and Rendering
Multi- Load Balancing for Simulation and Rendering Yong Cao Computer Science Department, Virginia Tech, USA In-situ ualization and ual Analytics Instant visualization and interaction of computing tasks
An Active Head Tracking System for Distance Education and Videoconferencing Applications
An Active Head Tracking System for Distance Education and Videoconferencing Applications Sami Huttunen and Janne Heikkilä Machine Vision Group Infotech Oulu and Department of Electrical and Information
Cees Snoek. Machine. Humans. Multimedia Archives. Euvision Technologies The Netherlands. University of Amsterdam The Netherlands. Tree.
Visual search: what's next? Cees Snoek University of Amsterdam The Netherlands Euvision Technologies The Netherlands Problem statement US flag Tree Aircraft Humans Dog Smoking Building Basketball Table
Introducing PgOpenCL A New PostgreSQL Procedural Language Unlocking the Power of the GPU! By Tim Child
Introducing A New PostgreSQL Procedural Language Unlocking the Power of the GPU! By Tim Child Bio Tim Child 35 years experience of software development Formerly VP Oracle Corporation VP BEA Systems Inc.
Stream Processing on GPUs Using Distributed Multimedia Middleware
Stream Processing on GPUs Using Distributed Multimedia Middleware Michael Repplinger 1,2, and Philipp Slusallek 1,2 1 Computer Graphics Lab, Saarland University, Saarbrücken, Germany 2 German Research
Research and Design of Universal and Open Software Development Platform for Digital Home
Research and Design of Universal and Open Software Development Platform for Digital Home CaiFeng Cao School of Computer Wuyi University, Jiangmen 529020, China [email protected] Abstract. With the development
OpenPOWER Outlook AXEL KOEHLER SR. SOLUTION ARCHITECT HPC
OpenPOWER Outlook AXEL KOEHLER SR. SOLUTION ARCHITECT HPC Driving industry innovation The goal of the OpenPOWER Foundation is to create an open ecosystem, using the POWER Architecture to share expertise,
Writing Applications for the GPU Using the RapidMind Development Platform
Writing Applications for the GPU Using the RapidMind Development Platform Contents Introduction... 1 Graphics Processing Units... 1 RapidMind Development Platform... 2 Writing RapidMind Enabled Applications...
Network Traffic Monitoring & Analysis with GPUs
Network Traffic Monitoring & Analysis with GPUs Wenji Wu, Phil DeMar [email protected], [email protected] GPU Technology Conference 2013 March 18-21, 2013 SAN JOSE, CALIFORNIA Background Main uses for network
DEFERRED IMAGE PROCESSING IN INTEL IPP LIBRARY
DEFERRED IMAGE PROCESSING IN INTEL IPP LIBRARY Alexander Kibkalo ([email protected]), Michael Lotkov ([email protected]), Ignat Rogozhkin ([email protected]), Alexander Turovets
Overview. Proven Image Quality and Easy to Use Without a Frame Grabber. Your benefits include:
Basler runner Line Scan Cameras High-quality line scan technology meets a cost-effective GigE interface Real color support in a compact housing size Shading correction compensates for difficult lighting
Packet-based Network Traffic Monitoring and Analysis with GPUs
Packet-based Network Traffic Monitoring and Analysis with GPUs Wenji Wu, Phil DeMar [email protected], [email protected] GPU Technology Conference 2014 March 24-27, 2014 SAN JOSE, CALIFORNIA Background Main
Basler. Area Scan Cameras
Basler Area Scan Cameras VGA to 5 megapixels and up to 210 fps Selected high quality Sony and Kodak CCD sensors Powerful Gigabit Ethernet interface Superb image quality at all resolutions and frame rates
Lecture Notes, CEng 477
Computer Graphics Hardware and Software Lecture Notes, CEng 477 What is Computer Graphics? Different things in different contexts: pictures, scenes that are generated by a computer. tools used to make
Affdex SDK for Windows!
Affdex SDK for Windows SDK Developer Guide 1 Introduction Affdex SDK is the culmination of years of scientific research into emotion detection, validated across thousands of tests worldwide on PC platforms,
Turbomachinery CFD on many-core platforms experiences and strategies
Turbomachinery CFD on many-core platforms experiences and strategies Graham Pullan Whittle Laboratory, Department of Engineering, University of Cambridge MUSAF Colloquium, CERFACS, Toulouse September 27-29
Sense Making in an IOT World: Sensor Data Analysis with Deep Learning
Sense Making in an IOT World: Sensor Data Analysis with Deep Learning Natalia Vassilieva, PhD Senior Research Manager GTC 2016 Deep learning proof points as of today Vision Speech Text Other Search & information
The High Performance Internet of Things: using GVirtuS for gluing cloud computing and ubiquitous connected devices
WS on Models, Algorithms and Methodologies for Hierarchical Parallelism in new HPC Systems The High Performance Internet of Things: using GVirtuS for gluing cloud computing and ubiquitous connected devices
CS1112 Spring 2014 Project 4. Objectives. 3 Pixelation for Identity Protection. due Thursday, 3/27, at 11pm
CS1112 Spring 2014 Project 4 due Thursday, 3/27, at 11pm You must work either on your own or with one partner. If you work with a partner you must first register as a group in CMS and then submit your
Recognition. Sanja Fidler CSC420: Intro to Image Understanding 1 / 28
Recognition Topics that we will try to cover: Indexing for fast retrieval (we still owe this one) History of recognition techniques Object classification Bag-of-words Spatial pyramids Neural Networks Object
Crosswalk: build world class hybrid mobile apps
Crosswalk: build world class hybrid mobile apps Ningxin Hu Intel Today s Hybrid Mobile Apps Application HTML CSS JS Extensions WebView of Operating System (Tizen, Android, etc.,) 2 State of Art HTML5 performance
Capacities Overview: 9.7 MultiTouch Screen with IPS technology Access to AndroidTM apps HD Multimedia playback
Arnova introduces a new HD multimedia tablet: The ARNOVA 9 G2 Equipped with a sharp, bright 9.7 (1024x768) LCD touchscreen, it delivers amazing display quality. The combination between Android TM 2.3 Gingerbread
Introduction to GPU hardware and to CUDA
Introduction to GPU hardware and to CUDA Philip Blakely Laboratory for Scientific Computing, University of Cambridge Philip Blakely (LSC) GPU introduction 1 / 37 Course outline Introduction to GPU hardware
Benchmark Hadoop and Mars: MapReduce on cluster versus on GPU
Benchmark Hadoop and Mars: MapReduce on cluster versus on GPU Heshan Li, Shaopeng Wang The Johns Hopkins University 3400 N. Charles Street Baltimore, Maryland 21218 {heshanli, shaopeng}@cs.jhu.edu 1 Overview
EFFICIENT VEHICLE TRACKING AND CLASSIFICATION FOR AN AUTOMATED TRAFFIC SURVEILLANCE SYSTEM
EFFICIENT VEHICLE TRACKING AND CLASSIFICATION FOR AN AUTOMATED TRAFFIC SURVEILLANCE SYSTEM Amol Ambardekar, Mircea Nicolescu, and George Bebis Department of Computer Science and Engineering University
