Vision meets Robotics: The KITTI Dataset
|
|
|
- Ethel Briggs
- 9 years ago
- Views:
Transcription
1 1 Vision meets Robotics: The KITTI Dataset Andreas Geiger, Philip Len, Christoph Stiller and Raquel Urtasun Velodne HDL-64E Laserscanner Abstract We present a novel dataset captured from a VW station wagon for use in mobile robotics and autonomous driving research. In total, we recorded 6 hours of traffic scenarios at 1-1 H using a variet of sensor modalities such as highresolution color and grascale stereo cameras, a Velodne 3D laser scanner and a high-precision GPS/IMU inertial navigation sstem. The scenarios are diverse, capturing real-world traffic situations and range from freewas over rural areas to innercit scenes with man static and dnamic objects. Our data is calibrated, snchronied and timestamped, and we provide the rectified and raw image sequences. Our dataset also contains object labels in the form of 3D tracklets and we provide online benchmarks for stereo, optical flow, object detection and other tasks. This paper describes our recording platform, the data format and the utilities that we provide. Point Gra Flea 2 Video Cameras OXTS OXTS RT RT 33 GPS GPS / IMU Inde Terms dataset, autonomous driving, mobile robotics, field robotics, computer vision, cameras, laser, GPS, benchmarks, stereo, optical flow, SLAM, object detection, tracking, KITTI I. I NTRODUCTION The KITTI dataset has been recorded from a moving platform (Fig. 1) while driving in and around Karlsruhe, German (Fig. 2). It includes camera images, laser scans, high-precision GPS measurements and IMU accelerations from a combined GPS/IMU sstem. The main purpose of this dataset is to push forward the development of computer vision and robotic algorithms targeted to autonomous driving [1] [7]. While our introductor paper [8] mainl focuses on the benchmarks, their creation and use for evaluating state-of-the-art computer vision methods, here we complement this information b providing technical details on the raw data itself. We give precise instructions on how to access the data and comment on sensor limitations and common pitfalls. The dataset can be downloaded from For a review on related work, we refer the reader to [8]. Fig. 1. Recording Platform. Our VW Passat station wagon is equipped with four video cameras (two color and two grascale cameras), a rotating 3D laser scanner and a combined GPS/IMU inertial navigation sstem. 1 OXTS RT33 inertial and GPS navigation sstem, 6 ais, 1 H, L1/L2 RTK, resolution:.2m /.1 Note that the color cameras lack in terms of resolution due to the Baer pattern interpolation process and are less sensitive to light. This is the reason wh we use two stereo camera rigs, one for grascale and one for color. The baseline of both stereo camera rigs is approimatel 54 cm. The trunk of our vehicle houses a PC with two si-core Intel XEON X565 processors and a shock-absorbed RAID 5 hard disk storage with a capacit of 4 Terabtes. Our computer runs Ubuntu Linu (64 bit) and a real-time database [9] to store the incoming data streams. II. S ENSOR S ETUP Our sensor setup is illustrated in Fig. 3: 2 PointGra Flea2 grascale cameras (FL2-14S3M-C), 1.4 Megapiels, 1/2 Son ICX267 CCD, global shutter 2 PointGra Flea2 color cameras (FL2-14S3C-C), 1.4 Megapiels, 1/2 Son ICX267 CCD, global shutter 4 Edmund Optics lenses, 4mm, opening angle 9, vertical opening angle of region of interest (ROI) 35 1 Velodne HDL-64E rotating 3D laser scanner, 1 H, 64 beams,.9 degree angular resolution, 2 cm distance accurac, collecting 1.3 million points/second, field of view: 36 horiontal, 26.8 vertical, range: 12 m A. Geiger, P. Len and C. Stiller are with the Department of Measurement and Control Sstems, Karlsruhe Institute of Technolog, German. {geiger,len,stiller}@kit.edu R. Urtasun is with the Toota Technological Institute at Chicago, USA. [email protected] III. DATASET The raw data described in this paper can be accessed from and contains 25% of our overall recordings. The reason for this is that primaril data with 3D tracklet annotations has been put online, though we will make more data available upon request. Furthermore, we have removed all sequences which are part of our benchmark test sets. The raw data set is divided into the categories Road, Cit, Residential, Campus and Person. Eample frames are illustrated in Fig. 5. For each sequence, we provide the raw data, object annotations in form of 3D bounding bo tracklets and a calibration file, as illustrated in Fig. 4. Our recordings have taken place on the 26th, 28th, 29th, 3th of September and on the 3rd of October 211 during datime. The total sie of the provided data is 18 GB.
2 2 All heights wrt. road surface All camera heights: 1.65 m 1.6 m All sensor readings of a sequence are ipped into a single file named date_drive.ip, where date and drive are placeholders for the recording date and the sequence number. The director structure is shown in Fig. 4. Besides the raw recordings ( raw data ), we also provide post-processed data ( snced data ), i.e., rectified and snchronied video streams, on the dataset website. Timestamps are stored in timestamps.tt and perframe sensor readings are provided in the corresponding data sub-folders. Each line in timestamps.tt is composed of the date and time in hours, minutes and seconds. As the Velodne laser scanner has a rolling shutter, three timestamp files are provided for this sensor, one for the start position (timestamps_start.tt) of a spin, one for the end position (timestamps_end.tt) of a spin, and one for the time, where the laser scanner is facing forward and triggering the cameras (timestamps.tt). The data format in which each sensor stream is stored is as follows: a) Images: Both, color and grascale images are stored with loss-less compression using 8-bit PNG files. The engine hood and the sk region have been cropped. To simplif working with the data, we also provide rectified images. The sie of the images after rectification depends on the calibration parameters and is.5 Mp on average. The original images before rectification are available as well. b) OXTS (GPS/IMU): For each frame, we store 3 different GPS/IMU values in a tet file: The geographic coordinates including altitude, global orientation, velocities, accelerations, angular rates, accuracies and satellite information. Accelera- Wheel ais Cam 1 (gra).6 m (height:.3m) Cam 3 (color) Cam-to-CamRect Velodne laserscanner.54 m & CamRect (height: 1.73 m).5 m -to-image Cam (gra) IMU-to-Velo.6 m Cam 2 (color).32 m Velo-to-Cam GPS/IMU 1.68 m (height:.93 m).8 m.81 m.48 m.27 m 2.71 m Fig. 2. Recording Zone. This figure shows the GPS traces of our recordings in the metropolitan area of Karlsruhe, German. Colors encode the GPS signal qualit: Red tracks have been recorded with highest precision using RTK corrections, blue denotes the absence of correction signals. The black runs have been ecluded from our data set as no GPS signal has been available. A. Data Description Fig. 3. Sensor Setup. This figure illustrates the dimensions and mounting positions of the sensors (red) with respect to the vehicle bod. Heights above ground are marked in green and measured with respect to the road surface. Transformations between sensors are shown in blue. date/ date_drive/ date_drive.ip image_/ ={,..,3} data/ frame_number.png timestamps.tt ots/ data/ frame_number.tt dataformat.tt timestamps.tt velodne_points/ data/ frame_number.bin timestamps.tt timestamps_start.tt timestamps_end.tt date_drive_tracklets.ip tracklet_labels.ml date_calib.ip calib_cam_to_cam.tt calib_imu_to_velo.tt calib_velo_to_cam.tt Fig. 4. Structure of the provided Zip-Files and their location within a global file structure that stores all KITTI sequences. Here, date and drive are placeholders, and image refers to the 4 video camera streams. tions and angular rates are both specified using two coordinate sstems, one which is attached to the vehicle bod (,, ) and one that is mapped to the tangent plane of the earth surface at that location (f, l, u). From time to time we encountered short ( 1 second) communication outages with the OXTS device for which we interpolated all values linearl and set the last 3 entries to -1 to indicate the missing information. More details are provided in dataformat.tt. Conversion utilities are provided in the development kit. c) Velodne: For efficienc, the Velodne scans are stored as floating point binaries that are eas to parse using the C++ or MATLAB code provided. Each point is stored with its (,, ) coordinate and an additional reflectance value (r). While the number of points per scan is not constant, on average each file/frame has a sie of 1.9 MB which corresponds to 12, 3D points and reflectance values. Note that the Velodne laser scanner rotates continuousl around its vertical ais (counter-clockwise), which can be taken into account using the timestamp files. B. Annotations For each dnamic object within the reference camera s field of view, we provide annotations in the form of 3D bounding bo tracklets, represented in Velodne coordinates. We define the classes,,,,,, and (e.g., Trailers, Segwas). The tracklets are stored in date_drive_tracklets.ml.
3 3 Velodne Coordinates width (w) Object Coordinates length (l) r Fig. 7. Object Coordinates. This figure illustrates the coordinate sstem of the annotated 3D bounding boes with respect to the coordinate sstem of the 3D Velodne laser scanner. In -direction, the object coordinate sstem is located at the bottom of the object (contact point with the supporting surface). Fig. 6. Development kit. Working with tracklets (top), Velodne point clouds (bottom) and their projections onto the image plane is demonstrated in the MATLAB development kit which is available from the KITTI website. Each object is assigned a class and its 3D sie (height, width, length). For each frame, we provide the object s translation and rotation in 3D, as illustrated in Fig. 7. Note that we onl provide the aw angle, while the other two angles are assumed to be close to ero. Furthermore, the level of occlusion and truncation is specified. The development kit contains C++/MATLAB code for reading and writing tracklets using the boost::serialiation 1 librar. To give further insights into the properties of our dataset, we provide statistics for all sequences that contain annotated objects. The total number of objects and the object orientations for the two predominant classes and are shown in Fig. 8. For each object class, the number of object labels per image and the length of the captured sequences is shown in Fig. 9. The egomotion of our platform recorded b the GPS/IMU sstem as well as statistics about the sequence length and the number of objects are shown in Fig. 1 for the whole dataset and in Fig. 11 per street categor. C. Development Kit The raw data development kit provided on the KITTI website 2 contains MATLAB demonstration code with C++ wrappers and a readme.tt file which gives further details. Here, we will briefl discuss the most important features. Before running the scripts, the me wrapper readtrackletsme.cpp for reading tracklets into MAT- LAB structures and cell arras needs to be built using the script make.m. It wraps the file tracklets.h from the data.php cpp folder which holds the tracklet object for serialiation. This file can also be directl interfaced with when working in a C++ environment. The script run_demotracklets.m demonstrates how 3D bounding bo tracklets can be read from the XML files and projected onto the image plane of the cameras. The projection of 3D Velodne point clouds into the image plane is demonstrated in run_demovelodne.m. See Fig. 6 for an illustration. The script run_demovehiclepath.m shows how to read and displa the 3D vehicle trajector using the GPS/IMU data. It makes use of convertotstopose(), which takes as input GPS/IMU measurements and outputs the 6D pose of the vehicle in Euclidean space. For this conversion we make use of the Mercator projection [1] = s r π lon (1) 18( ( )) π(9 + lat) = s r log tan (2) 36 with earth radius r meters, scale s = cos ( lat π ) 18, and (lat, lon) the geographic coordinates. lat denotes the latitude of the first frame s coordinates and uniquel determines the Mercator scale. The function loadcalibrationcamtocam() can be used to read the intrinsic and etrinsic calibration parameters of the four video sensors. The other 3D rigid bod transformations can be parsed with loadcalibrationrigid(). D. Benchmarks In addition to the raw data, our KITTI website hosts evaluation benchmarks for several computer vision and robotic tasks such as stereo, optical flow, visual odometr, SLAM, 3D object detection and 3D object tracking. For details about the benchmarks and evaluation metrics we refer the reader to [8]. IV. SENSOR CALIBRATION We took care that all sensors are carefull snchronied and calibrated. To avoid drift over time, we calibrated the sensors ever da after our recordings. Note that even though the sensor setup hasn t been altered in between, numerical differences are possible. The coordinate sstems are defined as illustrated in Fig. 1 and Fig. 3, i.e.: Camera: = right, = down, = forward Velodne: = forward, = left, = up
4 4 Fig. 5. Cit Residential Road Campus Person Eamples from the KITTI dataset. This figure demonstrates the diversit in our dataset. The left color camera image is shown. GPS/IMU: = forward, = left, = up Notation: In the following, we write scalars in lower-case letters (a), vectors in bold lower-case (a) and matrices using bold-face capitals (A). 3D rigid-bod transformations which take points from coordinate sstem a to coordinate sstem b will be denoted b T b a, with T for transformation. A. Snchroniation In order to snchronie the sensors, we use the timestamps of the Velodne 3D laser scanner as a reference and consider each spin as a frame. We mounted a reed contact at the bottom of the continuousl rotating scanner, triggering the cameras when facing forward. This minimies the differences in the range and image observations caused b dnamic objects. Unfortunatel, the GPS/IMU sstem cannot be snchronied that wa. Instead, as it provides updates at 1 H, we collect the information with the closest timestamp to the laser scanner timestamp for a particular frame, resulting in a worstcase time difference of 5 ms between a GPS/IMU and a camera/velodne data package. Note that all timestamps are provided such that positioning information at an time can be easil obtained via interpolation. All timestamps have been recorded on our host computer using the sstem clock. B. Camera Calibration For calibrating the cameras intrinsicall and etrinsicall, we use the approach proposed in [11]. Note that all camera centers are aligned, i.e., the lie on the same /-plane. This is important as it allows us to rectif all images jointl. The calibration parameters for each da are stored in row-major order in calib_cam_to_cam.tt using the following notation: s (i) N original image sie ( ) K (i) R calibration matrices (unrectified) d (i) R distortion coefficients (unrectified) R (i) R rotation from camera to camera i t (i) R translation from camera to camera i s (i) rect N 2 R (i) rect R 3 3 P (i) rect R image sie after rectification rectifing rotation matri projection matri after rectification Here, i {, 1, 2, 3} is the camera inde, where represents the left grascale, 1 the right grascale, 2 the left color and 3 the right color camera. Note that the variable definitions are compliant with the OpenCV librar, which we used for warping the images. When working with the snchronied and rectified datasets onl the variables with rect-subscript are relevant. Note that due to the pincushion distortion effect the images have been cropped such that the sie of the rectified images is smaller than the original sie of piels. The projection of a 3D point = (,,, 1) T in rectified (rotated) camera coordinates to a point = (u, v, 1) T in the i th camera image is given as = P (i) rect (3)
5 5 with P (i) rect = f (i) f (i) u u c (i) u f v (i) 1 c (i) v b (i) (4) the i th projection matri. Here, b (i) denotes the baseline (in meters) with respect to reference camera. Note that in order to project a 3D point in reference camera coordinates to a point on the i th image plane, the rectifing rotation matri of the reference camera R () rect must be considered as well: = P (i) rect R () rect (5) Here, R () rect has been epanded into a 4 4 matri b appending a fourth ero-row and column, and setting R () rect(4, 4) = 1. C. Velodne and IMU Calibration We have registered the Velodne laser scanner with respect to the reference camera coordinate sstem (camera ) b initialiing the rigid bod transformation using [11]. Net, we optimied an error criterion based on the Euclidean distance of 5 manuall selected correspondences and a robust measure on the disparit error with respect to the 3 top performing stereo methods in the KITTI stereo benchmark [8]. The optimiation was carried out using Metropolis-Hastings sampling. The rigid bod transformation from Velodne coordinates to camera coordinates is given in calib_velo_to_cam.tt: R cam velo R3 3 t cam velo R1 3 Using.... rotation matri: velodne camera... translation vector: velodne camera ( T cam R cam velo = velo t cam velo 1 a 3D point in Velodne coordinates gets projected to a point in the i th camera image as ) (6) = P (i) rect R () rect T cam velo (7) For registering the IMU/GPS with respect to the Velodne laser scanner, we first recorded a sequence with an -loop and registered the (untwisted) point clouds using the Pointto-Plane ICP algorithm. Given two trajectories this problem corresponds to the well-known hand-ee calibration problem which can be solved using standard tools [12]. The rotation matri R velo imu and the translation vector tvelo imu are stored in calib_imu_to_velo.tt. A 3D point in IMU/GPS coordinates gets projected to a point in the i th image as = P (i) rect R () rect T cam velo T velo imu (8) V. SUMMARY AND FUTURE WORK In this paper, we have presented a calibrated, snchronied and rectified autonomous driving dataset capturing a wide range of interesting scenarios. We believe that this dataset will be highl useful in man areas of robotics and computer vision. In the future we plan on epanding the set of available sequences b adding additional 3D object labels for currentl unlabeled sequences and recording new sequences, Number of s Orientation [deg] % of Members Number of s Occluded Truncated Occluded Truncated Occlusion/Truncation Status b Class Orientation [deg] Fig. 8. Object Occurrence and Orientation Statistics of our Dataset. This figure shows the different tpes of objects occurring in our sequences (top) and the orientation histograms (bottom) for the two most predominant categories and. for eample in difficult lighting situations such as at night, in tunnels, or in the presence of fog or rain. Furthermore, we plan on etending our benchmark suite b novel challenges. In particular, we will provide piel-accurate semantic labels for man of the sequences. REFERENCES [1] G. Singh and J. Kosecka, Acquiring semantics induced topolog in urban environments, in ICRA, 212. [2] R. Paul and P. Newman, FAB-MAP 3D: Topological mapping with spatial and visual appearance, in ICRA, 21. [3] C. Wojek, S. Walk, S. Roth, K. Schindler, and B. Schiele, Monocular visual scene understanding: Understanding multi-object traffic scenes, PAMI, 212. [4] D. Pfeiffer and U. Franke, Efficient representation of traffic scenes b means of dnamic stiels, in IV, 21. [5] A. Geiger, M. Lauer, and R. Urtasun, A generative model for 3d urban scene understanding from movable platforms, in CVPR, 211. [6] A. Geiger, C. Wojek, and R. Urtasun, Joint 3d estimation of objects and scene laout, in NIPS, 211. [7] M. A. Brubaker, A. Geiger, and R. Urtasun, Lost! leveraging the crowd for probabilistic visual self-localiation. in CVPR, 213. [8] A. Geiger, P. Len, and R. Urtasun, Are we read for autonomous driving? The KITTI vision benchmark suite, in CVPR, 212. [9] M. Goebl and G. Faerber, A real-time-capable hard- and software architecture for joint image and knowledge processing in cognitive automobiles, in IV, 27. [1] P. Osborne, The mercator projections, 28. [Online]. Available: [11] A. Geiger, F. Moosmann, O., and B. Schuster, A toolbo for automatic calibration of range and camera sensors using a single shot, in ICRA, 212. [12] R. Horaud and F. Dornaika, Hand-ee calibration, IJRR, vol. 14, no. 3, pp , 1995.
6 per Image Person(sitting) per Image per Image per Image per Image per Image per Image per Image Fig. 9. Number of Object Labels per Class and Image. This figure shows how often an object occurs in an image. Since our labeling efforts focused on cars and pedestrians, these are the most predominant classes here Number of Sequences Frames per Sequences campus cit person residential road Street Categor Fig. 1. Egomotion, Sequence Count and Length. This figure show (from-left-to-right) the egomotion (velocit and acceleration) of our recording platform for the whole dataset. Note that we ecluded sequences with a purel static observer from these statistics. The length of the available sequences is shown as a histogram counting the number of frames per sequence. The rightmost figure shows the number of frames/images per scene categor static observer static observer Cit Residential Road Campus Person Fig. 11. Velocities, Accelerations and per. For each scene categor we show the acceleration and velocit of the mobile platform as well as the number of labels and objects per class. Note that sequences with a purel static observer have been ecluded from the velocit and acceleration histograms as the don t represent natural driving behavior. The categor Person has been recorded from a static observer.
Vision meets Robotics: The KITTI Dataset
Geiger, Andreas ; Lenz, Philip ; Stiller, Christoph ; Urtasun, Raquel: Vision meets Robotics: The KITTI Dataset. 1 International Journal of Robotics Research (IJRR) 32 (213), pp. 1229 1235 Vision meets
Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite
Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite Philip Lenz 1 Andreas Geiger 2 Christoph Stiller 1 Raquel Urtasun 3 1 KARLSRUHE INSTITUTE OF TECHNOLOGY 2 MAX-PLANCK-INSTITUTE IS 3
IP-S2 HD. High Definition 3D Mobile Mapping System
IP-S2 HD High Definition 3D Mobile Mapping System Integrated, turnkey solution High Density, Long Range LiDAR sensor for ultimate in visual detail High Accuracy IMU and DMI Odometry for positional accuracy
IP-S2 Compact+ 3D Mobile Mapping System
IP-S2 Compact+ 3D Mobile Mapping System 3D scanning of road and roadside features Delivers high density point clouds and 360 spherical imagery High accuracy IMU options without export control Simple Map,
Affine Transformations
A P P E N D I X C Affine Transformations CONTENTS C The need for geometric transformations 335 C2 Affine transformations 336 C3 Matri representation of the linear transformations 338 C4 Homogeneous coordinates
How To Fuse A Point Cloud With A Laser And Image Data From A Pointcloud
REAL TIME 3D FUSION OF IMAGERY AND MOBILE LIDAR Paul Mrstik, Vice President Technology Kresimir Kusevic, R&D Engineer Terrapoint Inc. 140-1 Antares Dr. Ottawa, Ontario K2E 8C4 Canada [email protected]
T-REDSPEED White paper
T-REDSPEED White paper Index Index...2 Introduction...3 Specifications...4 Innovation...6 Technology added values...7 Introduction T-REDSPEED is an international patent pending technology for traffic violation
IP-S3 HD1. Compact, High-Density 3D Mobile Mapping System
IP-S3 HD1 Compact, High-Density 3D Mobile Mapping System Integrated, turnkey solution Ultra-compact design Multiple lasers minimize scanning shades Unparalleled ease-of-use No user calibration required
Automatic Labeling of Lane Markings for Autonomous Vehicles
Automatic Labeling of Lane Markings for Autonomous Vehicles Jeffrey Kiske Stanford University 450 Serra Mall, Stanford, CA 94305 [email protected] 1. Introduction As autonomous vehicles become more popular,
A PHOTOGRAMMETRIC APPRAOCH FOR AUTOMATIC TRAFFIC ASSESSMENT USING CONVENTIONAL CCTV CAMERA
A PHOTOGRAMMETRIC APPRAOCH FOR AUTOMATIC TRAFFIC ASSESSMENT USING CONVENTIONAL CCTV CAMERA N. Zarrinpanjeh a, F. Dadrassjavan b, H. Fattahi c * a Islamic Azad University of Qazvin - [email protected]
High-accuracy ultrasound target localization for hand-eye calibration between optical tracking systems and three-dimensional ultrasound
High-accuracy ultrasound target localization for hand-eye calibration between optical tracking systems and three-dimensional ultrasound Ralf Bruder 1, Florian Griese 2, Floris Ernst 1, Achim Schweikard
3D Vision An enabling Technology for Advanced Driver Assistance and Autonomous Offroad Driving
3D Vision An enabling Technology for Advanced Driver Assistance and Autonomous Offroad Driving AIT Austrian Institute of Technology Safety & Security Department Manfred Gruber Safe and Autonomous Systems
Robot Perception Continued
Robot Perception Continued 1 Visual Perception Visual Odometry Reconstruction Recognition CS 685 11 Range Sensing strategies Active range sensors Ultrasound Laser range sensor Slides adopted from Siegwart
Development of an automated Red Light Violation Detection System (RLVDS) for Indian vehicles
CS11 59 Development of an automated Red Light Violation Detection System (RLVDS) for Indian vehicles Satadal Saha 1, Subhadip Basu 2 *, Mita Nasipuri 2, Dipak Kumar Basu # 2 # AICTE Emeritus Fellow 1 CSE
3D Vision An enabling Technology for Advanced Driver Assistance and Autonomous Offroad Driving
3D Vision An enabling Technology for Advanced Driver Assistance and Autonomous Offroad Driving AIT Austrian Institute of Technology Safety & Security Department Christian Zinner Safe and Autonomous Systems
w = COI EYE view direction vector u = w ( 010,, ) cross product with y-axis v = w u up vector
. w COI EYE view direction vector u w ( 00,, ) cross product with -ais v w u up vector (EQ ) Computer Animation: Algorithms and Techniques 29 up vector view vector observer center of interest 30 Computer
VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS
VEHICLE LOCALISATION AND CLASSIFICATION IN URBAN CCTV STREAMS Norbert Buch 1, Mark Cracknell 2, James Orwell 1 and Sergio A. Velastin 1 1. Kingston University, Penrhyn Road, Kingston upon Thames, KT1 2EE,
Removing Moving Objects from Point Cloud Scenes
1 Removing Moving Objects from Point Cloud Scenes Krystof Litomisky [email protected] Abstract. Three-dimensional simultaneous localization and mapping is a topic of significant interest in the research
Traffic Monitoring Systems. Technology and sensors
Traffic Monitoring Systems Technology and sensors Technology Inductive loops Cameras Lidar/Ladar and laser Radar GPS etc Inductive loops Inductive loops signals Inductive loop sensor The inductance signal
2D Geometrical Transformations. Foley & Van Dam, Chapter 5
2D Geometrical Transformations Fole & Van Dam, Chapter 5 2D Geometrical Transformations Translation Scaling Rotation Shear Matri notation Compositions Homogeneous coordinates 2D Geometrical Transformations
3D Arm Motion Tracking for Home-based Rehabilitation
hapter 13 3D Arm Motion Tracking for Home-based Rehabilitation Y. Tao and H. Hu 13.1 Introduction This paper presents a real-time hbrid solution to articulated 3D arm motion tracking for home-based rehabilitation
Files Used in this Tutorial
Generate Point Clouds Tutorial This tutorial shows how to generate point clouds from IKONOS satellite stereo imagery. You will view the point clouds in the ENVI LiDAR Viewer. The estimated time to complete
Solution Guide III-C. 3D Vision. Building Vision for Business. MVTec Software GmbH
Solution Guide III-C 3D Vision MVTec Software GmbH Building Vision for Business Machine vision in 3D world coordinates, Version 10.0.4 All rights reserved. No part of this publication may be reproduced,
Physics 53. Kinematics 2. Our nature consists in movement; absolute rest is death. Pascal
Phsics 53 Kinematics 2 Our nature consists in movement; absolute rest is death. Pascal Velocit and Acceleration in 3-D We have defined the velocit and acceleration of a particle as the first and second
E190Q Lecture 5 Autonomous Robot Navigation
E190Q Lecture 5 Autonomous Robot Navigation Instructor: Chris Clark Semester: Spring 2014 1 Figures courtesy of Siegwart & Nourbakhsh Control Structures Planning Based Control Prior Knowledge Operator
Basler. Line Scan Cameras
Basler Line Scan Cameras High-quality line scan technology meets a cost-effective GigE interface Real color support in a compact housing size Shading correction compensates for difficult lighting conditions
Opportunities for the generation of high resolution digital elevation models based on small format aerial photography
Opportunities for the generation of high resolution digital elevation models based on small format aerial photography Boudewijn van Leeuwen 1, József Szatmári 1, Zalán Tobak 1, Csaba Németh 1, Gábor Hauberger
USB 3.0 Camera User s Guide
Rev 1.2 Leopard Imaging Inc. Mar, 2014 Preface Congratulations on your purchase of this product. Read this manual carefully and keep it in a safe place for any future reference. About this manual This
High Resolution RF Analysis: The Benefits of Lidar Terrain & Clutter Datasets
0 High Resolution RF Analysis: The Benefits of Lidar Terrain & Clutter Datasets January 15, 2014 Martin Rais 1 High Resolution Terrain & Clutter Datasets: Why Lidar? There are myriad methods, techniques
A. OPENING POINT CLOUDS. (Notepad++ Text editor) (Cloud Compare Point cloud and mesh editor) (MeshLab Point cloud and mesh editor)
MeshLAB tutorial 1 A. OPENING POINT CLOUDS (Notepad++ Text editor) (Cloud Compare Point cloud and mesh editor) (MeshLab Point cloud and mesh editor) 2 OPENING POINT CLOUDS IN NOTEPAD ++ Let us understand
Description of field acquisition of LIDAR point clouds and photos. CyberMapping Lab UT-Dallas
Description of field acquisition of LIDAR point clouds and photos CyberMapping Lab UT-Dallas Description of field acquisition of LIDAR point clouds and photos Objective: Introduce the basic steps that
Section V.2: Magnitudes, Directions, and Components of Vectors
Section V.: Magnitudes, Directions, and Components of Vectors Vectors in the plane If we graph a vector in the coordinate plane instead of just a grid, there are a few things to note. Firstl, directions
USING MOBILE LIDAR TO SURVEY A RAILWAY LINE FOR ASSET INVENTORY INTRODUCTION
USING MOBILE LIDAR TO SURVEY A RAILWAY LINE FOR ASSET INVENTORY a Michael Leslar, Sales and Marketing Support Engineer b Gordon Perry, Senior Project Manager b Keith McNease, Principal a Optech Incorporated
Behavior Analysis in Crowded Environments. XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011
Behavior Analysis in Crowded Environments XiaogangWang Department of Electronic Engineering The Chinese University of Hong Kong June 25, 2011 Behavior Analysis in Sparse Scenes Zelnik-Manor & Irani CVPR
Introduction. www.imagesystems.se
Product information Image Systems AB Main office: Ågatan 40, SE-582 22 Linköping Phone +46 13 200 100, fax +46 13 200 150 [email protected], Introduction Motion is the world leading software for advanced
Basler. Area Scan Cameras
Basler Area Scan Cameras VGA to 5 megapixels and up to 210 fps Selected high quality Sony and Kodak CCD sensors Powerful Gigabit Ethernet interface Superb image quality at all resolutions and frame rates
Synthetic Sensing: Proximity / Distance Sensors
Synthetic Sensing: Proximity / Distance Sensors MediaRobotics Lab, February 2010 Proximity detection is dependent on the object of interest. One size does not fit all For non-contact distance measurement,
False alarm in outdoor environments
Accepted 1.0 Savantic letter 1(6) False alarm in outdoor environments Accepted 1.0 Savantic letter 2(6) Table of contents Revision history 3 References 3 1 Introduction 4 2 Pre-processing 4 3 Detection,
PHOTOGRAMMETRIC TECHNIQUES FOR MEASUREMENTS IN WOODWORKING INDUSTRY
PHOTOGRAMMETRIC TECHNIQUES FOR MEASUREMENTS IN WOODWORKING INDUSTRY V. Knyaz a, *, Yu. Visilter, S. Zheltov a State Research Institute for Aviation System (GosNIIAS), 7, Victorenko str., Moscow, Russia
PRODUCT SHEET. [email protected] [email protected] www.biopac.com
EYE TRACKING SYSTEMS BIOPAC offers an array of monocular and binocular eye tracking systems that are easily integrated with stimulus presentations, VR environments and other media. Systems Monocular Part
automatic road sign detection from survey video
automatic road sign detection from survey video by paul stapleton gps4.us.com 10 ACSM BULLETIN february 2012 tech feature In the fields of road asset management and mapping for navigation, clients increasingly
MRC High Resolution. MR-compatible digital HD video camera. User manual
MRC High Resolution MR-compatible digital HD video camera User manual page 1 of 12 Contents 1. Intended use...2 2. System components...3 3. Video camera and lens...4 4. Interface...4 5. Installation...5
Waypoint. Best-in-Class GNSS and GNSS+INS Processing Software
Waypoint Best-in-Class GNSS and GNSS+INS Processing Software Waypoint Exceptional Post-Processing Software Enhance your GNSS Position, Velocity and Attitude Accuracy For applications requiring highly
Classifying Manipulation Primitives from Visual Data
Classifying Manipulation Primitives from Visual Data Sandy Huang and Dylan Hadfield-Menell Abstract One approach to learning from demonstrations in robotics is to make use of a classifier to predict if
Cost-Effective Collection of a Network-Level Asset Inventory. Michael Nieminen, Roadware
Cost-Effective Collection of a Network-Level Asset Inventory Michael Nieminen, Roadware Introduction This presentation is about using a mobile vehicle to collect roadway asset (feature) data. Asset/Feature
If you are working with the H4D-60 or multi-shot cameras we recommend 8GB of RAM on a 64 bit Windows and 1GB of video RAM.
Phocus 2.7.6 Windows read-me December 5 2013 Installation To install Phocus, run the installation bundle called Phocus 2.7.6 Setup.exe. This bundle contains Phocus, Hasselblad Device Drivers, Microsoft.NET
Using Optech LMS to Calibrate Survey Data Without Ground Control Points
Challenge An Optech client conducted an airborne lidar survey over a sparsely developed river valley. The data processors were finding that the data acquired in this survey was particularly difficult to
An Energy-Based Vehicle Tracking System using Principal Component Analysis and Unsupervised ART Network
Proceedings of the 8th WSEAS Int. Conf. on ARTIFICIAL INTELLIGENCE, KNOWLEDGE ENGINEERING & DATA BASES (AIKED '9) ISSN: 179-519 435 ISBN: 978-96-474-51-2 An Energy-Based Vehicle Tracking System using Principal
CNC-STEP. "LaserProbe4500" 3D laser scanning system Instruction manual
LaserProbe4500 CNC-STEP "LaserProbe4500" 3D laser scanning system Instruction manual 2 Hylewicz CNC-Technik Siemensstrasse 13-15 D-47608 Geldern Fon.: +49 (0) 2831 133236 E-Mail: [email protected] Website:
EFFICIENT VEHICLE TRACKING AND CLASSIFICATION FOR AN AUTOMATED TRAFFIC SURVEILLANCE SYSTEM
EFFICIENT VEHICLE TRACKING AND CLASSIFICATION FOR AN AUTOMATED TRAFFIC SURVEILLANCE SYSTEM Amol Ambardekar, Mircea Nicolescu, and George Bebis Department of Computer Science and Engineering University
An Experimental Study on Pixy CMUcam5 Vision Sensor
LTU-ARISE-2015-01 1 Lawrence Technological University / Autonomous Robotics Institute for Supporting Education - Technical Memo ARISE-2015-01 An Experimental Study on Pixy CMUcam5 Vision Sensor Charles
How To Use Trackeye
Product information Image Systems AB Main office: Ågatan 40, SE-582 22 Linköping Phone +46 13 200 100, fax +46 13 200 150 [email protected], Introduction TrackEye is the world leading system for motion
Introduction to polarization of light
Chapter 2 Introduction to polarization of light This Chapter treats the polarization of electromagnetic waves. In Section 2.1 the concept of light polarization is discussed and its Jones formalism is presented.
Collision Prevention and Area Monitoring with the LMS Laser Measurement System
Collision Prevention and Area Monitoring with the LMS Laser Measurement System PDF processed with CutePDF evaluation edition www.cutepdf.com A v o i d...... collisions SICK Laser Measurement Systems are
CAM-HFR-A HIGH FRAME RATE CAMERA
CAM-HFR-A HIGH FRAME RATE CAMERA Tightly synchronize high frame rate video up to 100 FPS with physiological data recorded with a BIOPAC MP150 Research System. Included Components High Frame Rate Camera
Chapter 5 Objectives. Chapter 5 Input
Chapter 5 Input Describe two types of input List characteristics of a Identify various types of s Identify various types of pointing devices Chapter 5 Objectives Explain how voice recognition works Understand
Barcode Based Automated Parking Management System
IJSRD - International Journal for Scientific Research & Development Vol. 2, Issue 03, 2014 ISSN (online): 2321-0613 Barcode Based Automated Parking Management System Parth Rajeshbhai Zalawadia 1 Jasmin
Addition and Subtraction of Vectors
ddition and Subtraction of Vectors 1 ppendi ddition and Subtraction of Vectors In this appendi the basic elements of vector algebra are eplored. Vectors are treated as geometric entities represented b
Client Based Power Iteration Clustering Algorithm to Reduce Dimensionality in Big Data
Client Based Power Iteration Clustering Algorithm to Reduce Dimensionalit in Big Data Jaalatchum. D 1, Thambidurai. P 1, Department of CSE, PKIET, Karaikal, India Abstract - Clustering is a group of objects
WHITE PAPER. Are More Pixels Better? www.basler-ipcam.com. Resolution Does it Really Matter?
WHITE PAPER www.basler-ipcam.com Are More Pixels Better? The most frequently asked question when buying a new digital security camera is, What resolution does the camera provide? The resolution is indeed
Tracking of Small Unmanned Aerial Vehicles
Tracking of Small Unmanned Aerial Vehicles Steven Krukowski Adrien Perkins Aeronautics and Astronautics Stanford University Stanford, CA 94305 Email: [email protected] Aeronautics and Astronautics Stanford
Pedestrian Detection with RCNN
Pedestrian Detection with RCNN Matthew Chen Department of Computer Science Stanford University [email protected] Abstract In this paper we evaluate the effectiveness of using a Region-based Convolutional
Integer Computation of Image Orthorectification for High Speed Throughput
Integer Computation of Image Orthorectification for High Speed Throughput Paul Sundlie Joseph French Eric Balster Abstract This paper presents an integer-based approach to the orthorectification of aerial
Robust and accurate global vision system for real time tracking of multiple mobile robots
Robust and accurate global vision system for real time tracking of multiple mobile robots Mišel Brezak Ivan Petrović Edouard Ivanjko Department of Control and Computer Engineering, Faculty of Electrical
Interactive person re-identification in TV series
Interactive person re-identification in TV series Mika Fischer Hazım Kemal Ekenel Rainer Stiefelhagen CV:HCI lab, Karlsruhe Institute of Technology Adenauerring 2, 76131 Karlsruhe, Germany E-mail: {mika.fischer,ekenel,rainer.stiefelhagen}@kit.edu
REAL TIME TRAFFIC LIGHT CONTROL USING IMAGE PROCESSING
REAL TIME TRAFFIC LIGHT CONTROL USING IMAGE PROCESSING Ms.PALLAVI CHOUDEKAR Ajay Kumar Garg Engineering College, Department of electrical and electronics Ms.SAYANTI BANERJEE Ajay Kumar Garg Engineering
High Tech Electronics
10011010010100011110101010010101010101010101010101010101010101010101010101010101010101010 11010101010101010101010101010101010101010101010101010101010101010101010101101010101010101 High Tech Electronics
Exploration, Navigation and Self-Localization in an Autonomous Mobile Robot
Exploration, Navigation and Self-Localization in an Autonomous Mobile Robot Thomas Edlinger [email protected] Gerhard Weiß [email protected] University of Kaiserslautern, Department
IMPLICIT SHAPE MODELS FOR OBJECT DETECTION IN 3D POINT CLOUDS
IMPLICIT SHAPE MODELS FOR OBJECT DETECTION IN 3D POINT CLOUDS Alexander Velizhev 1 (presenter) Roman Shapovalov 2 Konrad Schindler 3 1 Hexagon Technology Center, Heerbrugg, Switzerland 2 Graphics & Media
Basler pilot AREA SCAN CAMERAS
Basler pilot AREA SCAN CAMERAS VGA to 5 megapixels and up to 210 fps Selected high quality CCD sensors Powerful Gigabit Ethernet interface Superb image quality at all Resolutions and frame rates OVERVIEW
Data Storage. Chapter 3. Objectives. 3-1 Data Types. Data Inside the Computer. After studying this chapter, students should be able to:
Chapter 3 Data Storage Objectives After studying this chapter, students should be able to: List five different data types used in a computer. Describe how integers are stored in a computer. Describe how
Distance measuring based on stereoscopic pictures
9th International Ph Workshop on Systems and Control: Young Generation Viewpoint 1. - 3. October 8, Izola, Slovenia istance measuring d on stereoscopic pictures Jernej Mrovlje 1 and amir Vrančić Abstract
USING THE XBOX KINECT TO DETECT FEATURES OF THE FLOOR SURFACE
USING THE XBOX KINECT TO DETECT FEATURES OF THE FLOOR SURFACE By STEPHANIE COCKRELL Submitted in partial fulfillment of the requirements For the degree of Master of Science Thesis Advisor: Gregory Lee
Rotation: Moment of Inertia and Torque
Rotation: Moment of Inertia and Torque Every time we push a door open or tighten a bolt using a wrench, we apply a force that results in a rotational motion about a fixed axis. Through experience we learn
Colorado School of Mines Computer Vision Professor William Hoff
Professor William Hoff Dept of Electrical Engineering &Computer Science http://inside.mines.edu/~whoff/ 1 Introduction to 2 What is? A process that produces from images of the external world a description
ASSESSMENT OF VISUALIZATION SOFTWARE FOR SUPPORT OF CONSTRUCTION SITE INSPECTION TASKS USING DATA COLLECTED FROM REALITY CAPTURE TECHNOLOGIES
ASSESSMENT OF VISUALIZATION SOFTWARE FOR SUPPORT OF CONSTRUCTION SITE INSPECTION TASKS USING DATA COLLECTED FROM REALITY CAPTURE TECHNOLOGIES ABSTRACT Chris Gordon 1, Burcu Akinci 2, Frank Boukamp 3, and
Speed Performance Improvement of Vehicle Blob Tracking System
Speed Performance Improvement of Vehicle Blob Tracking System Sung Chun Lee and Ram Nevatia University of Southern California, Los Angeles, CA 90089, USA [email protected], [email protected] Abstract. A speed
Integration of Inertial Measurements with GNSS -NovAtel SPAN Architecture-
Integration of Inertial Measurements with GNSS -NovAtel SPAN Architecture- Sandy Kennedy, Jason Hamilton NovAtel Inc., Canada Edgar v. Hinueber imar GmbH, Germany Symposium Gyro Technology, Stuttgart 9/25
Application Report: Running µshape TM on a VF-20 Interferometer
: Running µshape TM on a VF-20 Interferometer General This report describes how a fiber interferometer from Arden Photonics Ltd was used together with the µshape TM Generic software package. The VF-20
MSc in Autonomous Robotics Engineering University of York
MSc in Autonomous Robotics Engineering University of York Practical Robotics Module 2015 A Mobile Robot Navigation System: Labs 1a, 1b, 2a, 2b. Associated lectures: Lecture 1 and lecture 2, given by Nick
Watch Your Garden Grow
Watch Your Garden Grow The Brinno GardenWatchCam is a low cost, light weight, weather resistant, battery operated time-lapse camera that captures the entire lifecycle of any garden season by taking photos
Robot Sensors. Outline. The Robot Structure. Robots and Sensors. Henrik I Christensen
Robot Sensors Henrik I Christensen Robotics & Intelligent Machines @ GT Georgia Institute of Technology, Atlanta, GA 30332-0760 [email protected] Henrik I Christensen (RIM@GT) Sensors 1 / 38 Outline 1
Rodenstock Photo Optics
Rogonar Rogonar-S Rodagon Apo-Rodagon N Rodagon-WA Apo-Rodagon-D Accessories: Modular-Focus Lenses for Enlarging, CCD Photos and Video To reproduce analog photographs as pictures on paper requires two
ACCURACY ASSESSMENT OF BUILDING POINT CLOUDS AUTOMATICALLY GENERATED FROM IPHONE IMAGES
ACCURACY ASSESSMENT OF BUILDING POINT CLOUDS AUTOMATICALLY GENERATED FROM IPHONE IMAGES B. Sirmacek, R. Lindenbergh Delft University of Technology, Department of Geoscience and Remote Sensing, Stevinweg
The KITTI-ROAD Evaluation Benchmark. for Road Detection Algorithms
The KITTI-ROAD Evaluation Benchmark for Road Detection Algorithms 08.06.2014 Jannik Fritsch Honda Research Institute Europe, Offenbach, Germany Presented material created together with Tobias Kuehnl Research
TVL - The True Measurement of Video Quality
ACTi Knowledge Base Category: Educational Note Sub-category: Video Quality, Hardware Model: N/A Firmware: N/A Software: N/A Author: Ando.Meritee Published: 2010/10/25 Reviewed: 2010/10/27 TVL - The True
3D Scanner using Line Laser. 1. Introduction. 2. Theory
. Introduction 3D Scanner using Line Laser Di Lu Electrical, Computer, and Systems Engineering Rensselaer Polytechnic Institute The goal of 3D reconstruction is to recover the 3D properties of a geometric
Applications of Deep Learning to the GEOINT mission. June 2015
Applications of Deep Learning to the GEOINT mission June 2015 Overview Motivation Deep Learning Recap GEOINT applications: Imagery exploitation OSINT exploitation Geospatial and activity based analytics
Ultra-High Resolution Digital Mosaics
Ultra-High Resolution Digital Mosaics J. Brian Caldwell, Ph.D. Introduction Digital photography has become a widely accepted alternative to conventional film photography for many applications ranging from
How To Use Bodescan For 3D Imaging Of The Human Body
«Bodescan» THE ULTIMATE 3D SCANNER FOR DATA ACQUISITION OF HUMAN BODY A COMPREHENSIVE SYSTEM INCLUDING HARDWARE AND SOFTWARE TOOLS Page 2 of 9 Bodescan HUMAN BODY3D SCANNING HARDWARE AND SOFTWARE INCLUDED
Terrain Traversability Analysis using Organized Point Cloud, Superpixel Surface Normals-based segmentation and PCA-based Classification
Terrain Traversability Analysis using Organized Point Cloud, Superpixel Surface Normals-based segmentation and PCA-based Classification Aras Dargazany 1 and Karsten Berns 2 Abstract In this paper, an stereo-based
Basic Principles of Inertial Navigation. Seminar on inertial navigation systems Tampere University of Technology
Basic Principles of Inertial Navigation Seminar on inertial navigation systems Tampere University of Technology 1 The five basic forms of navigation Pilotage, which essentially relies on recognizing landmarks
GEOScaN Remote Data Acquisition for Hydrographic, Topographic and GIS Surveying
GEOScaN Remote Data Acquisition for Hydrographic, Topographic and GIS Surveying Laurence WATERHOUSE, United Kingdom Key words: remote sensing, hydrographic, laser scanning, GIS SUMMARY British Waterways
Understanding Megapixel Camera Technology for Network Video Surveillance Systems. Glenn Adair
Understanding Megapixel Camera Technology for Network Video Surveillance Systems Glenn Adair Introduction (1) 3 MP Camera Covers an Area 9X as Large as (1) VGA Camera Megapixel = Reduce Cameras 3 Mega
