Intelligent Mobile Vehicle Navigator

Similar documents
Sensor Based Control of Autonomous Wheeled Mobile Robots

About the NeuroFuzzy Module of the FuzzyTECH5.5 Software

Adaptive Cruise Control of a Passenger Car Using Hybrid of Sliding Mode Control and Fuzzy Logic Control

HITACHI INVERTER SJ/L100/300 SERIES PID CONTROL USERS GUIDE

Problems often have a certain amount of uncertainty, possibly due to: Incompleteness of information about the environment,

Optimized Fuzzy Control by Particle Swarm Optimization Technique for Control of CSTR

Electric Power Steering Automation for Autonomous Driving

Visual Servoing using Fuzzy Controllers on an Unmanned Aerial Vehicle

Chapter 9. particle is increased.

Introduction to Robotics Analysis, Systems, Applications

3D Vision An enabling Technology for Advanced Driver Assistance and Autonomous Offroad Driving

Learning Module 4 - Thermal Fluid Analysis Note: LM4 is still in progress. This version contains only 3 tutorials.

3D Vision An enabling Technology for Advanced Driver Assistance and Autonomous Offroad Driving

The Easy-to-use Steering Design Tool SimuLENK as an Application of SIMPACK Code Export

MOBILE ROBOT TRACKING OF PRE-PLANNED PATHS. Department of Computer Science, York University, Heslington, York, Y010 5DD, UK

A STUDY ON WARNING TIMING FOR LANE CHANGE DECISION AID SYSTEMS BASED ON DRIVER S LANE CHANGE MANEUVER

A Fuzzy System Approach of Feed Rate Determination for CNC Milling

Intelligent Mobile Robot Motion Control in Unstructured Environments

A PHOTOGRAMMETRIC APPRAOCH FOR AUTOMATIC TRAFFIC ASSESSMENT USING CONVENTIONAL CCTV CAMERA

CE801: Intelligent Systems and Robotics Lecture 3: Actuators and Localisation. Prof. Dr. Hani Hagras

Path Tracking for a Miniature Robot

Automatic Train Control based on the Multi-Agent Control of Cooperative Systems

Detection of DDoS Attack Scheme

DESIGN AND STRUCTURE OF FUZZY LOGIC USING ADAPTIVE ONLINE LEARNING SYSTEMS

Robotics. Lecture 3: Sensors. See course website for up to date information.

Adaptive Cruise Control

Sensor-Based Robotic Model for Vehicle Accident Avoidance

Obstacle Avoidance Design for Humanoid Robot Based on Four Infrared Sensors

Parameter identification of a linear single track vehicle model

Multi-ultrasonic sensor fusion for autonomous mobile robots

Type-2 fuzzy logic control for a mobile robot tracking a moving target

BENEFIT OF DYNAMIC USE CASES TO EARLY DESIGN A DRIVING ASSISTANCE SYSTEM FOR PEDESTRIAN/TRUCK COLLISION AVOIDANCE

Roots of Equations (Chapters 5 and 6)

Internet based manipulator telepresence

Intelligent Control Design Using S12 Microcontroller: A Student Design Workshop

Understanding and Applying Kalman Filtering

PID Control. Proportional Integral Derivative (PID) Control. Matrix Multimedia 2011 MX009 - PID Control. by Ben Rowland, April 2011

How To Understand General Relativity

Proceeding of 5th International Mechanical Engineering Forum 2012 June 20th 2012 June 22nd 2012, Prague, Czech Republic

Adaptive Cruise Control System Overview

The Basics of Robot Mazes Teacher Notes

Using Fuzzy Logic Control to Provide Intelligent Traffic Management Service for High-Speed Networks ABSTRACT:

Power Electronics. Prof. K. Gopakumar. Centre for Electronics Design and Technology. Indian Institute of Science, Bangalore.

2. TEST PITCH REQUIREMENT

ANTI LOCK BRAKING SYSTEM MODELLING AND DEVELOPMENT

A.Giusti, C.Zocchi, A.Adami, F.Scaramellini, A.Rovetta Politecnico di Milano Robotics Laboratory

Robotic motion planning for 8- DOF motion stage

Experimental Uncertainties (Errors)

Figure CPT Equipment

2008 FXA DERIVING THE EQUATIONS OF MOTION 1. Candidates should be able to :

Sensor-Based Intelligent Mobile Robot Navigation in Unknown Environments

The Vertical Handoff Algorithm using Fuzzy Decisions in Cellular Phone Networks

Explore 3: Crash Test Dummies

Adaptive cruise control (ACC)

EE 402 RECITATION #13 REPORT

A FUZZY LOGIC APPROACH FOR SALES FORECASTING

Project Management Efficiency A Fuzzy Logic Approach

A Study of Classification for Driver Conditions using Driving Behaviors

TwinCAT NC Configuration

Rick Galdos, Forensic Engineering 1

Driver - Vehicle Environment simulation. Mauro Marchitto Kite Solutions

Projectile motion simulator.

Physics Notes Class 11 CHAPTER 3 MOTION IN A STRAIGHT LINE

LMS is a simple but powerful algorithm and can be implemented to take advantage of the Lattice FPGA architecture.

Last Mile Intelligent Driving in Urban Mobility

Chapter 3 Falling Objects and Projectile Motion

University Physics 226N/231N Old Dominion University. Getting Loopy and Friction

INTERFERENCE OF SOUND WAVES

The dynamic equation for the angular motion of the wheel is R w F t R w F w ]/ J w

Transmission Line Terminations It s The End That Counts!

Engineering Feasibility Study: Vehicle Shock Absorption System

Work-Energy Bar Charts

CHAPTER 5 PREDICTIVE MODELING STUDIES TO DETERMINE THE CONVEYING VELOCITY OF PARTS ON VIBRATORY FEEDER

POWER TRIM 5 F AUTO TRIM AND AUTO TRIM

Digital Systems Based on Principles and Applications of Electrical Engineering/Rizzoni (McGraw Hill

Active Vibration Isolation of an Unbalanced Machine Spindle

Section 4: The Basics of Satellite Orbits

The Telematics Application Innovation Based On the Big Data. China Telecom Transportation ICT Application Base(Shanghai)

Calculation of Source-detector Solid Angle, Using Monte Carlo Method, for Radioactive Sources with Various Geometries and Cylindrical Detector

Field and Service Robotics. Odometry sensors

STATIC AND KINETIC FRICTION

P211 Midterm 2 Spring 2004 Form D

Uniformly Accelerated Motion

C B A T 3 T 2 T What is the magnitude of the force T 1? A) 37.5 N B) 75.0 N C) 113 N D) 157 N E) 192 N

ServoPAL (#28824): Servo Pulser and Timer

Stability Analysis of Small Satellite Formation Flying and Reconfiguration Missions in Deep Space

Lab 8: Ballistic Pendulum

Operating Vehicle Control Devices

degrees of freedom and are able to adapt to the task they are supposed to do [Gupta].

Grant agreement no: FP SPENCER: Project start: April 1, 2013 Duration: 3 years XXXXXXXXXXDELIVERABLE 6.6XXXXXXXXXX

SIMERO Software System Design and Implementation

Energy transformations

VBA Macro for construction of an EM 3D model of a tyre and part of the vehicle

Transmission Line and Back Loaded Horn Physics

Autonomous Advertising Mobile Robot for Exhibitions, Developed at BMF

Force/position control of a robotic system for transcranial magnetic stimulation

Design of fuzzy systems

Time Domain and Frequency Domain Techniques For Multi Shaker Time Waveform Replication

Solving Simultaneous Equations and Matrices

ACCIDENTS AND NEAR-MISSES ANALYSIS BY USING VIDEO DRIVE-RECORDERS IN A FLEET TEST

Transcription:

Intelligent Mobile Vehicle Navigator Based on Fuzzy Logic and Reinforcement Learning Aashay Harlalka [100050035] Ankush Das [100050042] Pulkit Maheshwari [100050043]

Introduction The navigation of a mobile vehicle can be considered as a task of determining a collision free path that enables the vehicle to travel through an obstacle course from an initial configuration to a goal configuration. (Path Planning Problem) Path Planning Problem could be classified into: global path planning and local path planning. Global Path Planning : an exact environment model is used for planning the path. Local Path Planning : uses obstacle avoidance methods. Potential Field Method ( proposed by Khatib )

Motivation Drawbacks of Global Planning Method They can be conducted only in a completely known environment. Their time complexity grows with the geometry complexity and grows exponentially with the number of degrees of freedom in the vehicle s motion Drawbacks of Local Planning Method (Potential Field Method) Local minimum could occur and cause the vehicle to be stuck. It tends to cause unstable motion in the presence of obstacles. It is difficult to find the force coefficients influencing the vehicle s velocity and direction in an unknown environment.

Motivation Fuzzy Logic Approach Efficient in tackling the problem of obstacle avoidance, without requiring to construct an analytical model of the environment. Each rule of the rule base has a physical meaning making it possible to tune the rules by using expert s knowledge. Drawbacks of Fuzzy logic Difficult to consistently construct the rules. Tuning of the constructed rules is time consuming.

Motivation EEM ( Environment Exploration Method ) Training Method Uses reinforcement learning to associate an appropriate action to a situation. Drawbacks of EEM Slow and uncertain convergence of the learning process Insufficiently learnt rule-base

Overview of the Navigator Vehicle Model and Sensor arrangement Uses a cylindrical mobile platform driven by three active wheels. Equipped with an ultrasonic sensor ring having N sensors evenly distributed along the ring.

Overview of the Navigator Coordinate Systems and Navigation Task Each navigation task is specified in the world coordinate, where the vehicle configuration is represented by S = (X 0 Y 0 θ), where (X 0, Y 0 ) are co-ordinates of vehicle s center and θ is heading angle of vehicle. A navigation task is to obtain the environment information, d i and P g (X g,y g ), and the vehicle s configuration S(t) at each time step t, and determine the output variables v(t) and Δθ(t).

Overview of the Navigator Architecture of the Navigator Consists of 4 main modules : Obstacle Avoider (OA), Goal Seeker (GS), Navigation Supervisor (NS) and Environment Evaluator (EE). OA determines v a and Δθ a GS determines v g and Δθ g NS fuses these two values to obtain the eventual v and Δθ. EE computes the distance sensed by the ultrasonic sensors, and determines the value of W, which is used for fuzzification by the OA.

Overview of the Navigator

Obstacle Avoider Fuzzy Control of Obstacle Avoidance Input variables of the OA are the sensor input variables, d i The outputs are v a and θ a. Membership Functions (as in figure)

Obstacle Avoider Fuzzification of input variables The value of each d i is fuzzified and expressed by the fuzzy sets VN, NR, FR Rule base construction through reinforcement learning Fuzzy Inference Fire strength of the j th rule, μ j μ j = μ D j1 d 1 μ D j2 d 2 μ D jn d n

Obstacle Avoider Defuzzification of the output variables Method of height defuzzification Low computing cost v a = μ jb 1j μ j, θ a = μ jb 2j μ j

Obstacle Avoider Rule Learning for Obstacle Avoidance The vehicle begins the learning with an initial v and θ at time step t = 0. It moves into a new position at time step t = 1, and so on, until a collision occurs at the time step t = k. The whole process, until a collision occurred, is called a trial and the time step t; (t > 0) is called the t th learning step. A failure signal is fed back to the learning network, and the rules which were used at the previous time steps k; k - 1; k - 2, are changed in order to get an improvement on the vehicle s performance.

Obstacle Avoider After the rules are updated, a new trial begins at the (k+1) th learning step. The process is iterated and terminated until no more collisions occur.

Obstacle Avoider Simulation of Rule Learning for Obstacle Avoidance For efficient learning, a tradeoff between exploration and exploitation should be achieved to maximize the effect of learning and minimize the costs of exploration. Environment Exploration Method ( EEM ) Straightforward and simple method. Explores and converges slowly in a complex environment New Training Method Phase 1: The vehicle begins its training from an arbitrarily chosen start configuration, moves in a specific direction ( learning is iterated as in the EEM ). Phase 2: The vehicle then learns to navigate in the opposite direction with a new start configuration. Upon collision, it backtracks some steps and changes direction accordingly. Both these phases are completed when vehicles maintains trajectory without collision.

Performance Analysis Embedded in a fully integrated and interactive simulator developed on the SGI IRIX operating system and the OpenInventor platform. Three cases are compared, where t 1 and t 2 are the trajectories determined by the vehicle while using the rule base constructed by the new training method with and without the EE, respectively; and t 3 is the trajectory determined by the vehicle while using the rule base constructed by the EEM (terminated at 100,000 learning steps with W = 60 cm).

Motions of t 1,t 2,t 3 Top view of a laboratory and trajectory from s1 to g1

Analysis of t 1 s motion

Analysis of t 1 s motion s 1 - Vehicle was at a velocity of about 14 cm/s. a - Turned its heading direction slightly towards the goal g 1 with a small drop in velocity. b - Accelerated and passed by the door on its right. c - Encountered another AMV, vehicle slowed down to below 12 cm/s, before making a relatively large steering change to avoid the AMV. d - Accelerated to top speed when passing the table (TB). e - Detected the presence of the human being, decelerated, and steered to the left. f - Accelerated when passing the bookshelf (BS).

Analysis of t 1 s motion g - Decelerated when approaching the two human beings. h - Slowed down to about 9 cm/s when it was directly in front of HB3, before making a turn to the right. Selected the path between HB2 and HB3 and navigated through. g 1 - Decelerated gradually until coming to a stop.

Observations - t 1 s motion Acceleration/deceleration ranges are small when the vehicle passes by an obstacle but large when the obstacles are in its path. No abrupt change of velocity (±3 cm/s). No abrupt change in the steering angle (±11.5º). t 2 - Velocity and steering angle functions are very similar to t 1 even though the EE was not used. t 3 - Abrupt changes in velocity and steering angle, velocity varied between ±6 cm/s and the steering angle varied between +40º and -29º.

Evaluation of Path Quality Six navigation tasks were conducted and the errors are tabulated. p a - Length of actual path. p e - Length of shortest path. d ae - Deviation of the vehicle s position from the shortest path. E r - Relative error = (p a - p e )/p e

Evaluation of Path Quality NAVIGATION UNDER THE RULE BASE CONSTRUCTED BY THE EEM

Evaluation of Path Quality T Pa/cm Pe/cm Er(%) Average dae(cm) Max dae Time(s) obstacles Colloision 1 833.7 785.6 6.1 14.8 40.1 73.8 7 0 2 795.0 759.8 4.6 9.8 33.8 70.2 6 0 3 682.5 660.1 3.4 7.6 28.6 62.1 5 0 4 584.3 573.0 2.0 4.3 13.2 54.6 4 0 5 493.7 485.0 1.8 3.8 10.0 49.5 2 0 6 424.1 422.9 1.2 3.1 8.5 45.6 0 0 NAVIGATION UNDER THE RULE BASE CONSTRUCTED BY THE NEW METHOD

Conclusion Fuzzy navigator performs well in complex and unknown environments, using a rule base that is learned from a simple corridor-like environment. Fusion of the obstacle avoidance and goal seeking behaviors Aided by an environment evaluator to tune the universe of discourse of the input sensor readings and enhance its adaptability. 5 distinct advantages: 270 times faster in learning speed Only 4% of the learning cost Very reliable convergence of learning 98.8% of learned rules High adaptability

References N. H. C. Yung, Cang Ye: An intelligent mobile vehicle navigator based on fuzzy logic and reinforcement learning. IEEE Transactions on Systems, Man, and Cybernetics, Part B 29(2): 314-321 (1999)

Thank You.

,Trace of j th rule, λ is trace decay rate T is the time between two steps γ is the discount-rate parameter β is a positive constant

α is the learning rate, e mj (t) is the eligibility trace of the j th rule δ is the decay rate of eligibility

EEM Iteration Current distance readings are fed into the fuzzy quantization module, where they are encoded into μ j (t). If max ( ω mj (t) ) = 0, then the initial control actions, v a and θ a, are used as the control outputs; otherwise the control outputs are determined by defuzzification the external reinforcement signal, current prediction value p m (t), and the internal reinforcement signal is calculated by the previous equations the weights of the ACE and ASE (v mj (t) and ω mj (t)) are updated by the equations, while the trace of the rule and the eligibility trace are also updated Finally, if there is no collision, the configuration of the vehicle is changed by last equation and the learning process returns to Step 1. If a collision occurs, i.e., r m (t) = -1; v mj (t); trace(t); pm(t - 1) and e mj (t) are reset to zero. The vehicle is backtracked 4 steps and its heading direction is reversed. The weights of the ASE, ω mj (t) which are learned just before the collision are then used for the next trial. The next trial begins by repeating Step 1 through Step 5 again.