AN OVERVIEW OF DATA MINING IN ROAD TRAFFIC AND ACCIDENT ANALYSIS

Similar documents
Traffic Accident Data Mining Using Machine Learning Paradigms

How To Determine The Severity Of A Road Traffic Accident

A Perspective Analysis of Traffic Accident using Data Mining Techniques

MISSOURI TRAFFIC SAFETY COMPENDIUM

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Characteristics of High Injury Severity Crashes on km/h Rural Roads in South Australia

How To Know If You Are Distracted By Cell Phones

Safe Transportation System. Tangible Result Driver Leanna Depue, Highway Safety Director

The total number of traffic collisions in Saskatchewan is up 5% from 51,733 in 2008 to 54,229 in 2009.

The spectrum of motorcycle research in Maryland

Year Fatalities Injuries

Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10

A DATA MINING APPROACH TO IDENTIFY KEY FACTORS OF TRAFFIC INJURY SEVERITY

A Bicycle Accident Study Using GIS Mapping and Analysis

Mining Road Traffic Accident Data to Improve Safety: Role of Road-related Factors on Accident Severity in Ethiopia

Traffic accidents in Hanoi: data collection and analysis

An exploratory neural network model for predicting disability severity from road traffic accidents in Thailand

Design of Severity Estimation Unit of Automotive Accidents

DATA MINING TECHNIQUES AND APPLICATIONS

ALCOHOL, 2013 HIGHLIGHTS

Hybrid System for Driver Assistance

FMCSA Webinar Nov.18th to Examine Large Truck Crash Fatalities Involving Pedestrians & Bicyclists. New Technology for Better Fleet Management

Design and Development of Seat Belt Alert System with Ignition Interlocking in Four Wheeler

A Scientific Study "ETAC" European Truck Accident Causation

Deaths/injuries in motor vehicle crashes per million hours spent travelling, July 2008 June 2012 (All ages) Mode of travel

PEDESTRIAN AND BICYCLE ACCIDENT DATA. Irene Isaksson-Hellman If Insurance Company P&C Ltd.

Introduction. A. Bellaachia Page: 1

CHAPTER 1 Land Transport

Motorcycle Related Crash Victims (What the Statistics Say) Mehdi Nassirpour Illinois Department of Transportation Division of Transportation Safety

DECISION TREE ANALYSIS: PREDICTION OF SERIOUS TRAFFIC OFFENDING

EXPERT THINKING FROM MILLIMAN. Predictive analytics, text mining, and drug-impaired driving in automobile accidents

Prediction of Stock Performance Using Analytical Techniques

Development of an Ignition Interlock Device to Prevent Illegal Driving of a Drunk Driver

Data Report. Created: North Carolina Fatalities of year-olds in Motor Vehicle Crashes

Identifying Factors Underlying Injury

Prediction of Heart Disease Using Naïve Bayes Algorithm

Crime Hotspots Analysis in South Korea: A User-Oriented Approach

are the leading cause of teen fatalities, accounting for

the Ministry of Transport is attributed as the source of the material

A Review on Road Accident in Traffic System using Data Mining Techniques

ATSB RESEARCH AND ANALYSIS REPORT ROAD SAFETY. Characteristics of Fatal Road Crashes During National Holiday Periods

How To Write A Crash Reconstruction

Review of Best Practice in Road Crash Database and Analysis System Design Blair Turner ARRB Group Ltd

A statistical comparison between severe accidents and PDO accidents in Riyadh A.S. Al-Ghamdi College of Engineering, King Sand University,

STATISTICS OF FATAL AND INJURY ROAD ACCIDENTS IN LITHUANIA,

Density Map Visualization for Overlapping Bicycle Trajectories

UNION COLLEGE MOTOR VEHICLE POLICY

Welcome. Data Mining: Updates in Technologies. Xindong Wu. Colorado School of Mines Golden, Colorado 80401, USA

Testimony of Ann Wilson House Energy & Commerce Committee Subcommittee on Commerce, Manufacturing and Trade, October 21, 2015

Introduction to Data Mining and Business Intelligence Lecture 1/DMBI/IKI83403T/MTI/UI

Traffic Safety Facts. Children Data. Motor vehicle crashes are the leading cause of death for children from 2 to 14 years old.

Bicycle Traffic Accidents in Japan

COLLISION DYNAMICS P.O. BOX 1877 GLENDORA, CALIFORNIA CD-XPRT ( )

Knowledge and Data in Road Safety Management - Research at the Center for Road Safety

Accident configurations and injuries for bicyclists based on the German In-Depth Accident Study. Chiara Orsi

CCTV - Video Analytics for Traffic Management

HELPFUL TIPS AFTER A CAR ACCIDENT

OREGON TRAFFIC ACCIDENT AND INSURANCE REPORT

BASED UPON THE PRINCIPLES OF GRADUATED DRIVER LICENSING. A Family Guide to Teen Driver Safety

Principles of Road and Traffic Engineering Designs for Mixed Traffic Flow with a Numerous Motorcycles

2013 Student Competition

A Discussion of Highway Accident Data Collection and Statistics

Motorcycle Accident Investigation

Relative Contribution/Fault in Car-Truck Crashes February 2013

Crash Outcome Data Evaluation System

REAL-WORLD PERFORMANCE OF CITY SAFETY BASED ON SWEDISH INSURANCE DATA

DOT HS Summary of Statistical Findings April Data

1. The consultation seeks views on the vision, targets and measures for improving road safety in Great Britain for the period beyond 2010.

Introduction to Data Mining

DMV. OREGON TRAFFIC ACCIDENT AND INSURANCE REPORT Tear this sheet off your report, read and carefully follow the directions.

DOT HS Summary of Statistical Findings April Data

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

New York Study of Booster Seat Effects on Injury Reduction Compared to Safety Belts in Children Aged 4-8 in Motor Vehicle Crashes

Disputed Red Light Accidents: A Primer on Signal Control. Jon B. Landerville, MSME, PE Edward C. Fatzinger, MSME, PE Philip S.

Proposal 1. I. M. Writer 520 Safe Drive St. Anytown, Anystate, USA (555) October 13, 2004

PDCA Cycle Based Traffic Accident Countermeasure Management

2007 to Motor Vehicle Fatalities in British Columbia: Statistics

Motor Vehicle Crashes, Injuries, and Deaths in Relation to Driver Age: United States,

Safety Belt Use, Ejection and Entrapment

2. Road Traffic Accident Conditions During 2009

Trends in Transit Bus Accidents and Promising Collision Countermeasures

STAFFORDSHIRE COUNTY COUNCIL ROAD CASUALTY REVIEW

INFORMATION SUBMISSION FOR NAS. December Office Vehicle Safety Research National Highway Traffic Safety Administration

BOSTON CRASH REPORT.

Strategic Highway Safety Plan

Data Mining Applications in Higher Education

ANALYSIS OF INJURY SEVERITY IN PEDESTRIAN CRASHES USING CLASSIFICATION REGRESSION TREES

Data Mining for Fun and Profit

Neural Network based Vehicle Classification for Intelligent Traffic Control

Road Safety Authority Provisional Review of Road Crashes 2013

ITARDAInstitute for Traffic Accident

Information Management course

Traffic Accident Reconstruction

Development of road crash report form for Punjab traffic police

New Mexico Traffic Crash Annual Report 2011

Transcription:

AN OVERVIEW OF DATA MINING IN ROAD TRAFFIC AND ACCIDENT ANALYSIS K. Jayasudha Dr. C. Chandrasekar Lecturer/Research Scholar, Reader, Periyar University, Salem, Department of MCA, Tamilnadu, India K.S.R College of Engineering, Email: ccsekar@gmail.com Tiruchengode 637 215, Tamilnadu, India. Email: jayasudhakaliannan@yahoo.com able to introduce safety plans in order to prevent or ABSTRACT reduce occurrences of accidents. The road traffic and Data mining is a promising area for dealing accident statistics must be presented in such a way to with the increased, stored data that has been generated make it easier to be both recognized and interpreted by in our times.it is the extraction of implicit, previously a human operator. Previous works on accident analysis unknown and useful data. In this paper we have included statistical methods and formal techniques. analyzed some of the data mining techniques, tools, Statistics tables and ordinary charting techniques are applications and search engines for accident not sufficient for present day requirements and this investigation and traffic analysis. Most of the accident causes difficulties in the effective visualization of investigation methodologies are based on scenarios of results and patterns. Another disadvantage is that the accident occurrence and simulation of accident ordinary methods limit human involvement in the situation. The costs of fatalities and injuries due to exploration tasks. In this paper we have analyzed traffic accident have a great impact on society. various studies done by various authors in accident Engineers and researchers in the automobile industry databases and also measures proposed to prevent the have tried to design and build safer automobiles, but accident and its death rate. traffic accidents are unavoidable. In recent years, researchers have been utilizing real-life data in studying various aspects of traffic accidents. So measures have to be taken to reduce accidents. It is important that the measures should be based on scientific and objective surveys of the causes of accidents and severity of injuries. Our study highlights various tools, techniques and applications of data mining in accident analysis will eliminate deficiencies of other techniques but covers their advantages. Our main aim is to overcome the death rate and the increased rate of loss of lifes by means of using some tools, techniques or various algorithms in the field of data mining using the traffic data bases. Keywords - mining, accidents, road safety 1. INTRODUCTION Road safety experts and researchers deal with large volumes of quantitative information and collected statistics, in order to understand and estimate the social and economic cost of the accidents and to be Functionalities: Data mining is the process of identifying valid and understandable patterns in the data set. It helps in extracting and refining useful knowledge from large data sets. Data mining tasks can be classified in to two categories descriptive and predictive. Descriptive mining tasks characterize the general properties of the data in the database. Predictive mining tasks perform inference on the current data in order to make predictions. Main functionalities of data mining are as follows: Data characterization which summarizes the target class(class under the study).data discrimination which compares the target class with one or more sets. Association analysis which discovers association rules showing attribute-value conditions that occur frequently together in a given set of data.(using support and confidence).classification is the process of finding a set of models that describes and distinguish data classes or concepts for the purpose of being able to use Journal of Computer Applications, Vol II, No.4, Oct Dec 2009 Page 32

the model to predict the class of objects whose class label is unknown. Prediction it is nothing but in many applications users may wish to predict some missing or unavailable data values (numerical value) is often specifically referred to as prediction. Cluster analysis which analyzes data objects without consulting a known class label. The objects are clustered or grouped based on the principle of maximizing the intra class similarity and minimizing the interclass similarity. Outlier analysis: A database may contain data objects that do not comply with the general behavior or model of the data. These data objects are called outliers. Evolution analysis which describes and models regularities or trends for objects whose behavior changes over time. 2. RELATED RESEARCH Kim[1] developed a log-linear model to clarify the role of driver characteristics and behaviors in the causal sequence leading to more severe injuries. They found that driver behaviors of alcohol or drug use and lack of seat belt use greatly increase the odds of more severe crashes and injuries. Shankar [2] applied a nested logic formulation for estimating accident severity likelihood conditioned on the occurrence of an accident. The study found that there is a greater probability of evident injury or disabling injury/fatality relative to no evident injury if at least one driver did not use a restraint system at the time of the accident. Dia used real-world data for developing a multilayered NN freeway incident detection model [3]. They compared the performance of the neural network model and the incident detection model in operation on freeways. Yang [4] used NN approach to detect safer driving patterns that have less chances of causing death and injury when a car crash occurs. Evanco [5] conducted a multivariate population-based statistical analysis to determine the relationship between fatalities and accident notification times. The analysis demonstrated that accident notification time is an important determinant of the number of fatalities for accidents on rural roadways. Mussone [6] used neural networks to analyze vehicle accident that occurred at intersections. Results showed that the highest accident index for running over of pedestrian occurs at nonsignalized intersections at nighttime. The next major approach focuses on the area of Advanced Driving Assistance Systems (ADAS) [7]. These systems concentrate on attempting to prevent specific damaging scenarios, such as vehicles rear-end collision and lane deviation. They are mostly used in Smart Cars, and they work by mining data obtained from various sensors in the car. Different data mining techniques are used in an attempt to predict a driver s moves, so that unsafe actions can be rectified, or prevented. The use of data mining to improve road safety can be categorised into two major approaches. The first approach concentrates on mining crash data, which includes various attributes relating to both driver and vehicle at the time of the crash [8,9,10]. The focus is on analyzing the data for the purpose of discovering useful, and potentially actionable, information. In [11] crash data was mined to identify the driver and vehicle attributes which are the main causes for road accidents. Principal Component Analysis was used to emphasize the relationships between characteristics such as age, gender and vehicle type, to the crash variables. Bedard. [12] applied a multivariate logistic regression to determine the independent contribution of driver, crash, and vehicle characteristics to drivers fatality risk. It was found that increasing seatbelt use, reducing speed, and reducing the number and severity of driverside impacts might prevent fatalities. Ossiander [13] used Poisson regression to analyze the association between the fatal crash rate (fatal crashes per vehicle mile traveled) and the speed limit increase and found that the speed limit increase was associated with a higher fatal crash rate and more deaths on freeways. There are many research papers about spatial data mining [14] technologies of traffic data, for instance optimization of travel plan, visualization of traffic jam, GPS assisted navigation, road design, and ICT-assisted traffic congestion. The use of data mining to improve Journal of Computer Applications, Vol II, No.4, Oct Dec 2009 Page 33

road safety can be categorized into many major approaches and one such approach concentrates on mining crash data, which includes various attributes relating to both driver and vehicle at the time of the crash [15] The focus is on analyzing the data for the purpose of discovering useful, and potentially actionable, information. Abdel-Aty [16] used the Fatality Analysis Reporting System (FARS) crash databases covering the period of 1975-2000 to analyze the effect of the increasing number of light truck vehicle (LTV) registrations on fatal angle collision trends. They investigated the number of annual fatalities that result from angle collisions as well as collision configuration (car-car, car-ltv, LTV-car, and LTV-LTV). Sohn et al.[17] applied data fusion, ensemble and clustering to improve the accuracy of individual classifiers for two categories of severity (bodily injury and property damage) of road traffic accident. 3.ROAD SAFETY DATABASES In developing an accident database at the national level, efforts were made to use the technologies available that are able to assist in developing and implementing road safety programs and enable planning at both the local and national levels. The study of these databases showed that a tremendous effort had been made by different international and regional organizations. The main objectives of these databases were to explain road accidents in compatible and homogenous formats and to reduce the effort and time spent by researchers and end users in collecting accident statistics. International and regional road safety organizations use these databases to publish annual reports and statistics, according to predefined user requirements with agreed variables and indicators. Research in this area is devoted to developing database schema that represent the road accident data and other relevant statistics (population censuses, socioeconomic and energy related data) that is useful for users and decisionmakers. Technically, all available road databases collect and process data in different ways. Most of the accident variables used in different databases are selected to give full descriptions of the crash type and persons involved. For the purpose of the research, the data set used comes from international road databases that have good quality information and have been in existence for a significant length of time. IRTAD (http://www.irtad.net/) The International Road Traffic and Accident Database (IRTAD) is an international database that gathers data on traffic and road accidents (accident and victims as well as exposure data are collected on a continuous basis) from the Organization for Economic Co-operation and Development (OECD) member countries. The main part of the database, with around 500 data items, includes aggregated data on injury accidents, road fatalities, vehicle population, and network length. GLOBESAFE (http://www.globesafe.org/) GLOBESAFE is a database and a platform used to share road safety information among road safety organizations. This database offered data for the nine ASEAN countries from 1994 onwards. ACCIDENT RESOURCE (http://www.accident-resource.com/) The ARC Network is the number one web site for traffic accident reconstruction on the Internet. It hosts an expert directory of over 500 expert witnesses and contains thousands of pages relating to all areas of accident reconstruction. It provides valuable information regarding all aspects of all traffic accidents, including motorcycle accidents, pedestrian accidents, auto accidents, commercial vehicle accidents, bicycle accidents and more. CRASHDATARETRIEVAL(BlackBox) (www.cdr-system.com) This web site is dedicated to the crash data retrieval system. This is the software system kit that allows people to download information from a vehicles "black box". This web site helps to Journal of Computer Applications, Vol II, No.4, Oct Dec 2009 Page 34

understand what is going on with the new technology. The CDR System is a tool that provides access to vehicle crash data that may be stored in late model passenger cars, light trucks. Crash Data Retrieval System is the essential tool used for better case evaluation Collection of additional evidence Settlement decisions Subrogation http://www.indiandrivingschools.com/accidents-onindian-roads.html This web site clearly explains and gives details regarding the road safety,traffic rules, what are all the conditions to be followed for driving,details of the traffic authorities, in various types of road ranges. 4. TOOLS USED Orange Tanagra Clementine(SPSS PASW Modeler) Rapid Miner SAS SAS Enterprise Miner Darwin Data cruncher Enterprise miner Gain smarts Intelligent miner Mine set Model Quest Salford CART Olpars Beyesia Scenarion S-plus 5. PROPOSED WORK With the rapid increasing popularity of the WWW, Websites are playing a crucial role to convey knowledge and information to the end users. Discovering hidden and meaningful information about Web users usage patterns is critical to determine effective marketing strategies to optimize the Web server usage for accommodating future growth. Most of the currently available Web server analysis tools provide only explicitly and statistical information without real useful knowledge for Web managers. The task of mining useful information becomes more challenging when the Web traffic volume is enormous and keeps on growing. In this paper, we propose a alert model to discover and analyze useful knowledge from the available Web data. We made use of the classification information generated by a self organizing map for pattern analysis to capture the chaotic trend to provide short-term (hourly) and longterm (daily) Web traffic trend predictions like fatal accidents, seriously injured people, slightly injured people, material damage. Our traffic networks equipped with a number of measurement devices of various kinds(inductive loops, video sensors, radar detectors) that deliver real-time information about the current traffic conditions in corresponding locations. Our proposed model will gives alert whenever the vehicle crosses the danger zone either in audio or video format. Also we aimed to extend it by sending alerts from one vehicle to another. We can make use of the sensors to gather the details with the help of the GPS technology.it was collected in the data base and gathered in it.then data was classified using the decision support systems and identify the exact decision to be given to the driver as the alert.it is one among the useful way to prevent accident rates by giving alerts to the driver. A proposed sample diagram is shown in the following figure It is identified that accident rates are more at the intersections than in the highways or main roads. Sample traffic flow at the intersections was shown below. It was the aerial view of the photograph taken. Journal of Computer Applications, Vol II, No.4, Oct Dec 2009 Page 35

6. CONCLUSION The aim of this study was to show the applications of data mining techniques in the feild of accident investigation. It was done by reviewing various papers. Association rules are proposed to use to discover the patterns and rules that cause the occurrence of accidents. Our method is at early stages of development. We are currently enhancing it by considering several issues, variation in crash occurrence may have some consequence for traffic safety measures in some counties. It is beyond the scope of this study to determine which traffic crash excesses are due to poor roadway conditions or the necessity of stop signs or traffic lights.. The determination of specific precautionary measures to reduce crashes requires study of other factors such as the identification of specific road sections that need work, etc.. A systematic method for updating the index year after year could be considered. Additionally, further explorations of traffic safety data using data mining techniques are warranted. REFERENCES [1]. Kim, K., Nitz, L., Richardson, J., & Li, L., Personal and Behavioral Predictors of Automobile Crash and Injury Severity. Accident Analysis and Prevention, Vol. 27, No. 4, 1995, pp. 469-481. [2]. Shankar, V., Mannering, F., & Barfield, W., Statistical Analysis of Accident Severity on Rural Freeways. Accident Analysis and Prevention, Vol. 28, No. 3, 1996, pp.391-401. [3]. Dia, H., & Rose, G., Development and Evaluation of Neural Network Freeway Incident Detection Models Using Field Data. Transportation Research C, Vol. 5, No. 5, 1997, pp. 313-331. [4]. Yang, W.T., Chen, H. C., & Brown, D. B., Detecting Safer Driving Patterns by A Neural Network Approach. ANNIE 99 for the Proceedings of Smart Engineering System Design Neural Network, Evolutionary Programming, Complex Systems and Data Mining, Vol. 9, pp. 839-844, Nov. 1999. [5]. Evanco, W. M., The Potential Impact of Rural Mayday Systems on Vehicular Crash Fatalities. Accident Analysis and Prevention, Vol. 31, 1999, pp. 455-462. [6]. Mussone, L., Ferrari, A., & Oneta, M., An analysis of urban collisions using an artificial intelligence model. Accident Analysis and Prevention, Vol. 31, 1999, pp. 705-718. [7]. Oliver, N. and Pentland, A. P., Graphical Models for Driver Behavior Recognition in a SmartCar, MIT IEEE intelligent vehicles symposium, 2000. [8]. Tavris, D. R., Kuhn, E. M, & Layde, P. M., Age and Gender Patterns In Motor Vehicle Crash injuries: Improtance of Type of Crash and Occupant Role. Accident Analysis and Prevention, Vol. 33, 2001, pp. 167-172. [9]. Singh, S., Identification of Driver and Vehicle Characteristics through Data Mining the Highway Crash, National Highway Traffic Safety Administration, USA, 2001. [10]. Singh, S., A Sampling Strategy for Rear-End Pre- Crash Data Collection, National Highway Traffic Safety Administration, USA, 2001. [11]. Singh, S., A Sampling Strategy for Rear-End Pre- Crash Data Collection, National Highway Traffic Safety Administration, USA, 2001. [12]. Bedard, M., Guyatt, G. H., Stones, M. J., & Hireds, J. P., The Independent Contribution of Driver, Crash, and Vehicle Characteristics to Driver Fatalities. Accident analysis and Prevention, Vol. 34, 2002, pp. 717-727. [13]. Ossiander, E. M., & Cummings, P., Freeway speed limits and Traffic Fatalities in Journal of Computer Applications, Vol II, No.4, Oct Dec 2009 Page 36

Washington State. Accident Analysis and Prevention, Vol. 34, 2002, pp. 13-18. [14]. S. Shekhar and S. Chawla, Spatial Databases: A Tour, Prentice Hall, 2002. [15]. Flach, P.A., Mladenic, D., Moyle, S., Raeymaekers, S., Rauch, J., Rawles, S., Ribeiro, R., Sclep, G., Struyf, J., Todorovski, L., Torgo, L., Blockeel, H., Wettschereck, D., Wu, S., Gartner, T., Grobelnik, M., Kavsek, B., Kejkula, M., Krzywania, D., Lavrac, N. and Ljubic, P., On the road to knowledge: mining 21 years of UK traffic accident reports., Data Mining and Decision Support: Aspects of Integration and Collaboration, pages 143-155. Kluwer Academic Publishers,January 2003 [16]. Abdel-Aty, M., & Abdelwahab, H., Analysis and Prediction of Traffic Fatalities Resulting From Angle Collisions Including the Effect of Vehicles Configuration and Compatibility. Accident Analysis and Prevention, 2003. [17]. Sohn, S. Y., & Lee, S. H., Data Fusion, Ensemble and Clustering to Improve the Classification Accuracy for the Severity of Road Traffic Accidents in Korea. Safety Science, Vol. 4, issue1, February 2003, pp. 1-14. [18]. Ching-Yao Chan, David Marco,Traffic Monitoring at Signal-Controlled Intersections and Data Mining for Safety Applications, 2004. Biography Ms.K.Jayasudha received the B.Sc.degree in Computer Technology in 2001 and M.C.A Degree in 2005 and the M.E. degree in Computer science and Engineering in 2007 from K.S.Rangasamy college of Technology. She has three years of teaching experience. She is currently working as Lecturer at K.S.R college of Engineering.She has registered for PhD in Anna University, Coimbatore, India, under the supervision of Dr.C.Chandrasekar. Her area of interest includes data mining, networks etc., Dr.C.Chandrasekar obtained his Ph.D in the year of 2005. He is in teaching profession for the past 18 years in reputed engineering colleges and universities.. He is guiding more than 30 Ph.D Scholars. He is presently working as Reader at Periyar University Salem. He has presented many papers in National Conferences International Conferences and published many papers in International Journals. His area if interest includes data mining, Image processing, Mobile Ad Hoc Networks, Network Security etc., Journal of Computer Applications, Vol II, No.4, Oct Dec 2009 Page 37