ANALYSIS OF INJURY SEVERITY IN PEDESTRIAN CRASHES USING CLASSIFICATION REGRESSION TREES



Similar documents
Evaluation of Frequency and Injury Outcomes of Lane Departure Crashes

Word Count: Body Text = 5, ,000 (4 Figures, 4 Tables) = 7,500 words

DOT HS October 2001

Pedestrian and Bicycle Crash Data Analysis:

A DATA MINING APPROACH TO IDENTIFY KEY FACTORS OF TRAFFIC INJURY SEVERITY

Pedestrian Crash Facts

North Carolina. Bicycle Crash Facts Prepared for

Characteristics of High Injury Severity Crashes on km/h Rural Roads in South Australia

MISSOURI TRAFFIC SAFETY COMPENDIUM

Crash Analysis. Identify/Prioritize. Gather Data. Analyze Crashes and Identify Improvements. Review Funding Options. Implement Improvements

DOT HS December Motor Vehicle Crashes: Overview ,000. Fatality Rate per 100M VMT

Relative Contribution/Fault in Car-Truck Crashes February 2013

DOT HS August 2012

The total number of traffic collisions in Saskatchewan is up 5% from 51,733 in 2008 to 54,229 in 2009.

TRAFFIC SAFETY FACTS. Bicyclists and Other Cyclists Data

DECISION TREE ANALYSIS: PREDICTION OF SERIOUS TRAFFIC OFFENDING

Statistical Analysis of the Traffic Safety Impacts of On-Premise Digital Signs

A statistical comparison between severe accidents and PDO accidents in Riyadh A.S. Al-Ghamdi College of Engineering, King Sand University,

Bicycle Collisions in Washington State: A Six-Year Perspective,

DRIVER ATTRIBUTES AND REAR-END CRASH INVOLVEMENT PROPENSITY

DOT HS April 2014

Traffic Safety Facts. Children Data. Motor vehicle crashes are the leading cause of death for children from 2 to 14 years old.

H. The Study Design. William S. Cash and Abigail J. Moss National Center for Health Statistics

How To Be Safe

Deaths/injuries in motor vehicle crashes per million hours spent travelling, July 2008 June 2012 (All ages) Mode of travel

Traffic Safety Facts 2008 Data

The spectrum of motorcycle research in Maryland

A model to predict the probability of highway rail crossing accidents

Level Crossing Crash Taxonomy for Connected Vehicle Safety Research

FMCSA Webinar Nov.18th to Examine Large Truck Crash Fatalities Involving Pedestrians & Bicyclists. New Technology for Better Fleet Management

An Overview and Evaluation of Decision Tree Methodology

RISER. Roadside Infrastructure for Safer European Roads

Motorcycle Related Crash Victims (What the Statistics Say) Mehdi Nassirpour Illinois Department of Transportation Division of Transportation Safety

Traffic Collision Statistics Report

Motor Vehicle Crashes, Injuries, and Deaths in Relation to Driver Age: United States,

EXPERT THINKING FROM MILLIMAN. Predictive analytics, text mining, and drug-impaired driving in automobile accidents

Trends in Transit Bus Accidents and Promising Collision Countermeasures

Juha Luoma and Michael Sivak. The University of Michigan Transportation Research Institute Ann Arbor, Michigan U.S.A.

Drunk Driving Accident Statistics

How To Know How Many People Die In A Car Crash

2013 Student Competition

PRAIRIE ROSE SCHOOL DIVISION SECTION E: SUPPORT SERVICES (PART 3: TRANSPORTATION)

A Perspective Analysis of Traffic Accident using Data Mining Techniques

Odds That An Involved Driver Was Drinking: Best Indicator Of An Alcohol-Related Crash?

School Bus Accident Report Form

EMPIRICAL ANALYSIS OF THE EFFECTIVENESS OF MANDATED MOTORCYCLE HELMET USE IN TAIWAN

THE HYBRID CART-LOGIT MODEL IN CLASSIFICATION AND DATA MINING. Dan Steinberg and N. Scott Cardell

How To Determine The Severity Of A Road Traffic Accident

The Impact of Lowering the Illegal BAC Limit to.08 in Five States in the U.S.

FATALITIES AND INJURIES IN MOTOR VEHICLE BACKING CRASHES

SEVERITY ANALYSIS OF DRIVER CRASH INVOLVEMENTS ON MULTILANE HIGH SPEED ARTERIAL CORRIDORS

Per vehicle mile traveled, motorcyclists were more than 26 times more likely than passenger car occupants to die in a traffic crash.

Roadway and Human Factors of Motorcycle Crashes in Puerto Rico

Traffic Safety Facts. Alcohol-Impaired Driving Data. Overview. Key Findings

School Bus Accident Report

the Ministry of Transport is attributed as the source of the material

ALCOHOL, 2013 HIGHLIGHTS

Spatial Distribution and Characteristics of Accident Crashes at Work Zones of Interstate Freeways in Ohio

Characteristics of Motorcycle Crashes in the U.S.

REPORT DOCUMENTATION PAGE

OREGON TRAFFIC ACCIDENT AND INSURANCE REPORT

A NATIONAL REVIEW OF RCMP MOTOR

Traffic accidents in Hanoi: data collection and analysis

Data Mining Methods: Applications for Institutional Research

ALCOHOL AND DRUGS IN ROAD CRASHES IN SOUTH AUSTRALIA

2013 State of Colorado Distracted Driver Study

The National Highway Traffic Safety Administration and Ground Ambulance Crashes. April 2014

A latent class modelling approach for identifying vehicle driver injury severity factors at highway-railway crossings

SAFE Streets for CHICAGO

DMV. OREGON TRAFFIC ACCIDENT AND INSURANCE REPORT Tear this sheet off your report, read and carefully follow the directions.

Analysis of Accidents by Older Drivers in Japan

ITSMR Research Note. Cell Phone Use and Other Driver Distractions: A Status Report KEY FINDINGS ABSTRACT INTRODUCTION.

ROAD SAFETY GUIDELINES FOR TAH ROAD INFRASTRUCTURE SAFETY MANAGEMENT

Knowledge and Data in Road Safety Management - Research at the Center for Road Safety

Traffic Safety Facts 2008 Data

Four-wheel drive vehicle crash involvement patterns

Traffic Safety Basic Facts 2012

School-related traffic congestion is a problem in

Traffic Safety Facts Research Note

Hao and Daniel 1 Motor Vehicle Driver Injury Severity Study at Highway-rail Grade Crossings in the United States

Traffic Safety Facts. Laws. Motorcycle Helmet Use Laws. Inside This Issue. Key Facts. April 2004

The Relationship between Speed and Car Driver Injury Severity

Impaired Driver John Bartlett Fatal Crash Not Fully Explained

TABLE OF CONTENTS INTRODUCTION

CLAIMS REPORTING KIT. Administered by

Gerry Hobbs, Department of Statistics, West Virginia University

DOT HS December 2012


DOT HS December 2013

Data mining techniques: decision trees

Project Accidents. Section. Responsibilities of the Inspector-in-Charge

MINIMIZING TRAFFIC-RELATED WORK ZONE CRASHES IN ILLINOIS

DOT HS December Motor Vehicle Crashes: Overview. Fatality Rate per 100M VMT

Year Fatalities Injuries

What roundabout design provides the highest possible safety?

Talking Points. About Roadway Users

Intersection Cost Comparison Spreadsheet User Manual ROUNDABOUT GUIDANCE VIRGINIA DEPARTMENT OF TRANSPORTATION

Car occupants intoxication and non-use of safety belts

Motor Vehicle Collisions in Eastern Ontario. Supplement to the Eastern Ontario Health Unit Injury Report

A Comparison of Decision Tree and Logistic Regression Model Xianzhe Chen, North Dakota State University, Fargo, ND

Transcription:

ANALYSIS OF INJURY SEVERITY IN PEDESTRIAN CRASHES USING CLASSIFICATION REGRESSION TREES By Vichika Iragavarapu, P.E. (Corresponding author) Assistant Research Engineer Texas A&M Transportation Institute, 5 TAMU College Station, TX 7784-5 Phone: 979/845-5686, fax: 979/845-6006 Email: v-iragavarapu@ttimail.tamu.edu Dominique Lord, Ph.D, P.Eng. Associate Professor and Zachry Development Professor I Zachry Department of Civil Engineering, 6 TAMU Texas A&M University College Station, TX 7784-6 Phone: 979/458-949, fax: 979/845-648 Email: d-lord@tamu.edu And Kay Fitzpatrick, Ph.D., P.E. Senior Research Engineer Texas A&M Transportation Institute, 5 TAMU College Station, TX 7784-5 Phone: 979/845-7, fax: 979/845-6006 Email: K-Fitzpatrick@tamu.edu TOTAL WORDS: 6,64 [664 Words, 9 Tables, Figure (500)] Submitted to the Transportation Research Board 94th Annual Meeting January -5, 05, Washington D.C.

Iragavarapu, Lord, Fitzpatrick ABSTRACT Texas is considered to be an opportunity state by the Federal Highway Administration (FHWA), due to the high number of pedestrian crashes. Data from the Fatality Analysis Reporting System (FARS) show that the number of pedestrian fatal crashes in Texas is the third highest in the U.S and is significantly higher than the national average. The research team explored the Texas Department of Transportation (TxDOT) Crash Record Information System (CRIS) database to identify characteristics of crashes involving pedestrians in Texas. A Classification and Regression Tree (CART) analysis of all pedestrian crashes was conducted to find the significant factors influencing the severity of crashes involving pedestrians in Texas. The classification tree identified that light condition, road class, traffic control, right shoulder width, involvement of a commercial vehicle, pedestrian age, and the collision manner, have the most influence on the severity of pedestrian crashes.

Iragavarapu, Lord, Fitzpatrick 4 5 6 7 8 9 0 4 5 6 7 8 9 0 4 5 6 7 8 9 0 4 5 6 7 8 9 40 4 4 4 44 INTRODUCTION Pedestrian crashes are a major safety issue; on average, a pedestrian is killed every two hours and injured every eight minutes in traffic crashes in the United States (). For the years 007 through 0, Texas had the third highest number of pedestrian fatal crashes in the U.S., with about 400 pedestrian fatalities per year, which is equal to 5 percent of all traffic crash fatalities in the US (, ). To understand the characteristics and factors influencing pedestrian crashes, the Texas Department of Transportation (TxDOT) requested an in-depth analysis of these crashes. Understanding of these relations will provide insight for the development of effective countermeasures focused on pedestrians. The objective of this analysis was, therefore, to find the significant factors and their interactions that influence the severity of crashes involving a pedestrian in Texas. STUDY METHODOLOGY A variety of methodologies have been used to understand factors influencing crash severities. A comprehensive review of these methodologies is provided in Savolainen et. al, 0 (4). Data mining techniques are widely applied in the areas of business, medicine, industry and engineering and are gaining attention in the transportation safety area (5, 6, 7, 8, 9). The classification-regression tree (CRT) methodology is a popular data mining technique that does not need a specific functional form and is effective with large data sets containing a large number of explanatory variables (0). The CRT framework is based on the algorithm first proposed in Breiman et al. (). The root node, which is the node with all the data, is divided into two child nodes on the basis of an independent variable (splitter) that creates the best homogeneity. This process is repeated for each child node until all data in each node has the greatest possible homogeneity. This node is called a leaf node. The most famous index for splitting of nominal data is the Gini index. For each tree created, the goodness of fit index is calculated using the misclassification error rate or misclassification cost. Pruning is performed according to the cost-complexity algorithm to avoid over-fitting of the training data and to create an optimal tree. An optimal tree is the one that has the least misclassification cost for the test data. Misclassification cost allows inclusion of information about the relative penalty associated with incorrect classification and is inversely proportional to the accuracy of prediction. Correct classifications always have a misclassification cost of 0 (discussed further below). Importance of each variable is calculated using the variable importance index and is scaled such that its summation is one. () The equation used for the Gini index is (): (Gini Index) Gini(m) = ( ) ( ) ( ) ( ), ( ) ( ) ( ), ( ) ( )

Iragavarapu, Lord, Fitzpatrick 4 5 6 7 8 9 0 4 5 6 7 8 9 0 4 5 6 7 8 9 0 Where, j is the number of target variables or classes, π(j) is the prior probability for class j, p(j m) is the conditional probability of a record being in class j, provided that it is in the node m, N j (m) is the number of records in class j of node m, N j is the number of records of class j in the root node, and Gini (m) or the Gini index is the indication of impurity in node m. The prior probability shows the proportion of observations in each class in the population. () Misclassification Error Rate = ( )[ ( )] Where, p(m) is the proportion of existing observations in the terminal node or leaf m (from all observations) and M is the number of terminal nodes. () (Variable Importance Index) VIM(x j ) = ( ( )) Where, Gini(S(x j,t)) is the reduction in the Gini index at node t that is achieved by splitting variable x j, is the proportion of the observations in the dataset that belong to node t, T is the total number of nodes and N is the total number of observations.() DATA The TxDOT Crash Record Information System (CRIS) database has three subsets: crash, person, and unit. The crash dataset contains information on the characteristics of the crash (e.g., date, time, weather) and its location (e.g., intersection relation, surface condition, traffic control devices). The person dataset has information on the characteristics of the people involved (e.g., age, gender, blood alcohol content, etc.) and injuries sustained (e.g., fatal, no injury, etc.). The unit dataset describes the characteristics of the units (e.g., type of vehicle, contributing factor for unit) involved, along with the contributing factors for each unit involved. Each of these subsets has different codes that distinguish crashes involving pedestrians from other reported crashes. 4,60 pedestrian crashes were available between 007 and 0 for use in this analysis. Table lists the variables used in this analysis.

Iragavarapu, Lord, Fitzpatrick Variable Rpt_Road_Part_ID Wthr_Cond_Rev Light_Cond_ID Road_Algn_ID Surf_Cond_ID Traffic_Cntl Rev FHE_Collsn_Rev Othr_Factr_Rev Road_Cls_ID Road_Relat_ID Month Hour DriverAge DriverGender PedestrianAge PedestrianGender Hwy_Dsgn_Lane_ID Hwy_Dsgn_Hrt_ID Hp_Shldr_Left Hp_Shldr_Right Hp_Median_Width Nbr_Of_Lane Shldr_Type_Left_ID Shldr_Type_Right_ID Median_Type_ID Adt_Curnt_Amt Trk_Aadt_Pct Description Table. Variables used in analysis. Reported roadway part on which the crash occurred Weather condition at the time of crash (Revised to merge some categories) The type and level of light at the time of the crash Roadway Alignment at the crash site The surface condition at the time and place of the crash Type of traffic control at the scene of the crash (Revised to merge some categories) The manner in which the vehicle(s) were moving prior to the first harmful event (Revised to merge some categories) Additional detail of events/circumstances concerning the crash (Revised to merge some categories) The functional classification group of the priority road the motor vehicle(s) was traveling on before the crash Where the crash occurred in relation to the roadway Month of year when the crash occurred Time of day when the crash occurred Age of one of the drivers involved in the crash Gender of one of the drivers involved in the crash Age of one of the pedestrians involved in the crash Gender of one of the pedestrians involved in the crash Lane design on the applicable section of highway for crashes located on the state highway system Part of the Highway Design Code indicating HOV (High Occupancy Vehicle), railroads, and toll roads (HRT) for crashes located on the state highway system Width of inside shoulder on divided sections, or width of shoulder traveling in descending marker direction, measured in feet, for crashes located on the state highway system Width of outside shoulder on divided sections, or width of shoulder traveling in ascending marker direction, measured in feet, for crashes located on the state highway system Median width plus both inside shoulders, measured in feet, for crashes located on the state highway system Number of lanes, not including turning and climbing lanes, for crashes located on the state highway system Type of shoulder on the left side of the road, for crashes located on the state highway system Type of shoulder on the right side of the road, for crashes located on the state highway system Median type description, for crashes located on the state highway system Average daily traffic amount for a given road segment and year for crashes located on the state highway system Adjusted average daily traffic percent for trucks for crashes located on the state highway system

Iragavarapu, Lord, Fitzpatrick 4 4 5 6 7 8 9 0 4 5 6 7 8 9 0 4 5 6 7 8 9 ANALYSIS The analysis was performed with IBM SPSS Statistics Decision Tree tool, using the tree growing method of CRT (Classification-regression Tree). The response variable assessed in this analysis was crash severity, which is defined as the level of injury sustained by the most severely injured person involved in the crash. The crash severity variable used for developing the classification tree for this analysis was categorized as either fatal or non-fatal. The tree depth was restricted to five levels and impurity was measured with the Gini index. Minimum change in impurity improvement was set at 0.000. Seventy percent of the data was randomly assigned to train the model and the remaining thirty percent was allocated to the test. The tree was pruned to avoid over-fitting. In the dataset used for this analysis, the number of non-fatal crashes was almost fifteen times than that of fatal crashes (,88 vs.,) and the overall prediction accuracy for test sample was 9.5%, whereas the prediction accuracy for fatal crashes was only 0.9%. To ensure the same prediction accuracy for both severity levels, the prior probabilities are set equal so that the target variable level that has a lower proportion is also taken into consideration in predictions. Although this decreases the overall accuracy of the model, the prediction accuracy of the data with the least proportion increases (). Table shows that with using equal prior probabilities across all categories, the overall prediction accuracy of the model (for the training and the test data) decreased slightly, but the prediction accuracy for fatal crashes improved tremendously (74.% vs. 0.9%). Table shows the misclassification costs and Table 4 shows the risk estimate for the model, which is the proportion of cases incorrectly classified after adjustment for prior probabilities and misclassification costs. As discussed above, correct classifications represented on the diagonal in Table have a misclassification cost of 0

Iragavarapu, Lord, Fitzpatrick 5 4 5 6 7 8 9 0 4 5 Sample Training Test Table. Prediction Performance for the CRT Model. Predicted Crash Severity Observed Crash Severity Percent Non-Fatal Fatal Correct Non-Fatal 786 49 78.4% Fatal 59 4 77.% Overall Percentage 74.8% 5.% 78.% Non-Fatal 7575 075 78.5% Fatal 67 48 74.% Overall Percentage 75.% 4.8% 78.% Table. Misclassification Costs for the CRT Model. Predicted Observed N Y N 0.000.000 Y.000 0.000 Table 4. Risk Estimate of the Model. Sample Estimate Standard Error Training 0. 0.005 Test 0.6 0.009 The classification tree generated (Figure ) shows that the initial split at node 0 is based on the variable of light condition, which implies that light condition is the best variable to classify and predict pedestrian crash severity (fatal versus non-fatal). More (64%) pedestrian crashes are predicted to occur in daylight, whereas a higher proportion of fatal crashes are predicted to occur in dark conditions (% vs. %, Node vs. Node ). Traffic control, road class, and pedestrian age are selected to be the splitters more than once, implying that these variables have multiple effects on the crash severity outcome.

Iragavarapu, Lord, Fitzpatrick 6 Figure. Classification Tree.

Iragavarapu, Lord, Fitzpatrick 7 4 5 6 7 8 9 0 4 5 6 7 8 9 0 4 5 6 7 8 9 0 4 5 6 7 8 9 40 4 4 4 44 45 DISCUSSION The classification tree developed in this analysis (shown in Figure ) indicates that the following variables are critical in classifying the injury severity of pedestrian crashes: Light condition Road class Traffic control Right shoulder width Involvement of a commercial vehicle Pedestrian age Manner in which the vehicle(s) were moving prior to the first harmful event Daylight conditions are associated with more pedestrian; however, the severity of the crash is higher in dark conditions. When a pedestrian is struck at night, he or she is four times more likely to be killed when compared to daylight conditions (% vs.%, Node vs. Node ). This result is in agreement with a study on pedestrian crashes in North Carolina that found that dark conditions (with and without streetlights) significantly increase the probability of fatal injury for pedestrians (). This could be a reflection of higher speeds at night, along with greater difficulty in detecting pedestrians in dark conditions; hence, not being able to reduce the speed in time. Under all light conditions, the probability of pedestrian crashes is higher on city streets, county roads, and other lower speed roads that have segment-related traffic control device (e.g. warning sign); whereas the severity of the crash is more on higher speed roads, i.e. Interstates, US & State Highways, FM roads, and Tollways (5% vs. 6%, Node 4 vs. Node, and % vs. %, Node 4 vs. Node ). This result was further investigated and is discussed in greater detail in Iragavarapu et al. (4). Kim et al. (9) also found that freeway, U.S. route, and state route increased the probability of fatal injury in pedestrian crashes, compared with local city streets (). Younger ( 60 years) pedestrians are predicted to be involved in more crashes, whereas older (> 60 years) pedestrians are more likely to be killed when stuck by a vehicle (5% vs. 4%, Node 6 vs. Node 5, and % vs. %, Node 0 vs. Node 9). This finding is expected because in general older pedestrians have lesser physical strength to cope with the injuries and are less agile to escape to a safer location just before the crash. Kim et al. also found that older pedestrians are more likely to sustain greater injury than younger pedestrians (,5). Holubowycz found the greatest fatality rates in pedestrians 75 years or older (6).This results supports the concept of a pedestrian airbag technology which would improve the chances of surviving for pedestrians hit by vehicles, more so for elderly pedestrians. Crandall et. al discuss this approach for pedestrian safety in their 00 paper (7). Commercial vehicle involvement in a pedestrian crash is associated with a greater probability of pedestrian fatality (0% vs. %, Node vs. Node ). This is obviously attributed to their larger weight, longer stopping distances, higher bumper height, and blunt geometry, which has also been documented in previous studies (, 5, 8, 9).

Iragavarapu, Lord, Fitzpatrick 8 4 5 6 7 8 9 0 4 5 6 7 8 9 0 4 Locations with no traffic control device or intersection-related traffic control devices (e.g., signal) are found to be associated with more number of pedestrian crashes; however, locations with segment traffic control devices (e.g. warning sign or flagger) or both traffic control devices (i.e., officer, flagman) are associated with higher proportion of fatal pedestrian crashes (5% vs. 4%, Node 8 vs. Node 7, and 6% vs. %, Node 6 vs. Node 5). This result is intuitive because more pedestrians are expected to be present at intersections and the vehicle speeds are relatively lower, when compared to mid-segment. The results also show that on high speed roads, more crashes are expected at locations with right shoulder width less than 8.5 feet (Node 9), when one of the vehicles involved is either going straight or backing (Node 7). However, a higher proportion of fatal crashes on high speed roads are at locations with right shoulder width more than 8.5 feet. SUMMARY AND CONCLUSIONS This study has documented a regression tree analysis for examining factors that influence crash severity in crashes involving a pedestrian. A total of 4,60 pedestrian crashes that occurred between 007 and 0 were analyzed. The regression tree provided satisfactory results. The results (summarized in Table 5) were intuitive and consistent with the results of previous studies on pedestrian crashes that used other analytical techniques, such as probabilistic models of crash injury severity. The information provided in this study should help TxDOT and other transportation agencies to better target their efforts for reducing the number and severity of pedestrian collisions.

Iragavarapu, Lord, Fitzpatrick 9 Significant Factor Light condition Road class Pedestrian age Traffic control Right shoulder width Involvement of a commercial vehicle Collision manner Table 5. Summary of Classification Tree Result. Classification Tree Relationship When a pedestrian is struck at night, they are four times more likely to be killed when compared to daylight conditions. When a pedestrian is struck on higher speed roads, they are four times more likely to be killed when compared to lower speed roads. Older (60+) pedestrians are six times more likely to be killed when compared younger pedestrians when struck in daylight; at nighttime there is not much difference. When a pedestrian is struck at a location with segment traffic control devices (e.g., warning sign or flagger) or generic traffic control devices (e.g., officer), they are four times more likely in dark conditions and three times more likely in daylight to be killed when compared to location with no traffic control or intersection traffic control (e.g., stop sign). When a pedestrian is struck on high speed roads in dark conditions, they are twice as likely to be killed if the right shoulder width is more than 8.5 feet. When a pedestrian in daylight, they are five times as likely to be killed if the crash involves a commercial vehicle. On high speed roads with right shoulder width less than 8.5 feet, pedestrian crashes in dark conditions are six times more likely when one of the vehicles involved is either going straight or backing when compared to vehicles turning left or right.

Iragavarapu, Lord, Fitzpatrick 0 ACKNOWLEDGMENTS This paper is based on research sponsored by the Texas Department of Transportation (TxDOT) and the U.S. Department of Transportation, Federal Highway Administration (FHWA). The project was under the direction of Cary Choate of TxDOT. The research was performed at the Texas A&M Transportation Institute. The contents of this report reflect the views of the authors, who are responsible for the facts and the accuracy of the data presented herein. The contents do not necessarily reflect the official views or polices of TxDOT. REFERENCES NHTSA Pedestrian Crash Fact Sheet. http://www-nrd.nhtsa.dot.gov/pubs/8748.pdf Accessed June 04. NHTSA Fatality Analysis Reporting System. http://www-fars.nhtsa.dot.gov. Accessed April 0. Fitzpatrick, K., V. Iragavarapu, M.A. Brewer, D. Lord, J. Hudson, R. Avelar, and J. Robertson. Characteristics of Texas Pedestrian Crashes and Evaluation of Driver Yielding at Pedestrian Treatments. TxDOT Report FHWA/TX-/0-670-. May 04. 4 Savolainen, P.T., Mannering, F.L., Lord, D., Quddus, M.A., 0. The statistical analysis of highway crash-injury severities: a review and assessment of methodological alternatives. Accident Analysis and Prevention 4 (5), 666 676. 5 Kuhnert, P.M., K.-A. Do, and R. McClure. Combining non-parametric models with logistic regression: an application to motor vehicle injury data. Computational Statistics and Data Analysis, Vol. 4, No., 000, pp. 7-86. 6 Karlaftis, M.G., and I. Golias. Effects of road geometry and traffic volumes on rural roadway accident rates. Accident Anaysis & Prevention, Vol 4, No., 00, pp.57-65. 7 Montella, A., M. Aria, A. D Ambrosio, and F. Mauriello. Data-Mining Techniques for Exploratory Analysis of Pedestrian Crashes. Transportation Research Record: Journal of the Transportation Research Board, No. 7, Transportation Research Board of the National Academies, Washington, D.C., 0, pp.07-6. 8 Chang, L., and W. Chen. Data Mining of Tree-Based Models to Analyze Freeway Accident Frequency. Journal of Safety Research, Vol. 6, No. 4, 005, pp. 65-75. 9 Chang, L., and H. Wang. Analysis of Traffic Injury Severity: An Application of Non- Parametric Classification Tree Techniques. Accident Analysis & Prevention, Vol 8, No. 5, 006, pp.09-07. 0 Chang, L-Y, and J-T Chien. Analysis of Driver Injury Severity in Truck-Involved Accidents Using a Non-Parametric Classification Tree Model. Safety Science, Issue 5, 0, pp. 7-. Breiman, L., J.H. Friedman, R.A. Olshen, and C.J. Stone. Classification and Regression Trees. Wadsworth International Group, Belmont, California, 984. Kashani, A.T., and A.S. Mohaymany (0). Analysis of Traffic Injury Severity on Two- Lane, Two-Way Rural Road Based on Classification Tree Models. Safety Science, Issue 49, pp. 4 0.

Iragavarapu, Lord, Fitzpatrick Kim, J., G. F. Ulfarsson, V. N. Shankar, and S. Kim. Age and Pedestrian Injury Severity in Motor-Vehicle Crashes: A Heteroskedastic Logit Analysis. Accident Analysis and Prevention, Vol. 40, No. 5, 008, pp. 695 70. 4 Iragavarapu, V., S. H. Khazraee, D. Lord, and K. Fitzpatrick. Pedestrian Fatal Crashes on Freeways in Texas. Manuscript submitted for 94 th Transportation Research Board Meeting in Washington, D.C. 5 Kim, J., G. F. Ulfarsson, V. N. Shankar, and F. Mannering. A Note on Modeling Pedestrian Injury Severity in Motor-Vehicle Crashes with the Mixed Logit Model. Accident Analysis and Prevention, Vol. 4, No. 6, 00, pp. 75 758. 6 Holubowycz, O. T. Age, Sex, and Blood Alcohol Concentration of Killed and Injured Pedestrians. Accident Analysis and Prevention, Vol. 7, No., 995, pp. 47 4. 7 Crandall, J. R., K. S. Bhalla, and N. J. Madeley. Designing Road Vehicles for Pedestrian Protection. British Medical Journal Vol. 4, No.746, 00, pp. 45-48. 8 Fitzpatrick, K., and E. S. Park. Safety Effectiveness of HAWK Pedestrian Treatment. In Transportation Research Record: Journal of the Transportation Research Board, No. 40, Transportation Research Board of the National Academies, Washington, D.C., 009, pp. 4. 9 Lefler, D. E., and H. C. Gabler. The Fatality and Injury Risk of Light Truck Impacts with Pedestrians in the United States. Accident Analysis and Prevention, Vol. 6, No., 004, pp. 95 04.