Multivariate Statistical Model for Predicting Occurrence and Location of Broken Rails
|
|
|
- Annice Murphy
- 10 years ago
- Views:
Transcription
1 Transportation Research Record 1825 Paper No I Multivariate Statistical Model for Predicting Occurrence and Location of Broken Rails C. Tyler Dick, Christopher P. L. Barkan, Edward R. Chapman, and Mark P. Stehly Broken rails are the leading cause of major accidents on U.S. railroads and frequently cause delays. A multivariate statistical model was developed to improve the prediction of broken-rail incidences (i.e., service failures).improving the prediction of conditions that cause broken rails can assist railroads in allocating inspection, detection, and preventive resources more efficiently, to enhance safety, reduce the risk of hazardous materials transportation, improve service quality, and maximize rail assets. The service failure prediction model (SFPM) uses a combi nation of engineering and traffic data commonly recorded by major railroads. A Burlington Northern Santa Fe Railway database was devel oped in which the locations of approximately 1,800 service failures over 2 years were recorded. The data on each location were supplemented with information on other engineering and traffic volume parameters. A complementary database with the same parameters was developed for a randomly selected set oflocations at which service failures had not occurred. The combined databases were analyzed using multivariate statistical methods to identify the variables and their combinations most strongly correlated with service failures. SFPM accuracy in predicting service failures at specific locations exceeded 85%. Although further validation is necessary, SFPM is promising in the quantitative prediction of broken rails, thereby improving a railroad's ability to manage its assets and risks. Derailments from broken rails have been a safety concern for more than a century (1, 2). Improvements in rail manufacturing, inspection, and rail defect detection have greatly reduced the incidence of broken rails. However, broken rails frequently cause service interruptions and are a leading cause of derailments. Improving the prediction of locations where broken rails are likely to occur has economic and safety benefits, enabling more effective allocation of resources to detect and prevent broken rails (3-5). Previous work has focused on the development of fracture-mechanic approaches in combination with empirical testing (6-9) and single-variable probabilistic methods using Wei bull analysis (1 0). The first studies have assisted in understanding the underlying mechanisms of rail defect occurrence; the latter studies have assisted in predicting the useful life of rail, given basic information on traffic volume and loads. A statistical approach based on more variables can potentially improve C. T. Dick and C. P. L. Barkan, Railroad Engineering Program, University of Illinois at Urbana-Champaign, 1201 Newmark Civil Engineering Laboratory, 205 North Mathews Avenue, Urbana, IL E. R. Chapman and M.P. Stahly, Hazardous Materials, Burlington Northern Santa Fe Railway, 2600 Lou Menk Drive, Fort Worth, TX Current affiliation for C. T. Dick: HDR Engineering, Inc., 512 Main Street, Suite BOO, Fort Worth, TX the prediction of rail life and may also provide more insight into associated mechanisms (3). Over the past decade, railroads have expanded their use of information technology to include extensive geographic and engineering data systems. Large, multivariate databases that extensively detail key parameters likely to affect the occurrence of broken rails have been developed, thereby making such an approach feasible. DEFINITION OF SEVERE DERAILMENTS This study focused on identifying derailment causes most likely to lead to a severe accident in which many cars derail at speed. Generally such accidents will have the greatest potential for harm to people, property, equipment, and track. Further, analyses of FRA accident data have shown that accidents with these characteristics strongly correlate with the release of hazardous materials, if present, in the vicinity of the train derailment (11, 12). Consequently, for both safety and economic reasons, information on such types of derailments was of particular interest. Derailment Severity-Frequency Analysis To determine the causes of accidents most likely to lead to severe derailments, a simple risk analysis was conducted using FRA data for 3,504 mainline derailments that occurred during the period (13 ). The FRA reporting system requires the identification of a primary cause (and other contributing causes if applicable). FRA groups data on accident causes hierarchically. Data at the FRA "subcause" level (the second-highest level of aggregation) were used - in this study. The average number of cars derailed in accidents attributed to each subcause was calculated and plotted against the frequency of derailments with the same subcause (Figure 1). Figure 1 is divided into four quadrants by vertical and horizontal lines that represent the average value of the two variables with respect to the x andy axes, respectively. The vertical line represents the average frequency of accidents for all recorded causes combined, and the horizontal line is the average number of cars derailed due to each cause. Causes above or below these lines are, by definition, above or below average for the respective axis. The causes in the upper right quadrant are most interesting and pose the greatest risk-they are more frequent and more severe than average. It is clear that the most frequent cause of high-consequence accidents related to the FRA cause code for rail and joint bars. More detailed analyses revealed that most of these accidents were
2 Dick et al. Paper No Ml nvironmental T2 Rail anjl Joint Bar + Defects 10 I 8! 1l 1 Ci u 'c: z i E7 Loco. De1 I + M5 Vandalism E6 Wheel Defect +E9 Mechanical f ure + + H5 Train Handling +M Loading Tl Track +E3Cou ler Defect. + E5 Axle/Bearing Defect +Geometry + M4 Mtscellaneous Defects +EOBw Defect+ E4 Truck Defect SOSignal proadlied HOUse of rakes E8Car + H6S1 d Doo + E2Bod Defect ++ + H3Switcl ing Rules T3 Frog/Switchlfraclc + Appliance Defect +H4Auth + H7 Use of Switches H2Signal ses H8 Cab Signals El Trailer/Conta ner Defect + H9Human actors HI Employee Ph ical Condition Number of Derailments FIGURE 1 Frequency-severity graph of mainline derailments, the result of broken rails. On the basis of these results, a more detailed analysis was made offactors contributing to the occurrence of broken rails (11 ). Several recent hazardous materials accidents have underscored the importance of this particular aspect of risk from broken rails. Broken Rails Mos broken rails do not result in derailments. Instead the break is detected; usually by the track circuit system or by track inspectors, and repaired (on several North American railroads, these detected broken rails are refen:eci to as service failures). Broken rail derailments appear to correlate with the occurrence of service failures (11 ). Therefore, predicting the occurrence of service failures bas a potential safety benefi.t-eling railroads to allocate broken rail prevention measures, detection technology, and inspection efforts more effectively (3-5, 7). Further, understanding the factors correlated with service failure nee could. help identify contributing causal factors, thereby enabling better preventive measures. The objective of this research was develop a probabilistic model to predict the circumstances most likely to lead to the occurrence of a service failure. Model Form end Data Set Ideally, the model developed would enable the user to input values for the relevant parameters at a specific locationolithe railroad and determine a measure of the probability of a service failure there: The output of the model is an index value between 0 and '1, With 0 mdicating the lowest probability of service failure ari.df representing the highest. Because a probability is the desired output and there are orily two possible outcomes-service failure or no service 'failure at each location-the model can be constructed as a discrete choice model. A discrete choice model, such as the logit model, fits an appropriate equation to the data and uses this equation to score each Ioeation relative to a threshold value, above whichfailure is predicted to occur (14). The logit model then uses a logistic distribution to consider the uncertainty and error in the estimated score and.threshold value.and determine the probability that the score is above the threshold value. The calculated probability is then used as an esti.irurte ofthe service failure probability at that particular location. To fit a discrete choice logit model, two sets of data were requiredone to characterize locations where service failures occurred and another to characterize locations where service failures had not occurred. Data development began with Burlington Northern Santa Fe
3 detailed infonnation on the date, location, and type of 1,903 service failures that occurred over 2 years. The data were supplemented with engineering and operational data on each service failure location. A new dependent variable was created and assigned a value of 1 for each of these records signifying that a service failure had occurred. The second set of data was created with records for locations where no service failure occurred during the same interval. This data set, of about the same size as the first, was developed by selecting a random sample of locations from the railroad and assembling the same information as for the service failure locations. The dependent variable for these records was assigned a value of 0. Ultimately, a test database was developed that contained 3,676 records with complete service failure and descriptive parameter infonnation. On the basis of a univariate analysis of the service failure data and a literature review on the circumstances of rail defect growth and broken rail occurrence (8, 15, 16), track structure and dynamic effects (17-19), and rail fracture mechanics (6, 20), the following parameters were selected for inclusion in the multivariate service failure model: Rail age, Rail weight, Degree of curve, Speed, Average tons per car, Average dynamic tons per car, Percent grade, Annual gross tonnage, Annual wheel passes, Insulated joints, and Mainline turnouts. All of the parameters are continuous variables except the last two, insulated joints and mainline turnouts, which are both discrete. The parameters were assigned a value of 1 if present at a location, and 0 if not. Model Development The service failure probability model was developed using the statistical analysis system (SAS) and the LOGISTIC procedure. The LOGISTIC procedure fits a discrete choice logit model to the test database. Stepwise regression was used to determine the most relevant parameters and combinations of parameters (two-factor interaction terms) for inclusion in the model (21). The stepwise regression procedure uses an iterative process to select variables on the basis of their ability to explain the variance in the input data. The model conducts a goodness-of-fit test for each step and adds or subtracts variables or combinations of variables until the addition of another parameter does not significantly improve the fit. At this point, the last version of the model is considered the best and the resultant parameters, coefficients, and functional relationships comprise the final model. Retrospective and Prospective Models The service failure model was developed in two steps. Frrst, the model was fit to the test database. Approximately half of the 3,676locations in this database had a service failure during a 2-year period, and the other half were a random sample of locations that did not. Because the model makes predictions about broken rails that have already occurred, it was termed a "retrospective model." This version of the model is used primarily to assess the accuracy of the model's predictions relative to the test database. The second step of the process developed a "prospective model." This model is modified from the retrospective model by adjusting a constant term to reflect the actual 1 average service failure probability over a specific portion of the railroad system. After this adjustment, the prospective model can be used to calculate the annual probability of a service failure at particular locations, or along any portion of track. Retrospective Service Failure Model The retrospective service failure probability model was developed using the LOGISTIC procedure as follows: where PsF2 =probability that a service failure occurred at a particular point during the study period; U=Z+Y; Z = , model-specific constant; Y = 0.059A AC A2C T/S W/S W 2 /S (2N- 1) P/S; A =rail age (years); C = degree of curvature(= 0 for tangent); T =annual traffic [million gross tons (MGT)]; S =rail weight (pounds); W = 4T/L =annual number of wheel passes (millions); P = L(1 + V/100) =estimated average dynamic wheel load; N = 1 if at turnout, 0 if not at turnout; L = tons per car; and V = track speed. The fitted model includes a model-specific constant or intercept term, Z, that is related to the average service failure probability. The retrospective model is fit to a data set in which approximately half of the records are for locations with service failures. The average service failure probability on an actual system would be far lower, so this term would be adjusted accordingly. Interpretation of Model Terms The service failure probability model has terms that describe different effects and relationships among service failure probability, infrastructure characteristics, and traffic characteristics. The first term in the model, 0.059A, reflects the effect of rail age. As rail age increases, service failure probability increases. This result is consistent with extensive industry experience (9, 10). Older rail is likely to have carried more tonnage, experienced more thermal stress cycles, and may have been manufactured using processes that produced more flaws in the rail. A recent study of rail failures on Railtrack in Great Britain supports the importance of this parameter (22). The second and third terms in the model, 0.025AC A 2 C 2, reflect the interaction between rail age and degree of curve. As either rail age or degree of curve increases, service failure probability is predicted to increase. Because the interaction between rail age and curvature is multiplicative, the model indicates that in terms of service failure probability, higher degree (sharper) curves are more sensitive to the effects of rail age and vice versa.
4 Dick et al. Paper No The fourth term in the model, TIS, reflects the effect of annual traffic (MGT) normalized by rail weight. As annual gross tonnage increases, service failure probability increases. However, the increase in service failure probability associated with a unit increase in annual traffic is greater on segments of track with relatively light rail. The fifth and sixth terms in the model, 217.9W/S W 2 /S 2, describe the effect of annual wheel passes or load cycles normalized by rail weight. Service failure probability increases as the number of wheel passes or load cycles increases. However, just as with gross tonnage, the increase in service failure probability associated with a unit increase in the annual number of wheel passes is greater on segments of track with relatively light rail. This situation probably reflects the greater stress of lighter rail under a given load than heavier rail. Thus, the amount of crack growth per fatigue cycle is greater in lighter rail than heavier rail. The model includes terms that describe annual traffic relative to gross tonnage and number of wheel passes. The relationship between annual traffic and service failure probability is a function of both the total amount of load applied to a section of rail and the number of times the load is applied. This relationship is consistent with fracture mechanics models of fatigue crack growth in rails that depend on both the applied stress and the number of load cycles (1 0, 20). The seventh term in the model, 0.897(2N- 1), describes the effect of mainline turnouts. The model indicates that proximity to a turnout increases the probability of a service failure. Several possible explanations relate to inferences about rail stress. Turnouts may tend to anchor the track structure, thereby causing greater thermal stress cycling as the nearby rail expands and contracts. Also, to the extent that turnouts tend to be associated with locations where trains are braking or accelerating, rails in these locations may tend to experience more traction-induced stresses. The final term in the model, P/S, describes the effect of estimated average dynamic load on service failure probability. This variable for dynamic load was not directly measured by wheel impact load detectors. Such data would have been preferable but were unavailable for most of the locations where broken rails were recorded. Instead, the value was calculated using the average gross rail load data and track speed at each location using the formula in the American Railway Engineering and Maintenance-of-Way Association manual (19, p ). The values ranged from 30,000 to 55,000 lb and thus do not represent the full spectrum of dynamic loads, particularly the most damaging ones (23 ). The final term is negative, indicating that as average dynamic load increases, service failure probability decreases. This is an unexpected result and the opposite of what was suggested by a single variable analysis conducted before developing the multivariate model (JJ). However, the relative effect of this term is weak. For example, at an annual tonnage level of 50 MGT, on 136-lb rail, in tangent track, varying the annual jwheel passes between the highest and lowest possible values changes PsF2 by approximately Under the same conditions, varying the dynamic load term between its extreme values changes PSF2 by only In the stepwise regression, this final term added to the model has the least predictive ability of the other terms (as indicated by the low chi-square value) (Table 1 ). The artificial nature of the computed value for this term, combined with the way the model handled it, suggests that it does not represent a real physical relationship. Table 1 also indicates that during the stepwise regression process, an interaction term between rail age and annual gross tonnage was initially included in the model. By multiplying rail age by annual gross tonnage, this term provided an estimate of cumulative tonnage; however, it was not a direct measure of this important variable. Although this estimated cumulative tonnage term was initially significant, as more detailed terms describing the effects of rail age, rail size, annual tonnage, turnouts, curvature, etc., were added to the model, the cumulative tonnage term became less significant and was finally removed. Thus, the variance in service failure probability that was initially explained by the estimated cumulative tonnage term in a model with two terms could be better explained by a model with more terms and a combination of effects involving other variables. This result should not be interpreted as meaning that cumulative tonnage is not an important factor in predicting the occurrence of broken rails. There are several reasons for this. First, if a direct measure of accumulated tonnage was available for the analysis, a term based on it might not have been removed. Such a variable would have been preferable, but it was not consistently available systemwide. Second, the two elements of the cumulative tonnage term-age and annual tonnage-appear in several other terms, indicating that these factors are important. Third, the calculated cumulative tonnage term was a strong predictor in the absence of other variables. This term was removed only when most of the other terms were added. This finding is consistent with industry experience that cumulative tonnage is a good predictor of broken-rail frequency. Part of the point of the multivariate statistical approach is that it reveals other variables that have subtler or perhaps interactive effects. It is also interesting which parameters did not appear in the final model. The effects of grade, speed, average wheel load, and insulated joints were tested and not found to significantly improve the predictive ability of the model and were not included in the final model. Conversely, other variables would have been useful relative to physical factors that cause broken rails, but the requisite data were unavailable. In addition to the dynamic load and cumulative tonnage variables, other TABLE 1 Model Term Selection Order Step Term Added 1 Wheel Passes I Rail Weight 2 Annual Gross Tounage x Rail Age 3 (Wheel Passes I Rail Weight) 2 4 Aunual Gross Tonnage I Rail Weight 5 RailAge 6 Turnout 7 Degree of Curve x Rail Age 8 (Degree of Curve x Rail Agel 9 Dynamic Load I Rail Weight 10 Term Removed Annual Gross Tonnage x Rail Age Chi-Squared
5 52 Paper No Transportation Research Record 1825 variables that would have been useful were rail steel type, rail surface roughness, neutral temperature, and temperature at the time of the break. One benefit of model development is that it can highlight the importance of certain types of data and the potential benefit from expanded database development. It would be advantageous to include these variables in the future development of comparable databases. Retrospective Service Failure Model Performance Two methods were used to evaluate the ability of the retrospective model to predict locations where service failures occurred. The first method calculated a goodness-of-fit statistic for the model on the basis of PSF2 computed for each record in the input data. If the model completely accounted for all of the sources of variance, P sf2 = 1 would be expected at all of the service failure locations and PSF2 = 0 at all of the locations where service failures did not occur. In this case, the summation of PsF2 over all service failure locations should equal the total number of service failures, and the summation of 1- PsF2 over all locations where service failures did not occur should equal the total number oflocations where they did not occur. All sources of variance are unlikely to be accounted for by any statistical model. Therefore, when the summations are computed for actual values of PSF2, they will correctly account for only a percentage of the total. This percentage reflects the goodness of fit or the amount of variance explained by the retrospective model (14 ). Using this approach, the goodness-of-fit statistic is calculated using the following expression, where n.r is the actual number of locations where service failures occurred, and n.osr is the number of locations where they did not: (r PsF2 + r 1 - PsF2) Goodness of fit = "" - (n.r + nnosr) (1, , 462) (1, , 815) = On the basis of this analysis, the retrospective model accounted for 80.8% percent of the variance in the service failure data. The second method to evaluate the performance of the model was to compare the value of PsF2 to the event that actually occurred at a location. The decision criterion, or threshold value, for service failure prediction was PsF2 = 0.5.1f PsF2 < 0.5, it was classified as predicting no failure and if > 0.5, it was classified as predicting a service failure. Of these predictions, 87.4% were correct (Table 2). Of the incorrect predictions, there were twice as many false positives than missed service failures. This fmding indicates that the model is somewhat conservative because it is more likely to provide a false positive than miss a service failure. The decision criterion of 0.5 could be adjusted by users of the model to make the results more or less conservative (11). Further work in which additional variables are incorporated might reduce the error rate. These two evaluations indicate. that the model had a reasonably high level of accuracy in predicting the occurrence of service failures in the database from which it was developed. The next steps in assessing the model's accuracy would be to test it using data from another time period or another railroad, or both. Prospective Service Failure Model To use the model to predict the annual probability of a service failure at a particular location, the retrospective model must be transformed into a prospective model. This transformation is accomplished by adjusting the value of the model specific constant, z; to reflect the average service failure probability across the entire system of interest. There were 1,861 service failures in the test database over the 2-year period for which complete records were available. The probability that one of these service failures falls into any given segment of track is a function of the length of the segment To capture as much detail as possible, and to avoid the use of average values over a segment that may introduce additional variance, the segments should be kept relatively short. The maximum resolution in the data available for most of the parameters of interest was 0.01 mi (52.8 ft). The total system represented by the database was approximately 23,750 mi of mainline. Thus, there were 2, 375,000 segments, each 0.01 mi in length. Given this value, the average probability that a service failure is found in any one of those segments over a 2-year period is approximately This probability can be converted into a new model-specific constant, zt", through the use of the log-odds operator (21 ): z* = z + 1n [ PsFavg ] (1 - PsFavg) = n [ J ( ) = This new model-specific constant, z*, adjusts the scale of the probability calculated by the prospective service failure model so that the model predicts service failures at a rate comparable to the observed rate. The retrospective model calculated the probability of a service failure for a 2-year period. This probability can be converted to an annual probability simply by dividing by 2 when transforming the U score into a probability. After these two adjustments are made, the annual service failure probability for any 0.01-mi segment can be calculated with the prospective service failure model. The prospective service failure probability model has the following form: TABLE 2 Results of Goodness-of-Fit Test for a Threshold Value of PsF 2 = 0.5 Model Prediction Actual Event Events Percent of Total Outcome Service Failure (PsF2 > 0.5) Service Failure 1, Correct No Failure (PsF2 < 0.5) No Failure 1513 Prediction Service Failure <PsF2 > 0.5) No Failure False Positive No Failure (PsF2 < 0.5) Service Failure Missed Failure
6 Dick et al. Paper No where PsF = annual probability of a service failure in the 0.01-mi segment of interest, U= Z*+Y, and z* = , prospective-model-specific constant. Service Failure Probability and Expected Service Failures per Mile A cursory review of the annual service failure probabilities calculated by the prospective model might suggest that they are too low. However, the probability is based on a segment of track that is only 0.01 mi long. The calculated probability is approximately equal to the expected number of service failures per year in that 0.01-mi segment. Annual service failures per mile is a metric more typically used by North American railroads, so it is useful to calculate a per-mile rate by multiplying PsF by 100. where SFIMIIYR is the expected service failure rate on segment of interest (service failures per mile per year). This rate can be applied to a segment of track of any length as long as the values of the parameters in the service failure model remain constant along that section of track. A service failure rate of 2 SFIMIIYR indicates that for every mile of track for which the rate applies, two service failures are expected to occur. If the track section to which this rate applied is 0.5 mi long, one service failure is expected along that length; if the section is 2 mi, four service failures are expected along that length. In all three cases, the service failure rate, 2 SFIMIIYR, is the same. The number of service failures expected in a section of track in which the service failure rate is constant is a linear function of the length of the section. Example of Service Failure Model Application The following example illustrates how SfPM can be used to obtain a measure of service failure probability and rate. A hypotheticall.5-mi, single-track portion of a railroad mainline is illustrated in Figure 2, and the relevant parameters are presented in Table 3. The segment has been divided into several subsegments over which the input parameters are constant. Some of the rail is 47 years old and weighs 132lb/yd. The remaining rail is 5 years old and weighs 13Mb/yd. Mainline turnouts are located at Mile 0 and also at Mile 0.7, where another mainline connects to the line. A 1 o curve is located between Mile 0.25 and Mile Track speed on the segment is 50 mph. The annual traffic is 80 MGT between Mile 0.0 and Mile At Mile 0. 7, 40 MGT are routed on the connecting mainline, with the remaining 40 MGT being routed on the segment under consideration between Mile 0.7 and Mile 1.5. The average gross rail load is 100 tons eastward and 80 westward, with the higher value of 100 tons used in the calculations. The dynamic load computes to 150 tons per car, and the annual traffic of 80 MGT and a 1 00-ton average per car results in an estimated 3.2 million wheel passes. Because this is the prospective model, z* = was used to calculate the U score for each portion of the segment of interest and then transformed into an estimate of service failure rate. The estimated service failure rate (service failures per mile per year) for each subsegment is summarized in Table 4 and presented graphically in Figure 3. Multiplying each subsegment's calculated service failure rate by its length provides an estimate of the expected number of service failures per mile per year in that subsegment. Summing all of the subsegment values provides an estimate of the expected number of service failures per year on the 1.5 mi of the segment of interest. In this case, the expected number of service failures for the segment is The service failure profile in Figure 3 highlights how interactions between the various parameters affect service failure rate. Between Mile 0.0 and Mile 0.1, the rail is relatively old and a turnout is present. The combination of these two factors results in a relatively high service failure rate prediction. At Mile 0.1, the service failure rate drops as the rail is no longer close enough to the turnout to be subject to its effects. Between Mile 0.1 and Mile 0.25, the track is tangent but the old rail produces a higher service failure rate than on the segment between Mile 0.45 and Mile 0.6, where the track is tangent but the rail is relatively new. This difference in service failure rate illustrates the importance of rail age. Under the traffic conditions in this example, the age difference of 42 years results in a service failure rate that is 16 times higher on the older section of rail. At Mile 0.25, the track transitions from tangent to a 1 o curve and the service failure rate increases approximately threefold. Compared with Mile 0.45, where the new rail transitions from curve to tangent and the service failure rate increases by a factor of only 1.5, the increase in service failure rate at Mile 0.25 is large. This increase results from the interaction of rail age and curvature that makes the old rail on this subsection of track sensitive to curvature. At Mile 0.3, the rail on the 1 o curve changes from rail that is 47 years old to rail that is 5 years old. The model suggests that newer rail is less sensitive to curvature, so the service failure rate drops from 0.86 to 0.03 service failures per mile per year. Because there is one half the traffic between Mile 0. 7 and Mile 1.5 than there is between Mile 0.0 and Mile 0. 7, the service failure rate is also correspondingly lower miles 0.2 miles 0.25 miles 0.8 miles 80 MGT tangent 1 degree curve tangent tangent 40MGT...._ miles 0.7 miles ' miles years old, 5 years old, 136lbs!yd "- 40 MGT 47 years old, 132lbs/yd 132lbs/yd MPO.O MP0.7 MP1.5 FIGURE 2 A hypothetical section of mainline track.
7 54 Paper No Transportation Research Record 1825 TABLE 3 Input Parameters for e Hypothetical Section of Mainline Track Start End A c T MP MP z (age) (degree) (MGT) NoTE: MP = milepost. s w p I N (pounds) (million) (tons) (turnout) u CONCLUSIONS A simple risk analysis showed that broken rails are the leading cause of severe accidents as measured by the number of cars derailing. Improved detection and prevention of broken rails potentially has important safety and economic benefits. Further, service quality and reliability benefits can accrue if the incidence of broken rails can be reduced. Improving the ability to predict the conditions that can lead to broken rails can help railroads allocate inspection, detection, and preventive resources more efficiently, thereby enhancing safety and reducing service interruptions due to broken rails. A statistical model was developed that provides probabilistic estimates of the likelihood of service failure occurrence on the basis of engineering and operational input parameters. Although further validation needs to be conducted, the service failure prediction model shows promise in improving the ability to predict the occurrence of broken rails. If the requisite data for a railway system can be systematically developed in a consistent, easily accessed, electronic format, the model can be applied to any portion of a system to generate location-specific estimates of service failure probability. If the data include appropriate geographical information, the service failure model could be incorporated into a geographic information system that would generate service failure and broken rail derailment profiles automatically from railway databases. Previous models have been based on a combination of fatigue and fracture principles and a limited number of parameters available for in-service rail. The information technology and computer revolution has resulted in large, comprehensive databases and made practical the use of powerful statistical tools. The present research would not have been feasible 10 years ago. The results of these analyses, coupled with the graphical output capabilities typical of current Pes, can improve managers' access to information and enhance the quality and pace of decision making. The potential benefit of the approach is greater precision in predicting the occurrence of broken rails, along with wider availability and enhanced interpretation of the results. This capability is important as railroads strive to improve safety and at the same time more efficiently use their resources and extract more value from assets such as rail. ACKNOWLEDGMENTS Douglas Simpson and Todd Treichel provided helpful assistance and review of the statistical methods used. Thanks also to Hank Lees, Tom Wright, and Scott Staples, who assisted in obtaining the data needed for the analysis. Frederick Lawrence patiently shared his insight regarding the fracture mechanics of rail, and Kevin Sawley and David Davis also provided helpful discussion. The first two TABLE 4 Service Failure Probabilities Alonge Hypothetical Track Section StartMP EndMP Length u SFIMIIYR Expected SF O.Q Total (0.0 to 1.5)
8 Dick et al. Papr No !l &! 1.00 '"ij8. "' Q) til E Q) 0.40 r n Mile FIGURE 3 Service feilure probability along a hypothetical track segment. authors would like to express their gratitude to the Burlington Northern Santa Fe Railway for its support of this research. REFERENCES 1. Thompson, A. W. Service Performance of Railroad Rails. In Symposium on Railroad History, Vol. 2 (A. W. Thompson, ed.), A. C. K.almbach Library, Chattanooga, Tenn., Aldrich, M. The Peril of the Broken Rail: Carriers, Steel Companies, and Rail Technology, Technology and Culture, Vol. 40, 1999, pp Shyr, F.-Y., and M. Ben-Akiva. Modeling Rail Fatigue Behavior with Multiple Hazards. Journal of Infrastructure Systems, Vol. 2, No.2, June 1996, pp Palese, J. W., and T. W. Wright. Risk-Based Ultrasonic Rail Test Scheduling on Burlington Northern Santa Fe. Proc., AREMA 2000 Annual Conference (CD-ROM), Chicago, m., American Railway Engineering and Maintenance-of-Way Association, Landover, Md., Palese, J. W., and A.M. Zarembski. BNSF Tests Risk-Based Ultrasonic Detection, Railway Track and Structures. Vol. 97, No.2, 2001, pp Orrlnger, 0., and M. W. Bush. Applying Modern Fracture Mechanics to Improve the Control of Rail Fatigue Defects in Track. American Railway Engineering Association Bulletin, Bulletin 689, Vol. 83, 1982, pp Orringer, 0., Y. H. Tang, D. Y. Jeong, and A. B. Perlman. Risk-Benefit Assessment of Delayed Action Concept for Rail Inspection. Office of Research and Development, FRA, U.S. Department of Transportation, Lawrence, F. V., C. P. L. Barkan, and H. M. Reis. Summary of a Workshop on New Technologies for the Detection of Broken Rail. Railroad Engineering Program, University of illinois, Urbana-Champaign, Jeong, D. Y. Progress in Rail Integrity Research. Proc., AREMA 2001 Annual Conference (CD-ROM), Chicago, ill., American Railway Engineering and Maintenance-of-Way Association, Landover, Md., Roney, M.D., and W. Ebersohn. Maintaining Optimal Wheel and Rail Performance. In Guidelines to Best Practices for Heavy Haul Railway Operdtions: Wheel and Rail Interface Issues. International Heavy Haul Association, Virginia Beach, Va., May Dick, C. T. Factors Affecting the Frequency and Lccation of Broken Railroad Rails and Broken Rail Derailments. M.S. thesis. University of illinois, Urbana-Champaign, Barkan, C. P. L., C. T. Dick, and R. Anderson. Railroad Derailment Factors Affecting Hazardous Materials Transportation Risk. In Transportation Research Record: Journal of the Transportation Research Board, No. 1825, TRB, National Research Council, Washington, D.C., 2003, pp FRA. Accident Incident Database, dot.gov/officeofsafety/. Accessed May Ben-Akiva, M., and S. R. Lerman. Discrete Choice Analysis. MIT Press, Cambridge, Mass., Reiff, R. P. Proc., Rail Defect and Broken Rail Defects Expanded Workshop, July Transportation Technology Center, Association of American Railroads, Pueblo, Colo., Oark, R. Defective and Broken Rails-How They Occur. In Summary of a Workshop on New Technologies for the Detection of Broken Rail (F. V. Lawrence, C. P. L. Barkan, and H. M. Reis, eds.), Railroad Engineering Program, University of illinois, Urbana-Champaign, Hay, W. W. Railroad Engineering, 2nd ed. John Wiley and Sons, New York, Selig, E. T., and J. M. Waters. Track Geotechnology and Substructure Management. Thomas Telford Publications, London, Manual for Railway Engineering (CD-ROM). American Railway Engineering and Maintenance-of-Way Association, Landover, Md., Lawrence, F. V. An Analytical Model for Broken Rail. In Summary of a Workshop on New Technologies for the Detection of Broken Rail (F. V. Lawrence, C. P. L. Barkan, and H. M. Reis, eds.), Railroad Engineering Program, University of illinois, Urbana-Champaign, McCullagh, P., and J. A. Neider. Generalized Linear Models, 2nd ed., Chapman and Hall, New York, Sawley, K., and R. Reiff. Rail Failure Assessment for the Office of the Rail Regulator. Transportation Technology Center, Association of American Railroads, Pueblo, Colo., Kalay, S., and A. Tajaddini. Condemning Wheels Due to Impact Loads: Preliminary Survey-Six Railroads' Experience. Report R-754. Chicago Technical Center, Association of American Railroads, Chicago, ill., Publication of this paper sponsored by Committee on Railway Maintenance.
Analysis of Causes of Major Train Derailment and Their Effect on Accident Rates
Analysis of Causes of Major Train Derailment and Their Effect on Accident Rates Xiang Liu, M. Rapik Saat, and Christopher P. L. Barkan Analysis of the causes of train accidents is critical for rational
Wayside Defect Detector Data Mining to Predict Potential WILD Train Stops
Wayside Defect Detector Data Mining to Predict Potential WILD Train Stops Leila Hajibabai a, M. Rapik Saat a, Yanfeng Ouyang a, Christopher P.L. Barkan a, Zengyi Yang a, Kim Bowling b, Kamalesh Somani
A Quantitative Decision Support Framework for Optimal Railway Capacity Planning
A Quantitative Decision Support Framework for Optimal Railway Capacity Planning Y.C. Lai, C.P.L. Barkan University of Illinois at Urbana-Champaign, Urbana, USA Abstract Railways around the world are facing
FLEXURAL PERFORMANCE REQUIREMENTS FOR PRESTRESSED CONCRETE TIES BY FACTORING * * *
FLEXURAL PERFORMANCE REQUIREMENTS FOR PRESTRESSED CONCRETE TIES BY FACTORING By P. Jeffrey McQueen, P.E., Consulting Engineer Abstract The AREMA Manual for Railway Engineering, Chapter 30, Part 4, gives
STATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
Some Examples of Big Data in Railroad Engineering
2014 IEEE International Conference on Big Data Some Examples of Big Data in Railroad Engineering Allan M Zarembski Department of Civil and Environmental Engineering University of Delaware Newark, DE, USA
Chapter 3. Track and Wheel Load Testing
Chapter 3 Track and Wheel Load Testing This chapter describes the track, truck, and testing equipment that were used by the Transportation Technology Center, Inc. (TTCI) for collecting the data that was
SAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
RAILROAD ACCIDENT INVESTIGATION REPORT NO. 4129 THE PENNSYLVANIA RAILROAD COMPANY WEST JEFFERSON, O. AUGUST 4, 1967 DEPARTMENT OF TRANSPORTATION
RAILROAD ACCIDENT INVESTIGATION REPORT NO. 4129 THE PENNSYLVANIA RAILROAD COMPANY WEST JEFFERSON, O. AUGUST 4, 1967 DEPARTMENT OF TRANSPORTATION FEDERAL RAILROAD ADMINISTRATION Washington Summary DATE:
NATIONWIDE WAYSIDE DETECTOR SYSTEM
Nationwide Wayside Detector System Keeps Rail s Record Safety Performance on Track Freight railroads make safety their #1 priority and continually look to improve on their record safety performance through
Railway Track Design
Chapter Railway Track Design Basic considerations and guidelines to be used in the establishment of railway horizontal and vertical alignments. The route upon which a train travels and the track is constructed
Framework for Short-Line Railroad Track Asset Management and Condition Reporting
Framework for Short-Line Railroad Track Asset Management and Condition Reporting Michael N. Grussing 1, and Donald R.Uzarski, Ph.D., RE. 2, 1. Research Civil Engineer, Engineer Research and Development
Reducing Rail Surface Defect Service Failures on the CSXT Railroad
Reducing Rail Surface Defect Service Failures on the CSXT Railroad Bill Bell Manager Rail Services; CSX Transportation Ron Bright Direcr Track Testing; CSX Transportation Dennis Witt, PE Engineer Standards
Tutorial Paper on Quantitative Risk Assessment. Mohammad Reza Sohizadeh Abyaneh Seyed Mehdi Mohammed Hassanzadeh Håvard Raddum
Tutorial Paper on Quantitative Risk Assessment Mohammad Reza Sohizadeh Abyaneh Seyed Mehdi Mohammed Hassanzadeh Håvard Raddum Abstract This paper shows how to carry out a quantitative risk assessment,
WMATA S Automated Track Analysis Technology & Data Leveraging for Maintenance Decisions
WMATA S Automated Track Analysis Technology & Data Leveraging for Maintenance Decisions 1 WMATA System 6 Lines: 5 radial and 1 spur 234 mainline track miles and 91 stations Crew of 54 Track Inspectors
CSX Track Chart and Engineering Information System
Roy Anderson (CSX Transportation), David Maas (CSX Transportation), Brian Mack (Optram, Inc.), Ted Selig (Optram, Inc.) September 10, 2000 ABSTRACT Railroads have made a considerable investment in information
LOGISTIC REGRESSION ANALYSIS
LOGISTIC REGRESSION ANALYSIS C. Mitchell Dayton Department of Measurement, Statistics & Evaluation Room 1230D Benjamin Building University of Maryland September 1992 1. Introduction and Model Logistic
Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios
Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios By: Michael Banasiak & By: Daniel Tantum, Ph.D. What Are Statistical Based Behavior Scoring Models And How Are
Easily Identify Your Best Customers
IBM SPSS Statistics Easily Identify Your Best Customers Use IBM SPSS predictive analytics software to gain insight from your customer database Contents: 1 Introduction 2 Exploring customer data Where do
Big Data in Railroad Engineering
Big Data in Railroad Engineering Dr. Allan M Zarembski Director of Railroad Engineering and Safety Program Department of Civil and Environmental Engineering University of Delaware Newark, Delaware [email protected]
AN ILLUSTRATION OF COMPARATIVE QUANTITATIVE RESULTS USING ALTERNATIVE ANALYTICAL TECHNIQUES
CHAPTER 8. AN ILLUSTRATION OF COMPARATIVE QUANTITATIVE RESULTS USING ALTERNATIVE ANALYTICAL TECHNIQUES Based on TCRP B-11 Field Test Results CTA CHICAGO, ILLINOIS RED LINE SERVICE: 8A. CTA Red Line - Computation
STATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and
Clustering Techniques and STATISTICA Case Study: Defining Clusters of Shopping Center Patrons STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table
Process Compensated Resonant Testing
Process Compensated Resonant Testing Lessons Learned about a New, but Maturing NDT Method that Detects Structural Degradation of Parts Prepared By: Lemna Hunter Vibrant Corporation (505) 314 1511 & Robert
How to Get More Value from Your Survey Data
Technical report How to Get More Value from Your Survey Data Discover four advanced analysis techniques that make survey research more effective Table of contents Introduction..............................................................2
Correlation key concepts:
CORRELATION Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson s coefficient of correlation c) Spearman s Rank correlation coefficient d)
Calculating the Probability of Returning a Loan with Binary Probability Models
Calculating the Probability of Returning a Loan with Binary Probability Models Associate Professor PhD Julian VASILEV (e-mail: [email protected]) Varna University of Economics, Bulgaria ABSTRACT The
Strategies for Identifying Students at Risk for USMLE Step 1 Failure
Vol. 42, No. 2 105 Medical Student Education Strategies for Identifying Students at Risk for USMLE Step 1 Failure Jira Coumarbatch, MD; Leah Robinson, EdS; Ronald Thomas, PhD; Patrick D. Bridge, PhD Background
Overview of Factor Analysis
Overview of Factor Analysis Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone: (205) 348-4431 Fax: (205) 348-8648 August 1,
2. Simple Linear Regression
Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according
VEHICLE SURVIVABILITY AND TRAVEL MILEAGE SCHEDULES
DOT HS 809 952 January 2006 Technical Report VEHICLE SURVIVABILITY AND TRAVEL MILEAGE SCHEDULES Published By: NHTSA s National Center for Statistics and Analysis This document is available to the public
Railroad Safety Data Frequently Asked Questions (FAQ)
Railroad Safety Data Frequently Asked Questions (FAQ) Table of Contents Railroad Classifications and Groupings...3 How are railroads classified on this site and what definitions are used for those classifications?...
Maximizing Fleet Efficiencies with Predictive Analytics
PHH Arval - Trucks Maximizing Fleet Efficiencies with Predictive Analytics Neil Gaynor Manager of Business Development 29 December 2012 1 Maximizing Fleet Efficiencies with the use of Predictive Analytics
Integration of PTC and Ride Quality Data. Presented by: Wabtec Railway Electronics, I-ETMS PTC Supplier. and
Integration of PTC and Ride Quality Data Presented by: Wabtec Railway Electronics, I-ETMS PTC Supplier and dfuzion, Inc., rmetrix Ride Performance Assessment System Supplier The FRA mandate to implement
How To Check For Differences In The One Way Anova
MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way
Introduction to Railroad Track Structural Design
BCR2A 09 Railroad Track Design Including Asphalt Trackbeds Pre-Conference Workshop Introduction to Railroad Track Structural Design Don Uzarski, Ph.D., P.E. [email protected] Interaction, Vertical Load
Simple Predictive Analytics Curtis Seare
Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use
Module 3: Correlation and Covariance
Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis
A model to predict the probability of highway rail crossing accidents
321 A model to predict the probability of highway rail crossing accidents G M McCollister and C C Pflaum Spectrum Economics, Inc., Overland Park, Kansas, USA The manuscript was received on 26 April 2006
Nondestructive Testing and Evaluation of Steel Bridges
Abstract Nondestructive Testing and Evaluation of Steel Bridges James Bader ENCE 710 Spring 2008 Nondestructive evaluation (NDE) is a means of evaluating structural components without damaging them. It
Prediction of Stock Performance Using Analytical Techniques
136 JOURNAL OF EMERGING TECHNOLOGIES IN WEB INTELLIGENCE, VOL. 5, NO. 2, MAY 2013 Prediction of Stock Performance Using Analytical Techniques Carol Hargreaves Institute of Systems Science National University
11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
Reducing the Risk of Rail Transport of Hazardous Materials by Route Rationalization
Reducing the Risk of Rail Transport of Hazardous Materials by Route Rationalization Athaphon Kawprasert (Corresponding Author) Graduate Research Assistant Railroad Engineering Program Department of Civil
MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Final Exam Review MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) A researcher for an airline interviews all of the passengers on five randomly
On Correlating Performance Metrics
On Correlating Performance Metrics Yiping Ding and Chris Thornley BMC Software, Inc. Kenneth Newman BMC Software, Inc. University of Massachusetts, Boston Performance metrics and their measurements are
Multivariate Normal Distribution
Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues
Deploying Regional Jets to Add New Spokes to a Hub. Ian Savage* and Burgess Scott Northwestern University
Deploying Regional Jets to Add New Spokes to a Hub by Ian Savage* and Burgess Scott Northwestern University Correspondence Address Professor Ian Savage Department of Economics Northwestern University 2001
An analysis of the 2003 HEFCE national student survey pilot data.
An analysis of the 2003 HEFCE national student survey pilot data. by Harvey Goldstein Institute of Education, University of London [email protected] Abstract The summary report produced from the first
Impact / Performance Matrix A Strategic Planning Tool
Impact / Performance Matrix A Strategic Planning Tool Larry J. Seibert, Ph.D. When Board members and staff convene for strategic planning sessions, there are a number of questions that typically need to
IV. INDUSTRIAL TRACK DESIGN
IV. INDUSTRIAL TRACK DESIGN 4.01 GENERAL The following sections govern the criteria to be used in the designing of industry tracks served, or to be served, by the Railway Company. Any deviation from these
EXPERIMENTAL ERROR AND DATA ANALYSIS
EXPERIMENTAL ERROR AND DATA ANALYSIS 1. INTRODUCTION: Laboratory experiments involve taking measurements of physical quantities. No measurement of any physical quantity is ever perfectly accurate, except
Gamma Distribution Fitting
Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics
Modeling Customer Lifetime Value Using Survival Analysis An Application in the Telecommunications Industry
Paper 12028 Modeling Customer Lifetime Value Using Survival Analysis An Application in the Telecommunications Industry Junxiang Lu, Ph.D. Overland Park, Kansas ABSTRACT Increasingly, companies are viewing
Part 1 : 07/27/10 21:30:31
Question 1 - CIA 593 III-64 - Forecasting Techniques What coefficient of correlation results from the following data? X Y 1 10 2 8 3 6 4 4 5 2 A. 0 B. 1 C. Cannot be determined from the data given. D.
Preventive Grinding Moves into the 21 st Century on Canadian Pacific Railway
Preventive Grinding Moves into the 21 st Century on Canadian Pacific Railway Robert devries Canadian Pacific Railway Suite 700 Gulf Canada Square 401-9 th Avenue Calgary, Alberta, Canada T2P 4Z4 Telephone:
Developing Risk Adjustment Techniques Using the SAS@ System for Assessing Health Care Quality in the lmsystem@
Developing Risk Adjustment Techniques Using the SAS@ System for Assessing Health Care Quality in the lmsystem@ Yanchun Xu, Andrius Kubilius Joint Commission on Accreditation of Healthcare Organizations,
In mathematics, there are four attainment targets: using and applying mathematics; number and algebra; shape, space and measures, and handling data.
MATHEMATICS: THE LEVEL DESCRIPTIONS In mathematics, there are four attainment targets: using and applying mathematics; number and algebra; shape, space and measures, and handling data. Attainment target
Introduction to Longitudinal Data Analysis
Introduction to Longitudinal Data Analysis Longitudinal Data Analysis Workshop Section 1 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 1: Introduction
VALIDATION OF ANALYTICAL PROCEDURES: TEXT AND METHODOLOGY Q2(R1)
INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE ICH HARMONISED TRIPARTITE GUIDELINE VALIDATION OF ANALYTICAL PROCEDURES: TEXT AND METHODOLOGY
Interpreting Data in Normal Distributions
Interpreting Data in Normal Distributions This curve is kind of a big deal. It shows the distribution of a set of test scores, the results of rolling a die a million times, the heights of people on Earth,
The AASHO Road Test site (which eventually became part of I-80) at Ottawa, Illinois, was typical of northern climates (see Table 1).
Página 1 de 12 AASHO Road Test The AASHO Road Test, a $27 million (1960 dollars) investment and the largest road experiment of its time, was conceived and sponsored by the American Association of State
Making Accurate Voltage Noise and Current Noise Measurements on Operational Amplifiers Down to 0.1Hz
Author: Don LaFontaine Making Accurate Voltage Noise and Current Noise Measurements on Operational Amplifiers Down to 0.1Hz Abstract Making accurate voltage and current noise measurements on op amps in
SYSTEMWIDE REQUIREMENTS
SYSTEMWIDE REQUIREMENTS for the Peninsula Rail Program San Francisco to San Jose on the Caltrain Corridor Description of the Systemwide Context for the High Speed Train Project This document provides a
The Consequences of Missing Data in the ATLAS ACS 2-TIMI 51 Trial
The Consequences of Missing Data in the ATLAS ACS 2-TIMI 51 Trial In this white paper, we will explore the consequences of missing data in the ATLAS ACS 2-TIMI 51 Trial and consider if an alternative approach
Predict the Popularity of YouTube Videos Using Early View Data
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
Principle Component Analysis and Partial Least Squares: Two Dimension Reduction Techniques for Regression
Principle Component Analysis and Partial Least Squares: Two Dimension Reduction Techniques for Regression Saikat Maitra and Jun Yan Abstract: Dimension reduction is one of the major tasks for multivariate
ESTIMATING THE DISTRIBUTION OF DEMAND USING BOUNDED SALES DATA
ESTIMATING THE DISTRIBUTION OF DEMAND USING BOUNDED SALES DATA Michael R. Middleton, McLaren School of Business, University of San Francisco 0 Fulton Street, San Francisco, CA -00 -- [email protected]
Local outlier detection in data forensics: data mining approach to flag unusual schools
Local outlier detection in data forensics: data mining approach to flag unusual schools Mayuko Simon Data Recognition Corporation Paper presented at the 2012 Conference on Statistical Detection of Potential
C. Wohlin, "Is Prior Knowledge of a Programming Language Important for Software Quality?", Proceedings 1st International Symposium on Empirical
C. Wohlin, "Is Prior Knowledge of a Programming Language Important for Software Quality?", Proceedings 1st International Symposium on Empirical Software Engineering, pp. 27-36, Nara, Japan, October 2002.
Sandia National Laboratories New Mexico Wind Resource Assessment Lee Ranch
Sandia National Laboratories New Mexico Wind Resource Assessment Lee Ranch Data Summary and Transmittal for September December 2002 & Annual Analysis for January December 2002 Prepared for: Sandia National
Cluster Analysis for Evaluating Trading Strategies 1
CONTRIBUTORS Jeff Bacidore Managing Director, Head of Algorithmic Trading, ITG, Inc. [email protected] +1.212.588.4327 Kathryn Berkow Quantitative Analyst, Algorithmic Trading, ITG, Inc. [email protected]
Analysis of Load Frequency Control Performance Assessment Criteria
520 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 16, NO. 3, AUGUST 2001 Analysis of Load Frequency Control Performance Assessment Criteria George Gross, Fellow, IEEE and Jeong Woo Lee Abstract This paper presents
Introduction to Regression and Data Analysis
Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it
Teaching Multivariate Analysis to Business-Major Students
Teaching Multivariate Analysis to Business-Major Students Wing-Keung Wong and Teck-Wong Soon - Kent Ridge, Singapore 1. Introduction During the last two or three decades, multivariate statistical analysis
Highway Motor Transportation
Highway Motor Transportation WW I: Motor carrier industry started, converted automobiles were used for pick up and delivery in local areas Railroads encouraged the growth. They had difficulty with small
CHARACTERISTICS IN FLIGHT DATA ESTIMATION WITH LOGISTIC REGRESSION AND SUPPORT VECTOR MACHINES
CHARACTERISTICS IN FLIGHT DATA ESTIMATION WITH LOGISTIC REGRESSION AND SUPPORT VECTOR MACHINES Claus Gwiggner, Ecole Polytechnique, LIX, Palaiseau, France Gert Lanckriet, University of Berkeley, EECS,
Visualizing Data from Government Census and Surveys: Plans for the Future
Censuses and Surveys of Governments: A Workshop on the Research and Methodology behind the Estimates Visualizing Data from Government Census and Surveys: Plans for the Future Kerstin Edwards March 15,
Guidance for Industry
Guidance for Industry Q2B Validation of Analytical Procedures: Methodology November 1996 ICH Guidance for Industry Q2B Validation of Analytical Procedures: Methodology Additional copies are available from:
Factor Analysis. Principal components factor analysis. Use of extracted factors in multivariate dependency models
Factor Analysis Principal components factor analysis Use of extracted factors in multivariate dependency models 2 KEY CONCEPTS ***** Factor Analysis Interdependency technique Assumptions of factor analysis
CRS Report for Congress
December 3, 2007 CRS Report for Congress Lump-Sum Distributions under the Pension Protection Act Summary Patrick Purcell Specialist in Income Security Domestic Social Policy Division The Pension Protection
Comparing Alternate Designs For A Multi-Domain Cluster Sample
Comparing Alternate Designs For A Multi-Domain Cluster Sample Pedro J. Saavedra, Mareena McKinley Wright and Joseph P. Riley Mareena McKinley Wright, ORC Macro, 11785 Beltsville Dr., Calverton, MD 20705
Credit Risk Models. August 24 26, 2010
Credit Risk Models August 24 26, 2010 AGENDA 1 st Case Study : Credit Rating Model Borrowers and Factoring (Accounts Receivable Financing) pages 3 10 2 nd Case Study : Credit Scoring Model Automobile Leasing
Relationships Between Two Variables: Scatterplots and Correlation
Relationships Between Two Variables: Scatterplots and Correlation Example: Consider the population of cars manufactured in the U.S. What is the relationship (1) between engine size and horsepower? (2)
Noncrash fire safety recall losses
Bulletin Vol. 30, No. 33 : December 2013 Noncrash fire safety recall losses Vehicles with known fire-related defects have significantly higher risk of noncrash fire insurance losses compared with vehicles
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression
Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering
Engineering Problem Solving and Excel EGN 1006 Introduction to Engineering Mathematical Solution Procedures Commonly Used in Engineering Analysis Data Analysis Techniques (Statistics) Curve Fitting techniques
Software Development and Testing: A System Dynamics Simulation and Modeling Approach
Software Development and Testing: A System Dynamics Simulation and Modeling Approach KUMAR SAURABH IBM India Pvt. Ltd. SA-2, Bannerghatta Road, Bangalore. Pin- 560078 INDIA. Email: [email protected],
Determining Factors of a Quick Sale in Arlington's Condo Market. Team 2: Darik Gossa Roger Moncarz Jeff Robinson Chris Frohlich James Haas
Determining Factors of a Quick Sale in Arlington's Condo Market Team 2: Darik Gossa Roger Moncarz Jeff Robinson Chris Frohlich James Haas Executive Summary The real estate market for condominiums in Northern
Binary Logistic Regression
Binary Logistic Regression Main Effects Model Logistic regression will accept quantitative, binary or categorical predictors and will code the latter two in various ways. Here s a simple model including
Access Spacing and Accidents
Access Spacing and Accidents A Conceptual Analysis HERBERT S. LEVINSON Transportation Consultant 40 Hemlock Road New Haven, CT 06515 ABSTRACT This paper develops a method for predicting the safety of arterial
Testing Metrics. Introduction
Introduction Why Measure? What to Measure? It is often said that if something cannot be measured, it cannot be managed or improved. There is immense value in measurement, but you should always make sure
Assessing the Impact of a Tablet-PC-based Classroom Interaction System
STo appear in Proceedings of Workshop on the Impact of Pen-Based Technology on Education (WIPTE) 2008. Assessing the Impact of a Tablet-PC-based Classroom Interaction System Kimberle Koile David Singer
