An Investigation of the Performance of the Generalized S-X 2 Item-Fit Index for Polytomous IRT Models. Taehoon Kang Troy T. Chen

Size: px
Start display at page:

Download "An Investigation of the Performance of the Generalized S-X 2 Item-Fit Index for Polytomous IRT Models. Taehoon Kang Troy T. Chen"

Transcription

1

2 An Investgaton of the Performance of the eneralzed S-X Item-Ft Index for Polytomous IRT Models Taehoon Kang Troy T. Chen

3

4 Abstract Orlando and Thssen (, 3) proposed an tem-ft ndex, S-X, for dchotomous tem response theory (IRT) models, whch has performed better than tradtonal tem-ft statstcs such as Yen s (98) Q and McKnley and Mll s (985). Ths study extends the utlty of S-X to polytomous IRT models, ncludng the generalzed partal credt model (PCM: Murak, 99), partal credt model (PCM: Masters, 98), and ratng scale model (RSM: Andrch, 978). The performance of the generalzed S-X n assessng tem-model ft was studed n terms of emprcal Type I error rates and power as compared to results obtaned for provded by the computer program PARSCALE (Murak & Bock, 997). The results show that the generalzed S-X s a promsng tem-ft ndex for polytomous tems n educatonal and psychologcal testng programs. Keywords: tem response theory; tem ft; S-X ; polytomous IRT; PARSCALE

5

6 An Investgaton of the Performance of the eneralzed S-X Item-Ft Index for Polytomous IRT Models Introducton Ever snce Lawley (943) and Lord (95) establshed the basc concepts of tem response theory (IRT), many IRT models have been developed and appled to varous felds of study such as psychologcal scalng and educatonal measurement. Unless an approprate IRT model for a gven data set s used, however, the benefts of IRT for applcatons such as test development, tem bankng, dfferental tem functonng (DIF), computerzed adaptve testng (CAT), and test equatng mght not be attaned. In bref, the success of IRT applcatons requres satsfactory ft between the model and the data. The most crtcal problem caused by model-data msft may be that the hallmark feature of IRT, parameter nvarance, no longer apples (Shepard, Camll & Wllams, 984; Bolt, ; Rupp & Zumbo, 4). Numerous statstcal procedures have been developed to evaluate tem ft under an IRT model, and goodness-of-ft studes have been conducted and reported n the volumnous IRT lterature (see Bock, 97; Douglas & Cohen, ; las & Suarez-Falcon, 3; Lang & Wells, 7; McKnley & Mlls, 985; Orlando & Thssen,, 3; Snharay, 3, 5; Stone, ; Stone & Zhang, 3; Suarez-Falcon & las, 3; Wells, 4; Yen, 98). Among them, several Ch-square based tem-level goodness-of-ft ndces usng sgnfcance tests such as Yen s (98) Q for dchotomous tems, the tradtonal loglkelhood Ch-square,, for both dchotomous and polytomous tems (McKnley & Mlls, 985), and Orlando and Thssen s (, 3) S-X for dchotomous tems have been utlzed for IRT applcatons. Type I error rates for these goodness-of-ft ndces have been nvestgated and reported.

7 A shortcomng of the tem-ft tests based on and s ther senstvty to test length and sample sze. For nstance, on a short test of dchotomous tems, these tradtonal statstcs exhbted nflated emprcal Type I error rates as hgh as.96 and.97, Q respectvely, for a gven nomnal rejecton rate of α =.5 (Orlando & Thssen, ). DeMars (5) smulaton studes wth a sample sze of, used PARSCALE s (Murak & Bock, 997) ft ndex whch s smlar to McKnley and Mll s (985), and dscovered emprcal Type I error rates of.4 under the partal credt model (PCM: Masters, 98) and.34 under the graded response model (RM: Samejma, 969) on a polytomous tem test. Stone and Hansen () found that an tem-ft test usng for a 3 polytomous tem test under the RM showed nflated emprcal Type I error rates between.4 and.8 for cases wth, examnees, and between.9 and.396 for cases wth, examnees, even though the true tem parameter values were used n calculatng predcted proportons. Besdes studes on and, there have been noteworthy studes on S-X by Q Orlando and Thssen (, 3). In ther smulaton studes usng test lengths of, 4, and 8 dchotomous tems and a sample sze of,, they showed that S-X adequately controlled Type I error rates (Orlando & Thssen, ). Under the -, -, and 3-parameter logstc models (PLM, PLM, and 3PLM, respectvely), the emprcal Type I error rates for tests based on S-X were found to be between.4 and.7 wth nomnal α of.5. Addtonally, the emprcal power of S-X mproved as sample sze ncreased from 5 to, (Orlando & Thssen, 3). The S-X ndex could also be generalzed and appled to the goodness-of-ft test for polytomous tems (Roberts, n press). The man purpose of ths study s to assess the performances of the generalzed S-X under the polytomous IRT models ncludng the generalzed partal credt model (PCM: Murak, 99), PCM, and the ratng scale model (RSM: Andrch, 978) for dfferent combnatons of test length and sample sze. The paper

8 3 begns wth a revew of,, and S-X statstcs followed by a dscusson on the Q generalzaton of S-X for polytomous tems. Fnally, the performances of the generalzed S-X and PARSCALE s are compared through a smulaton study. Ch-Square Based Item Ft Indces Accordng to Hambleton, Swamnathan and Rogers (99), and Stone (), a common strategy for assessng tem-ft of an IRT model can be summarzed as follows: () estmate the tem and ablty parameters under the chosen model, () classfy examnees nto K homogeneous groups n terms of ther ablty estmates, (3) calculate observed response proportons n each group for the tem under nvestgaton, (4) derve predcted response proportons n each group usng the tem and ablty parameter estmates under the IRT model of nterest, and (5) compute Ch-square based statstcs by comparng the observed and predcted values. The Tradtonal Ch-Square Based Ft Indces Q Both and are consdered to be tradtonal Ch-square based ft ndces. Yen s (98) Q for a dchotomous tem can be expressed as follows: ( O E ) ( O E ) Q =, () kz kz k k N k = N k k = z= Ekz k = Ek( Ek ) where z ndcates the tem score, k represents the number of groups of examnees, N k s the number of examnes n group k, and (= - ) and (= - ) are, respectvely, Ok Ok Ek E k the observed and predcted proportons of correct responses for group k. To compute, the tem and ablty parameter estmates are frst obtaned under the chosen IRT model, and the total number of groups (K) s set to wth each group havng approxmately an equal number of examnees. The predcted proportons of correct responses,, s computed as the mean predcted proporton of each group. Snce K =, the degrees of freedom (df) E k Q

9 assocated wth Q 4 equal m where m s the number of tem parameters estmated. For assessng model-tem ft for both dchotomous and polytomous tems, PARSCALE provdes as the goodness-of-ft ndex. ven an tem denoted, can be computed as follows: = K Z k= r kz z= ln N k r P kz z, () ( θ ) k where z ndcates tem scores rangng from zero to the hghest tem score of Z, k represents the number of groups of examnees, r kz equals the observed number of examnees scorng z n group k, Nk s total number of examnees n group k, and P θ ) s the response z ( k functon for tem score z evaluated at the mean ablty of examnees n group k. The total number of groups K could however vary across tems because neghborng groups can be collapsed to avod expected values, N k Pz ( θ k ), less than 5. In general, the df of for dchotomous tems equals K whch s dfferent from that of Q Yen s as no adjustment for the number of estmated parameters m s made for. Mslevy and Bock (99) argued that m s not consdered n determnng the df for because the parameter estmaton procedure has nothng to do wth mnmzng. Orlando and Thssen s S-X Index for Dchotomous Items Though Orlando and Thssen s () S-X procedure follows much the same pattern as the procedure, t has a notable advantage over Q and. For both Q and, Q the groupng procedure reles on sample- and model- dependent cut scores, whereas S-X s based on test scores (.e., number-correct scores). Usng the same notaton defned earler, S-X for a dchotomous tem on an I-tem test s gven by:

10 5 S-X = I I ( Okz Ekz ) ( Ok Ek) k = N k k= z= Ekz k= Ek( Ek) N. (3) The summaton over the test score k n equaton (3) excludes k = and k = I, snce the proporton of examnees who respond to an tem correctly and have a test score of zero s always zero, and t s always for those havng a perfect test score of I. Also, n equaton (3), the expected proporton of examnees n x = k group who got tem rght, Ek, could be calculated usng the followng formula: P ( θ ) f ( k θ ) φ( θ ) θ Ek =, (4) f ( k θ ) φ( θ ) θ where f ( k θ ) s the condtonal predcted test score ( x = k ) dstrbuton gven θ, f ( k θ ) represents the condtonal predcted test score dstrbuton wthout tem, and φ (θ ) s the populaton dstrbuton of θ. f ( k θ ) and f ( k θ ) could be computed wth the help of the recursve algorthm developed by Lord and Wngersky (984). Smlar to, neghborng groups could be collapsed to mantan a mnmum expected cell frequency of 5. For S-X, the mnmum value s set to accordng to the studes of Larntz (978) and Orlando and Thssen s (). If t s not necessary to collapse groups, then the df equals I m where m s the number of tem parameters estmated; otherwse an adjustment for the number of groups beng collapsed s needed. The eneralzed S-X Index for Polytomous Items The current study extends the applcaton of S-X to the assessment of tem ft for polytomous tems. For a polytomous tem denoted on a test of I polytomous tems wth each havng Z + scorng categores (.e., category score z =,,, Z ), the generalzed S-X can, usng notaton defned earler, be expressed as follows:

11 6 S-X = ( F Z ) Z k = Z z= N k ( O kz E E kz kz ), (5) where z ndcates category scores, Z s the hghest score of tem, and F s a perfect test I score (.e., F = Z ). The expected category proportons, Ekz, n equaton (5), can be computed usng the followng formula: P ( z θ ) f ( k z θ ) φ( θ ) θ Ekz =. (6) f ( k θ ) φ( θ ) θ To compute f ( z θ ) and f ( z θ ) n equaton (6), the generalzed recursve algorthm developed by Thssen, Pommerch, Blleaud and Wllams (995) can be used. Smlar to that for dchotomous tems, the computaton for the generalzed S-X procedure excludes groups wth a test score of zero (.e., k = ) or perfect test score (.e., k = F). Also, n equaton (5), the summaton for k s from Z through F Z. Ths s because wthn some groups wth extremely low or hgh test scores, the expected proportons of examnees ( ) for some categores are always zero. For nstance, for the group wth k = E kz 38 on a test of polytomous tems wth each havng 5 categores (.e., z =,,, 3, 4) n Table, the observed and predcted proportons of examnees for the z = and z = categores are always zero. Smlarly, for the group wth k =, they wll always be zero for the z = 3 and z = 4 categores. For the generalzed S-X procedure, such groups are collapsed to the groups wth k = Z or k = F Z. For example, the groups wth k =, and 3 for the llustratve tem n Table wll be combned wth the k = 4 group, and the groups wth k = 37, 38, and 39 wll be merged wth the k = 36 group. Hence, for an tem wth Z +

12 7 scorng categores, the total number of groups ncluded n the generalzed S-X procedure s I equal to K = F Z + = Z Z +.

13 8 TABLE The Observed and Expected Cell Frequences of an Illustratve Item Observed Frequences ( N O k kz ) Expected Frequences( N E k kz ) Test Item Category Score Item Category Score Score roup k ndcates that the cell has a value of zero always. ndcates that the expected value s less than.5 but larger than

14 9 In lght of Orlando and Thssen (), neghborng test score groups need to be collapsed to mantan a mnmum expected category frequency ( N E k kz ) of. However, ths collapsng method could be nfeasble for the case of polytomous tems. For example, f Orlando and Thssen s cell collapsng approach s appled to the llustratve tem n Table, only two groups wth k = 8 and 9 would reman, and too much nformaton would be lost. An even worse scenaro would be that only one group remans. Thus, nstead of collapsng test score groups, the current study suggests, f needed, collapsng adjacent cells of tem score categores for a gven group k to ensure a mnmum expected category frequency of. For nstance, n the group wth k = of the llustratve tem n Table, the z = 3 and z = 4 categores could be combned wth the z = category. Pett (997) mentoned that cell-collapsng should be undertaken carefully so that the combned cells make ntutve sense. Snce the tem score categores are ordered, the suggested method s consdered reasonable. Also, Murak (996) used ths approach for combnng response categores. Wth ths collapsng algorthm, the generalzed S-X has df of K Z m C where m s the number of tem parameters estmated, and C ndcates the total number of tem score categores beng collapsed. For the llustratve tem wth 5 scorng categores n Table, the df = = 66 under the PCM. Also, the generalzed S-X for ths tem was found to be 6. (p-value =.6). So, the PCM appeared to ft the tem adequately. Method Desgn of Smulaton Study To assess the performance of the generalzed S-X ndex, a smulaton study varyng polytomous IRT models, test lengths, and sample szes was conducted. The ndex selected for comparson was PARSCALE s because t appears to be the most frequently employed ndex n appled settngs. Three commonly used polytomous IRT models (RSM,

15 PCM and PCM) were explored n ths study. Under the PCM, the probablty that an examnee j scores z wth z =,,, Z on tem wth Z + response categores s modeled by the followng: z y= c= [ θ ( β τ )] exp α j c c= P( z θ j, α, β, τ c ) = Z, (7) y exp α [ θ ( β τ )] j c where α s the dscrmnaton of tem, β denotes the dffculty of tem, and τ c represents the locaton parameter for a category on tem. Equaton (7) needs to set τ, = Z c= τ = and exp α [ θ ( β τ ] = c c= j c ) for model dentfcaton. The three polytomous IRT models consdered n ths study are herarchcally related. If α n equaton (7) s fxed at across tems, equaton (7) reduces to the PCM. Moreover, f theτ values for each category are, respectvely, the same across tems, equaton (7) further reduces to the RSM. Consequently, RSM s nested wthn PCM, whle PCM s nested wthn PCM. In addton to the three generatng models (RSM, PCM, and PCM), ths smulaton study employed three test lengths (I = 5,, and tems), and four sample szes (N = 5,,,,, and 5, examnees). The three test lengths mmc tests havng small, moderate and large numbers of polytomously scored tems. The four sample szes represent small, moderate, large and very large samples. Data eneraton A standard method was employed for tem response generaton for ths study. The steps for data generaton nclude: () generate tem and ablty parameters, () under the chosen IRT model, calculate the probablty, P( z θ, α, β, τ ), for the responses usng the j c

16 generated tem and ablty parameters, and then the cumulatve probablty denoted P ( z θ, α, β, τ ) = j c z x= P( x θ, α, β, τ j c ), and (3) wth a random number denoted u drawn from the unform dstrbuton, U(,), assgn a response of f u P (), or z f P ( z ) < u P ( z) for z =,,, or Z. For ths study, the tem parameters used for smulatng response data under the PCM were obtaned as follows. The dscrmnaton parameters (α ) were randomly sampled from a lognormal (,.5 ) dstrbuton. For each tem, four tem step parameters (.e., δ c = β τ where c =,, 3, or 4) were randomly drawn from four normal dstrbutons c wth a common standard devaton of.5 and means of -.5, -.5,.5, and.5, respectvely. The mean of these four step parameters s then used as the tem dffculty parameter ( β ), and the dfference between β and δ c s taken as τ c. Ths tem-parameter generatng procedure was repeated for all I tems. The values for the θ parameter were randomly drawn from the standard normal dstrbuton, N(,). Wth these tem and ablty parameters, a response dataset under the PCM was generated. The generatng procedure for datasets under the PCM s the same as that for the PCM except that the α parameters were fxed at. For generatng the dataset under the RSM, only one random sample of each step parameters ( τ c ) was generated and used for all tems, whle the dscrmnaton parameters for each tem were also fxed at. Fnally, there were a total of 36 dfferent condtons smulated n ths study (3 generatng models 3 test lengths 4 sample szes). One hundred replcatons were generated for each condton, and each condton mmcked dfferent I-tem tests from the same tem pool admnstered to equvalent groups of n examnees, respectvely.

17 Item Parameter Calbraton The tem and ablty parameters n each smulated dataset were calbrated usng the computer program PARSCALE (Murak & Bock, 997). Examples of the PARSCALE codes used for PCM, PCM, and RSM calbraton are gven n Appendx A. Parallelng Orlando and Thssen s () study, the number of tem parameters of the calbraton model (CM) was always fewer than or equal to that of the generatng model (M). As shown n Table, when the CM and M were the same, results were used to evaluate the emprcal Type I error rates of the generalzed S-X and PARSCALE s. And, when the CM was a smpler model than the M, results were used to calculate the emprcal power of the tem-ft ndces. TABLE Model Calbraton Desgn n the Smulaton Study Calbratng Model (CM) eneratng Model (M) RSM PCM PCM RSM Type I Error Rate Power Power PCM - Type I Error Rate Power PCM - - Type I Error Rate Ths study used a nomnal α of.5. An tem was flagged for msft f the sgnfcance level (.e., p-value) for the observed ft ndex under nvestgaton was less than.5. Under each condton, the smulated Type I error rates or power for the ft ndces were obtaned by dvdng the number of flagged tems by I upon the completon of the replcatons.

18 3 Results When the CM was same as the M, the PARSCALE calbratons ran properly and converged successfully for all study condtons. However, when the CM s a smpler model than the M, the calbraton runs dd not converge successfully for some condtons wth EM cycles. The worst stuaton occurred for the condton wth tems and, examnees under CM = PCM and M = PCM. For ths condton, 4 out of data sets were not successfully calbrated. In the current study, the problematc cases were excluded from the calculatons of emprcal power. The proportons of tems wrongly flagged for msft are shown n Table 3 for each condton. At a glance, the smulated Type I error rates of the generalzed S-X dd not change much across condtons, whle the values for the PARSCALE s dffered drastcally. In the short (5-tem) and medum (-tem) test length condtons, the Type I error rates of appeared to be severely nflated n most cases. Ths s not surprsng because t s a well-known problem for the tradtonal tem-ft ndces as mentoned earler. Also, even though the test length s as long as tems, had nflated Type I error rates n the 5, examnee condtons (.3 for the RSM,.8 for the PCM, and.74 for the PCM). In contrast to the poor performance of, the generalzed S-X dd not seem to be affected by test length, and had emprcal Type I error rates rangng from.4 and.65 for all study condtons except those wth long (-tem) test length and small sample sze of 5 examnees. In the long (-tem) test condtons, the performance of S-X appeared to mprove consstently as the sample sze ncreased from 5 to,, but ths pattern was not observed for.

19 4 TABLE 3 Type I Error Rates: Proportons of Indces wth Sgnfcance Level reater Than.5, Under M = CM Test Length and eneralzed S-X PARSCALE Sample Sze RSM PCM PCM RSM PCM PCM , , , , , , , , , Table 4 shows the emprcal power of the generalzed S-X and when the M s more complex than the CM. As expected, the greater the dfference that exsts between M and CM, the hgher the power s. When CM was RSM, the generalzed S-X was found to be more senstve n detectng msft wth M = PCM than wth M = PCM. Smlarly, for the condtons wth M = PCM, the condtons wth CM = RSM yeld hgher power than those wth CM = PCM. For all combnatons of test length and sample sze, the hghest power was found when M = PCM and CM = RSM. These fndngs reflect the nested structure of the three IRT models. Although both the generalzed S-X and showed better power as the sample sze ncreased, the power values for were not very useful because of the nflated Type I error rates of ths ndex under most study condtons.

20 5 TABLE 4 Emprcal Power: Proportons of Indces wth Sgnfcance Level reater Than.5, Under M > CM eneralzed S-X PARSCALE Test Length and Sample Sze PCM > PCM > PCM > PCM > PCM > PCM > RSM RSM PCM RSM RSM PCM , , , , , , , , , Dscusson As the emprcal Type I error rates of were found to be napproprately nflated, ts performance s not dscussed further n ths secton. The dscusson presented here focuses on ssues related to the generalzed S-X ndex. Ths secton begns wth a scrutny on the df adjustment for the number of estmated tem parameters followed by the dscusson on the performance of the generalzed S-X. The df Adjustment for the Number of Estmated Item Parameters For the applcatons of tem-ft ndces, controverses over the df adjustment for the number of estmated tem parameters have been found n the IRT lterature. For nstance, the df of Yen s Q s adjusted for the number of estmated tem parameters, whle the df of PARSCALE s s not adjusted. DeMars (5) explaned that the dsagreement comes from the dfference n tem parameter estmaton methods. Yen s Q s desgned for tem-ft analyss when the jont maxmum lkelhood (JML) method s used. In contrast, PARSCALE employs the margnal maxmum lkelhood (MML). Mslevy and Bock (99) mentoned, for the MML approach, the resduals are not under lnear constrants and there s

21 6 no loss of degrees of freedom due to the fttng of the tem parameters (p.-). Even though Orlando and Thssen (, 3) used the MML method wth the computer program MULTILO (Thssen, 99), they appled the df adjustment. An argument for usng the df adjustment was gven by Stone and Zhang (3): Regardless of tem parameter estmaton approach, the df adjustment should be used to account for the fact that expected frequences are based on estmated tem parameters (p. 347). Applyng the df adjustment to ther studes on S-X for dchotomous tems, both Orlando and Thssen () and Stone and Zhang (3) found emprcal Type I error rates to be close to the nomnal rates. Based upon these dscussons, a prelmnary nvestgaton was conducted to compare the applcatons of the generalzed S-X wth and wthout the df adjustment. The results showed no dramatc dfference n the smulated Type I error rates and power even though the cases wthout the df adjustment were slghtly more conservatve. Addtonally, snce the generalzed S-X was derved on the bass of Orlando and Thssen s S-X, the df adjustment was utlzed for the current study. Performance of the eneralzed S-X Smlar to Orlando and Thssen s () study on S-X for dchotomous tems, the current study found that the generalzed S-X exhbted emprcal Type I error rates rangng from.4 to.65 for all study condtons except that wth test length of tems and number of examnees equal to 5. For ths condton, the Type I error rates of the generalzed S-X appeared to be nflated as hgh as.95 for the RSM,.98 for the PCM, and.6 the PCM, as shown n Table 3. Ths could be explaned by the nevtable sparseness n expected frequences. ven that there are more score categores and hence more total score groups on an I polytomous tem test than on an I dchotomous tem test, the applcatons of generalzed S-X would encounter more sparseness n expected frequences

22 7 for condtons wth long tests (e.g., tems) and small sample szes (e.g., 5). Another ssue related to control of Type I errors s how close the samplng dstrbuton of the tem-ft statstcs under the null hypothess s to the theoretcal dstrbuton. To study the extent to whch the null dstrbuton of the S-X ndex followed a Ch-square dstrbuton, Orlando and Thssen () examned the frst two moments of the ndex. For a Ch-square dstrbuton, the mean and varance equal the df and df, respectvely. Also, Stone () used the technque of Q-Q plots to compare the emprcal dstrbuton of the ft statstcs of to the expected Ch-square dstrbuton wth an estmated df. These two methods were employed for examnng the emprcal dstrbuton of the generalzed S-X ndex. It s beleved that the generalzed S-X has approxmately a Ch-square dstrbuton. However, for a gven test length and a chosen IRT model, the df would vary due to the total number of cells beng collapsed. For example, the observed df ranged from 5 to 47 for the replcates under the condton wth test length of 5 tems and sample sze of 5, n the Type I error rate analyss under the PCM. From the 5 smulated tems (5 tems replcates) under ths condton, the observed mode of the df was 4 wth a frequency count of 65. The mean and varance of these 65 emprcal S-X values were found to be 4. and 8.99, respectvely. For other condtons, the two moments were also found to be close to ther expected values. In addton, Fgure presents the Q-Q plot to compare the emprcal dstrbuton of the 65 S-X observatons to a theoretcal Ch-square dstrbuton wth df = 4. The plot has a slope and ntercept close to and, respectvely.

23 8 FIURE. Q-Q Plot of Emprcal S-X Dstrbuton Compared to a Ch-Square Dstrbuton wth df = Expected Ch-square Value Observed Value The results of power analyss n a smulaton study for tem-ft statstcs are closely related to how to generate msfttng tems under the non-null cases. As shown n Table 4, the more dfferent the M and CM were, the better the observed power analyss results. Therefore, the generalzed S-X had the hghest power when the model ft of the RSM was nvestgated for the data smulated under the PCM. Also, for these condtons, a moderate sample sze of, would yeld adequate power. When the PCM was the M and the PCM was the CM, however, a very large sample sze (e.g., 5, examnees) was requred to yeld acceptable power hgher than.7 regardless of the test length. Smlarly, when the PCM was the M and the RSM was the CM, sample szes of, or hgher are needed for the generalzed S-X ndex to produce satsfactory power.

24 9 Concluson Smlar to the fndngs reported n the lterature, the results of the current study showed that the performance of appeared to be poor n most condtons wth short and moderate test lengths or very large sample szes (e.g., 5, examnees). In contrast, for such tests wth 5 or polytomous tems, the generalzed S-X showed superor performance n terms of Type I error rate control and power. Smlar performance pattern of the ndex was also found for the cases wth very large sample szes regardless of test length. Consequently, the generalzed S-X s a promsng ndex for nvestgatng tem-ft n educatonal and psychologcal assessments. To gan a better understandng of ths promsng tem-ft ndex, addtonal studes need to be conducted. Frst, wth the desgn of ths smulaton study, all generated tems were consdered msft for the power study. Followng Orlando and Thssen s (3) study, the senstvty of the generalzed S-X n detectng dfferent percentages of msft tems on a test form needs to be further studed. Second, the number of test score groups and hence the df are determned by the test length and the numbers of tem score categores. All of the smulated tems n the current study had a fxed number of score categores. Thus, studes on the mpact due to dfferent numbers of tem score categores could provde some nsght on the behavor of the ndex. Thrd, t s not rare for ablty dstrbutons to be non-normal for an educatonal or psychologcal assessment (Mccer, 989). The performance of the generalzed S-X needs to be nvestgated under condtons where the ablty dstrbuton s not normal (e.g., unform or skewed). Fnally, noteworthy behavor of S-X has been found for dchotomous and polytomous tems separately, yet t mght behave dfferently for mxed format tests because of format effect or multdmensonalty. So, further studes on the S-X ndex for mxed format test data are requred.

25

26 References Andrch, D. (978). Applcaton of a psychometrc model to ordered categores whch are scored wth successve ntegers. Appled Psychologcal Measurement,, Bock, R. D. (97). Estmatng tem parameters and latent ablty when responses are scored n two or more nomnal categores. Psychometrka, 37, 9-5. Bolt, D. M. (). A Monte Carlo comparson of parametrc and nonparametrc polytomous DIF detecton methods. Appled Measurement n Educaton, 5, 3-4. DeMars, C. E. (5). Type I error rates for PARSCALE s ft ndex. Educatonal and Psychologcal Measurement, 65, 4-5. Douglas, J., & Cohen, A. S. (). Nonparametrc tem response functon estmaton for assessng parametrc model ft. Appled Psychologcal Measurement, 5, las, C. A. W., & Suarez-Falcon, J. C. (3). A comparson of tem-ft statstcs for the three-parameter logstc model. Appled Psychologcal Measurement, 7, Hambleton, R. K., Swamnathan, H., & Rogers, H. J. (99). Fundamentals of tem response theory. Newbury Park, CA:Sage. Larntz, K. (978). Small-sample comparsons of exact levels for Ch-squared goodness-of-ft statstcs. Journal of the Amercan Statstcal Assocaton, 73, Lawley, D.N. (943). On problems connected wth tem selecton and test constructon. Proceedngs of the Royal Socety of Ednburgh, 6A, Lang, T., & Wells, C. S. (7). A model ft statstc for generalzed partal credt model. Paper presented at the annual meetng of the Natonal Councl on Measurement n Educaton, Chcago, IL. Lord, F.M. (95). A theory of test scores. Psychometrc Monograph, No. 7. Lord, F. M., & Wngersky, M. S. (984). Comparson of IRT true-score and equpercentle observed-score equatongs". Appled Psychologcal Measurement, 8, Masters,. N. (98). A Rasch model for partal credt scorng. Psychometrka, 47, McKnley, R. L., & Mlls, C. N. (985). A comparson of several goodness-of-ft statstcs, Appled Psychologcal Measurement, 9, Mccer, T. (989). The uncorn, the normal curve and other mprobable creatures. Psychologcal Bulletn, 5:, Mslevy, R. J., & Bock, R. D. (99). BILO: Item analyss and test scorng wth bnary logstc models [computer software]. Chcago, IL: Scentfc Software. Murak, E. (99). A generalzed partal credt model: Applcaton of an EM algorthm. Appled Psychologcal Measurement, 6, Murak, E. (996). A generalzed partal credt model. In Van der Lnden, W. J., & Hambleton, R. K. (Eds.), Handbook of modern tem response theory. New York: Sprnger.

27 Murak, E. & Bock, R. D. (997). PARSCALE: IRT tem analyss and test scorng for ratngscale data [Computer software]. Chcago: Scentfc Software. Orlando, M., & Thssen, D. (). New tem ft ndces for dchotomous tem response theory models. Appled Psychologcal Measurement, 4, Orlando, M., & Thssen, D. (3). Further nvestgaton of the performance of S-X : An tem ft ndex for use wth dchotomous tem response theory models. Appled Psychologcal Measurement, 7, Pett, M. A. (997). Nonparametrc statstcs for health care research: Statstcs for small samples and unusual dstrbutons. Thousand Oaks, CA: Sage Publcaton, Inc. Roberts, J. S. (n press). Modfed lkelhood-based tem ft statstcs for the generalzed graded unfoldng model. Appled Psychologcal Measurement. Rupp, A. A., & Zumbo, B. D. (4). A note on how to quantfy and report whether IRT parameter nvarance holds: When Pearson correlatons are not enough. Educatonal and Psychologcal Measurement, 65, Samejma, F. (969). Estmaton of latent ablty usng a response pattern of graded scores. Psychometrka Monograph, 7. Shepard, L., Camll,., & Wllams, D. M. (984). Accountng for statstcal artfacts n tem bas research. Journal of Educatonal Statstcs, 9, Snharay, S. (3). Bayesan tem analyss for dchotomous tem response theory models (ETS RR-3-34). Retreved May, 5, from the ETS Web ste Snharay, S. (5). Bayesan tem ft analyss for undmensonal tem response models. Paper presented at the annual meetng of the Natonal Councl on Measurement n Educaton, Montreal, Canada. Stone, C. A. (). Monte Carlo based null dstrbuton for an alternatve goodness-of-ft test statstc n IRT models. Journal of Educatonal Measurement, 37, Stone, C. A., & Hansen, M. A. (). The effect of errors n estmatng ablty on goodnessof-ft tests for IRT models. Educatonal and Psychologcal Measurement, 6, Stone, C. A., & Zhang, B. (3). Assessng goodness of ft of tem response theory models: a comparson of tradtonal and alternatve procedures. Journal of Educatonal Measurement, 4, Suarez-Falcon, J. C., & las, C. A. W. (3). Evaluaton of global testng procedures for tem ft to the Rasch model. Brtsh Journal of Mathematcal and Statstcal Psychology, Thssen, D. (99). MULTILO: Multple, categorcal tem analyss and test scorng usng tem response theory. Chcago, IL: Scentfc Software. [Computer Program.] Thssen, D., Pommerch, M., Blleaud, K., & Wllams, V. (995). Item response theory for scores on tests ncludng polytomous tems wth ordered responses. Appled Psychologcal Measurement, 9, Wells, C. S. (4). Investgaton of model msft n IRT and a new approach based on smultaneous parametrc and nonparametrc IRT estmaton. Unpublshed doctoral dssertaton. Unversty of Wsconsn-Madson.

28 Yen, W. M. (98). Usng smulaton results to choose a latent trat model. Appled Psychologcal Measurement, 5,

29 4

30 5 Appendx A Examples of the PARSCALE Codes Used for PCM, PCM and RSM Calbraton < A-: PCM > >FILE DFNAME='ca.dat', SAVE; >SAVE PARM='gpgpca.par'; >INPUT NIDW=3, NTOTAL=, NTEST=, LENTH(), NFMT=; (3A,T,A) >TEST TNAME=gpca, ITEM=(()), NBLOCK=; >BLOCK BNAME=SBLOCK, NITEMS=, NCAT=5, SCORIN=(,,3,4,5), REPEAT = ; >CALIB PARTIAL, LOISTIC, CYCLES=(,,,), SCALE=., NQPTS=4, ITEMFIT=; >SCORE NOSCORE; < A-: PCM > >FILE DFNAME='ca.dat', SAVE; >SAVE PARM='pgpca.par'; >INPUT NIDW=3, NTOTAL=, NTEST=, LENTH(), NFMT=; (3A,T,A) >TEST TNAME=gpca, ITEM=(()), NBLOCK=; >BLOCK BNAME=SBLOCK, NITEMS=, NCAT=5, SCORIN=(,,3,4,5), REPEAT = ; >CALIB PARTIAL, LOISTIC, CYCLES=(,,,), SCALE=., NQPTS=4, ITEMFIT=,SPRIOR, PRIORREAD; >PRIORS SMU=(()), SSIMA=(.()); >SCORE NOSCORE; < A-3: RSM > >FILE DFNAME='ca.dat', SAVE; >SAVE PARM='rgpca.par'; >INPUT NIDW=3, NTOTAL=, NTEST=, LENTH(), NFMT=; (3A,T,A) >TEST TNAME=gpca, ITEM=(()), NBLOCK=; >BLOCK BNAME=SBLOCK, NITEMS=, NCAT=5, SCORIN=(,,3,4,5); >CALIB PARTIAL, LOISTIC, CYCLES=(,,,), SCALE=., NQPTS=4, ITEMFIT=,SPRIOR, PRIORREAD; >PRIORS SMU=(()), SSIMA=(.()); >SCORE NOSCORE;

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 12 14 The Ch-squared dstrbuton PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 1 If a normal varable X, havng mean µ and varance σ, s standardsed, the new varable Z has a mean 0 and varance 1. When ths standardsed

More information

Can Auto Liability Insurance Purchases Signal Risk Attitude?

Can Auto Liability Insurance Purchases Signal Risk Attitude? Internatonal Journal of Busness and Economcs, 2011, Vol. 10, No. 2, 159-164 Can Auto Lablty Insurance Purchases Sgnal Rsk Atttude? Chu-Shu L Department of Internatonal Busness, Asa Unversty, Tawan Sheng-Chang

More information

Forecasting the Direction and Strength of Stock Market Movement

Forecasting the Direction and Strength of Stock Market Movement Forecastng the Drecton and Strength of Stock Market Movement Jngwe Chen Mng Chen Nan Ye cjngwe@stanford.edu mchen5@stanford.edu nanye@stanford.edu Abstract - Stock market s one of the most complcated systems

More information

NPAR TESTS. One-Sample Chi-Square Test. Cell Specification. Observed Frequencies 1O i 6. Expected Frequencies 1EXP i 6

NPAR TESTS. One-Sample Chi-Square Test. Cell Specification. Observed Frequencies 1O i 6. Expected Frequencies 1EXP i 6 PAR TESTS If a WEIGHT varable s specfed, t s used to replcate a case as many tmes as ndcated by the weght value rounded to the nearest nteger. If the workspace requrements are exceeded and samplng has

More information

Calculation of Sampling Weights

Calculation of Sampling Weights Perre Foy Statstcs Canada 4 Calculaton of Samplng Weghts 4.1 OVERVIEW The basc sample desgn used n TIMSS Populatons 1 and 2 was a two-stage stratfed cluster desgn. 1 The frst stage conssted of a sample

More information

An Alternative Way to Measure Private Equity Performance

An Alternative Way to Measure Private Equity Performance An Alternatve Way to Measure Prvate Equty Performance Peter Todd Parlux Investment Technology LLC Summary Internal Rate of Return (IRR) s probably the most common way to measure the performance of prvate

More information

SIMPLE LINEAR CORRELATION

SIMPLE LINEAR CORRELATION SIMPLE LINEAR CORRELATION Smple lnear correlaton s a measure of the degree to whch two varables vary together, or a measure of the ntensty of the assocaton between two varables. Correlaton often s abused.

More information

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol

CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK Sample Stablty Protocol Background The Cholesterol Reference Method Laboratory Network (CRMLN) developed certfcaton protocols for total cholesterol, HDL

More information

Latent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006

Latent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006 Latent Class Regresson Statstcs for Psychosocal Research II: Structural Models December 4 and 6, 2006 Latent Class Regresson (LCR) What s t and when do we use t? Recall the standard latent class model

More information

Exhaustive Regression. An Exploration of Regression-Based Data Mining Techniques Using Super Computation

Exhaustive Regression. An Exploration of Regression-Based Data Mining Techniques Using Super Computation Exhaustve Regresson An Exploraton of Regresson-Based Data Mnng Technques Usng Super Computaton Antony Daves, Ph.D. Assocate Professor of Economcs Duquesne Unversty Pttsburgh, PA 58 Research Fellow The

More information

An Interest-Oriented Network Evolution Mechanism for Online Communities

An Interest-Oriented Network Evolution Mechanism for Online Communities An Interest-Orented Network Evoluton Mechansm for Onlne Communtes Cahong Sun and Xaopng Yang School of Informaton, Renmn Unversty of Chna, Bejng 100872, P.R. Chna {chsun,yang}@ruc.edu.cn Abstract. Onlne

More information

Statistical algorithms in Review Manager 5

Statistical algorithms in Review Manager 5 Statstcal algorthms n Reve Manager 5 Jonathan J Deeks and Julan PT Hggns on behalf of the Statstcal Methods Group of The Cochrane Collaboraton August 00 Data structure Consder a meta-analyss of k studes

More information

Evaluating credit risk models: A critique and a new proposal

Evaluating credit risk models: A critique and a new proposal Evaluatng credt rsk models: A crtque and a new proposal Hergen Frerchs* Gunter Löffler Unversty of Frankfurt (Man) February 14, 2001 Abstract Evaluatng the qualty of credt portfolo rsk models s an mportant

More information

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module LOSSLESS IMAGE COMPRESSION SYSTEMS Lesson 3 Lossless Compresson: Huffman Codng Instructonal Objectves At the end of ths lesson, the students should be able to:. Defne and measure source entropy..

More information

How To Understand The Results Of The German Meris Cloud And Water Vapour Product

How To Understand The Results Of The German Meris Cloud And Water Vapour Product Ttel: Project: Doc. No.: MERIS level 3 cloud and water vapour products MAPP MAPP-ATBD-ClWVL3 Issue: 1 Revson: 0 Date: 9.12.1998 Functon Name Organsaton Sgnature Date Author: Bennartz FUB Preusker FUB Schüller

More information

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek

THE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek HE DISRIBUION OF LOAN PORFOLIO VALUE * Oldrch Alfons Vascek he amount of captal necessary to support a portfolo of debt securtes depends on the probablty dstrbuton of the portfolo loss. Consder a portfolo

More information

How To Calculate The Accountng Perod Of Nequalty

How To Calculate The Accountng Perod Of Nequalty Inequalty and The Accountng Perod Quentn Wodon and Shlomo Ytzha World Ban and Hebrew Unversty September Abstract Income nequalty typcally declnes wth the length of tme taen nto account for measurement.

More information

Proceedings of the Annual Meeting of the American Statistical Association, August 5-9, 2001

Proceedings of the Annual Meeting of the American Statistical Association, August 5-9, 2001 Proceedngs of the Annual Meetng of the Amercan Statstcal Assocaton, August 5-9, 2001 LIST-ASSISTED SAMPLING: THE EFFECT OF TELEPHONE SYSTEM CHANGES ON DESIGN 1 Clyde Tucker, Bureau of Labor Statstcs James

More information

) of the Cell class is created containing information about events associated with the cell. Events are added to the Cell instance

) of the Cell class is created containing information about events associated with the cell. Events are added to the Cell instance Calbraton Method Instances of the Cell class (one nstance for each FMS cell) contan ADC raw data and methods assocated wth each partcular FMS cell. The calbraton method ncludes event selecton (Class Cell

More information

Statistical Methods to Develop Rating Models

Statistical Methods to Develop Rating Models Statstcal Methods to Develop Ratng Models [Evelyn Hayden and Danel Porath, Österrechsche Natonalbank and Unversty of Appled Scences at Manz] Source: The Basel II Rsk Parameters Estmaton, Valdaton, and

More information

An Evaluation of the Extended Logistic, Simple Logistic, and Gompertz Models for Forecasting Short Lifecycle Products and Services

An Evaluation of the Extended Logistic, Simple Logistic, and Gompertz Models for Forecasting Short Lifecycle Products and Services An Evaluaton of the Extended Logstc, Smple Logstc, and Gompertz Models for Forecastng Short Lfecycle Products and Servces Charles V. Trappey a,1, Hsn-yng Wu b a Professor (Management Scence), Natonal Chao

More information

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).

benefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ). REVIEW OF RISK MANAGEMENT CONCEPTS LOSS DISTRIBUTIONS AND INSURANCE Loss and nsurance: When someone s subject to the rsk of ncurrng a fnancal loss, the loss s generally modeled usng a random varable or

More information

Project Networks With Mixed-Time Constraints

Project Networks With Mixed-Time Constraints Project Networs Wth Mxed-Tme Constrants L Caccetta and B Wattananon Western Australan Centre of Excellence n Industral Optmsaton (WACEIO) Curtn Unversty of Technology GPO Box U1987 Perth Western Australa

More information

CHAPTER 14 MORE ABOUT REGRESSION

CHAPTER 14 MORE ABOUT REGRESSION CHAPTER 14 MORE ABOUT REGRESSION We learned n Chapter 5 that often a straght lne descrbes the pattern of a relatonshp between two quanttatve varables. For nstance, n Example 5.1 we explored the relatonshp

More information

Staff Paper. Farm Savings Accounts: Examining Income Variability, Eligibility, and Benefits. Brent Gloy, Eddy LaDue, and Charles Cuykendall

Staff Paper. Farm Savings Accounts: Examining Income Variability, Eligibility, and Benefits. Brent Gloy, Eddy LaDue, and Charles Cuykendall SP 2005-02 August 2005 Staff Paper Department of Appled Economcs and Management Cornell Unversty, Ithaca, New York 14853-7801 USA Farm Savngs Accounts: Examnng Income Varablty, Elgblty, and Benefts Brent

More information

Estimation of Dispersion Parameters in GLMs with and without Random Effects

Estimation of Dispersion Parameters in GLMs with and without Random Effects Mathematcal Statstcs Stockholm Unversty Estmaton of Dsperson Parameters n GLMs wth and wthout Random Effects Meng Ruoyan Examensarbete 2004:5 Postal address: Mathematcal Statstcs Dept. of Mathematcs Stockholm

More information

DEFINING %COMPLETE IN MICROSOFT PROJECT

DEFINING %COMPLETE IN MICROSOFT PROJECT CelersSystems DEFINING %COMPLETE IN MICROSOFT PROJECT PREPARED BY James E Aksel, PMP, PMI-SP, MVP For Addtonal Informaton about Earned Value Management Systems and reportng, please contact: CelersSystems,

More information

Assessing short-term individual consistency using IRT-based statistics

Assessing short-term individual consistency using IRT-based statistics Pscológca (2010), 31, 319-334. Assessng short-term ndvdual consstency usng IRT-based statstcs Pere J. Ferrando * Rovra Vrgl Unversty, Span Ths artcle proposes a procedure, based on a global statstc, for

More information

A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña

A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION. Michael E. Kuhl Radhamés A. Tolentino-Peña Proceedngs of the 2008 Wnter Smulaton Conference S. J. Mason, R. R. Hll, L. Mönch, O. Rose, T. Jefferson, J. W. Fowler eds. A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATION-BASED OPTIMIZATION

More information

STATISTICAL DATA ANALYSIS IN EXCEL

STATISTICAL DATA ANALYSIS IN EXCEL Mcroarray Center STATISTICAL DATA ANALYSIS IN EXCEL Lecture 6 Some Advanced Topcs Dr. Petr Nazarov 14-01-013 petr.nazarov@crp-sante.lu Statstcal data analyss n Ecel. 6. Some advanced topcs Correcton for

More information

The OC Curve of Attribute Acceptance Plans

The OC Curve of Attribute Acceptance Plans The OC Curve of Attrbute Acceptance Plans The Operatng Characterstc (OC) curve descrbes the probablty of acceptng a lot as a functon of the lot s qualty. Fgure 1 shows a typcal OC Curve. 10 8 6 4 1 3 4

More information

INVESTIGATION OF VEHICULAR USERS FAIRNESS IN CDMA-HDR NETWORKS

INVESTIGATION OF VEHICULAR USERS FAIRNESS IN CDMA-HDR NETWORKS 21 22 September 2007, BULGARIA 119 Proceedngs of the Internatonal Conference on Informaton Technologes (InfoTech-2007) 21 st 22 nd September 2007, Bulgara vol. 2 INVESTIGATION OF VEHICULAR USERS FAIRNESS

More information

Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network *

Data Broadcast on a Multi-System Heterogeneous Overlayed Wireless Network * JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 24, 819-840 (2008) Data Broadcast on a Mult-System Heterogeneous Overlayed Wreless Network * Department of Computer Scence Natonal Chao Tung Unversty Hsnchu,

More information

Realistic Image Synthesis

Realistic Image Synthesis Realstc Image Synthess - Combned Samplng and Path Tracng - Phlpp Slusallek Karol Myszkowsk Vncent Pegoraro Overvew: Today Combned Samplng (Multple Importance Samplng) Renderng and Measurng Equaton Random

More information

A multiple objective test assembly approach for exposure control problems in Computerized Adaptive Testing

A multiple objective test assembly approach for exposure control problems in Computerized Adaptive Testing Pscológca (2), 3, 335-355. A multple objectve test assembly approach for exposure control problems n Computerzed Adaptve Testng Bernard P. Veldkamp * (), Angela J. Verschoor (2) & Theo J.H.M. Eggen (2)

More information

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis

The Development of Web Log Mining Based on Improve-K-Means Clustering Analysis The Development of Web Log Mnng Based on Improve-K-Means Clusterng Analyss TngZhong Wang * College of Informaton Technology, Luoyang Normal Unversty, Luoyang, 471022, Chna wangtngzhong2@sna.cn Abstract.

More information

Gender differences in revealed risk taking: evidence from mutual fund investors

Gender differences in revealed risk taking: evidence from mutual fund investors Economcs Letters 76 (2002) 151 158 www.elsever.com/ locate/ econbase Gender dfferences n revealed rsk takng: evdence from mutual fund nvestors a b c, * Peggy D. Dwyer, James H. Glkeson, John A. Lst a Unversty

More information

PRIVATE SCHOOL CHOICE: THE EFFECTS OF RELIGIOUS AFFILIATION AND PARTICIPATION

PRIVATE SCHOOL CHOICE: THE EFFECTS OF RELIGIOUS AFFILIATION AND PARTICIPATION PRIVATE SCHOOL CHOICE: THE EFFECTS OF RELIIOUS AFFILIATION AND PARTICIPATION Danny Cohen-Zada Department of Economcs, Ben-uron Unversty, Beer-Sheva 84105, Israel Wllam Sander Department of Economcs, DePaul

More information

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements

CS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements Lecture 3 Densty estmaton Mlos Hauskrecht mlos@cs.ptt.edu 5329 Sennott Square Next lecture: Matlab tutoral Announcements Rules for attendng the class: Regstered for credt Regstered for audt (only f there

More information

Binomial Link Functions. Lori Murray, Phil Munz

Binomial Link Functions. Lori Murray, Phil Munz Bnomal Lnk Functons Lor Murray, Phl Munz Bnomal Lnk Functons Logt Lnk functon: ( p) p ln 1 p Probt Lnk functon: ( p) 1 ( p) Complentary Log Log functon: ( p) ln( ln(1 p)) Motvatng Example A researcher

More information

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting

Causal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting Causal, Explanatory Forecastng Assumes cause-and-effect relatonshp between system nputs and ts output Forecastng wth Regresson Analyss Rchard S. Barr Inputs System Cause + Effect Relatonshp The job of

More information

A Probabilistic Theory of Coherence

A Probabilistic Theory of Coherence A Probablstc Theory of Coherence BRANDEN FITELSON. The Coherence Measure C Let E be a set of n propostons E,..., E n. We seek a probablstc measure C(E) of the degree of coherence of E. Intutvely, we want

More information

Portfolio Loss Distribution

Portfolio Loss Distribution Portfolo Loss Dstrbuton Rsky assets n loan ortfolo hghly llqud assets hold-to-maturty n the bank s balance sheet Outstandngs The orton of the bank asset that has already been extended to borrowers. Commtment

More information

An Empirical Study of Search Engine Advertising Effectiveness

An Empirical Study of Search Engine Advertising Effectiveness An Emprcal Study of Search Engne Advertsng Effectveness Sanjog Msra, Smon School of Busness Unversty of Rochester Edeal Pnker, Smon School of Busness Unversty of Rochester Alan Rmm-Kaufman, Rmm-Kaufman

More information

Characterization of Assembly. Variation Analysis Methods. A Thesis. Presented to the. Department of Mechanical Engineering. Brigham Young University

Characterization of Assembly. Variation Analysis Methods. A Thesis. Presented to the. Department of Mechanical Engineering. Brigham Young University Characterzaton of Assembly Varaton Analyss Methods A Thess Presented to the Department of Mechancal Engneerng Brgham Young Unversty In Partal Fulfllment of the Requrements for the Degree Master of Scence

More information

Enabling P2P One-view Multi-party Video Conferencing

Enabling P2P One-view Multi-party Video Conferencing Enablng P2P One-vew Mult-party Vdeo Conferencng Yongxang Zhao, Yong Lu, Changja Chen, and JanYn Zhang Abstract Mult-Party Vdeo Conferencng (MPVC) facltates realtme group nteracton between users. Whle P2P

More information

Brigid Mullany, Ph.D University of North Carolina, Charlotte

Brigid Mullany, Ph.D University of North Carolina, Charlotte Evaluaton And Comparson Of The Dfferent Standards Used To Defne The Postonal Accuracy And Repeatablty Of Numercally Controlled Machnng Center Axes Brgd Mullany, Ph.D Unversty of North Carolna, Charlotte

More information

1. Measuring association using correlation and regression

1. Measuring association using correlation and regression How to measure assocaton I: Correlaton. 1. Measurng assocaton usng correlaton and regresson We often would lke to know how one varable, such as a mother's weght, s related to another varable, such as a

More information

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic

Institute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic Lagrange Multplers as Quanttatve Indcators n Economcs Ivan Mezník Insttute of Informatcs, Faculty of Busness and Management, Brno Unversty of TechnologCzech Republc Abstract The quanttatve role of Lagrange

More information

Variance estimation for the instrumental variables approach to measurement error in generalized linear models

Variance estimation for the instrumental variables approach to measurement error in generalized linear models he Stata Journal (2003) 3, Number 4, pp. 342 350 Varance estmaton for the nstrumental varables approach to measurement error n generalzed lnear models James W. Hardn Arnold School of Publc Health Unversty

More information

What is Candidate Sampling

What is Candidate Sampling What s Canddate Samplng Say we have a multclass or mult label problem where each tranng example ( x, T ) conssts of a context x a small (mult)set of target classes T out of a large unverse L of possble

More information

Psicológica Universidad de Valencia psicologica@uv.es ISSN (Versión impresa): 0211-2159 ISSN (Versión en línea): 1576-8597 ESPAÑA

Psicológica Universidad de Valencia psicologica@uv.es ISSN (Versión impresa): 0211-2159 ISSN (Versión en línea): 1576-8597 ESPAÑA Pscológca Unversdad de Valenca pscologca@uv.es ISSN (Versón mpresa): 02-259 ISSN (Versón en línea): 576-8597 ESPAÑA 2000 Vcenta Serra / Vcenç Quera / Anton Solanas AUTOCORRELATION EFFECT ON TYPE I ERROR

More information

Single and multiple stage classifiers implementing logistic discrimination

Single and multiple stage classifiers implementing logistic discrimination Sngle and multple stage classfers mplementng logstc dscrmnaton Hélo Radke Bttencourt 1 Dens Alter de Olvera Moraes 2 Vctor Haertel 2 1 Pontfíca Unversdade Católca do Ro Grande do Sul - PUCRS Av. Ipranga,

More information

Recurrence. 1 Definitions and main statements

Recurrence. 1 Definitions and main statements Recurrence 1 Defntons and man statements Let X n, n = 0, 1, 2,... be a MC wth the state space S = (1, 2,...), transton probabltes p j = P {X n+1 = j X n = }, and the transton matrx P = (p j ),j S def.

More information

8 Algorithm for Binary Searching in Trees

8 Algorithm for Binary Searching in Trees 8 Algorthm for Bnary Searchng n Trees In ths secton we present our algorthm for bnary searchng n trees. A crucal observaton employed by the algorthm s that ths problem can be effcently solved when the

More information

RESEARCH ON DUAL-SHAKER SINE VIBRATION CONTROL. Yaoqi FENG 1, Hanping QIU 1. China Academy of Space Technology (CAST) yaoqi.feng@yahoo.

RESEARCH ON DUAL-SHAKER SINE VIBRATION CONTROL. Yaoqi FENG 1, Hanping QIU 1. China Academy of Space Technology (CAST) yaoqi.feng@yahoo. ICSV4 Carns Australa 9- July, 007 RESEARCH ON DUAL-SHAKER SINE VIBRATION CONTROL Yaoq FENG, Hanpng QIU Dynamc Test Laboratory, BISEE Chna Academy of Space Technology (CAST) yaoq.feng@yahoo.com Abstract

More information

Chapter XX More advanced approaches to the analysis of survey data. Gad Nathan Hebrew University Jerusalem, Israel. Abstract

Chapter XX More advanced approaches to the analysis of survey data. Gad Nathan Hebrew University Jerusalem, Israel. Abstract Household Sample Surveys n Developng and Transton Countres Chapter More advanced approaches to the analyss of survey data Gad Nathan Hebrew Unversty Jerusalem, Israel Abstract In the present chapter, we

More information

Quantization Effects in Digital Filters

Quantization Effects in Digital Filters Quantzaton Effects n Dgtal Flters Dstrbuton of Truncaton Errors In two's complement representaton an exact number would have nfntely many bts (n general). When we lmt the number of bts to some fnte value

More information

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network

Forecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network 700 Proceedngs of the 8th Internatonal Conference on Innovaton & Management Forecastng the Demand of Emergency Supples: Based on the CBR Theory and BP Neural Network Fu Deqang, Lu Yun, L Changbng School

More information

On-Line Fault Detection in Wind Turbine Transmission System using Adaptive Filter and Robust Statistical Features

On-Line Fault Detection in Wind Turbine Transmission System using Adaptive Filter and Robust Statistical Features On-Lne Fault Detecton n Wnd Turbne Transmsson System usng Adaptve Flter and Robust Statstcal Features Ruoyu L Remote Dagnostcs Center SKF USA Inc. 3443 N. Sam Houston Pkwy., Houston TX 77086 Emal: ruoyu.l@skf.com

More information

Analysis of Premium Liabilities for Australian Lines of Business

Analysis of Premium Liabilities for Australian Lines of Business Summary of Analyss of Premum Labltes for Australan Lnes of Busness Emly Tao Honours Research Paper, The Unversty of Melbourne Emly Tao Acknowledgements I am grateful to the Australan Prudental Regulaton

More information

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING

ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING Matthew J. Lberatore, Department of Management and Operatons, Vllanova Unversty, Vllanova, PA 19085, 610-519-4390,

More information

Returns to Experience in Mozambique: A Nonparametric Regression Approach

Returns to Experience in Mozambique: A Nonparametric Regression Approach Returns to Experence n Mozambque: A Nonparametrc Regresson Approach Joel Muzma Conference Paper nº 27 Conferênca Inaugural do IESE Desafos para a nvestgação socal e económca em Moçambque 19 de Setembro

More information

A Multistage Model of Loans and the Role of Relationships

A Multistage Model of Loans and the Role of Relationships A Multstage Model of Loans and the Role of Relatonshps Sugato Chakravarty, Purdue Unversty, and Tansel Ylmazer, Purdue Unversty Abstract The goal of ths paper s to further our understandng of how relatonshps

More information

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence

How Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence 1 st Internatonal Symposum on Imprecse Probabltes and Ther Applcatons, Ghent, Belgum, 29 June 2 July 1999 How Sets of Coherent Probabltes May Serve as Models for Degrees of Incoherence Mar J. Schervsh

More information

The Application of Fractional Brownian Motion in Option Pricing

The Application of Fractional Brownian Motion in Option Pricing Vol. 0, No. (05), pp. 73-8 http://dx.do.org/0.457/jmue.05.0..6 The Applcaton of Fractonal Brownan Moton n Opton Prcng Qng-xn Zhou School of Basc Scence,arbn Unversty of Commerce,arbn zhouqngxn98@6.com

More information

Forecasting and Stress Testing Credit Card Default using Dynamic Models

Forecasting and Stress Testing Credit Card Default using Dynamic Models Forecastng and Stress Testng Credt Card Default usng Dynamc Models Tony Bellott and Jonathan Crook Credt Research Centre Unversty of Ednburgh Busness School Verson 4.5 Abstract Typcally models of credt

More information

Predicting Software Development Project Outcomes *

Predicting Software Development Project Outcomes * Predctng Software Development Project Outcomes * Rosna Weber, Mchael Waller, June Verner, Wllam Evanco College of Informaton Scence & Technology, Drexel Unversty 3141 Chestnut Street Phladelpha, PA 19104

More information

Number of Levels Cumulative Annual operating Income per year construction costs costs ($) ($) ($) 1 600,000 35,000 100,000 2 2,200,000 60,000 350,000

Number of Levels Cumulative Annual operating Income per year construction costs costs ($) ($) ($) 1 600,000 35,000 100,000 2 2,200,000 60,000 350,000 Problem Set 5 Solutons 1 MIT s consderng buldng a new car park near Kendall Square. o unversty funds are avalable (overhead rates are under pressure and the new faclty would have to pay for tself from

More information

Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008

Risk-based Fatigue Estimate of Deep Water Risers -- Course Project for EM388F: Fracture Mechanics, Spring 2008 Rsk-based Fatgue Estmate of Deep Water Rsers -- Course Project for EM388F: Fracture Mechancs, Sprng 2008 Chen Sh Department of Cvl, Archtectural, and Envronmental Engneerng The Unversty of Texas at Austn

More information

Validity evidence based on internal structure

Validity evidence based on internal structure Pscothema 014, Vol. 6, No. 1, 108-116 do: 10.7334/pscothema013.60 ISSN 014-9915 CODEN PSOTEG Copyrght 014 Pscothema www.pscothema.com Valdty evdence based on nternal structure Joseph Ros and Crag Wells

More information

1 Example 1: Axis-aligned rectangles

1 Example 1: Axis-aligned rectangles COS 511: Theoretcal Machne Learnng Lecturer: Rob Schapre Lecture # 6 Scrbe: Aaron Schld February 21, 2013 Last class, we dscussed an analogue for Occam s Razor for nfnte hypothess spaces that, n conjuncton

More information

Survival analysis methods in Insurance Applications in car insurance contracts

Survival analysis methods in Insurance Applications in car insurance contracts Survval analyss methods n Insurance Applcatons n car nsurance contracts Abder OULIDI 1 Jean-Mare MARION 2 Hervé GANACHAUD 3 Abstract In ths wor, we are nterested n survval models and ther applcatons on

More information

Logistic Regression. Steve Kroon

Logistic Regression. Steve Kroon Logstc Regresson Steve Kroon Course notes sectons: 24.3-24.4 Dsclamer: these notes do not explctly ndcate whether values are vectors or scalars, but expects the reader to dscern ths from the context. Scenaro

More information

On the Optimal Control of a Cascade of Hydro-Electric Power Stations

On the Optimal Control of a Cascade of Hydro-Electric Power Stations On the Optmal Control of a Cascade of Hydro-Electrc Power Statons M.C.M. Guedes a, A.F. Rbero a, G.V. Smrnov b and S. Vlela c a Department of Mathematcs, School of Scences, Unversty of Porto, Portugal;

More information

CHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES

CHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES CHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES In ths chapter, we wll learn how to descrbe the relatonshp between two quanttatve varables. Remember (from Chapter 2) that the terms quanttatve varable

More information

The Current Employment Statistics (CES) survey,

The Current Employment Statistics (CES) survey, Busness Brths and Deaths Impact of busness brths and deaths n the payroll survey The CES probablty-based sample redesgn accounts for most busness brth employment through the mputaton of busness deaths,

More information

Abstract. 260 Business Intelligence Journal July IDENTIFICATION OF DEMAND THROUGH STATISTICAL DISTRIBUTION MODELING FOR IMPROVED DEMAND FORECASTING

Abstract. 260 Business Intelligence Journal July IDENTIFICATION OF DEMAND THROUGH STATISTICAL DISTRIBUTION MODELING FOR IMPROVED DEMAND FORECASTING 260 Busness Intellgence Journal July IDENTIFICATION OF DEMAND THROUGH STATISTICAL DISTRIBUTION MODELING FOR IMPROVED DEMAND FORECASTING Murphy Choy Mchelle L.F. Cheong School of Informaton Systems, Sngapore

More information

Transition Matrix Models of Consumer Credit Ratings

Transition Matrix Models of Consumer Credit Ratings Transton Matrx Models of Consumer Credt Ratngs Abstract Although the corporate credt rsk lterature has many studes modellng the change n the credt rsk of corporate bonds over tme, there s far less analyss

More information

Marginal Benefit Incidence Analysis Using a Single Cross-section of Data. Mohamed Ihsan Ajwad and Quentin Wodon 1. World Bank.

Marginal Benefit Incidence Analysis Using a Single Cross-section of Data. Mohamed Ihsan Ajwad and Quentin Wodon 1. World Bank. Margnal Beneft Incdence Analyss Usng a Sngle Cross-secton of Data Mohamed Ihsan Ajwad and uentn Wodon World Bank August 200 Abstract In a recent paper, Lanjouw and Ravallon proposed an attractve and smple

More information

A Secure Password-Authenticated Key Agreement Using Smart Cards

A Secure Password-Authenticated Key Agreement Using Smart Cards A Secure Password-Authentcated Key Agreement Usng Smart Cards Ka Chan 1, Wen-Chung Kuo 2 and Jn-Chou Cheng 3 1 Department of Computer and Informaton Scence, R.O.C. Mltary Academy, Kaohsung 83059, Tawan,

More information

Hot and easy in Florida: The case of economics professors

Hot and easy in Florida: The case of economics professors Research n Hgher Educaton Journal Abstract Hot and easy n Florda: The case of economcs professors Olver Schnusenberg The Unversty of North Florda Cheryl Froehlch The Unversty of North Florda We nvestgate

More information

Depreciation of Business R&D Capital

Depreciation of Business R&D Capital Deprecaton of Busness R&D Captal U.S. Bureau of Economc Analyss Abstract R&D deprecaton rates are crtcal to calculatng the rates of return to R&D nvestments and captal servce costs, whch are mportant for

More information

General Iteration Algorithm for Classification Ratemaking

General Iteration Algorithm for Classification Ratemaking General Iteraton Algorthm for Classfcaton Ratemakng by Luyang Fu and Cheng-sheng eter Wu ABSTRACT In ths study, we propose a flexble and comprehensve teraton algorthm called general teraton algorthm (GIA)

More information

Time Domain simulation of PD Propagation in XLPE Cables Considering Frequency Dependent Parameters

Time Domain simulation of PD Propagation in XLPE Cables Considering Frequency Dependent Parameters Internatonal Journal of Smart Grd and Clean Energy Tme Doman smulaton of PD Propagaton n XLPE Cables Consderng Frequency Dependent Parameters We Zhang a, Jan He b, Ln Tan b, Xuejun Lv b, Hong-Je L a *

More information

Measures of Fit for Logistic Regression

Measures of Fit for Logistic Regression ABSTRACT Paper 1485-014 SAS Global Forum Measures of Ft for Logstc Regresson Paul D. Allson, Statstcal Horzons LLC and the Unversty of Pennsylvana One of the most common questons about logstc regresson

More information

BERNSTEIN POLYNOMIALS

BERNSTEIN POLYNOMIALS On-Lne Geometrc Modelng Notes BERNSTEIN POLYNOMIALS Kenneth I. Joy Vsualzaton and Graphcs Research Group Department of Computer Scence Unversty of Calforna, Davs Overvew Polynomals are ncredbly useful

More information

"Research Note" APPLICATION OF CHARGE SIMULATION METHOD TO ELECTRIC FIELD CALCULATION IN THE POWER CABLES *

Research Note APPLICATION OF CHARGE SIMULATION METHOD TO ELECTRIC FIELD CALCULATION IN THE POWER CABLES * Iranan Journal of Scence & Technology, Transacton B, Engneerng, ol. 30, No. B6, 789-794 rnted n The Islamc Republc of Iran, 006 Shraz Unversty "Research Note" ALICATION OF CHARGE SIMULATION METHOD TO ELECTRIC

More information

Is There A Tradeoff between Employer-Provided Health Insurance and Wages?

Is There A Tradeoff between Employer-Provided Health Insurance and Wages? Is There A Tradeoff between Employer-Provded Health Insurance and Wages? Lye Zhu, Southern Methodst Unversty October 2005 Abstract Though most of the lterature n health nsurance and the labor market assumes

More information

Financial Mathemetics

Financial Mathemetics Fnancal Mathemetcs 15 Mathematcs Grade 12 Teacher Gude Fnancal Maths Seres Overvew In ths seres we am to show how Mathematcs can be used to support personal fnancal decsons. In ths seres we jon Tebogo,

More information

Regression Models for a Binary Response Using EXCEL and JMP

Regression Models for a Binary Response Using EXCEL and JMP SEMATECH 997 Statstcal Methods Symposum Austn Regresson Models for a Bnary Response Usng EXCEL and JMP Davd C. Trndade, Ph.D. STAT-TECH Consultng and Tranng n Appled Statstcs San Jose, CA Topcs Practcal

More information

FREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES

FREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES FREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES Zuzanna BRO EK-MUCHA, Grzegorz ZADORA, 2 Insttute of Forensc Research, Cracow, Poland 2 Faculty of Chemstry, Jagellonan

More information

Trade Adjustment and Productivity in Large Crises. Online Appendix May 2013. Appendix A: Derivation of Equations for Productivity

Trade Adjustment and Productivity in Large Crises. Online Appendix May 2013. Appendix A: Derivation of Equations for Productivity Trade Adjustment Productvty n Large Crses Gta Gopnath Department of Economcs Harvard Unversty NBER Brent Neman Booth School of Busness Unversty of Chcago NBER Onlne Appendx May 2013 Appendx A: Dervaton

More information

AN APPOINTMENT ORDER OUTPATIENT SCHEDULING SYSTEM THAT IMPROVES OUTPATIENT EXPERIENCE

AN APPOINTMENT ORDER OUTPATIENT SCHEDULING SYSTEM THAT IMPROVES OUTPATIENT EXPERIENCE AN APPOINTMENT ORDER OUTPATIENT SCHEDULING SYSTEM THAT IMPROVES OUTPATIENT EXPERIENCE Yu-L Huang Industral Engneerng Department New Mexco State Unversty Las Cruces, New Mexco 88003, U.S.A. Abstract Patent

More information

FINAL REPORT. City of Toronto. Contract 47016555. Project No: B000203-3

FINAL REPORT. City of Toronto. Contract 47016555. Project No: B000203-3 Cty of Toronto SAFETY IMPACTS AD REGULATIOS OF ELECTROIC STATIC ROADSIDE ADVERTISIG SIGS TECHICAL MEMORADUM #2C BEFORE/AFTER COLLISIO AALYSIS AT SIGALIZED ITERSECTIO FIAL REPORT 3027 Harvester Road, Sute

More information

Control Charts with Supplementary Runs Rules for Monitoring Bivariate Processes

Control Charts with Supplementary Runs Rules for Monitoring Bivariate Processes Control Charts wth Supplementary Runs Rules for Montorng varate Processes Marcela. G. Machado *, ntono F.. Costa * * Producton Department, Sao Paulo State Unversty, Campus of Guaratnguetá, 56-4 Guaratnguetá,

More information

Underwriting Risk. Glenn Meyers. Insurance Services Office, Inc.

Underwriting Risk. Glenn Meyers. Insurance Services Office, Inc. Underwrtng Rsk By Glenn Meyers Insurance Servces Offce, Inc. Abstract In a compettve nsurance market, nsurers have lmted nfluence on the premum charged for an nsurance contract. hey must decde whether

More information

Approximating Cross-validatory Predictive Evaluation in Bayesian Latent Variables Models with Integrated IS and WAIC

Approximating Cross-validatory Predictive Evaluation in Bayesian Latent Variables Models with Integrated IS and WAIC Approxmatng Cross-valdatory Predctve Evaluaton n Bayesan Latent Varables Models wth Integrated IS and WAIC Longha L Department of Mathematcs and Statstcs Unversty of Saskatchewan Saskatoon, SK, CANADA

More information

7 ANALYSIS OF VARIANCE (ANOVA)

7 ANALYSIS OF VARIANCE (ANOVA) 7 ANALYSIS OF VARIANCE (ANOVA) Chapter 7 Analyss of Varance (Anova) Objectves After studyng ths chapter you should apprecate the need for analysng data from more than two samples; understand the underlyng

More information

Adaptive Clinical Trials Incorporating Treatment Selection and Evaluation: Methodology and Applications in Multiple Sclerosis

Adaptive Clinical Trials Incorporating Treatment Selection and Evaluation: Methodology and Applications in Multiple Sclerosis Adaptve Clncal Trals Incorporatng Treatment electon and Evaluaton: Methodology and Applcatons n Multple cleross usan Todd, Tm Frede, Ngel tallard, Ncholas Parsons, Elsa Valdés-Márquez, Jeremy Chataway

More information