Inferring Individual Level Relationships from Aggregate Data *


 Jade Evans
 1 years ago
 Views:
Transcription
1 Inferrng Indvdual Level Relatonshps from Aggregate Data * Khong Eom **, Youngjae Jn *** < ABSTRACT > Ths paper ntroduces a technque for nferrng ndvdual level relatonshps from aggregate data. Socal scentsts encounter wth the ecologcal fallacy problem where ndvdual level data are not avalable, yet the ndvdual level relatonshp s sought. In partcular, t s the case that socal scentsts attempt to examne a subject for whch no survey data are avalable or relable. A seres of crosslevel nference technques have been ntroduced snce the Goodman s semnal work (1959). We ntroduce a new technque of sgnfcantly mprovng the crosslevel nference, the Gary Kng s soluton. For the purpose of verfcaton, we examned regonal votng n the 16 th Natonal Assembly electons of South Korea. The estmates of regonal votng are compared wth those of survey results. We found that the estmates from the Kng s soluton are closely matchng wth those from survey results f cell frequency n survey results s large enough. The Kng s soluton produces more relable estmates n a case that cell frequency n survey results s small. Key words : Crosslevel nference, ecologcal fallacy, aggregate data, ndvdual level relatonshp * An earler verson was presented at the 53rd meetng of the Internatonal Statstcal Insttute, Seoul, Korea, August 2229, ** Frostburg State Unversty, Lecturer (emal: *** Yonse Unversty, Assocate Professor (emal:
2 I. Introducton Who voted for Chunghee Park n South Korea n the 1963 presdental electon? What s the level of regonal votng n that electon? How many tmes dd female have an experence on aborton for her lfe tme? They all are nterestng questons, yet hard to examne, partly because t s a hstorcal matter so that survey data s not avalable or partly because a survey respondent had a poltcally correct answer, f data were avalable, and thus results mght be based. The purpose of ths paper s to ntroduce a new technque for solvng these problems, the Gary Kng s ecologcal regresson. To verfy ths technque, we compare estmates from survey results wth those from the Kng s method. The case selected for the verfcaton s "regonal votng" n the 16 th Korean Natonal Assembly electons. The regonal votng refers to the concentraton of votes along regonal party lnes n a number of Korean regons (Km 1994; Lee 1998; Lee and Brunn 1996). 1) Stated another way, voters whose hometown s Jeolla mostly vote for the canddate of the party whose leaders were born n the regon. Snce ths appears to happen regardless of the qualty of canddates and the deology of party, votng patterns result n partes often beng representatves of regons 1) By defnton, regonalsm refers to the voters affectve dentfcatons wth, and support for, canddates wth roots n ther respectve regons (Km and Koh 1980, 81)
3 nstead of dstrcts or the naton (Shn, Jn, Gross and Eom 2005). However, the level of regonal votng has been a dffcult topc to be examned, because votng s secret and a survey respondent may have a poltcally correct answer. Snce the Goodman s semnal work (1959), several technques for the crosslevel nference s developed to solve the ecologcal fallacy problem (Palmqust 1993). The Kng s soluton (1997) s well known for producng effcent and robust estmates. In addton, t contans nformaton on the uncertanty of estmates at the level of analyss. After the explanaton of the Kng s soluton, we analyzed regonal votng n the 16th Natonal Assembly electons. The estmates from the Kng's soluton are compared wth those from survey results. We found that estmates from the Kng s soluton are closely matchng wth those from survey results f cell frequency n survey results s large enough. The Kng s soluton produces more relable estmates n a case that cell frequency n survey results s small. II. Ecologcal Fallacy and Ecologcal Regressons The crosslevel nference s "the process of usng aggregate (.e., "ecologcal") data to nfer dscrete ndvdual level relatonshps of nterest" (Kng 1997, xv). It provdes a soluton for the problem of ecologcal fallacy. In ths secton, we ntroduce the problem of  2 
4 ecologcal fallacy. We then move to descrbe a seres of efforts to solve ths problem. 1. Ecologcal Fallacy It s well known that usng aggregate level data to fgure out ndvdual level relatonshps generates the ecologcal fallacy problem whch produces based and neffcent estmates (Palmqust 1993). For example, suppose that our research queston s to examne the level of lteracy between the foregn born and the natve (Robnson 1950). Further, assume that we have three groups (the sophstcated, the regular and the foregn born), and both the sophstcated and the foregn born prefer to lve n a cty and the regular lke to lve n a rural area. If a researcher regresses the percentage of the foregn born on lteracy rate at the county level, he or she may fnd that the greater the percentage of foregn born, the hgher the lteracy rate. It would be a shockng result, because the foregn born are not lkely to be lterate. However, f one analyzes the relatonshp at the ndvdual level, he or she may fnd a dfferent and more convncng result; the natve tend to have a hgher lteracy than the foregn born. Ths dscrepancy occurs because the sophstcated as well as the foregn born resde n the same type of area,.e., cty. Wthout consderaton of aggregaton unt, the fndngs from aggregate data mslead the ndvdual level relatonshp. It shows the napproprateness of usng aggregate data to examne the ndvdual level relatonshp
5 2. Ecologcal Regressons To solve the aggregaton bas, several methods have been ntroduced. A common assumpton the models make can be descrbed n below table. <Table 1> The Robnson s problem Lterate (L) Illterate (IL) Margnal The Natve(N)?? The Foregn born (F)?? 1000 Margnal Let s suppose that n a total populaton of 21,000 we observe only margnal populaton values for the Natve (N) and the Foregn born (F): 20,000 and 1,000. Also we know margnal values for the Lterate (L) and the Illterate (IL). Our research problem s to fnd cell frequency noted as queston marks; how many are the lterate among the natve and how many are the lterate among the foregn born? We then calculate lteracy ratos between the natve and the foregn born and examne whether the brthplace s related to the lteracy. One of the ways to solve ths problem can be suggested as follows. Let s suppose that we know the value for the left upper corner by pure luck; the number of the lterate among the natve s 15,000. Once we have ths nformaton, we can accordngly calculate the rest of cell values. The results are shown n table
6 <Table 2> A soluton for the Robnson s problem Lterate (L) Illterate (IL) Margnal The Natve(N) (5000) The Foregn born (F) (0) (1000) 1000 Margnal Snce the natve populaton who can read and wrte s 15,000, the number of the llterate among the natve s 5000 n a populaton of In addton, the number of the entre lterate s 15,000 and the number of the lterate natve s 15,000, and thus the number of the lterate foregn born s zero. For a purpose of comparson, table 2 can be rewrtten n table 3. <Table 3> Lteracy Ratos Lterate (L) Illterate (IL) Margnal The Natve(N) The Foregn born (F) Margnal Of the natve, the lteracy rato s 0.75 whle t s 0.00 for the foregn born. Therefore, t leads to a concluson that the natve are more lkely to be lterate compared to the foregn born. Ths example shows that we are able to dsaggregate aggregate data f we "correctly" mpose some constrants on the parameter of our nterest. In ths case, we assumed that some nformaton on the number of the  5 
7 lterate among the natve s avalable. We can generalze table 3 n the followng table. <Table 4> General Form of Ecologcal Regresson Lterate (L) Illterate (IL) Margnal The Natve(N) β N 1β N X The Foregn(F) β F 1β F 1X Margnal T 1T X s the proporton of the natve, β N s the lteracy rato for the natve, and β F s the lteracy rato for the foregn born. T s the proporton of the lterate and "" s an aggregaton unt. Wth some constrants, the parameters of our nterest (β N and β F ) can be calculated usng aggregate values of X and T. Wth ths general form, two approaches for ecologcal regresson have been developed: method of bounds and statstcal approach. Method of bounds uses determnstc nformaton n data (Achen and Shvely 1995). Let s suppose once agan we attempt to estmate the lteracy rato among the natve and the foregn born wth aggregate nformaton. The relatonshp n table 4 can be wrtten as follows: T = β N Then, X + β F (1X ) 1) β N = T X X 1 X β F  6 
8 Snce βs are a proporton, t should be between 0 and 1; 0 β T 1. In addton, f β F = 0, X becomes a maxmum value for β N, whle f β F T 1+ X = 1, X becomes a mnmum value for β N. Hence, lower and upper lmts for β N are: T 1+ X Max X,0, T Mn X,1. Wth the same procedure, we can obtan lower and upper lmts for β F as follows: T X Max 1 X,0, T Mn 1 X,1. Wth our example, the range of plausble values for β N s = [Max (0.7, 0), Mn ( )]= [0.7, 0.75]. Therefore, we can sgnfcantly narrow down plausble values of parameter β N. However, n often cases, method of bounds produces too broad nformaton, especally when the dstrbuton of proportons (X and/or T) s consderably skewed. For example, the range of β F values as n our example s [Max (5, 0), Mn (15, 1)] = [0, 1]. In ths case, method of bounds does not reduce the range of plausble values for
9 The second approach for ecologcal regressons has been developed to use logc of statstcal assocaton. If there s assocaton between varables, t wll occur across unts wth some fluctuaton. The frst model was developed by Leo A. Goodman (1959). He argues that f we can reasonably assume three thngs, we can nfer ndvdual behavor from aggregate data. Hs assumptons are constant effect of parameters, lnear functon, and normal dstrbuton of resduals. Followng hs suggeston, the equaton 1) can be rewrtten as follows: T = β N X + β F (1X ) + e, 2) Where e s resduals. If three condtons are met, he argues, parameter βs and ther standard errors are correctly nferred. The Goodman s model, however, has several problems (Voss 2000). Frst, hs constant effect assumpton s not substantvely reasonable. For example, beng the constant lteracy rato for the natve across unts are too restrctve. If the parameters (βs) covary wth a unt, the estmates may over or underestmates true βs due to the aggregaton bas. Second, snce the Goodman s model produces only a sngle estmate, t s hard to know ndvdual behavor wthn a unt. A seres of models have been developed to solve or relax these assumptons (Achen and Shverly 1995). For example, the  8 
10 homogeneous model utlzes, rather than estmates, nformaton from observed data. That s, wth our example, the homogeneous model observes the lteracy rato among the entre natve, and then uses ths rato as a benchmark for nferrng ndvdual relatonshps. The same procedure s appled for the lteracy rato for the entre foregn born. It s only useful when unts are hghly segregated, however. It becomes unrelable when both the natve and the foregn born are mxed n the same unt. The nformed assumpton model uses nformed knowledge nstead of observed lteracy rato. For example, we may have pror nformaton that the entre foregn born are llterate. In ths case, β F becomes zero, and thus we can use ths nformaton and then calculate β N and the rest of βs, as shown n table 4. However, n most cases, pror nformaton s unattanable. And, researchers may not receve a warnng when ths nformed knowledge s ncorrect, whch results n based estmates of parameters (Voss 2000). The fnal example for ecologcal regressons has a dfferent premse. The neghborhood model assumes that parameters of our nterest s the same wthn a unt (β N = β F ), yet vares across unts (β N β N j, where j). Therefore, the equaton 2) becomes T = + β (1X ) + e = β + e, where β s a functon of X. For example, ths model assumes that the lteracy rato between the natve and the foregn born s the same wthn the same unt, whle the lteracy rato vares across unts. As one may notce, the assumpton  9 
11 the neghborhood model makes s too strong. Even t s a plausble assumpton, we do not have to estmate a model, because we have an answer for our research queston; whether the brthplace s related to the level of the lteracy. Wth an excepton of the neghborhood model, a survey of ecologcal regressons shows some common problems. Frst, all of the models assumed the constant effect of parameters. It seems to be too restrctve, because parameters of our nterest are hardly constant across unts. Second, the models produce only a sngle estmate. Snce we attempt to nfer the ndvdual level relatonshp, t s not lkely to be satsfed wth a sngle estmate. Fnally, f an equaton has more than two parameters to be estmated, t s hard to magne how these methods can be extended. Gary Kng (1997) provdes an nterestng method to solve these problems. Frst, he does not assume a constant effect; rather he assumes that a parameter vares wth a common underlyng dmenson. Second, because of the varyng parameter, we may have an estmate per unt. In addton, snce hs method uses addtonal nformaton from method of bounds, the estmates become more effcent. Hs method can be wrtten as follows (Kng 1997, 9394): T = β N X + β F (1X ) + e, where P(β N, β F ) = TN (β N, β F Β, Σ) 3)
12 Probablty densty of parameters (β N, β F ) follows truncated normal dstrbuton of (β N, β F ) wth lmts β N, = [0, 1] and β F = [0, 1]. Wth the help of method of bounds, these lmts for (β N, β F ) can be narrowed down as follows: β N, = T 1+ X Max X,0, T Mn X,1 β F = T X Max 1 X,0, T Mn 1 X,1. The mean and varance matrx of (β N, β F ) are Β Β = Β N F and 2 σ N = σ σ NF Σ 2 NF σ F If hs three assumptons are met, estmaton produces an effcent and robust estmate. 2) The estmaton procedure of the Kng s soluton can be summarzed as follows: 1) The frst step calculates the bounds of parameters. 2) The second step estmates parameters from truncated bvarate normal dstrbutons wthn the bounds. If one extends a model wth more than two parameters, the estmates from the frst estmaton are used for margnal values. For 2) Three assumptons are sngle model of parameter, the absence of spatal correlaton, and no correlaton of margnal and parameter. Kng, Rosen, and Tanner (1999, 6768) show, however, that the volaton of the thrd assumpton does not produce based estmates f the bounds of parameters are low enough
13 example, f one s nterested n the proporton of regonal votng, he or she frst estmates a turnout rate among those who were born n a certan regon n a gven dstrct. The estmated turnout rate s used as margnal values for the proporton of regonal votng. It can be dagramed below: <Fgure 1> Kng s Soluton: the frst step Jeolla Vote Not vote Margnal β J 1β J X Other Regons β J ' 1β J ' 1X Margnal T 1T Where X s the proporton of votng age populaton who were born n Jeolla, T s the proporton of voters those who turn out to vote, β J s a turnout rate among those who were born n Jeolla, β J ' s a turnout rate among those who were born n a regon other than Jeolla, and "" s a dstrct ndcator. <Fgure 2> Kng s Soluton: the second step Vote Not vote Margnal NCNP Other partes Jeolla λ J 1λ J β J 1β J x Other Regons λ J ' 1λ J ' β J ' 1β J ' 1x P 1  P T Where "x" s the estmated proporton of voters whose hometown s n Jeolla and who turn out to vote, P s the vote share for a canddate whose party label s the Natonal Congress for New Poltcs (NCNP), and proporton of regonal votng. s the
14 The frst step s to examne turnout rate ( and ) for those who were born n Jeolla (X ) and for those who were born n areas other than Jeolla (1X ). Once we obtan estmates for βs, these estmates are used to calculate margnal values for regonal votng estmates (x and 1x ). The second step starts wth the calculaton of bounds of parameters (λs) and then estmates the parameters across unts. Note, however, that snce has a component to be estmated, t s not a fxed varable. Therefore, extendng tables produce more uncertan estmates due to added uncertanty orgnatng from the frst estmaton. 3) In next secton, we apply the Kng s soluton to fnd the level of regonal votng n the Korean Natonal Assembly electons of III. Applcaton: Dsaggregatng Regonal Votng The 2000 electon outcomes n Korea suggest that there are three regons whch tend to exhbt partsan regonalsm: Jeolla, Gyeongsang, and Chungcheong. Jeolla regon covers Jeollabukdo and Jeollanamdo areas, Gyeongsang regon refers to Gyeongsangbukdo and Gyeongsangnamdo areas, and Chungcheong regon means Chungcheongbukdo and Chungcheongnamdo areas. Regonal domnance by a partcular party was specfed n terms of the 3) Note that the parameters (λ and (1λ)) are weghted by the number of votng age populaton n a gven dstrct
15 brthplace of partcular party leaders. A leader of the Grand Natonal Party, Km Yong Sam was born n Gyeongsang regon a leader of the Natonal Congress for New Poltcs, Km Dae Jung n Jeolla regon and a leader of the Unted Lberal Democrats, Km Chong Phl n Chungcheong regon. Ths lnk between the brthplace of a party leader and the domnance of a partcular party s well documented n contemporary Korean poltcs (Km 1994; Lee 1998; Lee and Brunn 1996). In ths secton, usng the Gary Kng s ecologcal regresson we attempt to dsaggregate aggregate votes along the level of regonal party lnes n a dstrct. The Kng s method nfers regonal votng at the canddate level. The percentage of regonal votng at the canddate level wll be averaged out across regonal blocs and compared to estmates from survey results. The followng equatons are to be estmated:, 4), 5), 6) where P s the vote share of a canddate, λ s the proporton of regonal votng, and λ' s the proporton of nonregonal votng. J ndcates Jeolla, G Gyeongsang, and C Chungcheong. "x " s, where β s a turnout rate for those were born n a certan
16 regon, and X s the proporton of voters for those who were born n a certan regon. "" ndcates a dstrct. The level of analyss s the canddate level. Estmaton s done by the program called "EzI." 4) In the 16 th Natonal Assembly electons of Korea (Aprl 13, 2000), 194 ncumbent and 449 nonncumbent canddates ran for offce (Natonal Electon Commsson 2000). We focus our analyss on the vote share for canddates of the three major partes. 5) The percentage of those who regstered ther brthplace n a gven dstrct s collected wth the help of one of major partes. 6) The results are shown n table 5. <Table 5> Regonal Votng Estmates from Ecologcal Inference Regonal Votng Level (Average λs) Regonal Blocs GNP NCNP ULD Seoul 61.00% 75.27% 2.57% Busan 67.29% 64.66% 1.62% Daegu 62.02% 70.49% 2.26% Incheon 61.05% 73.75% 2.51% Gwangju 56.14% 79.46% 0.95% Ulsan 57.61% 66.25% 1.23% Gyeonggdo 60.43% 73.92% 2.48% Gangwondo 60.14% 71.80% 2.61% Jeollabukdo 57.52% 61.04% 1.76% Jeollanamdo 43.85% 63.86% 0.00% 4) "EzI" are developed by Kenneth Benot and Gary Kng (released n 2001). It s avalable from vsted May 1, ) We focus on only these three partes because they comprsed over 96% of the sngle member dstrct seats n the 16 th Natonal Assembly Electon. 6) Because of a contrbutor s request, the source of data has not been released. Data on dstrcts n Chungcheongdo are not avalable so that the number of dstrcts n ths study are
17 Gyeongsangbukdo 54.88% 71.17% 1.83% Gyeongsangnamdo 55.35% 68.99% 2.18% Average 52.21% 61.74% 1.04% Source: compled by the authors. Note: Average λ s calculated by averagng out dstrct level regonal votngs (λ ) along wth regonal blocs. GNP stands for the Grand Natonal Party, NCNP for the Natonal Congress for New Poltcs and ULD for the Unted Lberal Democrats. Table 5 shows that on average the percentage of regonal votng (61.74%) s the hghest among those who were born n Jeolla and t may be beneft to canddates of the Natonal Congress for New Poltcs. The percentage of regonal votng for those who were born n Gyeongsang ranked the second. Not surprsngly, the level of regonal votng are the lowest for those who were born n Chungcheong. It resulted n less concentraton of votes on canddates runnng under the Unted Lberal Democrats (ULD). In the 15 th Natonal Assembly Electons of 1996, the ULD won 25 of the 28 seats. But, by the electons of 2000, the ULD was only able to wn 11 of the 24 seats n Chungcheong. It s also the case when one examnes the percentage of regonal votng wthn a regonal bloc. For example, percent of those who were born n Jeolla cast a regonal votng f they resde n dstrcts wthn Gwangju. More than 70 percent of voters also voted for canddates of the NCNP n dstrcts wthn Seoul, Daegu, Incheon, Gyeonggdo, Gangwondo and Gyeongsangbukdo f they were born n Jeolla
18 Those who were born n Gyeongsang tend to cast a slghtly less regonal votng, yet qute a sgnfcant level. On average, more than half of voters who were born n Gyeongsang cast a regonal votng n the 16 th Natonal Assembly electons. It s especally the case when one examnes n dstrcts wthn Seoul, Busan, Daegu, Incheon, Gyeonggdo, and Gangwondo more than 60 percent of voters voted for canddates of the GNP f they were born n Gyeongsang. It s also the case, though to less extent, f he or she resdes n Gwangju, Ulsan, Jeollabukdo, Gyeongsangbukdo, and Gyeongsangnamdo. Not surprsngly, those who were born n Chungcheong cast the least extent of regonal votng. Only handful of voters who were born n Chungcheong cast regonal votng on average. However, t should be noted that data for dstrcts wthn Chungcheong area were not avalable and thus estmates may be underestmated. In sum, regonal votng estmates from the Kng s method provde supportve evdence for the argument that regonal votng s a natonwde problem (Km 1994; Lee 1998; Lee and Brunn 1996). Not only s the level of regonal votng sgnfcant n dstrcts wthn the the known regonal votng blocs (Jeolla and Gyeongsang), but also t appears to be substantal n dstrcts outsde these regonal blocs. However, there s a sgnfcant fluctuaton at the level of regonal votng across regons. For example, the percentage of regonal votng for those who were born n Jeolla vares from 61.04% n Jeollabukdo
19 to 79.46% n Gwangju, whle t vares from 43.85% n Jeollanamdo to 67.29% n Busan f voters were born n Gyeongsang. We can conclude that the level of regonal votng s not constant, but vares across regonal blocs. The results from ecologcal nference can be verfed by survey results. The procedure s the same above except that fgures are obtaned from ndvdual level data. The frst step s to dentfy voters who were born n a certan regon, and then calculate how many these voters turn out to vote for the pertnent party. The Korean Socal Scence Data Center conducted a survey of the 16 th Natonal Assembly Electons n Aprl 13, 2000 (Korean Socal Scence Data Center 2000). Multstage quota samplng technque was used to collect a random sample by regonal blocs. 1,100 ntervews were completed wth a rejecton rate of 5 percent. Fortunately, the survey ncludes a queston on the hometown of and the vote choce of a respondent. These two questons were used to construct a regonal votng; for example, f he or she was born n Jeolla area and voted for the NCNP, t s coded as a regonal votng for the NCNP. In Seoul, ffty fve respondents were born n Jeolla. Thrty four out of the ffty fve voted for the NCNP. Therefore, the percentage of regonal votng for the NCNP s percent for Seoul. Table 6 shows the percentage of regonal votng n regonal blocs. <Table 6> Regonal Votng Estmates from Survey Results
20 Regonal Votng (Percentage/Frequency) Regonal Blocs GNP NCNP ULD Seoul 46.88% 61.82% 3.03% (32) (55) (33) Incheon/Gyeonggdo 65.00% 57.14% 7.50% (20) (35) (40) Gangwondo 0.00% % 0.00% (1) (1) (3) Daejeon/Chungcheongnamdo 0.00% 20.00% 20.34% (4) (5) (59) Chungcheongbukdo 33.33% 0.00% 27.59% (3) (1) (29) Gwangju/Jeollanamdo 25.00% 50.68% 0.00% (4) (73) (4) Jeollabukdo 0.00% 60.00% 0.00% (1) (40) (5) Busan/Ulsan/Gyeongsangnamdo 64.24% 31.25% 0.00% (151) (16) (8) Daegu/Gyeongsangbukdo 53.15% 33.33% 25.00% (111) (3) (4) Average 30.09% 46.02% 9.27% Source: The Korean Socal Scence Data Center (2000). Fgures n parenthess are the number of respondents. Table 6 shows that regonal votng s the most evdent for those who were born n Jeolla, followed by those who were born n Gyeongsang and n Chungcheong. The level of regonal votng sslghtly low compared to that from the Kng s soluton. On average, percent voted for the NCNP f they were born n Jeolla, whle t s percent f voters were born n Gyeongsang. A sgnfcant fluctuaton appeared across regonal blocs. In partcular, f cell frequency s too small, the varaton of regonal votng s beyond the acceptable range. For example, n Gangwondo where cell frequency s one, the percentage of regonal votng s
21 percent out of those who were born n Jeolla, whle t s zero percent n Chungcheongbukdo where cell frequency s also one. If one may focus on the level of regonal votng where the number of respondents are suffcent enough, we can fnd smlarty n the level of regonal votng between estmates from the Kng s method and estmates from survey results. For example, accordng to survey results, the percentage of regonal votng n Seoul s percent for the NCNP whle the comparable fgure s percent by the ecologcal regresson. It s 60 percent n Jeollabukdo by survey results, whle t s percent by the ecologcal regresson. We can safely conclude that estmates from the Kng s soluton are closely matchng wth those from survey results. IV. Concluson Applyng aggregate level fndngs for the ndvdual level relatonshps generates based estmates, known as the ecologcal fallacy problem. Socal scentsts often encounters wth a dffculty to conduct a research at the ndvdual level wth aggregate data. In partcular, f a research queston s related to the past event when survey data are not avalable, t s almost mpossble to pursue a research. Further, f there s a poltcally correct answer on survey questons, t s hard to obtan unbased estmates
22 In ths paper, we ntroduced a way to nfer the ndvdual level relatonshps from aggregate data. We began wth the aggregaton bas whch leads to the ecologcal fallacy problem. A seres of efforts have been suggested to solve the aggregaton bas. The method by Gary Kng, whch combnes method of bounds and statstcal assocaton, s emphaszed. The Kng s soluton s well known for a method to produce a robust and effcent estmate, even though there s a severe aggregaton bas. The soluton appled to nfer regonal votng at the canddate level. The percentage of regonal votng was averaged out across regonal blocs. The average percentages, then, were compared to estmates from survey results. We found that estmates from the Kng s method are closely matchng wth estmates from survey results f cell frequency n survey results s large enough. We also found that the former s more relable than the latter f cell frequency n survey results s small. Ecologcal regressons offer a new venue to examne prevously mpossble questons. For example, we can examne who voted for Chunghee Park n the 1963 Korean presdental electon. We can further queston why they voted for hm; for example, was the generaton effect related to the outcome of the 1963 Korean presdental electon? Furthermore, we can use ecologcal regressons to examne whether or not a voter casts a vote for a party canddate n a congressonal electon, whle the same voter casts a dfferent party canddate for a presdental electon (Burden and Kmball 1998). We
23 should note, however, that ecologcal regressons also show some lmtaton. If tables are extended more than 2 by 2, the uncertanty of estmates gets thcker. Scholars of ecologcal regresson attempt to reduce ths uncertanty (Kng, Rosen, Tanner 1999; Rosen, Jang, Kng forthcomng)
24 < REFERENCE > Achen, Chrstopher H. and W. Phllps Shvely (1995). CrossLevel Inference. Chcago: Unversty of Chcago press. Benot, Kenneth and Gary Kng (1996). "A Prevew of EI and EzI: Program for Ecologcal Inference." Socal Scence Computer Revew 14: Burden, Barry C. and Davd C. Kmball (1998). "A New Approach to the Study of Tcket Splttng." Amercan Poltcal Scence Revew 92: Goodman, Leo (1959). "Some Alternatves to Ecologcal Correlaton." Amercan Journal of Socology 64: Km, JaeOn and B.C. Koh (1980). "The Dynamcs of Electoral Poltcs: Socal Development, Poltcal Partcpaton, and Manpulaton of Electoral Laws." n Poltcal Partcpaton n Korea: Democracy, Moblzaton, and Stablty edted by Chong Lm Km. Santa Barbara: CLIO books Kng, Gary, Or Rosen, and Martn A. Tanner (1999). "BnomalBeta Herarchcal Models for Ecologcal Inference." Socologcal Methods & Research 28: Kng, Gary (1997). A Soluton to the Ecologcal Inference Problem: Reconstructng Indvdual Behavor from Aggregate Data. Prnceton, NJ: Prnceton Unversty Press
25 Korean Socal Scence Data Center (2000). A Survey on Voters Atttudes toward the 16th General Electon. Seoul: Korean Socal Scence Data Center. Lee, Dong Ok and Stanley D. Brunn (1996). "Poltcs and regons n Korea: an analyss of the recent presdental electon." Poltcal Geography 15: Lee, Nam Young (1998). "Regonalsm and Votng Behavor n South Korea." Korea Observer 29: Natonal Electon Commsson. ( ). Palmqust, Bradley Lowell (1993). Ecologcal Inference, Aggregate Data Analyss of U. S. Electons, and the Socalst Party of Amerca. Ph. D. dssertaton at the Unversty of Calforna, Berkley. Robnson, W. S (1950). "Ecologcal Correlatons and the Behavor of Indvduals." Amercan Socologcal Revew 15: Rosen, Or, Wenxn Jang, Gary Kng, and Martn A. Tanner (Forthcomng). "Bayesan and Frequentst Inference for Ecologcal Inference: the R X C Case." Statstca Neerlandca. Shn, Myungsoon, Youngjae, Jn, Donald A. Gross, and Khong Eom (2005). "Money Matters n PartyCentered Poltcs: Campagn Spendng n Korean Congressonal Electons." Electoral Studes 24: Voss, D. Stephen (2000). Famlarty Doesn t Breed Contempt: The Poltcal Geography of Racal Polarzaton. Ph. D. dssertaton at Harvard Unversty
Can Auto Liability Insurance Purchases Signal Risk Attitude?
Internatonal Journal of Busness and Economcs, 2011, Vol. 10, No. 2, 159164 Can Auto Lablty Insurance Purchases Sgnal Rsk Atttude? ChuShu L Department of Internatonal Busness, Asa Unversty, Tawan ShengChang
More informationInequality and The Accounting Period. Quentin Wodon and Shlomo Yitzhaki. World Bank and Hebrew University. September 2001.
Inequalty and The Accountng Perod Quentn Wodon and Shlomo Ytzha World Ban and Hebrew Unversty September Abstract Income nequalty typcally declnes wth the length of tme taen nto account for measurement.
More informationCalculation of Sampling Weights
Perre Foy Statstcs Canada 4 Calculaton of Samplng Weghts 4.1 OVERVIEW The basc sample desgn used n TIMSS Populatons 1 and 2 was a twostage stratfed cluster desgn. 1 The frst stage conssted of a sample
More informationPSYCHOLOGICAL RESEARCH (PYC 304C) Lecture 12
14 The Chsquared dstrbuton PSYCHOLOGICAL RESEARCH (PYC 304C) Lecture 1 If a normal varable X, havng mean µ and varance σ, s standardsed, the new varable Z has a mean 0 and varance 1. When ths standardsed
More informationModule 2 LOSSLESS IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur
Module LOSSLESS IMAGE COMPRESSION SYSTEMS Lesson 3 Lossless Compresson: Huffman Codng Instructonal Objectves At the end of ths lesson, the students should be able to:. Defne and measure source entropy..
More informationbenefit is 2, paid if the policyholder dies within the year, and probability of death within the year is ).
REVIEW OF RISK MANAGEMENT CONCEPTS LOSS DISTRIBUTIONS AND INSURANCE Loss and nsurance: When someone s subject to the rsk of ncurrng a fnancal loss, the loss s generally modeled usng a random varable or
More informationThe Analysis of Outliers in Statistical Data
THALES Project No. xxxx The Analyss of Outlers n Statstcal Data Research Team Chrysses Caron, Assocate Professor (P.I.) Vaslk Karot, Doctoral canddate Polychrons Economou, Chrstna Perrakou, Postgraduate
More informationDEFINING %COMPLETE IN MICROSOFT PROJECT
CelersSystems DEFINING %COMPLETE IN MICROSOFT PROJECT PREPARED BY James E Aksel, PMP, PMISP, MVP For Addtonal Informaton about Earned Value Management Systems and reportng, please contact: CelersSystems,
More informationAn Alternative Way to Measure Private Equity Performance
An Alternatve Way to Measure Prvate Equty Performance Peter Todd Parlux Investment Technology LLC Summary Internal Rate of Return (IRR) s probably the most common way to measure the performance of prvate
More informationRECENT DEVELOPMENTS IN QUANTITATIVE COMPARATIVE METHODOLOGY:
Federco Podestà RECENT DEVELOPMENTS IN QUANTITATIVE COMPARATIVE METHODOLOGY: THE CASE OF POOLED TIME SERIES CROSSSECTION ANALYSIS DSS PAPERS SOC 302 INDICE 1. Advantages and Dsadvantages of Pooled Analyss...
More informationOn the Optimal Control of a Cascade of HydroElectric Power Stations
On the Optmal Control of a Cascade of HydroElectrc Power Statons M.C.M. Guedes a, A.F. Rbero a, G.V. Smrnov b and S. Vlela c a Department of Mathematcs, School of Scences, Unversty of Porto, Portugal;
More informationAn Empirical Study of Search Engine Advertising Effectiveness
An Emprcal Study of Search Engne Advertsng Effectveness Sanjog Msra, Smon School of Busness Unversty of Rochester Edeal Pnker, Smon School of Busness Unversty of Rochester Alan RmmKaufman, RmmKaufman
More informationInstitute of Informatics, Faculty of Business and Management, Brno University of Technology,Czech Republic
Lagrange Multplers as Quanttatve Indcators n Economcs Ivan Mezník Insttute of Informatcs, Faculty of Busness and Management, Brno Unversty of TechnologCzech Republc Abstract The quanttatve role of Lagrange
More informationStatistical Methods to Develop Rating Models
Statstcal Methods to Develop Ratng Models [Evelyn Hayden and Danel Porath, Österrechsche Natonalbank and Unversty of Appled Scences at Manz] Source: The Basel II Rsk Parameters Estmaton, Valdaton, and
More informationWhat is Candidate Sampling
What s Canddate Samplng Say we have a multclass or mult label problem where each tranng example ( x, T ) conssts of a context x a small (mult)set of target classes T out of a large unverse L of possble
More informationCHAPTER 14 MORE ABOUT REGRESSION
CHAPTER 14 MORE ABOUT REGRESSION We learned n Chapter 5 that often a straght lne descrbes the pattern of a relatonshp between two quanttatve varables. For nstance, n Example 5.1 we explored the relatonshp
More informationThe OC Curve of Attribute Acceptance Plans
The OC Curve of Attrbute Acceptance Plans The Operatng Characterstc (OC) curve descrbes the probablty of acceptng a lot as a functon of the lot s qualty. Fgure 1 shows a typcal OC Curve. 10 8 6 4 1 3 4
More informationExhaustive Regression. An Exploration of RegressionBased Data Mining Techniques Using Super Computation
Exhaustve Regresson An Exploraton of RegressonBased Data Mnng Technques Usng Super Computaton Antony Daves, Ph.D. Assocate Professor of Economcs Duquesne Unversty Pttsburgh, PA 58 Research Fellow The
More informationTrafficlight a stress test for life insurance provisions
MEMORANDUM Date 006097 Authors Bengt von Bahr, Göran Ronge Traffclght a stress test for lfe nsurance provsons Fnansnspetonen P.O. Box 6750 SE113 85 Stocholm [Sveavägen 167] Tel +46 8 787 80 00 Fax
More information1.1 The University may award Higher Doctorate degrees as specified from timetotime in UPR AS11 1.
HIGHER DOCTORATE DEGREES SUMMARY OF PRINCIPAL CHANGES General changes None Secton 3.2 Refer to text (Amendments to verson 03.0, UPR AS02 are shown n talcs.) 1 INTRODUCTION 1.1 The Unversty may award Hgher
More informationCHOLESTEROL REFERENCE METHOD LABORATORY NETWORK. Sample Stability Protocol
CHOLESTEROL REFERENCE METHOD LABORATORY NETWORK Sample Stablty Protocol Background The Cholesterol Reference Method Laboratory Network (CRMLN) developed certfcaton protocols for total cholesterol, HDL
More informationTo manage leave, meeting institutional requirements and treating individual staff members fairly and consistently.
Corporate Polces & Procedures Human Resources  Document CPP216 Leave Management Frst Produced: Current Verson: Past Revsons: Revew Cycle: Apples From: 09/09/09 26/10/12 09/09/09 3 years Immedately Authorsaton:
More information1. Measuring association using correlation and regression
How to measure assocaton I: Correlaton. 1. Measurng assocaton usng correlaton and regresson We often would lke to know how one varable, such as a mother's weght, s related to another varable, such as a
More informationHollinger Canadian Publishing Holdings Co. ( HCPH ) proceeding under the Companies Creditors Arrangement Act ( CCAA )
February 17, 2011 Andrew J. Hatnay ahatnay@kmlaw.ca Dear Sr/Madam: Re: Re: Hollnger Canadan Publshng Holdngs Co. ( HCPH ) proceedng under the Companes Credtors Arrangement Act ( CCAA ) Update on CCAA Proceedngs
More informationProceedings of the Annual Meeting of the American Statistical Association, August 59, 2001
Proceedngs of the Annual Meetng of the Amercan Statstcal Assocaton, August 59, 2001 LISTASSISTED SAMPLING: THE EFFECT OF TELEPHONE SYSTEM CHANGES ON DESIGN 1 Clyde Tucker, Bureau of Labor Statstcs James
More informationThe Current Employment Statistics (CES) survey,
Busness Brths and Deaths Impact of busness brths and deaths n the payroll survey The CES probabltybased sample redesgn accounts for most busness brth employment through the mputaton of busness deaths,
More informationA DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATIONBASED OPTIMIZATION. Michael E. Kuhl Radhamés A. TolentinoPeña
Proceedngs of the 2008 Wnter Smulaton Conference S. J. Mason, R. R. Hll, L. Mönch, O. Rose, T. Jefferson, J. W. Fowler eds. A DYNAMIC CRASHING METHOD FOR PROJECT MANAGEMENT USING SIMULATIONBASED OPTIMIZATION
More informationAn InterestOriented Network Evolution Mechanism for Online Communities
An InterestOrented Network Evoluton Mechansm for Onlne Communtes Cahong Sun and Xaopng Yang School of Informaton, Renmn Unversty of Chna, Bejng 100872, P.R. Chna {chsun,yang}@ruc.edu.cn Abstract. Onlne
More informationNONCONSTANT SUM REDANDBLACK GAMES WITH BETDEPENDENT WIN PROBABILITY FUNCTION LAURA PONTIGGIA, University of the Sciences in Philadelphia
To appear n Journal o Appled Probablty June 2007 OCOSTAT SUM REDADBLACK GAMES WITH BETDEPEDET WI PROBABILITY FUCTIO LAURA POTIGGIA, Unversty o the Scences n Phladelpha Abstract In ths paper we nvestgate
More informationANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING
ANALYZING THE RELATIONSHIPS BETWEEN QUALITY, TIME, AND COST IN PROJECT MANAGEMENT DECISION MAKING Matthew J. Lberatore, Department of Management and Operatons, Vllanova Unversty, Vllanova, PA 19085, 6105194390,
More information1. Fundamentals of probability theory 2. Emergence of communication traffic 3. Stochastic & Markovian Processes (SP & MP)
6.3 /  Communcaton Networks II (Görg) SS20  www.comnets.unbremen.de Communcaton Networks II Contents. Fundamentals of probablty theory 2. Emergence of communcaton traffc 3. Stochastc & Markovan Processes
More informationPRIVATE SCHOOL CHOICE: THE EFFECTS OF RELIGIOUS AFFILIATION AND PARTICIPATION
PRIVATE SCHOOL CHOICE: THE EFFECTS OF RELIIOUS AFFILIATION AND PARTICIPATION Danny CohenZada Department of Economcs, Benuron Unversty, BeerSheva 84105, Israel Wllam Sander Department of Economcs, DePaul
More informationCHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES
CHAPTER 5 RELATIONSHIPS BETWEEN QUANTITATIVE VARIABLES In ths chapter, we wll learn how to descrbe the relatonshp between two quanttatve varables. Remember (from Chapter 2) that the terms quanttatve varable
More informationA Secure PasswordAuthenticated Key Agreement Using Smart Cards
A Secure PasswordAuthentcated Key Agreement Usng Smart Cards Ka Chan 1, WenChung Kuo 2 and JnChou Cheng 3 1 Department of Computer and Informaton Scence, R.O.C. Mltary Academy, Kaohsung 83059, Tawan,
More informationRecurrence. 1 Definitions and main statements
Recurrence 1 Defntons and man statements Let X n, n = 0, 1, 2,... be a MC wth the state space S = (1, 2,...), transton probabltes p j = P {X n+1 = j X n = }, and the transton matrx P = (p j ),j S def.
More informationFREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES
FREQUENCY OF OCCURRENCE OF CERTAIN CHEMICAL CLASSES OF GSR FROM VARIOUS AMMUNITION TYPES Zuzanna BRO EKMUCHA, Grzegorz ZADORA, 2 Insttute of Forensc Research, Cracow, Poland 2 Faculty of Chemstry, Jagellonan
More informationStart me up: The Effectiveness of a SelfEmployment Programme for Needy Unemployed People in Germany*
Start me up: The Effectveness of a SelfEmployment Programme for Needy Unemployed People n Germany* Joachm Wolff Anton Nvorozhkn Date: 22/10/2008 Abstract In recent years actvaton of meanstested unemployment
More informationQuantification of qualitative data: the case of the Central Bank of Armenia
Quantfcaton of qualtatve data: the case of the Central Bank of Armena Martn Galstyan 1 and Vahe Movssyan 2 Overvew The effect of nonfnancal organsatons and consumers atttudes on economc actvty s a subject
More informationForecasting the Demand of Emergency Supplies: Based on the CBR Theory and BP Neural Network
700 Proceedngs of the 8th Internatonal Conference on Innovaton & Management Forecastng the Demand of Emergency Supples: Based on the CBR Theory and BP Neural Network Fu Deqang, Lu Yun, L Changbng School
More informationAnalysis of Premium Liabilities for Australian Lines of Business
Summary of Analyss of Premum Labltes for Australan Lnes of Busness Emly Tao Honours Research Paper, The Unversty of Melbourne Emly Tao Acknowledgements I am grateful to the Australan Prudental Regulaton
More informationTHE METHOD OF LEAST SQUARES THE METHOD OF LEAST SQUARES
The goal: to measure (determne) an unknown quantty x (the value of a RV X) Realsaton: n results: y 1, y 2,..., y j,..., y n, (the measured values of Y 1, Y 2,..., Y j,..., Y n ) every result s encumbered
More informationA Novel Methodology of Working Capital Management for Large. Public Constructions by Using Fuzzy Scurve Regression
Novel Methodology of Workng Captal Management for Large Publc Constructons by Usng Fuzzy Scurve Regresson ChengWu Chen, Morrs H. L. Wang and TngYa Hseh Department of Cvl Engneerng, Natonal Central Unversty,
More informationAPPLICATION OF PROBE DATA COLLECTED VIA INFRARED BEACONS TO TRAFFIC MANEGEMENT
APPLICATION OF PROBE DATA COLLECTED VIA INFRARED BEACONS TO TRAFFIC MANEGEMENT Toshhko Oda (1), Kochro Iwaoka (2) (1), (2) Infrastructure Systems Busness Unt, Panasonc System Networks Co., Ltd. Saedocho
More informationAnswer: A). There is a flatter IS curve in the high MPC economy. Original LM LM after increase in M. IS curve for low MPC economy
4.02 Quz Solutons Fall 2004 MultpleChoce Questons (30/00 ponts) Please, crcle the correct answer for each of the followng 0 multplechoce questons. For each queston, only one of the answers s correct.
More informationStaff Paper. Farm Savings Accounts: Examining Income Variability, Eligibility, and Benefits. Brent Gloy, Eddy LaDue, and Charles Cuykendall
SP 200502 August 2005 Staff Paper Department of Appled Economcs and Management Cornell Unversty, Ithaca, New York 148537801 USA Farm Savngs Accounts: Examnng Income Varablty, Elgblty, and Benefts Brent
More informationHOUSEHOLDS DEBT BURDEN: AN ANALYSIS BASED ON MICROECONOMIC DATA*
HOUSEHOLDS DEBT BURDEN: AN ANALYSIS BASED ON MICROECONOMIC DATA* Luísa Farnha** 1. INTRODUCTION The rapd growth n Portuguese households ndebtedness n the past few years ncreased the concerns that debt
More informationUsing Series to Analyze Financial Situations: Present Value
2.8 Usng Seres to Analyze Fnancal Stuatons: Present Value In the prevous secton, you learned how to calculate the amount, or future value, of an ordnary smple annuty. The amount s the sum of the accumulated
More informationCausal, Explanatory Forecasting. Analysis. Regression Analysis. Simple Linear Regression. Which is Independent? Forecasting
Causal, Explanatory Forecastng Assumes causeandeffect relatonshp between system nputs and ts output Forecastng wth Regresson Analyss Rchard S. Barr Inputs System Cause + Effect Relatonshp The job of
More informationHigh Correlation between Net Promoter Score and the Development of Consumers' Willingness to Pay (Empirical Evidence from European Mobile Markets)
Hgh Correlaton between et Promoter Score and the Development of Consumers' Wllngness to Pay (Emprcal Evdence from European Moble Marets Ths paper shows that the correlaton between the et Promoter Score
More informationLatent Class Regression. Statistics for Psychosocial Research II: Structural Models December 4 and 6, 2006
Latent Class Regresson Statstcs for Psychosocal Research II: Structural Models December 4 and 6, 2006 Latent Class Regresson (LCR) What s t and when do we use t? Recall the standard latent class model
More informationADVERSE SELECTION IN INSURANCE MARKETS: POLICYHOLDER EVIDENCE FROM THE U.K. ANNUITY MARKET *
ADVERSE SELECTION IN INSURANCE MARKETS: POLICYHOLDER EVIDENCE FROM THE U.K. ANNUITY MARKET * Amy Fnkelsten Harvard Unversty and NBER James Poterba MIT and NBER * We are grateful to Jeffrey Brown, PerreAndre
More informationThe covariance is the two variable analog to the variance. The formula for the covariance between two variables is
Regresson Lectures So far we have talked only about statstcs that descrbe one varable. What we are gong to be dscussng for much of the remander of the course s relatonshps between two or more varables.
More information1 Approximation Algorithms
CME 305: Dscrete Mathematcs and Algorthms 1 Approxmaton Algorthms In lght of the apparent ntractablty of the problems we beleve not to le n P, t makes sense to pursue deas other than complete solutons
More informationPortfolio Loss Distribution
Portfolo Loss Dstrbuton Rsky assets n loan ortfolo hghly llqud assets holdtomaturty n the bank s balance sheet Outstandngs The orton of the bank asset that has already been extended to borrowers. Commtment
More informationCS 2750 Machine Learning. Lecture 3. Density estimation. CS 2750 Machine Learning. Announcements
Lecture 3 Densty estmaton Mlos Hauskrecht mlos@cs.ptt.edu 5329 Sennott Square Next lecture: Matlab tutoral Announcements Rules for attendng the class: Regstered for credt Regstered for audt (only f there
More informationScale Dependence of Overconfidence in Stock Market Volatility Forecasts
Scale Dependence of Overconfdence n Stoc Maret Volatlty Forecasts Marus Glaser, Thomas Langer, Jens Reynders, Martn Weber* June 7, 007 Abstract In ths study, we analyze whether volatlty forecasts (judgmental
More informationTime Series Analysis in Studies of AGN Variability. Bradley M. Peterson The Ohio State University
Tme Seres Analyss n Studes of AGN Varablty Bradley M. Peterson The Oho State Unversty 1 Lnear Correlaton Degree to whch two parameters are lnearly correlated can be expressed n terms of the lnear correlaton
More informationWhen Talk is Free : The Effect of Tariff Structure on Usage under Two and ThreePart Tariffs
0 When Talk s Free : The Effect of Tarff Structure on Usage under Two and ThreePart Tarffs Eva Ascarza Ana Lambrecht Naufel Vlcassm July 2012 (Forthcomng at Journal of Marketng Research) Eva Ascarza
More informationPowerofTwo Policies for Single Warehouse MultiRetailer Inventory Systems with Order Frequency Discounts
Powerofwo Polces for Sngle Warehouse MultRetaler Inventory Systems wth Order Frequency Dscounts José A. Ventura Pennsylvana State Unversty (USA) Yale. Herer echnon Israel Insttute of echnology (Israel)
More informationINVESTIGATION OF VEHICULAR USERS FAIRNESS IN CDMAHDR NETWORKS
21 22 September 2007, BULGARIA 119 Proceedngs of the Internatonal Conference on Informaton Technologes (InfoTech2007) 21 st 22 nd September 2007, Bulgara vol. 2 INVESTIGATION OF VEHICULAR USERS FAIRNESS
More informationSmall pots lump sum payment instruction
For customers Small pots lump sum payment nstructon Please read these notes before completng ths nstructon About ths nstructon Use ths nstructon f you re an ndvdual wth Aegon Retrement Choces Self Invested
More informationSupport Vector Machines
Support Vector Machnes Max Wellng Department of Computer Scence Unversty of Toronto 10 Kng s College Road Toronto, M5S 3G5 Canada wellng@cs.toronto.edu Abstract Ths s a note to explan support vector machnes.
More informationAn Evaluation of the Extended Logistic, Simple Logistic, and Gompertz Models for Forecasting Short Lifecycle Products and Services
An Evaluaton of the Extended Logstc, Smple Logstc, and Gompertz Models for Forecastng Short Lfecycle Products and Servces Charles V. Trappey a,1, Hsnyng Wu b a Professor (Management Scence), Natonal Chao
More informationForecasting the Direction and Strength of Stock Market Movement
Forecastng the Drecton and Strength of Stock Market Movement Jngwe Chen Mng Chen Nan Ye cjngwe@stanford.edu mchen5@stanford.edu nanye@stanford.edu Abstract  Stock market s one of the most complcated systems
More informationSection 5.3 Annuities, Future Value, and Sinking Funds
Secton 5.3 Annutes, Future Value, and Snkng Funds Ordnary Annutes A sequence of equal payments made at equal perods of tme s called an annuty. The tme between payments s the payment perod, and the tme
More informationx f(x) 1 0.25 1 0.75 x 1 0 1 1 0.04 0.01 0.20 1 0.12 0.03 0.60
BIVARIATE DISTRIBUTIONS Let be a varable that assumes the values { 1,,..., n }. Then, a functon that epresses the relatve frequenc of these values s called a unvarate frequenc functon. It must be true
More informationMAPP. MERIS level 3 cloud and water vapour products. Issue: 1. Revision: 0. Date: 9.12.1998. Function Name Organisation Signature Date
Ttel: Project: Doc. No.: MERIS level 3 cloud and water vapour products MAPP MAPPATBDClWVL3 Issue: 1 Revson: 0 Date: 9.12.1998 Functon Name Organsaton Sgnature Date Author: Bennartz FUB Preusker FUB Schüller
More informationThe Development of Web Log Mining Based on ImproveKMeans Clustering Analysis
The Development of Web Log Mnng Based on ImproveKMeans Clusterng Analyss TngZhong Wang * College of Informaton Technology, Luoyang Normal Unversty, Luoyang, 471022, Chna wangtngzhong2@sna.cn Abstract.
More informationTraffic State Estimation in the Traffic Management Center of Berlin
Traffc State Estmaton n the Traffc Management Center of Berln Authors: Peter Vortsch, PTV AG, Stumpfstrasse, D763 Karlsruhe, Germany phone ++49/72/965/35, emal peter.vortsch@ptv.de Peter Möhl, PTV AG,
More informationIDENTIFICATION AND CORRECTION OF A COMMON ERROR IN GENERAL ANNUITY CALCULATIONS
IDENTIFICATION AND CORRECTION OF A COMMON ERROR IN GENERAL ANNUITY CALCULATIONS Chrs Deeley* Last revsed: September 22, 200 * Chrs Deeley s a Senor Lecturer n the School of Accountng, Charles Sturt Unversty,
More informationHow Sets of Coherent Probabilities May Serve as Models for Degrees of Incoherence
1 st Internatonal Symposum on Imprecse Probabltes and Ther Applcatons, Ghent, Belgum, 29 June 2 July 1999 How Sets of Coherent Probabltes May Serve as Models for Degrees of Incoherence Mar J. Schervsh
More informationThe Racial and Gender Interest Rate Gap. in Small Business Lending: Improved Estimates Using Matching Methods*
The Racal and Gender Interest Rate Gap n Small Busness Lendng: Improved Estmates Usng Matchng Methods* Yue Hu and Long Lu Department of Economcs Unversty of Texas at San Antono Jan Ondrch and John Ynger
More informationErrorPropagation.nb 1. Error Propagation
ErrorPropagaton.nb Error Propagaton Suppose that we make observatons of a quantty x that s subject to random fluctuatons or measurement errors. Our best estmate of the true value for ths quantty s then
More informationHARVARD John M. Olin Center for Law, Economics, and Business
HARVARD John M. Oln Center for Law, Economcs, and Busness ISSN 10456333 ASYMMETRIC INFORMATION AND LEARNING IN THE AUTOMOBILE INSURANCE MARKET Alma Cohen Dscusson Paper No. 371 6/2002 Harvard Law School
More informationEvaluating the generalizability of an RCT using electronic health records data
Evaluatng the generalzablty of an RCT usng electronc health records data 3 nterestng questons Is our RCT representatve? How can we generalze RCT results? Can we use EHR* data as a control group? *) Electronc
More informationAn empirical study for credit card approvals in the Greek banking sector
An emprcal study for credt card approvals n the Greek bankng sector Mara Mavr George Ioannou Bergamo, Italy 1721 May 2004 Management Scences Laboratory Department of Management Scence & Technology Athens
More informationSection 5.4 Annuities, Present Value, and Amortization
Secton 5.4 Annutes, Present Value, and Amortzaton Present Value In Secton 5.2, we saw that the present value of A dollars at nterest rate per perod for n perods s the amount that must be deposted today
More informationChapter 4 ECONOMIC DISPATCH AND UNIT COMMITMENT
Chapter 4 ECOOMIC DISATCH AD UIT COMMITMET ITRODUCTIO A power system has several power plants. Each power plant has several generatng unts. At any pont of tme, the total load n the system s met by the
More informationToday s class. Chapter 13. Sources of uncertainty. Decision making with uncertainty
Today s class Probablty theory Bayesan nference From the ont dstrbuton Usng ndependence/factorng From sources of evdence Chapter 13 1 2 Sources of uncertanty Uncertan nputs Mssng data Nosy data Uncertan
More informationEvaluating credit risk models: A critique and a new proposal
Evaluatng credt rsk models: A crtque and a new proposal Hergen Frerchs* Gunter Löffler Unversty of Frankfurt (Man) February 14, 2001 Abstract Evaluatng the qualty of credt portfolo rsk models s an mportant
More informationEvaluating the Effects of FUNDEF on Wages and Test Scores in Brazil *
Evaluatng the Effects of FUNDEF on Wages and Test Scores n Brazl * Naérco MenezesFlho Elane Pazello Unversty of São Paulo Abstract In ths paper we nvestgate the effects of the 1998 reform n the fundng
More informationL10: Linear discriminants analysis
L0: Lnear dscrmnants analyss Lnear dscrmnant analyss, two classes Lnear dscrmnant analyss, C classes LDA vs. PCA Lmtatons of LDA Varants of LDA Other dmensonalty reducton methods CSCE 666 Pattern Analyss
More informationSurvival analysis methods in Insurance Applications in car insurance contracts
Survval analyss methods n Insurance Applcatons n car nsurance contracts Abder OULIDI 1 JeanMare MARION 2 Hervé GANACHAUD 3 Abstract In ths wor, we are nterested n survval models and ther applcatons on
More information8 Algorithm for Binary Searching in Trees
8 Algorthm for Bnary Searchng n Trees In ths secton we present our algorthm for bnary searchng n trees. A crucal observaton employed by the algorthm s that ths problem can be effcently solved when the
More informationDemographic and Health Surveys Methodology
samplng and household lstng manual Demographc and Health Surveys Methodology Ths document s part of the Demographc and Health Survey s DHS Toolkt of methodology for the MEASURE DHS Phase III project, mplemented
More informationBrigid Mullany, Ph.D University of North Carolina, Charlotte
Evaluaton And Comparson Of The Dfferent Standards Used To Defne The Postonal Accuracy And Repeatablty Of Numercally Controlled Machnng Center Axes Brgd Mullany, Ph.D Unversty of North Carolna, Charlotte
More informationAnalysis of Subscription Demand for PayTV
Analyss of Subscrpton Demand for PayTV Manabu Shshkura * Norhro Kasuga ** Ako Tor *** Abstract In ths paper, we wll conduct an analyss from an emprcal perspectve concernng broadcastng demand behavor and
More informationPrediction of Disability Frequencies in Life Insurance
Predcton of Dsablty Frequences n Lfe Insurance Bernhard Köng Fran Weber Maro V. Wüthrch October 28, 2011 Abstract For the predcton of dsablty frequences, not only the observed, but also the ncurred but
More informationA 'Virtual Population' Approach To Small Area Estimation
A 'Vrtual Populaton' Approach To Small Area Estmaton Mchael P. Battagla 1, Martn R. Frankel 2, Machell Town 3 and Lna S. Balluz 3 1 Abt Assocates Inc., Cambrdge MA 02138 2 Baruch College, CUNY, New York
More informationRiskbased Fatigue Estimate of Deep Water Risers  Course Project for EM388F: Fracture Mechanics, Spring 2008
Rskbased Fatgue Estmate of Deep Water Rsers  Course Project for EM388F: Fracture Mechancs, Sprng 2008 Chen Sh Department of Cvl, Archtectural, and Envronmental Engneerng The Unversty of Texas at Austn
More informationReturns to Experience in Mozambique: A Nonparametric Regression Approach
Returns to Experence n Mozambque: A Nonparametrc Regresson Approach Joel Muzma Conference Paper nº 27 Conferênca Inaugural do IESE Desafos para a nvestgação socal e económca em Moçambque 19 de Setembro
More informationBinomial Link Functions. Lori Murray, Phil Munz
Bnomal Lnk Functons Lor Murray, Phl Munz Bnomal Lnk Functons Logt Lnk functon: ( p) p ln 1 p Probt Lnk functon: ( p) 1 ( p) Complentary Log Log functon: ( p) ln( ln(1 p)) Motvatng Example A researcher
More informationPrediction of Disability Frequencies in Life Insurance
1 Predcton of Dsablty Frequences n Lfe Insurance Bernhard Köng 1, Fran Weber 1, Maro V. Wüthrch 2 Abstract: For the predcton of dsablty frequences, not only the observed, but also the ncurred but not yet
More informationDiagnostic Tests of Cross Section Independence for Nonlinear Panel Data Models
DISCUSSION PAPER SERIES IZA DP No. 2756 Dagnostc ests of Cross Secton Independence for Nonlnear Panel Data Models Cheng Hsao M. Hashem Pesaran Andreas Pck Aprl 2007 Forschungsnsttut zur Zukunft der Arbet
More information2.4 Bivariate distributions
page 28 2.4 Bvarate dstrbutons 2.4.1 Defntons Let X and Y be dscrete r.v.s defned on the same probablty space (S, F, P). Instead of treatng them separately, t s often necessary to thnk of them actng together
More informationCalculating the high frequency transmission line parameters of power cables
< ' Calculatng the hgh frequency transmsson lne parameters of power cables Authors: Dr. John Dcknson, Laboratory Servces Manager, N 0 RW E B Communcatons Mr. Peter J. Ncholson, Project Assgnment Manager,
More informationTHE DISTRIBUTION OF LOAN PORTFOLIO VALUE * Oldrich Alfons Vasicek
HE DISRIBUION OF LOAN PORFOLIO VALUE * Oldrch Alfons Vascek he amount of captal necessary to support a portfolo of debt securtes depends on the probablty dstrbuton of the portfolo loss. Consder a portfolo
More informationThe Application of Fractional Brownian Motion in Option Pricing
Vol. 0, No. (05), pp. 738 http://dx.do.org/0.457/jmue.05.0..6 The Applcaton of Fractonal Brownan Moton n Opton Prcng Qngxn Zhou School of Basc Scence,arbn Unversty of Commerce,arbn zhouqngxn98@6.com
More informationA Probabilistic Theory of Coherence
A Probablstc Theory of Coherence BRANDEN FITELSON. The Coherence Measure C Let E be a set of n propostons E,..., E n. We seek a probablstc measure C(E) of the degree of coherence of E. Intutvely, we want
More informationSimple Interest Loans (Section 5.1) :
Chapter 5 Fnance The frst part of ths revew wll explan the dfferent nterest and nvestment equatons you learned n secton 5.1 through 5.4 of your textbook and go through several examples. The second part
More information