Data-driven Multi-touch Attribution Models
|
|
|
- Damon Hancock
- 10 years ago
- Views:
Transcription
1 Data-driven Multi-touch Attribution Models Xuhui Shao Turn, Inc. 835 Main St. Redwood City, CA Lexin Li Department of Statistics North Carolina State University Raleigh, NC ABSTRACT In digital advertising, attribution is the problem of assigning credit to one or more advertisements for driving the user to the desirable actions such as making a purchase. Rather than giving all the credit to the last ad a user sees, multitouch attribution allows more than one ads to get the credit based on their corresponding contributions. Multi-touch attribution is one of the most important problems in digital advertising, especially when multiple media channels, such as search, display, social, mobile and video are involved. Due to the lack of statistical framework and a viable modeling approach, true data-driven methodology does not exist today in the industry. While predictive modeling has been thoroughly researched in recent years in the digital advertising domain, the attribution problem focuses more on accurate and stable interpretation of the influence of each user interaction to the final user decision rather than just user classification. Traditional classification models fail to achieve those goals. In this paper, we first proposeabivariate metric, one measures the variability of the estimate, and the other measures the accuracy of classifying the positive and negative users. We then develop a bagged logistic regression model, which we show achieves a comparable classification accuracy as a usual logistic regression, but a much more stable estimate of individual advertising channel contributions. We also propose an intuitive and simple probabilistic model to directly quantify the attribution of different advertising channels. We then apply both the bagged logistic model and the probabilistic model to a real-world data set from a multi-channel advertising campaign for a well-known consumer software and services brand. The two models produce consistent general conclusions and thus offer useful cross-validation. The Xuhui Shao is Chief Technology Officer, Turn, Inc. Lexin Li is the corresponding author and Associate Professor, Department of Statistics, North Carolina State University. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for prof t or commercial advantage and that copies bear this notice and the full citation on the f rst page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specif c permission and/or a fee. KDD 11, August 21 24, 2011, San Diego, California, USA. Copyright 2011 ACM /11/08...$ results of our attribution models also shed several important insights that have been validated by the advertising team. We have implemented the probabilistic model in the production advertising platform of the first author s company, and plan to implement the bagged logistic regression in the next product release. We believe availability of such datadriven multi-touch attribution metric and models is a breakthrough in the digital advertising industry. Categories and Subject Descriptors I.6.5 [Computing Methodologies]: Simulation and Modeling, Model Development General Terms Algorithms, Performance, Theory Keywords Digital Advertising, Multi-touch Attribution Model, Bagged Logistic Regression 1. INTRODUCTION Digital advertising started 16 years ago as a new media where traditional print ads can appear [1]. When internet continues to grow with an exploding rate, advertising industry embraced digital advertising and has made it a $40 Billion a year mega industry in US alone. Digital advertising s appeal is not only in its ability to precisely target different groups of consumers with customized ad messages and ad placements, but probably more importantly in its ability to track responses and performances almost instantaneously. Advertising campaigns are often launched across multiple channels. Traditional advertising channels include outdoor billboard, TV, radio, newspapers and magazines, and direct mailing. Digital advertising channels include search, online display, social, video, mobile and . In this article, we focus on the digital advertising channels. Typically multiple advertising channels have delivered advertisement impressions to a user. When the user then makes a purchase decision or signs up to a service being advertised, the advertiser wants to determine which ads have contributed to the user s decision. This step is critical in completing the feedback loop so that one can analyze, report and optimize an advertising campaign. This problem of interpreting the influence of advertisements to the user s decision process is called the attribution problem. 258
2 Figure 1: An illustration of multi-touch attribution problem. The goal of attribute modeling is to pin-point the credit assignment of each positive user to one or more advertising touch point, which is illustrated in Figure 1. The resulting user-level assignment can be aggregated along different dimensions including media channel to derive overall insights. Attribution modeling is not to be confused with marketing mix modeling (MMM), which is limited to the temporal analysis of marketing channels and can not perform any inference at the user level or any dimensions other than marketing channel. To determine which media channel or which ad is to be credited, initially a simple rule was developed and quickly adopted by the online advertising industry: The last ad the user clicked on before he made the purchase or sign up decision, or say, conversion, gets 100% of the credit. This lastclick win model was extended to include last-view win if none of the ads was clicked within a reasonable time window before user conversion. We call both these two models last-touch attribution (LTA), where touch or touch point is defined to be any ad impression, click or advertising related interaction the user has experienced from the advertiser. The last-touch attribution model is simple. However, it completely ignores the influences of all ad impressions except the last one. It is a highly flawed model as pointed out by [2]. Alternatively, the concept of multi-touch attribution(mta) model has been recently proposed, where more than one touch point can each have a fraction of the credit based on the true influence each touch point has on the outcome, i.e., user s conversion decision. Atlas institute, a division of Microsoft Advertising first proposed the notion of MTA [2]. However, in that paper and other related research from Microsoft Atlas, there is no proposal for how to assign the percentage of credit statistically based on the campaign data. Clearsaleing is a consulting company specialized in attribution analysis, whose attribution model assigns equal fraction of credits to the first and the last touch point, and collectively all the touch points in between[3]. While a datadriven custom model is described as available upon request, the methodology of the custom model is not publicized. Another company, C3 Metric, also offers a rule-based MTA model [4]. But like [3], their model assigns credit to certain touch points simply based on the temporal order of touch points and with fixed percentages. In our opinion, because user s decision process is largely dependent on the advertiser, the product offer, and how advertising messages and creative design are structured, a desirable attribution model should be campaign-specific and be driven by a solid statistical analysis of user response data. In addition to the lack of a true data-driven MTA model, a good metric to evaluate different MTA models is not available either. Intuitively, a good MTA model should have a high degree of accuracy in correctly classifying a user as positive (with a conversion action) or negative (without a conversion action). Equally or more important in digital advertising is that, a good MTA model should provide a stable estimation of individual variable s(for example, media channel) contribution. Unlike predictive models, the stability of the estimation is especially important here because attribution model determines the performance metric for the ad campaign. Every advertising company and every advertising tactic ultimately are judged by the performance metric set forth in the attribution model. Having stable and reproducible result is by definition what a performance metric needs to be. Ideally the attribution model should be easy to interpret as the results of attribution analysis are often used to derive insights to the ad campaign and its optimization strategy. Although in recent years predictive modeling has been thoroughly researched in the digital advertising domain, for example in [5] and [6], the focus has been on the classification accuracy. The resulting models, many generated from a black-box type predictive approach, are very hard to interpret. Furthermore, little attention has been paid to the stability issue of the variable contribution estimate. There is also the problem of variable correlation when one tries to interpret the model coefficients directly, which was discussed in section of [7]. In this paper, we first propose a new bivariate metric. One component of this metric measures the variability of the estimate, and the other measures the accuracy of classifying the positive and negative users. We then develop a bagged logistic regression model, which we show achieves a comparable classification accuracy as a usual logistic regression, but a much more stable estimate of individual variable contributions. We also propose a simple and intuitive probabilistic model to compute the attribution of different variables based on a combination of first and second order conditional probabilities. We evaluate both models using the proposed bivariate metric, and find the two generate consistent results. We then analyze a large advertising campaign data set, which has 72.5 million anonymized users with over 2 billion ad impressions coming from search, display, social, and video channels over a four-week period. As for implementation, the probabilistic model has been deployed in the production advertising system of the first author s company. The bagged logistic regression model is currently being developed for future product release in the production system. The rest of the paper is organized as follows. We present the bivariate metric in Section 2, and the two data-driven multi-touch attribution models in Section 3. We evaluate the empirical performance the proposed models in Section 4. We conclude the paper with a discussion. 2. A BIVARIATE METRIC It is always of interest to identify if a user is to make a 259
3 purchase or sign up for a service based on his exposure to various advertisement channels. This is a typical classification problem, where the outcome is binary, with positive meaning a user is to make a purchase action and negative meaning otherwise, and the covariates are the number of touch points of different channels. Towards that end, we employ the usual misclassification error rate as part of an evaluation metric for an MTA model. On the other hand, human behavior is complex and the user data are highly correlated. As a consequence, a simple MTA model, e.g., a usual logistic regression, could have highly variable estimate which would make the model difficult to interpret. In addition, the high collinearity in attributes also causes strong variables to suppress weaker, correlated variables as described in Section of [7]. Therefore we aim to capture the variability of an MTA model in our model evaluation metric. Towards that goal, we employ the notion of standard deviation and also take advantage of the fact that the advertising campaign data almost always have a large number of users. More specifically, we first obtain a random subset of samples of both positive and negative users as a training data set, then another random subset as a testing data set. To avoid having too few positive users in the samples, we fix the ratio of positive versus negative users. In our numerical analysis, we have experimented this ratio with 1 : 1 and 1 : 4 and the two yield very similar results. For brevity we only report the results based on 1 : 4 ratio below. We then fit an MTA model to the training data. We record the contribution of each advertisement channel, i.e., the coefficient estimate, from the fitted MTA model. We also evaluate the fitted model on the independent testing data and record the misclassification error rate. We then repeat the above process multiple times in order to compute the standard deviation of individual coefficient estimates across multiple repetitions. We report the average of all standard deviations across different channels as the variability measure (V-metric), and the average of misclassification error rates across data repetitions as the accuracy measure (A-metric). We evaluate an MTA model based upon the bivariate metric of both the variability and the accuracy (the V-A-metric). A small A-metric indicates that the model under investigation has a high accuracy of predicting the active or inactive user, while a small V-metric indicates that the model has a stable estimate. Ideally a good MTA model should have both metrics small. 3. MULTI-TOUCH ATTRIBUTION MODELS 3.1 A Bagged Logistic Regression There have been intensive research on classification modeling in the literature. Some well known examples include support vector machines [8], neural networks [10], and other unique methods designed for online advertising in [6] and [9]. See [7], [10] and [11] for a good review. Most of those methods generate a complex model, some of which are of a black-box type. The resulting classification boundary is rather flexible, so it can achieve a competent classification accuracy. However, in attribution modeling, it is more of a concern to obtain a model that is stable and relatively easy to interpret, so that advertisers can develop a clear strategy to optimize their resource allocations and optimization among multiple advertising channels. The bagging approach as a meta learning method was first proposed in [12]. One of the most popular bagged approaches is random forest[13] where decision tree models are stacked to increase performance and robustness. Bagged logistic regression is not of much interest in terms of predictive modeling, since it is more productive to combine nonlinear models in order to increase the prediction accuracy. It has been shown to be outperformed by the tree-based method [14]. On the other hand, the bagging approach possesses the ability to isolate variable collinearity, as discussed in Section of [7]. In our context of attribution modeling, we combine the commonly used logistic regression, which is simple and easy to interpret, and the bagging idea, which is to help reduce the estimation variability due to the highly correlated covariates. This results in the bagged logistic regression, which retains the ease of interpretation of a simple logistic model, whereas achieving a stable and reproducible estimation result. More specifically, the bagged logistic regression is fitted using the following steps. Step 1. For a given data set, sample a proportion p s of all the sample observations and a proportion p c of all the covariates. Fit a logistic regression model on the sampled covariates and the sampled data. Record the estimated coefficients. Step 2. Repeat Step 1 for M iterations, and the final coefficient estimate for each covariate is taken as the average of estimated coefficients in M iterations. The sample proportion p s, the covariate proportion p c, and the number of iterations M are the parameters of the bagged logistic regression. We will examine their choices in detail in Section 4. Our observations are that, for a range of values of p s and p c that are not close to either 0 or 1, the bagged logistic regression yield similar results. Besides, the results are not overly sensitive to the choice of M. When evaluating the model using the proposed V-A-metric, we find that, the bagged logistic regression achieves a very similar misclassification rate (A-metric) but enjoys a much smaller variability (V-metric) compared to a usual logistic regression, which is desirable for attribution modeling. 3.2 A Simple Probabilistic Model In addition to the bagged logistic regression model, we also develop a probabilistic model based on a combination of first and second-order conditional probabilities. This new model is even simpler than a logistic model. Such a model simplicity translates into both low estimation variability and ease of interpretation, meanwhile it trades off accuracy. As such, compared to the bagged logistic model, we expect the new model would achieve a smaller V-metric but a larger A- metric. Our numerical analysis confirms this expectation. The probabilistic model is generated using the following steps: Step 1. For a given data set, compute the empirical probability of the main factors, P(y x i) = N positive(x i) N positive(x i)+n negative(x i) (1) 260
4 and the pair-wise conditional probabilities P(y x i,x j) = N positive(x i,x j) N positive(x i,x j)+n negative(x i,x j), (2) for i j. Here y is a binary outcome variable denoting a conversion event (purchase or sign-up), and x i,i = 1,...,p, denotepdifferentadvertisingchannels. N positive(x i) and N positive(x i) denote the number of positive or negative users exposed to channel i, respectively, and N positive(x i,x j) and N negative(x i,x j) denote the number of positive or negative users exposed to both channels i and j. Step 2. The contributionof channeliis then computed at each positive user level as: C(x i) = p(y x 1 { i)+ p(y x i,x j) 2N j i j i } p(y x i) p(y x j), (3) where N j i denotes the total number of j s not equal to i. In this case it equals to N-1, or the total number of channels minus one (the channel i itself) for a particular user. The model is essentially a second-order probability estimation. Due to the similarly designed advertising messages and user s exposure to multiple media channels, there are a fair amount of overlapping between the influences of different touch points. Therefore it is critically important to include the second-order interaction terms in the probability model. Theoretically we can go to the third-order, fourthorder interactions or higher. However, the number of observations with the same third-order interaction drops significantly for even a data set as large as the one analyzed in Section 4. Therefore, it is of little practical use to attempt to estimate the empirical probability with the third or higher order. Furthermore, we make an important assumption in the probability model in that the net effect of the second-order interaction goes evenly to each of the two factors involved. Based on the Occam s Razor principle, we feel this is the minimal assumption we need to make without any data evidence to suggest otherwise. Focusing on the first and second-order terms also helps to reduce any assumption to a minimum for example, trying to split the effect in the third-order interactions can be more hazardous than in the second-order interactions. In Section 4, we will employ both the bagged logistic regression model and the probabilistic model to analyze the same advertising campaign data set for overall attribution results across all main media channels. Our results show that, while there are small differences, the general conclusion is consistent between the two models. The reason that we consider more than one model is the following. Digital advertising relies on a fair amount of subjectivity. Having two different modeling approaches give advertiser the flexibility to choose. The bagged logistic regression model is more accurate and more flexible with a larger number of covariates. It is slightly more difficult to interpret. On the other hand, the probabilistic model is less accurate but much more intuitive to interpret. In addition, the result from both models can cross-validate the general conclusion reached in the overall advertising campaign analysis. 4. NUMERICAL ANALYSIS 4.1 Data Background In this section, we analyze a large advertising campaign data set using both proposed methods. This is a 2010 advertising campaign of a consumer software and services company. The campaign ran over a four week period. The size of the data set is over 300GB compressed. We sampled onethird, i.e., 72.5 million anonymous users. In total these 72.5 million users received over 2 billion ad impressions coming from search, display, social, and video channels over a four-week period. Because search advertising is priced as pay-per-click model, only search clicks are reported for each user. Furthermore, more than a dozen advertising networks or equivalent media buying channels are involved in delivering identically designed advertisements. In our study, there are 39 channels in total. It is an unresolved but critically important problem for the advertiser to determine the true effectiveness of each media buying channels. This attribution analysis is not only important for ranking the effectiveness of the channels, but also in deriving insights so that different optimization tactic can be deployed under different circumstances. We apply the bagged logistic regression model and the simple probabilistic model to analyze this data. 4.2 Bagged Logistic Regression Analysis In this section we examine the empirical performance of the bagged logistic regression model, and compare with the usual logistic regression using the V-A-metric. In addition, we also examine the choice of the tuning parameters in the bagged logistic regression. The simulation setup is based upon the following scheme. Step 1. Randomly sample a subset of N users as the training data. We choose N = 50,000, and the ratio between the active and inactive users is 1 : 4. (The results for the ratio of 1 : 1 are very similar, so are omitted for brevity.) This leads to 10,000 randomly selected active users and 40,000 inactive users. Step 2. Randomly sample another independent subset of N users as the testing data. Step 3. Fit the bagged logistic regression to the training data, with the pre-specified sample proportion p s and the covariate proportion p c, and obtain the coefficient estimate. Step 4. Fit the usual logistic regression to the training data, and obtain the coefficient estimate. Step 5. Evaluate the misclassification error rate of both regression models on the testing data. Step 6. Repeat Steps 1 to 5 for S = 100 times. Compute the V-A-metric for both regression models. Because each sampling is random, all data have chance of being selected as training or testing data. We set the sample proportion as p s = 0.25,0.5 and 0.75, and the covariate proportion as p c = 0.25,0.5 and 0.75, respectively. Table 1 reports the results. It is seen from the table that, when p s and p c are both close to zero, the bagged 261
5 Table 1: Comparison of the bagged logistic regression (BLR) and the usual logistic regression (LR) in terms of the V-A-metric. p s p c V-metric A-metric V-metric A-metric V-metric A-metric LR BLR LR BLR LR BLR logistic model achieves a substantially smaller V-metric but also a worse A-metric compared to the usual logistic model. When p s and p c take some value in the middle range of zero and one, e.g., when p s = 0.5 and p c = 0.5, we clearly see that the bagged model achieves a variability measure that is much smaller than the variability of the usual logistic model, whereas the accuracy measure of the two models become almost identical. As p s and p c increase closer to one, the bagged model exhibits a A-metric that is essentially identical to that of the usual logistic model, but with a lower V-metric. As such we recommend to choose p s and p c to take values around 0.5 if both the variability and the accuracy are of the concern. For the number of iterations M, we have experimented with a number of values and observe the same qualitative patterns. For brevity, we only report in Table 1 the results based on M = 1000 iterations. We also note that the V-metric for the usual logistic regression varies a little although it does not depend on the varying parameters p s and p c. This is due to the random sampling variation, which to some extent reflects how variable the usual logistic model can be for the advertising data even a random subset of samples would cause visible estimation variation. 4.3 Probabilistic Model Analysis We next apply the simple probabilistic model to the same data set, and we evaluate the model with the V-A-metric. The resulting V-metric is 0.026, whereas the A-metric is Comparing with the results in Table 1, we see that the probabilistic model achieves a very low variability due to its deterministic logic and simple model structure. On the other hand, its misclassification rate is higher than the bagged logistic model, which again is intuitively attributable to the low model complexity. These observations reflect the well known bias-variance tradeoff. Although more complicated models, e.g., a higher order probabilistic model, could improve estimation accuracy, it would also induce higher variation. Besides, higher order models are often computationally infeasible for ad data of such a scale. We also compare the bagged logistic regression model and the simple probabilistic model in terms of MTA user-level assignment. For the bagged logistic model, we take the linear term ˆβ x i as the contribution of the channel i, where ˆβ denotes the coefficient estimate based on the bagged model. For the simple probabilistic model, we use equation (3) to compute user-level assignment for each channel. We resample the data S = 100 times, and show the box plot for the two models in Figure 2. First, we observe that the two models yield very similar patterns, suggesting a good agreement of the two models. Second, the bagged logistic regression model exhibits a relatively low variability across data re-sampling, whereas the simple probabilistic model shows a even smaller variability due to its model simplicity. We also comment that, for ease of comparison, we choose the simplest feature construction scheme for all the models, i.e., we only encode the presence of each channel as a binary variable. The actual model can take on more complex features such as the creative design, web-site category, time of advertisement, frequency of the user s exposure to the same ad, among others. While the scaling constants are different, both proposed models have a computation complexity of O(p 2 N), where p is the number of dimensions and N is the data sample size. These additional variables have been implemented in the production environment of the first author s company with the help of a cluster of multi-core Linux servers. The general conclusion reached in this paper extends well to those more complex models. 4.4 Interpretation of the Results We presented the user-level attribution analysis to the advertising team. Some interesting observations were made when comparing the MTA model with advertiser s existing LTA model. The comparison is show in Table 2 for a subset of channels that are of particular interests to the advertising team. As seen from the table, for search click, click, retail click and social click, MTA and LTA get very similar numbers. Essentially these types of user initiated responses are both: highly correlated to the final purchase decision; and temporally occurring very close to the purchase decision. On the other hand, the effectiveness of display ad networks are widely different. Overall, display ads (or banner ads) are undervalued by the LTA model since these ad impressions are usually further away in time from the purchase action than, say, search click. In addition, some ad networks (for example, Network G) are doing much better and some (for example, Network A) are doing much worse. This may be attributed to a trick some ad networks play in gaming the LTA model. It is called cookie bombing where large amount of low-cost almost invisible ads are shown to large amount of users. While these impressions do not have much real influence on user s decision, they appear quite often as the last ad impression user sees and therefore gets the credit from LTA model. 262
6 bagged logistic regression model V1 V3 V5 V7 V9 V11 V13 V15 V17 V19 V21 V23 V25 V27 V29 V31 V33 V35 V37 V39 simple probabilistic model Table 2: The MTA user-level attribution analysis. Channel MTA Total LTA Total Difference Search Click 17,494 17,017 97% Click 6,938 7, % Display Network A 5,567 8, % Display Network G 2, % Display Network B 1,818 1,272 70% Display Trading Desk 1,565 1,367 87% Display Network C 1,494 1,373 92% Display Network D 1,491 1,233 83% View 1, % Display Network E 1,187 1,138 96% Brand Campaign 907 1, % Social 768 1, % Display Network H % Display Network F % Display Network I % Retail Click % Display Network J % Retail % Social Click % Video % V1 V3 V5 V7 V9 V11 V13 V15 V17 V19 V21 V23 V25 V27 V29 V31 V33 V35 V37 V39 Figure 2: MTA user-level assignment for the bagged logistic regression model and the simple probabilistic model. Our models provided some important insights that helped the advertiser to gauge the true effectiveness of each media channel and root out those gaming tactics. By this change alone, it is estimated that the advertiser can improve the overall campaign performance by as much as 30%. 5. DISCUSSION In this article we proposed two statistical multi-touch attribution models. We also proposed a bivariate metric that canbeusedtoevaluateandselectadata-drivenmtamodel. We consider the main body of this work falls under descriptive or interpretive modeling, a field that has been largely ignored in comparison to predictive modeling. For digital advertising, having the right attribution model is critically important as it drives performance metric, advertising in- sights and optimization strategy. We believe our work makes some useful and unique contribution in this field. Current state-of-the-art attribution models are represented by [2], [3] and [4]. Comparing to our proposed models, none of the existing publicized models are statistically derived from the advertising data in question. To apply those models, one needs to either rely on some universal rule that would result in identical assignment regardless of advertisers or user context ([3] and [4]), or one needs to come up with some subjective assignment rule oneself based on human intuition. By contrast, our methods are data-driven and are based upon the most relevant advertising data, and as such are believed to be more accurate and objective. The probabilistic model is currently deployed in the production environment of the first author s company. It is the industry s first data-driven multi-touch attribution model commercially available to the best of our knowledge. Because of this, at the time of this writing, a number of top-5 media holding companies and several Fortune 100 advertisers have signed up to test this MTA model. We are also planning to develop and deploy the bagged logistic regression model as a follow-up version so that advertisers can choose either model to focus more on accuracy or more on interpretation. While we believe both methods are statistically sound, to make MTA models useful for digital advertising requires additional heuristics in the following areas: 1. Select the right dimensions to model on. Introducing unnecessary dimensions would introduce noise and make results difficult to interpret. 2. Control the dimensionality and cardinality. Higher dimensionality and cardinality would either significantly increase the amount of data needed for statistical significance or drown out the important conclusions. 263
7 3. Carefully encode variables so that domain knowledge could help choose a compact yet effective model. There are a number of avenues for future research. First, bagging process is a wrapper method that can be applied to many types of learning machines. For example Random Forrest [13] is a very popular bagged decision tree model. We choose logistic regression for the ease of implementation and the simple interpretation of the coefficients. One area of the future development is to extend this MTA framework to other learning machines so that we can choose a more powerful learning method while still be able to easily derive the user-level attribution assignment. Another area of development is in formalizing the heuristics needed for building specific types of MTA models that can address typical digital advertising questions such as budget allocation, crosschannel optimization, and message sequencing. The third area is in incorporating the MTA model into predictive advertising models. Attribution model defines the success metric of each advertising campaign. Because of the dominance of the LTA model, many predictive models used today are influenced by it. New predictive models are needed when advertisers start to adopt the new attribution model. 6. REFERENCES [1] D Angelo, F. Happy Birthday, Digital Advertising! [2] Chandler-Pepelnjak, J. Atlas Institute, Microsoft Advertising. Measuring ROI Beyond the Last AD. Atlas_Institute/Published_Content/ dmi-measuringroibeyondlastad.pdf. [3] Clearsaleing Inc. Clearsaleing Attribution Model. accurate-attribution-management/ [4] C3 Metric, Inc. What is C3 Metric. [5] Provost, F., Dalessandro, B., Hook, R., Zhang, X., and Murray, A. Audience Selection for On-line Brand Advertising: Privacy-friendly Social Network Targeting. In Proceedings of the Fifteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, [6] Li, W., Wang, X., Zhang, R., Cui, Y., Mao, J., and Jin, R. Exploitation and Exploration in a Performance Based Contextual Advertising System. In Proceedings of the Fifteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, [7] Hastie, T., Tibshirani, R., and Friedman, J. The Elements of Statistical Learning: Data Mining, Inference and Prediction, 2nd Edition, Springer, New York, [8] Cortes, C., and Vapnik, V. Support-Vector Networks. Machine Learning, 20, , [9] Jin, X., Li, Y., Mah, T., and Tong, J. Sensitive Webpage Classification for Content Advertising. In Proceedings of the 1st international workshop on Data mining and audience intelligence for advertising, [10] Bishop, C.M. Neural Networks for Pattern Recognition, Oxford University Press, [11] Bishop, C.M. Pattern Recognition and Machine Learning, Springer, [12] Breiman, L. Bagging Predictors. Machine Learning, 24, , [13] Breiman, L. Random Forests. Machine Learning, 45, 5-32, [14] Perlich, Cl, Provost, F., and Simonoff, J.S. Tree Induction vs. Logistic Regression: A Learning-Curve Analysis. Journal of Machine Learning Research, 4, ,
Purchase Conversions and Attribution Modeling in Online Advertising: An Empirical Investigation
Purchase Conversions and Attribution Modeling in Online Advertising: An Empirical Investigation Author: TAHIR NISAR - Email: [email protected] University: SOUTHAMPTON UNIVERSITY BUSINESS SCHOOL Track:
DATA BROUGHT TO YOU BY
DATA BROUGHT TO YOU BY WHY BEING CUSTOMER-CENTRIC MEANS BEING DATA-DRIVEN A CROSS-CHANNEL ANALYSIS OF HOW MULTI-TOUCH ATTRIBUTION CHALLENGES CATEGORY MARKETING NORMS. Changing habits and the rapid rise
Marketing Mix Modelling and Big Data P. M Cain
1) Introduction Marketing Mix Modelling and Big Data P. M Cain Big data is generally defined in terms of the volume and variety of structured and unstructured information. Whereas structured data is stored
MYTH-BUSTING SOCIAL MEDIA ADVERTISING Do ads on social Web sites work? Multi-touch attribution makes it possible to separate the facts from fiction.
RE MYTH-BUSTING SOCIAL MEDIA ADVERTISING Do ads on social Web sites work? Multi-touch attribution makes it possible to separate the facts from fiction. Data brought to you by: In recent years, social media
Classification of Bad Accounts in Credit Card Industry
Classification of Bad Accounts in Credit Card Industry Chengwei Yuan December 12, 2014 Introduction Risk management is critical for a credit card company to survive in such competing industry. In addition
SYNTASA DATA SCIENCE SERVICES
SYNTASA DATA SCIENCE SERVICES A 3 : Advanced Attribution Analysis A Data Science Approach Joseph A. Marr, Ph.D. Oscar O. Olmedo, Ph.D. Kirk D. Borne, Ph.D. February 11, 2015 The content and the concepts
Multichannel Attribution
Accenture Interactive Point of View Series Multichannel Attribution Measuring Marketing ROI in the Digital Era Multichannel Attribution Measuring Marketing ROI in the Digital Era Digital technologies have
Data Mining - Evaluation of Classifiers
Data Mining - Evaluation of Classifiers Lecturer: JERZY STEFANOWSKI Institute of Computing Sciences Poznan University of Technology Poznan, Poland Lecture 4 SE Master Course 2008/2009 revised for 2010
Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets
Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets http://info.salford-systems.com/jsm-2015-ctw August 2015 Salford Systems Course Outline Demonstration of two classification
Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
Redefining Measurement from Awareness to Conversion. Smart Market: Vol. 4 Data-Driven Marketing, Demystified
Smart Market: Vol. 4 Data-Driven Marketing, Demystified Redefining Measurement from Awareness to Conversion PROGRAMMATIC MARKETING & THE NEW CUSTOMER JOURNEY In today s multiscreen world, the odds that
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
Data Mining. Nonlinear Classification
Data Mining Unit # 6 Sajjad Haider Fall 2014 1 Nonlinear Classification Classes may not be separable by a linear boundary Suppose we randomly generate a data set as follows: X has range between 0 to 15
Digital Enterprise. White Paper. Multi-Channel Strategies that Deliver Results with the Right Marketing Attribution Model
Digital Enterprise White Paper Multi-Channel Strategies that Deliver Results with the Right Marketing Model About the Authors Vishal Machewad Head Marketing Services Practice Vishal Machewad has over 13
Mobile Real-Time Bidding and Predictive
Mobile Real-Time Bidding and Predictive Targeting AdTheorent s Real-Time Learning Machine : An Intelligent Solution FOR Mobile Advertising Marketing and media companies must move beyond basic data analysis
Why do statisticians "hate" us?
Why do statisticians "hate" us? David Hand, Heikki Mannila, Padhraic Smyth "Data mining is the analysis of (often large) observational data sets to find unsuspected relationships and to summarize the data
Getting Even More Out of Ensemble Selection
Getting Even More Out of Ensemble Selection Quan Sun Department of Computer Science The University of Waikato Hamilton, New Zealand [email protected] ABSTRACT Ensemble Selection uses forward stepwise
Chapter 6. The stacking ensemble approach
82 This chapter proposes the stacking ensemble approach for combining different data mining classifiers to get better performance. Other combination techniques like voting, bagging etc are also described
D-optimal plans in observational studies
D-optimal plans in observational studies Constanze Pumplün Stefan Rüping Katharina Morik Claus Weihs October 11, 2005 Abstract This paper investigates the use of Design of Experiments in observational
Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms
Comparing the Results of Support Vector Machines with Traditional Data Mining Algorithms Scott Pion and Lutz Hamel Abstract This paper presents the results of a series of analyses performed on direct mail
CHARACTERISTICS IN FLIGHT DATA ESTIMATION WITH LOGISTIC REGRESSION AND SUPPORT VECTOR MACHINES
CHARACTERISTICS IN FLIGHT DATA ESTIMATION WITH LOGISTIC REGRESSION AND SUPPORT VECTOR MACHINES Claus Gwiggner, Ecole Polytechnique, LIX, Palaiseau, France Gert Lanckriet, University of Berkeley, EECS,
Ensemble Methods. Knowledge Discovery and Data Mining 2 (VU) (707.004) Roman Kern. KTI, TU Graz 2015-03-05
Ensemble Methods Knowledge Discovery and Data Mining 2 (VU) (707004) Roman Kern KTI, TU Graz 2015-03-05 Roman Kern (KTI, TU Graz) Ensemble Methods 2015-03-05 1 / 38 Outline 1 Introduction 2 Classification
Nine Common Types of Data Mining Techniques Used in Predictive Analytics
1 Nine Common Types of Data Mining Techniques Used in Predictive Analytics By Laura Patterson, President, VisionEdge Marketing Predictive analytics enable you to develop mathematical models to help better
Chapter 12 Discovering New Knowledge Data Mining
Chapter 12 Discovering New Knowledge Data Mining Becerra-Fernandez, et al. -- Knowledge Management 1/e -- 2004 Prentice Hall Additional material 2007 Dekai Wu Chapter Objectives Introduce the student to
Multiple Linear Regression in Data Mining
Multiple Linear Regression in Data Mining Contents 2.1. A Review of Multiple Linear Regression 2.2. Illustration of the Regression Process 2.3. Subset Selection in Linear Regression 1 2 Chap. 2 Multiple
Fractional Revenue Attribution: Precision intelligence for multi-channel marketers
Fractional Revenue Attribution: Precision intelligence for multi-channel marketers Today most customer interactions and sales happen over a broad spectrum of marketing channels. This makes the big picture
LISTENING, INTERPRETING, AND ASKING BIG DATA MARKETING QUESTIONS
LISTENING, INTERPRETING, AND ASKING BIG DATA MARKETING QUESTIONS 1 LISTENING, INTERPRETING, AND ASKING BIG DATA MARKETING QUESTIONS IN A RECENT SURVEY BY MCKINSEY AND COMPANY, AS MANY AS 50% OF RESPONDENTS
Prediction of Stock Performance Using Analytical Techniques
136 JOURNAL OF EMERGING TECHNOLOGIES IN WEB INTELLIGENCE, VOL. 5, NO. 2, MAY 2013 Prediction of Stock Performance Using Analytical Techniques Carol Hargreaves Institute of Systems Science National University
THE HYBRID CART-LOGIT MODEL IN CLASSIFICATION AND DATA MINING. Dan Steinberg and N. Scott Cardell
THE HYBID CAT-LOGIT MODEL IN CLASSIFICATION AND DATA MINING Introduction Dan Steinberg and N. Scott Cardell Most data-mining projects involve classification problems assigning objects to classes whether
Bootstrapping Big Data
Bootstrapping Big Data Ariel Kleiner Ameet Talwalkar Purnamrita Sarkar Michael I. Jordan Computer Science Division University of California, Berkeley {akleiner, ameet, psarkar, jordan}@eecs.berkeley.edu
Data Mining Practical Machine Learning Tools and Techniques
Ensemble learning Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter 8 of Data Mining by I. H. Witten, E. Frank and M. A. Hall Combining multiple models Bagging The basic idea
Predict the Popularity of YouTube Videos Using Early View Data
000 001 002 003 004 005 006 007 008 009 010 011 012 013 014 015 016 017 018 019 020 021 022 023 024 025 026 027 028 029 030 031 032 033 034 035 036 037 038 039 040 041 042 043 044 045 046 047 048 049 050
A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier
A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier G.T. Prasanna Kumari Associate Professor, Dept of Computer Science and Engineering, Gokula Krishna College of Engg, Sullurpet-524121,
Benchmarking of different classes of models used for credit scoring
Benchmarking of different classes of models used for credit scoring We use this competition as an opportunity to compare the performance of different classes of predictive models. In particular we want
AdTheorent s. The Intelligent Solution for Real-time Predictive Technology in Mobile Advertising. The Intelligent Impression TM
AdTheorent s Real-Time Learning Machine (RTLM) The Intelligent Solution for Real-time Predictive Technology in Mobile Advertising Worldwide mobile advertising revenue is forecast to reach $11.4 billion
Point of View: Programmatic Ad Viewability in China
Presents Point of View: Programmatic Ad Viewability in China Market First Industry Benchmark and Analysis Powered by Introduction In digital advertising, many marketers and agencies assume that ads are
Leveraging Ensemble Models in SAS Enterprise Miner
ABSTRACT Paper SAS133-2014 Leveraging Ensemble Models in SAS Enterprise Miner Miguel Maldonado, Jared Dean, Wendy Czika, and Susan Haller SAS Institute Inc. Ensemble models combine two or more models to
Knowledge Discovery and Data Mining. Bootstrap review. Bagging Important Concepts. Notes. Lecture 19 - Bagging. Tom Kelsey. Notes
Knowledge Discovery and Data Mining Lecture 19 - Bagging Tom Kelsey School of Computer Science University of St Andrews http://tom.host.cs.st-andrews.ac.uk [email protected] Tom Kelsey ID5059-19-B &
Analyzing Customer Churn in the Software as a Service (SaaS) Industry
Analyzing Customer Churn in the Software as a Service (SaaS) Industry Ben Frank, Radford University Jeff Pittges, Radford University Abstract Predicting customer churn is a classic data mining problem.
Telecommunications, Media, and Technology. Leveraging big data to optimize digital marketing
Telecommunications, Media, and Technology Leveraging big data to optimize digital marketing Leveraging big data to optimize digital marketing 3 Leveraging big data to optimize digital marketing Given
Supervised Learning (Big Data Analytics)
Supervised Learning (Big Data Analytics) Vibhav Gogate Department of Computer Science The University of Texas at Dallas Practical advice Goal of Big Data Analytics Uncover patterns in Data. Can be used
Applying Data Science to Sales Pipelines for Fun and Profit
Applying Data Science to Sales Pipelines for Fun and Profit Andy Twigg, CTO, C9 @lambdatwigg Abstract Machine learning is now routinely applied to many areas of industry. At C9, we apply machine learning
Beyond the Click : The B2B Marketer s Guide to Display Advertising
Beyond the Click : The challenge In B2B marketing, the ultimate objective of any campaign is to generate the most qualified leads possible, convert these leads into new opportunities that fill the sales
DATA MINING TECHNIQUES AND APPLICATIONS
DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,
Media attribution: Optimising digital marketing spend in Financial Services
Media : Optimising digital marketing spend in Financial Services A multi touch study on the role of Facebook, display and search marketing in Financial Services. Executive summary Media solves the key
CAUSALLY MOTIVATED ATTRIBUTION FOR ONLINE ADVERTISING
CAUSALLY MOTIVATED ATTRIBUTION FOR ONLINE ADVERTISING ABSTRACT Brian Dalessandro M6D Research, NYC,USA [email protected] Claudia Perlich M6D Research, NYC,USA [email protected] In many online advertising campaigns,
The Quantcast Display Play-By-Play. Unlocking the Value of Display Advertising
The Quantcast Display Play-By-Play Unlocking the Value of Display Advertising Introduction In 2013, businesses will spend nearly $18 billion on display advertising. Over the past few years, we've seen
Comparison of machine learning methods for intelligent tutoring systems
Comparison of machine learning methods for intelligent tutoring systems Wilhelmiina Hämäläinen 1 and Mikko Vinni 1 Department of Computer Science, University of Joensuu, P.O. Box 111, FI-80101 Joensuu
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 15 Version 8, August 5, 2014 1 Abstract This paper discusses alternatives
A BETTER WAY TO ADVERTISE. Your guide to SmartAds & Internet Advertising
A BETTER WAY TO ADVERTISE Your guide to SmartAds & Internet Advertising CONTENTS The Basics Of Internet Advertising It all starts with Cookies Segmentation Targeting Re- Targeting Geographic Targeting
Strategic Online Advertising: Modeling Internet User Behavior with
2 Strategic Online Advertising: Modeling Internet User Behavior with Patrick Johnston, Nicholas Kristoff, Heather McGinness, Phuong Vu, Nathaniel Wong, Jason Wright with William T. Scherer and Matthew
Generalizing Random Forests Principles to other Methods: Random MultiNomial Logit, Random Naive Bayes, Anita Prinzie & Dirk Van den Poel
Generalizing Random Forests Principles to other Methods: Random MultiNomial Logit, Random Naive Bayes, Anita Prinzie & Dirk Van den Poel Copyright 2008 All rights reserved. Random Forests Forest of decision
A Learning Algorithm For Neural Network Ensembles
A Learning Algorithm For Neural Network Ensembles H. D. Navone, P. M. Granitto, P. F. Verdes and H. A. Ceccatto Instituto de Física Rosario (CONICET-UNR) Blvd. 27 de Febrero 210 Bis, 2000 Rosario. República
A Decision Theoretic Approach to Targeted Advertising
82 UNCERTAINTY IN ARTIFICIAL INTELLIGENCE PROCEEDINGS 2000 A Decision Theoretic Approach to Targeted Advertising David Maxwell Chickering and David Heckerman Microsoft Research Redmond WA, 98052-6399 [email protected]
Numerical Algorithms Group
Title: Summary: Using the Component Approach to Craft Customized Data Mining Solutions One definition of data mining is the non-trivial extraction of implicit, previously unknown and potentially useful
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM Thanh-Nghi Do College of Information Technology, Cantho University 1 Ly Tu Trong Street, Ninh Kieu District Cantho City, Vietnam
Mining Direct Marketing Data by Ensembles of Weak Learners and Rough Set Methods
Mining Direct Marketing Data by Ensembles of Weak Learners and Rough Set Methods Jerzy B laszczyński 1, Krzysztof Dembczyński 1, Wojciech Kot lowski 1, and Mariusz Paw lowski 2 1 Institute of Computing
Content Affiliates and Digital Advertising through Online Sales Channels
Content Affiliates and Digital Advertising through Online Sales Channels Abstract. Digital advertisers can use affiliate related sales channels to drive more qualified and targeted user traffic to their
Using multiple models: Bagging, Boosting, Ensembles, Forests
Using multiple models: Bagging, Boosting, Ensembles, Forests Bagging Combining predictions from multiple models Different models obtained from bootstrap samples of training data Average predictions or
How To Solve The Kd Cup 2010 Challenge
A Lightweight Solution to the Educational Data Mining Challenge Kun Liu Yan Xing Faculty of Automation Guangdong University of Technology Guangzhou, 510090, China [email protected] [email protected]
Advanced Ensemble Strategies for Polynomial Models
Advanced Ensemble Strategies for Polynomial Models Pavel Kordík 1, Jan Černý 2 1 Dept. of Computer Science, Faculty of Information Technology, Czech Technical University in Prague, 2 Dept. of Computer
Feature vs. Classifier Fusion for Predictive Data Mining a Case Study in Pesticide Classification
Feature vs. Classifier Fusion for Predictive Data Mining a Case Study in Pesticide Classification Henrik Boström School of Humanities and Informatics University of Skövde P.O. Box 408, SE-541 28 Skövde
HAS PROFOUNDLY CHANGED THE WAY AMERICANS SHOP...
The Internet HAS PROFOUNDLY CHANGED THE WAY AMERICANS SHOP...... a fact that has opened up new possibilities for advertising your business. As consumers continue to turn to online sources in the process
Studying Auto Insurance Data
Studying Auto Insurance Data Ashutosh Nandeshwar February 23, 2010 1 Introduction To study auto insurance data using traditional and non-traditional tools, I downloaded a well-studied data from http://www.statsci.org/data/general/motorins.
REVIEW OF ENSEMBLE CLASSIFICATION
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IJCSMC, Vol. 2, Issue.
STATISTICA. Financial Institutions. Case Study: Credit Scoring. and
Financial Institutions and STATISTICA Case Study: Credit Scoring STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table of Contents INTRODUCTION: WHAT
ELITE SEM OVERVIEW SEM & SEO
ELITE SEM OVERVIEW SEM & SEO Zach Morrison Vice President 212.863.9699 [email protected] AGENDA Introductions Agency Background Search Engine Marketing SEM Strategic Approach Sample Reporting and Analytics
Chapter 12 Bagging and Random Forests
Chapter 12 Bagging and Random Forests Xiaogang Su Department of Statistics and Actuarial Science University of Central Florida - 1 - Outline A brief introduction to the bootstrap Bagging: basic concepts
Machine Learning for Display Advertising
Machine Learning for Display Advertising The Idea: Social Targeting for Online Advertising Performance Current ad spending seems disproportionate Online Advertising Spending Breakdown (2009) Source: IAB
EFFICIENT DATA PRE-PROCESSING FOR DATA MINING
EFFICIENT DATA PRE-PROCESSING FOR DATA MINING USING NEURAL NETWORKS JothiKumar.R 1, Sivabalan.R.V 2 1 Research scholar, Noorul Islam University, Nagercoil, India Assistant Professor, Adhiparasakthi College
Maximize Social Media Effectiveness with Data Science. An Insurance Industry White Paper from Saama Technologies, Inc.
Maximize Social Media Effectiveness with Data Science An Insurance Industry White Paper from Saama Technologies, Inc. February 2014 Table of Contents Executive Summary 1 Social Media for Insurance 2 Effective
A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH
205 A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH ABSTRACT MR. HEMANT KUMAR*; DR. SARMISTHA SARMA** *Assistant Professor, Department of Information Technology (IT), Institute of Innovation in Technology
fragment of your imagination?
Is good digital advertising a fragment of your imagination? Ken Mallon Global President, Digital Allan Thompson Vice President, Digital Product Development This media revolution can be an extremely sharp
not possible or was possible at a high cost for collecting the data.
Data Mining and Knowledge Discovery Generating knowledge from data Knowledge Discovery Data Mining White Paper Organizations collect a vast amount of data in the process of carrying out their day-to-day
New Ensemble Combination Scheme
New Ensemble Combination Scheme Namhyoung Kim, Youngdoo Son, and Jaewook Lee, Member, IEEE Abstract Recently many statistical learning techniques are successfully developed and used in several areas However,
How To Identify A Churner
2012 45th Hawaii International Conference on System Sciences A New Ensemble Model for Efficient Churn Prediction in Mobile Telecommunication Namhyoung Kim, Jaewook Lee Department of Industrial and Management
Paper AA-08-2015. Get the highest bangs for your marketing bucks using Incremental Response Models in SAS Enterprise Miner TM
Paper AA-08-2015 Get the highest bangs for your marketing bucks using Incremental Response Models in SAS Enterprise Miner TM Delali Agbenyegah, Alliance Data Systems, Columbus, Ohio 0.0 ABSTRACT Traditional
MSCA 31000 Introduction to Statistical Concepts
MSCA 31000 Introduction to Statistical Concepts This course provides general exposure to basic statistical concepts that are necessary for students to understand the content presented in more advanced
Azure Machine Learning, SQL Data Mining and R
Azure Machine Learning, SQL Data Mining and R Day-by-day Agenda Prerequisites No formal prerequisites. Basic knowledge of SQL Server Data Tools, Excel and any analytical experience helps. Best of all:
Mobile Phone APP Software Browsing Behavior using Clustering Analysis
Proceedings of the 2014 International Conference on Industrial Engineering and Operations Management Bali, Indonesia, January 7 9, 2014 Mobile Phone APP Software Browsing Behavior using Clustering Analysis
Mining the Software Change Repository of a Legacy Telephony System
Mining the Software Change Repository of a Legacy Telephony System Jelber Sayyad Shirabad, Timothy C. Lethbridge, Stan Matwin School of Information Technology and Engineering University of Ottawa, Ottawa,
CI6227: Data Mining. Lesson 11b: Ensemble Learning. Data Analytics Department, Institute for Infocomm Research, A*STAR, Singapore.
CI6227: Data Mining Lesson 11b: Ensemble Learning Sinno Jialin PAN Data Analytics Department, Institute for Infocomm Research, A*STAR, Singapore Acknowledgements: slides are adapted from the lecture notes
A semi-supervised Spam mail detector
A semi-supervised Spam mail detector Bernhard Pfahringer Department of Computer Science, University of Waikato, Hamilton, New Zealand Abstract. This document describes a novel semi-supervised approach
Regularized Logistic Regression for Mind Reading with Parallel Validation
Regularized Logistic Regression for Mind Reading with Parallel Validation Heikki Huttunen, Jukka-Pekka Kauppi, Jussi Tohka Tampere University of Technology Department of Signal Processing Tampere, Finland
The Top 5 Mobile Marketing Mistakes
The Top 5 Mobile Marketing Mistakes And How to Avoid Them Data fueled mobile marketing powered by miq www.fiksu.com The challenges of effectively marketing on mobile are numerous. And while each marketer
DIGITAL MARKETING BASICS: PPC
DIGITAL MARKETING BASICS: PPC Search Engine Marketing (SEM) is an umbrella term referring to all activities that generate visibility in search engine result pages (SERPs) through the use of paid placement,
Chapter 20: Data Analysis
Chapter 20: Data Analysis Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Chapter 20: Data Analysis Decision Support Systems Data Warehousing Data Mining Classification
Study Guide #2 for MKTG 469 Advertising Types of online advertising:
Study Guide #2 for MKTG 469 Advertising Types of online advertising: Display (banner) ads, Search ads Paid search, Ads on social networks, Mobile ads Direct response is growing faster, Not all ads are
Multivariate testing. Understanding multivariate testing techniques and how to apply them to your email marketing strategies
Understanding multivariate testing techniques and how to apply them to your email marketing strategies An Experian Marketing Services white paper Testing should be standard operating procedure Whether
Android App Marketing and Google Play
Android App Marketing and Google Play WHAT YOU NEED TO KNOW Learn how to improve Android app discovery, drive more installs, and generate long-term, loyal usage. TO SUCCEED, MARKETERS NEED VISIBILITY.
What is the Cost of an Unseen Ad?
What is the Cost of an Unseen Ad? Steven Millman, comscore ZhiWei Tan, comscore Abstract In order to determine the relationship between viewability and campaign lift, viewable and non-viewable ads - as
State of Marketing Measurement Survey Report
2014 State of Marketing Measurement Survey Report 1 Foreword Welcome to the Ifbyphone 2014 State of Marketing Measurement Survey report. Since we last published our trends on the fast-evolving marketing
II. RELATED WORK. Sentiment Mining
Sentiment Mining Using Ensemble Classification Models Matthew Whitehead and Larry Yaeger Indiana University School of Informatics 901 E. 10th St. Bloomington, IN 47408 {mewhiteh, larryy}@indiana.edu Abstract
Specific Usage of Visual Data Analysis Techniques
Specific Usage of Visual Data Analysis Techniques Snezana Savoska 1 and Suzana Loskovska 2 1 Faculty of Administration and Management of Information systems, Partizanska bb, 7000, Bitola, Republic of Macedonia
2015 Workshops for Professors
SAS Education Grow with us Offered by the SAS Global Academic Program Supporting teaching, learning and research in higher education 2015 Workshops for Professors 1 Workshops for Professors As the market
How I won the Chess Ratings: Elo vs the rest of the world Competition
How I won the Chess Ratings: Elo vs the rest of the world Competition Yannis Sismanis November 2010 Abstract This article discusses in detail the rating system that won the kaggle competition Chess Ratings:
A General Approach to Incorporate Data Quality Matrices into Data Mining Algorithms
A General Approach to Incorporate Data Quality Matrices into Data Mining Algorithms Ian Davidson 1st author's affiliation 1st line of address 2nd line of address Telephone number, incl country code 1st
