Multivariate Data Analysis In Practice 5th Edition

Size: px
Start display at page:

Download "Multivariate Data Analysis In Practice 5th Edition"

Transcription

1 Multivariate Data Analysis In Practice 5th Edition An Introduction to Multivariate Data Analysis and Experimental Design Kim H. Esbensen Ålborg University, Esbjerg with contributions from Dominique Guyot Frank Westad Lars P. Houmøller CAMO Software AS. Nedre Vollgate 8, N-0158, Oslo, NORWAY Tel: (47) Fax: (47) CAMO Software Inc. One Woodbridge Center, Suite 319, Woodbridge, NJ 07095, USA Tel: (732) Fax: (973) CAMO Software India Pvt. Ltd. 14 & 15, Krishna Reddy Colony Domlur Layout, Bangalore , INDIA Tel: (91) Fax: (91)

2 This book was produced using Doc-to-Help together with Microsoft Word. Visio and Excel were used to make some of the illustrations. The screen captures were taken with Paint Shop Pro. Trademark Acknowledgments Doc-To-Help is a trademark of WexTech Systems, Inc. Microsoft is a registered trademark and Windows 95, Windows NT, Excel and Word are trademarks of the Microsoft Corporation. PaintShop Pro is a trademark of JASC, Inc. Visio is a trademark of the Shapeware Corporation. Information in this book is subject to change without notice. No part of this document may be reproduced or transmitted in any form or by any means, electronic or mechanical, for any purpose, without the express written permission of CAMO Process AS. ISBN CAMO Process AS All rights reserved. 5th edition. Re-print December 2004

3 Preface iii Preface October 2001 Learning to do multivariate data analysis is in many ways like learning to drive a car: You are not let loose on the road without mandatory training, theoretical and practical, as required by current concern for traffic safety. As a minimum you need to know how a car functions and you need to know the traffic code. On the other hand, everybody would agree that it is first after having obtained your drivers license that the real practical learning begins. This is when your personal experience really starts to accumulate. There is a strong interaction between the theory absorbed and the practice gained in this secondary, personal training period. Please substitute multivariate data analysis for driving a car in all of the above. Neither in this context are you let out on the data analytical road without mandatory training, theoretical and practical. The analogy is actually very apt! This book presents a basic theoretical foundation for bilinear (projection-based) multivariate data modeling and gives a conceptual framework for starting to do your own data modeling on the data sets provided. There are some 25 data sets included in this training package. By doing all exercises included you re off to a flying start! Driving your newly acquired multivariate data analysis car is very much an evolutionary process: this introductory textbook is filled with illustrative examples, many practical exercises and a full set of selfexamination real-world data analysis problems (with corresponding data sets). If, after all of this, you are able to work confidently on your own applications, you ll have reached the goal set for this book.

4 iv Preface This is the 5 th revised edition of this book. The three first editions were mainly reprints, the only major change being the inclusion of a completely revised chapter on Introduction to experimental design, which first appeared in the 3 rd edition (CAMO). The 4 th revised edition however (published March 2000) saw very many major extensions and improvements: Text completely rewritten by the senior author, based on five years of extensive use in teaching at both university and dedicated course levels. More than copies in use. 30% new theory & text material added, reflecting extensive student response, full integration of PCA, PLS1 & PLS2 NIPALS algorithms and explanations. Text revised with an augmented self-learning objective throughout. Four new master data sets added (with extended self-exercise potential): 1. Master violin data (PCA/PLS) 2. Norwegian car dealerships (PCA/PLS) 3. Vintages (PCA/PLS) 4. Acoustic chemometric calibration (PCR/PLS) Additional chapter on experimental design: new features include mixture designs and D-optimal designs. New chapter on the powerful, novel: Martens Uncertainty Test. Comprehensive glossary of terms. This 5th edition also includes essential additional revisions and improvements: Lars P. Houmøller, Ålborg University Esbjerg, has carried out a complete work-through of all demonstrations and exercises. Many of these had not been updated with respect to several of the intervening UNSCRAMBLER software versions. We are happy to have finally eliminated this most frustrating nuisance.

5 Preface v About the authors Kim H. Esbensen, Ph.D., has more than 20 years of experience in multivariate data analysis and applied chemometrics. He was professor in chemometrics at the Norwegian Telemark Institute of Technology (HIT/TF), Institute of Process Technology (PT) , where he was also head of the Chemometrics Department Tel-Tek, Telemark Industrial R&D Center, Porsgrunn. Between these institutions he founded ACRG: the Applied Chemometrics Research Group, HIT/TF- Tel-Tek, which a.o. hosted SSC6, the 6 th Scandinavian Symposium on Chemometrics, August 1999 as well as numerous other international courses, workshops and meetings. July 1 st, 2001 he moved to a position as research professor in Applied Chemometrics at Ålborg University, Esbjerg, Denmark (AUE), where he is currently leading ACACSRG: the Applied Chemometrics, Analytical Chemistry and Sampling Research Group. As the name implies, applied chemometrics activities continue in Esbjerg while new activities are added most notably through close collaboration with assoc. prof. Lars P. Houmøller, who independently built up the area of analytical chemistry/chemometrics at AUE before Prof. Esbensen s arrival. Most recently the discipline of sampling (proper sampling) has been added, in recognition of the immense importance of sampling in any data analytical discipline, including chemometrics. Kim H. Esbensen has published more than 60 papers and technical reports on a wide range of chemical, geochemical, industrial, technological, remote sensing, image analytic and acoustic chemometric applications. Together with Paul Geladi he has been instrumental in codeveloping the concept of Multivariate Image Analysis (MIA); with ACRG he pioneered the development of the novel area of acoustic chemometrics. His M. Sc. is from the University of Aarhus, Denmark in 1978 (geology, geochemistry), while a Ph.D. was conferred him by the Technical University of Denmark (DTH) in 1981 within the areas of metallurgy, meteoritics and multivariate data analysis. He then did post-doctoral work for two years with the Research Group for Chemometrics at the University of Umeå , after which he worked in a Swedish geochemical exploration company, Terra Swede, for two more years. Moving to Norway, this was followed by eight years as data analytical research scientist at the Norwegian Computing Center (NCC), Oslo,

6 vi Preface after which he became a senior research scientist at SINTEF, the Norwegian Foundation for Industrial and Technological Research for four additional years. In between these two assignments he was a visiting guest professor at Norsk Hydro s Research Center in Bergen, Norway. He also holds a position as Chercheur associé (now Chercheur affilié) du Centre de Recherche en Géomatique, Université Laval, Quebec. He is a member of the editorial board of Journal of Chemometrics, Wiley Publishers, and is a member of ICS, AGU and several other geological, data analytical and statistical associations. Dominique Guyot, educated in Statistics, Economics and Biomathematics (ENSAE and Université de Paris 7, France), has 15 years of experience in the field of chemometrics. She gained industrial experience from her work in the pharmaceutical and cosmetic industries, before joining CAMO from 1995 until With CAMO, Dominique worked as a Senior Consultant, and was particularly involved in food applications. She put together a practical strategy for efficient product development, based on experimental design and multivariate data analysis. This strategy was implemented in the Guideline + software package, complemented by an integrated training course focusing on multivariate methods for food product developers. Dominique is now studying music and singing at the Conservatoire of Trondheim, Norway. Frank Westad has a M. Sc. in physical chemistry from the University of Trondheim, Norway. He has 13 years experience in applied multivariate data analysis, and he completed a Ph.D. in multivariate regression in Frank has given numerous courses in experimental design and multivariate analysis for companies in Europe and in the U.S.A. His main research fields include variable selection, shift modelling and image analysis. Lars P. Houmøller has a M.Sc. in chemistry and physics from the University of Aarhus, Denmark. He has 12 years of experience in analytical chemistry and has worked 5-7 years with chemometrics. His teaching experiences include chemometrics, analytical chemistry, spectroscopy, physical chemistry, general and technical chemistry, organic and inorganic chemistry, unit operations and fluid dynamics. His research field covers NIR spectroscopic applications over a very broad industrial spectrum. He also has experience from working in the Danish food production industry.

7 Preface vii interaction with the authors: Kim Esbensen Dominique Guyot Frank Westad Lars P. Houmøller About this book Since 1986, when CAMO ASA first commercialized and started marketing THE UNSCRAMBLER, many customers have asked for basic, easy-to-understand literature on chemometrics. In 1993 a group of data analysts at different competence levels was invited to a one-day seminar at CAMO, Trondheim, for discussing their experience from both learning and teaching chemometrics. The result was a blue-print outline for what came to be this introductory book: the specifications called for a comprehensive training-package, involving basic, practical, easy-to-read, largely non-mathematical theory, with plenty of hands-on examples and exercises on real-world data sets. CAMO contracted SINTEF to write this book (first three editions), and the parties agreed to cooperate on the completion of the complete training package. In the intervening years, this book was published in some copies and was used for the introductory basic training in some 15 universities and in several hundred industrial companies; reactions were many and largely constructive. We learned a lot from these criticisms; we thank all who contributed! Came 1999, the time was ripe for a complete revision of the entire package. This was undertaken by the senior author in the summer 1999 with significant assistance from his then Ph.D. student Jun Huang (now with CAMO, Norway); Frank Westad (Matforsk) who wrote chapter 14 (Martens Uncertainty Test), Dominique Guyot (CAMO) who wrote the original new entire chapter 17 (Complex Experimental Design Problems), and with further invaluable editorial and managerical contributions from Michael Byström (CAMO) and Valérie Lengard (CAMO). A most sincere thank you goes to Peter Hindmarch (CAMO, UK) for very effective linguistic streamlining of the 4 th edition! The authors and CAMO also take this opportunity to acknowledge Suzanne Schönkopf s (CAMO) contribution to editions previous to the 4 th one.

8 viii Preface The present edition of this book still bears the fruit of her very important past efforts. The publication of the 4 th edition, in March 2000, was unfortunately somewhat marred by a less than complete revision of the exercises and illustrative UNSCRAMBLER runs in the book, which was not considered fatal at the time This soon proved to be a serious mistake; disapointment and frustration from several generations of students, who wanted to follow all the exercises closely, followed rapidly. A Danish university teacher, who had himself experienced this frustration close up when using the book for his own teachings, assoc. prof. Lars P. Houmøller at the University of Ålborg, Esbjerg voluntarily took it upon himself to carry out a complete work-through of this essential didactic aspect of the book. His very valuable demo and exercise revisions, as well as a very thorough text consistency check, have now been included in toto in the 5 th edition. Today, this book is a collaborative effort between the senior author and CAMO Process AS; the tie with SINTEF is now defunct. There is little academic glamour in writing an introductory level textbook, as the senior author has well experienced - which was never the goal anyway. But on the other hand, the introductory level is definitely where the largest audience and potential market exist, as CAMO has well experienced. The senior author has used the book for six consecutive years teaching introductory chemometrics largely to engineering (M.Sc.) students, as well as for extensive course work in industrial and foreign university environments. The response from some accumulated 500 students has made this author happy, while some 5500 sales have made CAMO equally satisfied. Thus all is well with the training package! We hope that this revised 5 th edition will continue to meet the challenging demands of the market, hopefully now in an improved form. Writing for precisely this introductory audience/market constitutes the highest scientific and didactic challenge, and is thus (still) irresistible!

9 Preface ix Acknowledgements The authors wish to thank the following persons, institutions and companies for their very valuable help in the preparation of this training package: Hans Blom, Østlandskonsult AS, Fredrikstad, Norway Frode Brakstad, Norsk Hydro F-Center, Porsgrunn, Norway Rolf Carlson, Department of Chemistry, University of Tromsø, Norway Chevron Research & Technology Co, Richmond, CA, USA Lennart Eriksson, Dept. of Organic Chemistry, University of Umeå, Sweden (now with Umetrics, Inc.) Professor Magni Martens, The Royal Vetarinary & Agricultural University, Denmark Geological Survey of Greenland, Denmark IKU, Institute for Petroleum Research, Trondhein, Norway Norwegian Food Research Institute (MATFORSK), Ås, Norway Norwegian Society of Process Control Norwegian Chemometrics Society International Chemometrics Society UOP Guided Wave, CA, USA Pierre Gy, Cannes, France (for a gentleman s introduction to the finest French wines) Zander & Ingerstrõm, Oslo, Norway Tomas Õberg Konsult AB, Karlskoga, Sweden KAPITAL (weekly Norwegian economic magazine), no 14/1994, p50-55 Hlif Sigurjonsdottir, Reykjavik, Iceland (owner of G. Sgarabotto violin no 9 ) Birgitta Spur, LSO, Reykjavik, Iceland (permission to use the Sgarabotto oeuvre data) Sensorteknikk A/S, Bærum, Oslo (Bjørn Hope: sensor technology entrepreneur extraordinaire; Evy: for innumerable occasions: warm company, coffee and waffles, waffles, waffles) Thorbjørn T. Lied, Maths Halstensen, Tore Gravermoen, Rune Mathisen a.o. (for enormous help in developing acoustic chemometrics) Anonymous wine importer, Odense, Denmark. Helpful wine assessors (partly anonymous), Manson, Wa, USA. Finally the author(s) and CAMO wish to thank all THE UNSCRAMBLER users during the last seven years for their close relationships with us, which have given us so much added experience in

10 x Preface teaching multivariate data analysis. And thanks for all the constructive criticism to the earlier editions of this book. Last, but certainly not least, a warm thank you to all the students at HIT/TF, at Ålborg University, Esbjerg and many, many others, who have been associated with the teachings of the authors, nearly all of whom have been very constructive in their ongoing criticism of the entire teaching system embedded in this training package. We even learned from the occasional not-so-friendly criticisms Communication The period of seven years that has been the formative period for the training package has come of age. By now we are actually beginning to be rather satisfied with it! And yet: The author(s) and CAMO always welcome all critical responses to the present text. They are seriously needed in order for this work to be continually improving.

11 Contents xi Contents 1. Introduction to Multivariate Data Analysis - Overview Indirect Observations and Correlation Hidden Data Structures Multivariate Data Analysis vs. Multivariate Statistics Main Objectives of Multivariate Data Analytical Techniques Multivariate Techniques as Projections Getting Started - with Descriptive Statistics Purpose Data Set 1: Quality of Green Peas Data set 2: Economic Characteristics of Car Dealerships in Norway Principal Component Analysis (PCA) Introduction Representing the Data as a Matrix The Variable Space - Plotting Objects in p Dimensions Plotting Objects in Variable Space Exercise - Plotting Raw Data (People) The First Principal Component Extension to Higher-Order Principal Components Principal Component Models - Scores and Loadings Model Center Loadings - Relations Between X and PCs Scores - Coordinates in PC Space Object Residuals Objectives of PCA Score Plot - Map of Samples Loading Plot - Map of Variables 40

12 xii Contents 3.10 Exercise: Plotting and Interpreting a PCA-Model (People) PC-Models The PC Model: X = TP T + E = Structure + Noise Residuals - The E-Matrix How Many PCs to Use? Variable Residuals More about Variances - Modeling Error Variance Exercise - Interpreting a PCA Model (Peas) Exercise - PCA Modeling (Car Dealerships) PCA Modeling The NIPALS Algorithm Principal Component Analysis (PCA) - In Practice Scaling or Weighting Outliers Scaling, Transformation and Normalization are Highly Problem Dependent Issues PCA Step by Step The Unscrambler and PCA Summary of PCA Interpretation of PCA-Models Interpretation of Score Plots Look for Patterns Summary - Interpretation of Score Plots Summary - Interpretation of Loading Plots PCA - What Can Go Wrong? Exercise - Detecting Outliers (Troodos) PCA Exercises Real-World Application Examples Exercise - Find Clusters (Iris Species Discrimination) Exercise - PCA for Experimental Design (Lewis Acids) Exercise - Mud Samples Exercise - Scaling (Troodos) Multivariate Calibration (PCR/PLS) Multivariate Modeling (X,Y): The Calibration Stage Multivariate Modeling (X, Y): The Prediction Stage Calibration Set Requirements (Training Data Set) Introduction to Validation Number of Components (Model Dimensionality) Univariate Regression (y x) and MLR 124

13 Contents xiii Univariate Regression (y x) Multiple Linear Regression, MLR Collinearity PCR - Principal Component Regression Exercise - Interpretation of Jam (PCR) Weaknesses of PCR PLS- Regression (PLS-R) PLS - A Powerful Alternative to PCR PLS (X,Y): Initial Comparison with PCA(X), PCA(Y) PLS2 NIPALS Algorithm Interpretation of PLS Models The PLS1 NIPALS Algorithm Exercise - Interpretation of PLS1 (Jam) Exercise - Interpretation PLS2 (Jam) When to Use which Method? Exercise - Compare PCR and PLS1 (Jam) Summary Validation: Mandatory Performance Testing The Concept of Test Set Validation Calculating the Calibration Variance (Modeling Error) Calculating the Validation Variance (Prediction Error) Studying the Calibration and Validation Variances Requirements for the Test Set Cross Validation Leverage Corrected Validation How to Perform PCR and PLS-R PLS and PCR - Step by Step Optimal Number of Components in Modeling Information in Later PCs Exercises on PLS and PCR: the Heart-of-the-Matter! Exercise - PLS2 (Peas) Exercise - PLS1 or PLS2? (Peas) Exercise - Is PCR better than PLS? (Peas) Multivariate Data Analysis in Practice: Miscellaneous Issues Data Constraints 181

14 xiv Contents Data Matrix Dimensions Missing Data Data Collection Use Historical Data Monitoring Data from an On-Going Process Data Generated by Planned Experiments Perform Experiments or Collect Data - Always by Careful Reflection The Random Design A Powerful Alternative Selecting from Abundant Data Selecting a Calibration Data Set from Abundant Training Data Selecting a Validation Data Set Error Sources Replicates - A Means to Quantify Errors Estimates of Experimental - and Measurement Errors Error in Y (Reference Method): Reproducibility Stability over Consecutive Measurements: Repeatability Handling Replicates in Multivariate Modeling Validation in Practice Test Set Cross Validation Leverage Correction The Multivariate Model Validation Alternatives How Good is the Model: RMSEP and Other Measures Residuals Residual Variances (Calibration, Prediction) Correction for Degrees of Freedom RMSEP and RMSEC - Average, Representative Errors in Original Units RMSEP, SEP and Bias Comparison Between Prediction Error and Measurement Error Compare RMSEP for Different Models Compare Results with Other Methods Other Measures of Errors Prediction of New Data Getting Reliable Prediction Results How Does Prediction Work? Prediction Used as Validation 210

15 Contents xv Uncertainty at Prediction Study Prediction Objects and Training Objects in the Same Plot Coding Category Variables: PLS-DISCRIM Scaling or Weighting Variables Using the B- and the Bw-Coefficients Calibration of Spectroscopic Data Spectroscopic Data: Calibration Options Interpretation of Spectroscopic Calibration Models Choosing Wavelengths PLS (PCR) Exercises: Real-World Application Examples - I Exercise - Prediction of Gasoline Octane Number Exercise - Water Quality Exercise - Freezing Point of Jet Fuel Exercise - Paper PLS (PCR) Multivariate Calibration In Practice Outliers and Subgroups Scores X-Y Relation Outlier Plots (T vs. U Scores) Residuals Dangerous Outliers or Interesting Extremes? Systematic Errors Y-Residuals Plotted Against Objects Residuals Plotted Against Predicted Values Normal Probability Plot of Residuals Transformations Logarithmic Transformations Spectroscopic Transformations Multiplicative Scatter Correction Differentiation Averaging Normalization Non-Linearities How to Handle Non-Linearities? Deleting Variables Procedure for Refining Models 264

16 xvi Contents 11.6 Precise Measurements vs. Noisy Measurements How to Interpret the Residual Variance Plot Summary: The Unscrambler Plots Revealing Problems PLS (PCR) Exercises: Real-World Applications - II Exercise ~ Log-Transformation (Dioxin) Exercise - Multiplicative Scatter Correction (Alcohol) Exercise Dirty Data (Geologic Data with Severe Uncertainties) Exercise - Spectroscopy Calibration (Wheat) Exercise QSAR (Cytotoxicity) Master Data Sets: Interim Examination Sgarabotto Master Violin Data Set Norwegian Car Dealerships - Revisited Vintages Acoustic Chemometrics (a. c.) Uncertainty Estimates, Significance and Stability (Martens Uncertainty Test) Uncertainty Estimates in Regression Coefficients, b Rotation of Perturbed Models Variable Selection Model Stability Introduction An Example Using the Paper Data Exercise - Paper - Uncertainty Test and Model Stability SIMCA: An Introduction to Classification SIMCA - Fields of Use How to Make SIMCA Class-Models? Basic SIMCA Steps: A Standard Flow-Sheet How Do we Classify new Samples? Classification Results Statistical Significance Level and its Use: An Introduction Graphical Interpretation of Classification Results The Coomans Plot The Si vs. Hi Plot (Distance vs. Leverage) 345

17 Contents xvii Si/S0 vs. Hi Model Distance Variable Discrimination Power Modeling Power SIMCA-Exercise IRIS Classification Introduction to Experimental Design Experimental Design Screening Designs Full Factorial Designs Fractional Factorial Designs Plackett-Burman Designs Analyzing a Screening Design Significant effects Using F-Test and P-Values to Determine Significant Effects Exercise - Willgerodt-Kindler Reaction Optimization Designs Central Composite Designs Box-Behnken Designs Analyzing an Optimization Design Exercise - Optimization of Enamine Synthesis Practical Aspects of Making an Experimental Design Extending a Design Validation of Designed Data Sets Problems in Designed Data Sets Detect and Interpret Effects How to Separate Confounded Effects? Blocking and Repeated Response Measurements Fold-Over Designs What Do We Do if We Cannot Keep to the Planned Variable Settings? A Random Design Modeling Uncoded Data Exercise - Designed Data with Non-Stipulated Values (Lacotid) Experimental Design Procedure in The Unscrambler Complex Experimental Design Problems 447

18 xviii Contents 17.1 Introduction to Complex Experimental Design Problems Constraints Between the Levels of Several Design Variables A Special Case: Mixture Situations Alternative Solutions The Mixture Situation An Example of Mixture Design Screening Designs for Mixtures Optimization Designs for Mixtures Designs that Cover a Mixture Region Evenly How To Deal With Constraints Introduction to the D-Optimal Principle Non-Mixture D-Optimal Designs Mixture D-Optimal Designs Advanced Topics How To Analyze Results From Constrained Experiments Use of PLS Regression For Constrained Designs Relevant Regression Models The Mixture Response Surface Plot Exercise ~ Build a Mixture Design - Wines Comparison of Methods for Multivariate Data Analysis - And their Validation Comparison of Selected Multivariate Methods Principal Component Analysis (PCA) Factor Analysis (FA) Cluster Analysis (CA) Linear Discriminant Analysis (LDA) Comparison: Projection Dimensionality in Multivariate Data Analysis Multiple Linear Regression, (MLR) Principal Component Regression (PCR) Partial Least Squares Regression (PLS-R) Increasing Projection Dimensionality in Regression Modeling Choosing Multivariate Methods Is Not Optional! Problem Formulation Unsupervised Methods Supervised Methods 503

19 Contents xix 18.5 A Final Discussion about Validation Test Set Validation Cross Validation Leverage Corrected Validation Selecting a Validation Approach in Practice Summary of Basic Rules for Success From Here You Are on Your Own. Good Luck! Literature Appendix: Algorithms PCA PCR PLS PLS Appendix: Software Installation and User Interface Welcome to The Unscrambler How to Install and Configure The Unscrambler Problems You Can Solve with The Unscrambler The Unscrambler Workplace The Editor The Viewer Dockable Views Dialogs The Help System Tooltips Using The Unscrambler Efficiently Analyses Some Tips to Make Your Work Easier 545 Glossary of Terms 549 Index 587

Monitoring chemical processes for early fault detection using multivariate data analysis methods

Monitoring chemical processes for early fault detection using multivariate data analysis methods Bring data to life Monitoring chemical processes for early fault detection using multivariate data analysis methods by Dr Frank Westad, Chief Scientific Officer, CAMO Software Makers of CAMO 02 Monitoring

More information

Chemometric Analysis for Spectroscopy

Chemometric Analysis for Spectroscopy Chemometric Analysis for Spectroscopy Bridging the Gap between the State and Measurement of a Chemical System by Dongsheng Bu, PhD, Principal Scientist, CAMO Software Inc. Chemometrics is the use of mathematical

More information

Multivariate Chemometric and Statistic Software Role in Process Analytical Technology

Multivariate Chemometric and Statistic Software Role in Process Analytical Technology Multivariate Chemometric and Statistic Software Role in Process Analytical Technology Camo Software Inc For IFPAC 2007 By Dongsheng Bu and Andrew Chu PAT Definition A system Understand the Process Design

More information

How To Use Mva And Doe

How To Use Mva And Doe Bring data to life Multivariate Data Analysis for Biotechnology and Bio-processing Powerful Multivariate Data Analysis and Design of Experiments methods are giving biotechnology companies greater insights

More information

All-in-one Multivariate Data Analysis and Design of Experiments software

All-in-one Multivariate Data Analysis and Design of Experiments software Bring data to life Version 10.3 All-in-one Multivariate Data Analysis and Design of Experiments software Powerful multivariate analysis methods and design of experiments Easy data importing options with

More information

NIRCal Software data sheet

NIRCal Software data sheet NIRCal Software data sheet NIRCal is an optional software package for NIRFlex N-500 and NIRMaster, that allows the development of qualitative and quantitative calibrations. It offers numerous chemometric

More information

Working with partners to deliver exceptional value to customers

Working with partners to deliver exceptional value to customers CAMO Software OEM PARTNER PROGRAM Working with partners to deliver exceptional value to customers Introduction to CAMO Software Our solutions and value proposition Program benefits and eligibility Bring

More information

O2PLS for improved analysis and visualization of complex data

O2PLS for improved analysis and visualization of complex data O2PLS for improved analysis and visualization of complex data Lennart Eriksson 1, Svante Wold 2 and Johan Trygg 3 1 Umetrics AB, POB 7960, SE-907 19 Umeå, Sweden, lennart.eriksson@umetrics.com 2 Umetrics

More information

Multivariate Tools for Modern Pharmaceutical Control FDA Perspective

Multivariate Tools for Modern Pharmaceutical Control FDA Perspective Multivariate Tools for Modern Pharmaceutical Control FDA Perspective IFPAC Annual Meeting 22 January 2013 Christine M. V. Moore, Ph.D. Acting Director ONDQA/CDER/FDA Outline Introduction to Multivariate

More information

How To Understand Multivariate Models

How To Understand Multivariate Models Neil H. Timm Applied Multivariate Analysis With 42 Figures Springer Contents Preface Acknowledgments List of Tables List of Figures vii ix xix xxiii 1 Introduction 1 1.1 Overview 1 1.2 Multivariate Models

More information

Partial Least Squares (PLS) Regression.

Partial Least Squares (PLS) Regression. Partial Least Squares (PLS) Regression. Hervé Abdi 1 The University of Texas at Dallas Introduction Pls regression is a recent technique that generalizes and combines features from principal component

More information

Teaching Multivariate Analysis to Business-Major Students

Teaching Multivariate Analysis to Business-Major Students Teaching Multivariate Analysis to Business-Major Students Wing-Keung Wong and Teck-Wong Soon - Kent Ridge, Singapore 1. Introduction During the last two or three decades, multivariate statistical analysis

More information

Regression Modeling Strategies

Regression Modeling Strategies Frank E. Harrell, Jr. Regression Modeling Strategies With Applications to Linear Models, Logistic Regression, and Survival Analysis With 141 Figures Springer Contents Preface Typographical Conventions

More information

SIMCA 14 MASTER YOUR DATA SIMCA THE STANDARD IN MULTIVARIATE DATA ANALYSIS

SIMCA 14 MASTER YOUR DATA SIMCA THE STANDARD IN MULTIVARIATE DATA ANALYSIS SIMCA 14 MASTER YOUR DATA SIMCA THE STANDARD IN MULTIVARIATE DATA ANALYSIS 02 Value From Data A NEW WORLD OF MASTERING DATA EXPLORE, ANALYZE AND INTERPRET Our world is increasingly dependent on data, and

More information

Statistics for Experimenters

Statistics for Experimenters Statistics for Experimenters Design, Innovation, and Discovery Second Edition GEORGE E. P. BOX J. STUART HUNTER WILLIAM G. HUNTER WILEY- INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION FACHGEBIETSBGCHEREI

More information

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R Practical Data Science with Azure Machine Learning, SQL Data Mining, and R Overview This 4-day class is the first of the two data science courses taught by Rafal Lukawiecki. Some of the topics will be

More information

All-in-one Multivariate Data Analysis and Design of Experiments software

All-in-one Multivariate Data Analysis and Design of Experiments software Bring data to life with Design-Expert Version 10.4 All-in-one Multivariate Data Analysis and Design of Experiments software Powerful and user friendly multivariate analysis methods Extensive and intuitive

More information

CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19

CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19 PREFACE xi 1 INTRODUCTION 1 1.1 Overview 1 1.2 Definition 1 1.3 Preparation 2 1.3.1 Overview 2 1.3.2 Accessing Tabular Data 3 1.3.3 Accessing Unstructured Data 3 1.3.4 Understanding the Variables and Observations

More information

Overview of Factor Analysis

Overview of Factor Analysis Overview of Factor Analysis Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone: (205) 348-4431 Fax: (205) 348-8648 August 1,

More information

Azure Machine Learning, SQL Data Mining and R

Azure Machine Learning, SQL Data Mining and R Azure Machine Learning, SQL Data Mining and R Day-by-day Agenda Prerequisites No formal prerequisites. Basic knowledge of SQL Server Data Tools, Excel and any analytical experience helps. Best of all:

More information

Statistical Rules of Thumb

Statistical Rules of Thumb Statistical Rules of Thumb Second Edition Gerald van Belle University of Washington Department of Biostatistics and Department of Environmental and Occupational Health Sciences Seattle, WA WILEY AJOHN

More information

MarkerView Software 1.2.1 for Metabolomic and Biomarker Profiling Analysis

MarkerView Software 1.2.1 for Metabolomic and Biomarker Profiling Analysis MarkerView Software 1.2.1 for Metabolomic and Biomarker Profiling Analysis Overview MarkerView software is a novel program designed for metabolomics applications and biomarker profiling workflows 1. Using

More information

Multivariate Analysis of Ecological Data

Multivariate Analysis of Ecological Data Multivariate Analysis of Ecological Data MICHAEL GREENACRE Professor of Statistics at the Pompeu Fabra University in Barcelona, Spain RAUL PRIMICERIO Associate Professor of Ecology, Evolutionary Biology

More information

Statistical Analysis. NBAF-B Metabolomics Masterclass. Mark Viant

Statistical Analysis. NBAF-B Metabolomics Masterclass. Mark Viant Statistical Analysis NBAF-B Metabolomics Masterclass Mark Viant 1. Introduction 2. Univariate analysis Overview of lecture 3. Unsupervised multivariate analysis Principal components analysis (PCA) Interpreting

More information

Empirical Model-Building and Response Surfaces

Empirical Model-Building and Response Surfaces Empirical Model-Building and Response Surfaces GEORGE E. P. BOX NORMAN R. DRAPER Technische Universitat Darmstadt FACHBEREICH INFORMATIK BIBLIOTHEK Invortar-Nf.-. Sachgsbiete: Standort: New York John Wiley

More information

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics. Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are

More information

Multivariate Data Analysis

Multivariate Data Analysis Multivariate Data Analysis FOR DUMmIES CAMO SOFTWARE SPECIAL EDITION by Brad Swarbrick, CAMO Software A John Wiley and Sons, Ltd, Publication Multivariate Data Analysis For Dummies, CAMO Software Special

More information

On 4 December 1995, the National Faculty Meeting for Legal Studies agreed on the following statement:

On 4 December 1995, the National Faculty Meeting for Legal Studies agreed on the following statement: THE UNIVERSITY OF BERGEN The Faculty of Law THE LEVEL OF DOCTORAL DEGREES IN LAW Guidelines for the Faculties of Law at the University of Bergen and the University of Oslo, adopted by the board of the

More information

Asian Journal of Food and Agro-Industry ISSN 1906-3040 Available online at www.ajofai.info

Asian Journal of Food and Agro-Industry ISSN 1906-3040 Available online at www.ajofai.info As. J. Food Ag-Ind. 008, (0), - Asian Journal of Food and Agro-Industry ISSN 906-00 Available online at www.ajofai.info Research Article Analysis of NIR spectral reflectance linearization and gradient

More information

TRAINING SCHOOL IN EXPERIMENTAL DESIGN & STATISTICAL ANALYSIS OF BIOMEDICAL EXPERIMENTS

TRAINING SCHOOL IN EXPERIMENTAL DESIGN & STATISTICAL ANALYSIS OF BIOMEDICAL EXPERIMENTS TRAINING SCHOOL IN EXPERIMENTAL DESIGN & STATISTICAL ANALYSIS OF BIOMEDICAL EXPERIMENTS March 3 1 April 15 University of Coimbra, Portugal Supporters: CPD accreditation: FRAME delivers regular training

More information

Data Mining and Visualization

Data Mining and Visualization Data Mining and Visualization Jeremy Walton NAG Ltd, Oxford Overview Data mining components Functionality Example application Quality control Visualization Use of 3D Example application Market research

More information

Introduction to Engineering System Dynamics

Introduction to Engineering System Dynamics CHAPTER 0 Introduction to Engineering System Dynamics 0.1 INTRODUCTION The objective of an engineering analysis of a dynamic system is prediction of its behaviour or performance. Real dynamic systems are

More information

1 st day Basic Training Course

1 st day Basic Training Course DATES AND LOCATIONS 13-14 April 2015 Princeton Marriott at Forrestal, 100 College Road East, Princeton NJ 08540, New Jersey 16-17 April 2015 Hotel Nikko San Francisco 222 Mason Street, San Francisco, CA

More information

Security Metrics. A Beginner's Guide. Caroline Wong. Mc Graw Hill. Singapore Sydney Toronto. Lisbon London Madrid Mexico City Milan New Delhi San Juan

Security Metrics. A Beginner's Guide. Caroline Wong. Mc Graw Hill. Singapore Sydney Toronto. Lisbon London Madrid Mexico City Milan New Delhi San Juan Security Metrics A Beginner's Guide Caroline Wong Mc Graw Hill New York Chicago San Francisco Lisbon London Madrid Mexico City Milan New Delhi San Juan Seoul Singapore Sydney Toronto Contents FOREWORD

More information

An Introduction to Partial Least Squares Regression

An Introduction to Partial Least Squares Regression An Introduction to Partial Least Squares Regression Randall D. Tobias, SAS Institute Inc., Cary, NC Abstract Partial least squares is a popular method for soft modelling in industrial applications. This

More information

Detection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup

Detection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup Network Anomaly Detection A Machine Learning Perspective Dhruba Kumar Bhattacharyya Jugal Kumar KaKta»C) CRC Press J Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor

More information

Multivariate Statistical Inference and Applications

Multivariate Statistical Inference and Applications Multivariate Statistical Inference and Applications ALVIN C. RENCHER Department of Statistics Brigham Young University A Wiley-Interscience Publication JOHN WILEY & SONS, INC. New York Chichester Weinheim

More information

4. Simple regression. QBUS6840 Predictive Analytics. https://www.otexts.org/fpp/4

4. Simple regression. QBUS6840 Predictive Analytics. https://www.otexts.org/fpp/4 4. Simple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/4 Outline The simple linear model Least squares estimation Forecasting with regression Non-linear functional forms Regression

More information

Time series experiments

Time series experiments Time series experiments Time series experiments Why is this a separate lecture: The price of microarrays are decreasing more time series experiments are coming Often a more complex experimental design

More information

Application Note. The Optimization of Injection Molding Processes Using Design of Experiments

Application Note. The Optimization of Injection Molding Processes Using Design of Experiments The Optimization of Injection Molding Processes Using Design of Experiments PROBLEM Manufacturers have three primary goals: 1) produce goods that meet customer specifications; 2) improve process efficiency

More information

Diagnosis of Students Online Learning Portfolios

Diagnosis of Students Online Learning Portfolios Diagnosis of Students Online Learning Portfolios Chien-Ming Chen 1, Chao-Yi Li 2, Te-Yi Chan 3, Bin-Shyan Jong 4, and Tsong-Wuu Lin 5 Abstract - Online learning is different from the instruction provided

More information

Introduction to Principal Components and FactorAnalysis

Introduction to Principal Components and FactorAnalysis Introduction to Principal Components and FactorAnalysis Multivariate Analysis often starts out with data involving a substantial number of correlated variables. Principal Component Analysis (PCA) is a

More information

Data Mining for Business Intelligence. Concepts, Techniques, and Applications in Microsoft Office Excel with XLMiner. 2nd Edition

Data Mining for Business Intelligence. Concepts, Techniques, and Applications in Microsoft Office Excel with XLMiner. 2nd Edition Brochure More information from http://www.researchandmarkets.com/reports/2170926/ Data Mining for Business Intelligence. Concepts, Techniques, and Applications in Microsoft Office Excel with XLMiner. 2nd

More information

Analysis of Financial Time Series

Analysis of Financial Time Series Analysis of Financial Time Series Analysis of Financial Time Series Financial Econometrics RUEY S. TSAY University of Chicago A Wiley-Interscience Publication JOHN WILEY & SONS, INC. This book is printed

More information

Principal Component Analysis

Principal Component Analysis Principal Component Analysis ERS70D George Fernandez INTRODUCTION Analysis of multivariate data plays a key role in data analysis. Multivariate data consists of many different attributes or variables recorded

More information

RARITAN VALLEY COMMUNITY COLLEGE ACADEMIC COURSE OUTLINE MATH 111H STATISTICS II HONORS

RARITAN VALLEY COMMUNITY COLLEGE ACADEMIC COURSE OUTLINE MATH 111H STATISTICS II HONORS RARITAN VALLEY COMMUNITY COLLEGE ACADEMIC COURSE OUTLINE MATH 111H STATISTICS II HONORS I. Basic Course Information A. Course Number and Title: MATH 111H Statistics II Honors B. New or Modified Course:

More information

Data Analysis on the ABI PRISM 7700 Sequence Detection System: Setting Baselines and Thresholds. Overview. Data Analysis Tutorial

Data Analysis on the ABI PRISM 7700 Sequence Detection System: Setting Baselines and Thresholds. Overview. Data Analysis Tutorial Data Analysis on the ABI PRISM 7700 Sequence Detection System: Setting Baselines and Thresholds Overview In order for accuracy and precision to be optimal, the assay must be properly evaluated and a few

More information

HOW TO USE MINITAB: DESIGN OF EXPERIMENTS. Noelle M. Richard 08/27/14

HOW TO USE MINITAB: DESIGN OF EXPERIMENTS. Noelle M. Richard 08/27/14 HOW TO USE MINITAB: DESIGN OF EXPERIMENTS 1 Noelle M. Richard 08/27/14 CONTENTS 1. Terminology 2. Factorial Designs When to Use? (preliminary experiments) Full Factorial Design General Full Factorial Design

More information

THE STANDARD FOR DOCTORAL DEGREES IN LAW AT THE FACULTY OF LAW, UNIVERSITY OF TROMSØ

THE STANDARD FOR DOCTORAL DEGREES IN LAW AT THE FACULTY OF LAW, UNIVERSITY OF TROMSØ THE FACULTY OF LAW THE STANDARD FOR DOCTORAL DEGREES IN LAW AT THE FACULTY OF LAW, UNIVERSITY OF TROMSØ Guidelines for the Faculty of Law in Tromsø, adopted by the Faculty Board on 31 May 2010. 1 Background

More information

Computer-Aided Multivariate Analysis

Computer-Aided Multivariate Analysis Computer-Aided Multivariate Analysis FOURTH EDITION Abdelmonem Af if i Virginia A. Clark and Susanne May CHAPMAN & HALL/CRC A CRC Press Company Boca Raton London New York Washington, D.C Contents Preface

More information

Experiment #1, Analyze Data using Excel, Calculator and Graphs.

Experiment #1, Analyze Data using Excel, Calculator and Graphs. Physics 182 - Fall 2014 - Experiment #1 1 Experiment #1, Analyze Data using Excel, Calculator and Graphs. 1 Purpose (5 Points, Including Title. Points apply to your lab report.) Before we start measuring

More information

Application of Automated Data Collection to Surface-Enhanced Raman Scattering (SERS)

Application of Automated Data Collection to Surface-Enhanced Raman Scattering (SERS) Application Note: 52020 Application of Automated Data Collection to Surface-Enhanced Raman Scattering (SERS) Timothy O. Deschaines, Ph.D., Thermo Fisher Scientific, Madison, WI, USA Key Words Array Automation

More information

Simple Predictive Analytics Curtis Seare

Simple Predictive Analytics Curtis Seare Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use

More information

Graduate Certificate in Systems Engineering

Graduate Certificate in Systems Engineering Graduate Certificate in Systems Engineering Systems Engineering is a multi-disciplinary field that aims at integrating the engineering and management functions in the development and creation of a product,

More information

Succession planning in Chinese family-owned businesses in Hong Kong: an exploratory study on critical success factors and successor selection criteria

Succession planning in Chinese family-owned businesses in Hong Kong: an exploratory study on critical success factors and successor selection criteria Succession planning in Chinese family-owned businesses in Hong Kong: an exploratory study on critical success factors and successor selection criteria By Ling Ming Chan BEng (University of Newcastle upon

More information

Univariate and Multivariate Methods PEARSON. Addison Wesley

Univariate and Multivariate Methods PEARSON. Addison Wesley Time Series Analysis Univariate and Multivariate Methods SECOND EDITION William W. S. Wei Department of Statistics The Fox School of Business and Management Temple University PEARSON Addison Wesley Boston

More information

Data Visualization. Principles and Practice. Second Edition. Alexandru Telea

Data Visualization. Principles and Practice. Second Edition. Alexandru Telea Data Visualization Principles and Practice Second Edition Alexandru Telea First edition published in 2007 by A K Peters, Ltd. Cover image: The cover shows the combination of scientific visualization and

More information

What Is School Mathematics?

What Is School Mathematics? What Is School Mathematics? Lisbon, Portugal January 30, 2010 H. Wu *I am grateful to Alexandra Alves-Rodrigues for her many contributions that helped shape this document. The German conductor Herbert

More information

Chapter 5: Analysis of The National Education Longitudinal Study (NELS:88)

Chapter 5: Analysis of The National Education Longitudinal Study (NELS:88) Chapter 5: Analysis of The National Education Longitudinal Study (NELS:88) Introduction The National Educational Longitudinal Survey (NELS:88) followed students from 8 th grade in 1988 to 10 th grade in

More information

How to report the percentage of explained common variance in exploratory factor analysis

How to report the percentage of explained common variance in exploratory factor analysis UNIVERSITAT ROVIRA I VIRGILI How to report the percentage of explained common variance in exploratory factor analysis Tarragona 2013 Please reference this document as: Lorenzo-Seva, U. (2013). How to report

More information

Integrated Reservoir Asset Management

Integrated Reservoir Asset Management Integrated Reservoir Asset Management Integrated Reservoir Asset Management Principles and Best Practices John R. Fanchi AMSTERDAM. BOSTON. HEIDELBERG. LONDON NEW YORK. OXFORD. PARIS. SAN DIEGO SAN FRANCISCO.

More information

Mining. Practical. Data. Monte F. Hancock, Jr. Chief Scientist, Celestech, Inc. CRC Press. Taylor & Francis Group

Mining. Practical. Data. Monte F. Hancock, Jr. Chief Scientist, Celestech, Inc. CRC Press. Taylor & Francis Group Practical Data Mining Monte F. Hancock, Jr. Chief Scientist, Celestech, Inc. CRC Press Taylor & Francis Group Boca Raton London New York CRC Press is an imprint of the Taylor Ei Francis Group, an Informs

More information

CITY UNIVERSITY OF HONG KONG 香 港 城 市 大 學. Self-Organizing Map: Visualization and Data Handling 自 組 織 神 經 網 絡 : 可 視 化 和 數 據 處 理

CITY UNIVERSITY OF HONG KONG 香 港 城 市 大 學. Self-Organizing Map: Visualization and Data Handling 自 組 織 神 經 網 絡 : 可 視 化 和 數 據 處 理 CITY UNIVERSITY OF HONG KONG 香 港 城 市 大 學 Self-Organizing Map: Visualization and Data Handling 自 組 織 神 經 網 絡 : 可 視 化 和 數 據 處 理 Submitted to Department of Electronic Engineering 電 子 工 程 學 系 in Partial Fulfillment

More information

The electrical field produces a force that acts

The electrical field produces a force that acts Physics Equipotential Lines and Electric Fields Plotting the Electric Field MATERIALS AND RESOURCES ABOUT THIS LESSON EACH GROUP 5 alligator clip leads 2 batteries, 9 V 2 binder clips, large computer LabQuest

More information

Dimensionality Reduction: Principal Components Analysis

Dimensionality Reduction: Principal Components Analysis Dimensionality Reduction: Principal Components Analysis In data mining one often encounters situations where there are a large number of variables in the database. In such situations it is very likely

More information

Why participation works

Why participation works Why participation works Full title Why participation works: the role of employee involvement in the implementation of the customer relationship management type of organizational change. Key words Participation,

More information

QUALITY MANAGEMENT IN VETERINARY TESTING LABORATORIES

QUALITY MANAGEMENT IN VETERINARY TESTING LABORATORIES NB: Version adopted by the World Assembly of Delegates of the OIE in May 2012 CHAPTER 1.1.4. QUALITY MANAGEMENT IN VETERINARY TESTING LABORATORIES SUMMARY Valid laboratory results are essential for diagnosis,

More information

MATHEMATICAL METHODS OF STATISTICS

MATHEMATICAL METHODS OF STATISTICS MATHEMATICAL METHODS OF STATISTICS By HARALD CRAMER TROFESSOK IN THE UNIVERSITY OF STOCKHOLM Princeton PRINCETON UNIVERSITY PRESS 1946 TABLE OF CONTENTS. First Part. MATHEMATICAL INTRODUCTION. CHAPTERS

More information

Data Mining Techniques in CRM

Data Mining Techniques in CRM Data Mining Techniques in CRM Inside Customer Segmentation Konstantinos Tsiptsis CRM 6- Customer Intelligence Expert, Athens, Greece Antonios Chorianopoulos Data Mining Expert, Athens, Greece WILEY A John

More information

Prerequisite: High School Chemistry.

Prerequisite: High School Chemistry. ACT 101 Financial Accounting The course will provide the student with a fundamental understanding of accounting as a means for decision making by integrating preparation of financial information and written

More information

A Comparison of Variable Selection Techniques for Credit Scoring

A Comparison of Variable Selection Techniques for Credit Scoring 1 A Comparison of Variable Selection Techniques for Credit Scoring K. Leung and F. Cheong and C. Cheong School of Business Information Technology, RMIT University, Melbourne, Victoria, Australia E-mail:

More information

Advanced Topics in Statistical Process Control

Advanced Topics in Statistical Process Control Advanced Topics in Statistical Process Control The Power of Shewhart s Charts Second Edition Donald J. Wheeler SPC Press Knoxville, Tennessee Contents Preface to the Second Edition Preface The Shewhart

More information

Validation and Calibration. Definitions and Terminology

Validation and Calibration. Definitions and Terminology Validation and Calibration Definitions and Terminology ACCEPTANCE CRITERIA: The specifications and acceptance/rejection criteria, such as acceptable quality level and unacceptable quality level, with an

More information

Practical Applications of DATA MINING. Sang C Suh Texas A&M University Commerce JONES & BARTLETT LEARNING

Practical Applications of DATA MINING. Sang C Suh Texas A&M University Commerce JONES & BARTLETT LEARNING Practical Applications of DATA MINING Sang C Suh Texas A&M University Commerce r 3 JONES & BARTLETT LEARNING Contents Preface xi Foreword by Murat M.Tanik xvii Foreword by John Kocur xix Chapter 1 Introduction

More information

Contents. List of Figures. List of Tables. List of Examples. Preface to Volume IV

Contents. List of Figures. List of Tables. List of Examples. Preface to Volume IV Contents List of Figures List of Tables List of Examples Foreword Preface to Volume IV xiii xvi xxi xxv xxix IV.1 Value at Risk and Other Risk Metrics 1 IV.1.1 Introduction 1 IV.1.2 An Overview of Market

More information

Probability and Statistics

Probability and Statistics Probability and Statistics Syllabus for the TEMPUS SEE PhD Course (Podgorica, April 4 29, 2011) Franz Kappel 1 Institute for Mathematics and Scientific Computing University of Graz Žaneta Popeska 2 Faculty

More information

1) Chemical Engg. PEOs & POs Programme Educational Objectives

1) Chemical Engg. PEOs & POs Programme Educational Objectives 1) Chemical Engg. PEOs & POs Programme Educational Objectives The Programme has the following educational objectives: To prepare students for successful practice in diverse fields of chemical engineering

More information

Design & Analysis of Ecological Data. Landscape of Statistical Methods...

Design & Analysis of Ecological Data. Landscape of Statistical Methods... Design & Analysis of Ecological Data Landscape of Statistical Methods: Part 3 Topics: 1. Multivariate statistics 2. Finding groups - cluster analysis 3. Testing/describing group differences 4. Unconstratined

More information

Alignment and Preprocessing for Data Analysis

Alignment and Preprocessing for Data Analysis Alignment and Preprocessing for Data Analysis Preprocessing tools for chromatography Basics of alignment GC FID (D) data and issues PCA F Ratios GC MS (D) data and issues PCA F Ratios PARAFAC Piecewise

More information

Syllabus for MATH 191 MATH 191 Topics in Data Science: Algorithms and Mathematical Foundations Department of Mathematics, UCLA Fall Quarter 2015

Syllabus for MATH 191 MATH 191 Topics in Data Science: Algorithms and Mathematical Foundations Department of Mathematics, UCLA Fall Quarter 2015 Syllabus for MATH 191 MATH 191 Topics in Data Science: Algorithms and Mathematical Foundations Department of Mathematics, UCLA Fall Quarter 2015 Lecture: MWF: 1:00-1:50pm, GEOLOGY 4645 Instructor: Mihai

More information

CLASSIFYING SERVICES USING A BINARY VECTOR CLUSTERING ALGORITHM: PRELIMINARY RESULTS

CLASSIFYING SERVICES USING A BINARY VECTOR CLUSTERING ALGORITHM: PRELIMINARY RESULTS CLASSIFYING SERVICES USING A BINARY VECTOR CLUSTERING ALGORITHM: PRELIMINARY RESULTS Venkat Venkateswaran Department of Engineering and Science Rensselaer Polytechnic Institute 275 Windsor Street Hartford,

More information

CROP CLASSIFICATION WITH HYPERSPECTRAL DATA OF THE HYMAP SENSOR USING DIFFERENT FEATURE EXTRACTION TECHNIQUES

CROP CLASSIFICATION WITH HYPERSPECTRAL DATA OF THE HYMAP SENSOR USING DIFFERENT FEATURE EXTRACTION TECHNIQUES Proceedings of the 2 nd Workshop of the EARSeL SIG on Land Use and Land Cover CROP CLASSIFICATION WITH HYPERSPECTRAL DATA OF THE HYMAP SENSOR USING DIFFERENT FEATURE EXTRACTION TECHNIQUES Sebastian Mader

More information

Design of Experiments for Analytical Method Development and Validation

Design of Experiments for Analytical Method Development and Validation Design of Experiments for Analytical Method Development and Validation Thomas A. Little Ph.D. 2/12/2014 President Thomas A. Little Consulting 12401 N Wildflower Lane Highland, UT 84003 1-925-285-1847 drlittle@dr-tom.com

More information

D-optimal plans in observational studies

D-optimal plans in observational studies D-optimal plans in observational studies Constanze Pumplün Stefan Rüping Katharina Morik Claus Weihs October 11, 2005 Abstract This paper investigates the use of Design of Experiments in observational

More information

COPYRIGHTED MATERIAL. Contents. List of Figures. Acknowledgments

COPYRIGHTED MATERIAL. Contents. List of Figures. Acknowledgments Contents List of Figures Foreword Preface xxv xxiii xv Acknowledgments xxix Chapter 1 Fraud: Detection, Prevention, and Analytics! 1 Introduction 2 Fraud! 2 Fraud Detection and Prevention 10 Big Data for

More information

MULTIPLE LINEAR REGRESSION ANALYSIS USING MICROSOFT EXCEL. by Michael L. Orlov Chemistry Department, Oregon State University (1996)

MULTIPLE LINEAR REGRESSION ANALYSIS USING MICROSOFT EXCEL. by Michael L. Orlov Chemistry Department, Oregon State University (1996) MULTIPLE LINEAR REGRESSION ANALYSIS USING MICROSOFT EXCEL by Michael L. Orlov Chemistry Department, Oregon State University (1996) INTRODUCTION In modern science, regression analysis is a necessary part

More information

American Statistical Association Draft Guidelines for Undergraduate Programs in Statistical Science

American Statistical Association Draft Guidelines for Undergraduate Programs in Statistical Science American Statistical Association Draft Guidelines for Undergraduate Programs in Statistical Science Guidelines Workgroup (Beth Chance, Steve Cohen, Scott Grimshaw, Johanna Hardin, Tim Hesterberg, Roger

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

Machine Learning and Data Mining. Regression Problem. (adapted from) Prof. Alexander Ihler

Machine Learning and Data Mining. Regression Problem. (adapted from) Prof. Alexander Ihler Machine Learning and Data Mining Regression Problem (adapted from) Prof. Alexander Ihler Overview Regression Problem Definition and define parameters ϴ. Prediction using ϴ as parameters Measure the error

More information

Methods for Meta-analysis in Medical Research

Methods for Meta-analysis in Medical Research Methods for Meta-analysis in Medical Research Alex J. Sutton University of Leicester, UK Keith R. Abrams University of Leicester, UK David R. Jones University of Leicester, UK Trevor A. Sheldon University

More information

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

More information

vii TABLE OF CONTENTS CHAPTER TITLE PAGE DECLARATION DEDICATION ACKNOWLEDGEMENT ABSTRACT ABSTRAK

vii TABLE OF CONTENTS CHAPTER TITLE PAGE DECLARATION DEDICATION ACKNOWLEDGEMENT ABSTRACT ABSTRAK vii TABLE OF CONTENTS CHAPTER TITLE PAGE DECLARATION DEDICATION ACKNOWLEDGEMENT ABSTRACT ABSTRAK TABLE OF CONTENTS LIST OF TABLES LIST OF FIGURES LIST OF ABBREVIATIONS LIST OF SYMBOLS LIST OF APPENDICES

More information

How To Evaluate The Performance Of The Process Industry Supply Chain

How To Evaluate The Performance Of The Process Industry Supply Chain Performance Evaluation of the Process Industry Supply r Chain: Case of the Petroleum Industry in India :.2A By Siddharth Varma Submitted in fulfillment of requirements of the degree of DOCTOR OF PHILOSOPHY

More information

A Correlation of. to the. South Carolina Data Analysis and Probability Standards

A Correlation of. to the. South Carolina Data Analysis and Probability Standards A Correlation of to the South Carolina Data Analysis and Probability Standards INTRODUCTION This document demonstrates how Stats in Your World 2012 meets the indicators of the South Carolina Academic Standards

More information

CLUSTER ANALYSIS WITH R

CLUSTER ANALYSIS WITH R CLUSTER ANALYSIS WITH R [cluster analysis divides data into groups that are meaningful, useful, or both] LEARNING STAGE ADVANCED DURATION 3 DAY WHAT IS CLUSTER ANALYSIS? Cluster Analysis or Clustering

More information

Using Excel for Statistics Tips and Warnings

Using Excel for Statistics Tips and Warnings Using Excel for Statistics Tips and Warnings November 2000 University of Reading Statistical Services Centre Biometrics Advisory and Support Service to DFID Contents 1. Introduction 3 1.1 Data Entry and

More information

Exploratory Data Analysis with MATLAB

Exploratory Data Analysis with MATLAB Computer Science and Data Analysis Series Exploratory Data Analysis with MATLAB Second Edition Wendy L Martinez Angel R. Martinez Jeffrey L. Solka ( r ec) CRC Press VV J Taylor & Francis Group Boca Raton

More information

FT-NIR for Online Analysis in Polyol Production

FT-NIR for Online Analysis in Polyol Production Application Note: 51594 FT-NIR for Online Analysis in Polyol Production Key Words Acid Number Ethylene Oxide FT-NIR Hydroxyl Value Polyester Polyols Abstract Hydroxyl value and other related parameters

More information