All-in-one Multivariate Data Analysis and Design of Experiments software



Similar documents
All-in-one Multivariate Data Analysis and Design of Experiments software

Monitoring chemical processes for early fault detection using multivariate data analysis methods

Chemometric Analysis for Spectroscopy

How To Use Mva And Doe

NIRCal Software data sheet

User Manual. Version 1.1

Multivariate Chemometric and Statistic Software Role in Process Analytical Technology

SIMCA 14 MASTER YOUR DATA SIMCA THE STANDARD IN MULTIVARIATE DATA ANALYSIS

DeCyder Extended Data Analysis module Version 1.0

2015 Workshops for Professors

Introduction to SAS Business Intelligence/Enterprise Guide Alex Dmitrienko, Ph.D., Eli Lilly and Company, Indianapolis, IN

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

Tutorial for proteome data analysis using the Perseus software platform

Statistical Analysis. NBAF-B Metabolomics Masterclass. Mark Viant

Oracle Data Miner (Extension of SQL Developer 4.0)

Multivariate Tools for Modern Pharmaceutical Control FDA Perspective

Impact Analysis for Software changes in OpenLAB CDS A.01.04

Intellect Platform - Tables and Templates Basic Document Management System - A101

How To Use Statgraphics Centurion Xvii (Version 17) On A Computer Or A Computer (For Free)

Data Mining. SPSS Clementine Clementine Overview. Spring 2010 Instructor: Dr. Masoud Yaghini. Clementine

Data Analysis Tools. Tools for Summarizing Data

CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19

Structural Health Monitoring Tools (SHMTools)

Azure Machine Learning, SQL Data Mining and R

Getting Answers at the Source Requires a New Way of Thinking About Software

EDM SOFTWARE ENGINEERING DATA MANAGEMENT SOFTWARE

Expanding Uniformance. Driving Digital Intelligence through Unified Data, Analytics, and Visualization

TIBCO Spotfire Business Author Essentials Quick Reference Guide. Table of contents:

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

GeoGebra Statistics and Probability

AssurX Makes Quality & Compliance a Given Not Just a Goal

Application of Automated Data Collection to Surface-Enhanced Raman Scattering (SERS)

SAS Enterprise Guide in Pharmaceutical Applications: Automated Analysis and Reporting Alex Dmitrienko, Ph.D., Eli Lilly and Company, Indianapolis, IN

DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7

Step-by-Step Guide to Bi-Parental Linkage Mapping WHITE PAPER

Knowledge Discovery from patents using KMX Text Analytics

Spectrum 10 Spectroscopy Software

3D Viewer. user's manual _2

Hands-on Guide. FileMaker 8. Making FileMaker Pro 8 your Spreadsheet Companion. Making FileMaker Pro 8 your Spreadsheet Companion

Tableau Data Visualization Cookbook

1 st day Basic Training Course

Hierarchical Clustering Analysis

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

imc FAMOS 6.3 visualization signal analysis data processing test reporting Comprehensive data analysis and documentation imc productive testing

Instructions for Use. CyAn ADP. High-speed Analyzer. Summit G June Beckman Coulter, Inc N. Harbor Blvd. Fullerton, CA 92835

Maschinelles Lernen mit MATLAB

imc FAMOS 6.3 visualization signal analysis data processing test reporting Comprehensive data analysis and documentation imc productive testing

Benefits of Upgrading to Phoenix WinNonlin 6.2

Supervised Classification workflow in ENVI 4.8 using WorldView-2 imagery

STATISTICA. Financial Institutions. Case Study: Credit Scoring. and

Analyzing the Effect of Treatment and Time on Gene Expression in Partek Genomics Suite (PGS) 6.6: A Breast Cancer Study

Polynomial Neural Network Discovery Client User Guide

Data Mining and Visualization

Spreadsheet software for linear regression analysis

How To Use Gss Software In Trimble Business Center

Oracle Advanced Analytics 12c & SQLDEV/Oracle Data Miner 4.0 New Features

MarkerView Software for Metabolomic and Biomarker Profiling Analysis

NVMS User Manual

MM8000 safety and security with smart danger management. A scalable and flexible management station for any requirement. Answers for infrastructure.

O2PLS for improved analysis and visualization of complex data

Exiqon Array Software Manual. Quick guide to data extraction from mircury LNA microrna Arrays

Data Visualization Techniques

Industrial Roadmap for Connected Machines. Sal Spada Research Director ARC Advisory Group

Big Data: Rethinking Text Visualization

Corona process The new ZEISS spectrometer system for the food industry

How to Get More Value from Your Survey Data

Data Visualization Techniques

Vetting Smart Instruments for the Nuclear Industry

CHAPTER 6: ANALYZE MICROSOFT DYNAMICS NAV 5.0 DATA IN MICROSOFT EXCEL

LEGENDplex Data Analysis Software

How To Make A Global Cad/Cam Solution For A Large Company

SPPA-D3000 Plant Monitor Technical Description

Heat Map Explorer Getting Started Guide

Mascot Integra: Data management for Proteomics ASMS 2004

Data Mining mit der JMSL Numerical Library for Java Applications

An In-Depth Look at In-Memory Predictive Analytics for Developers

Additional sources Compilation of sources:

Files Used in this Tutorial

Getting started manual

Information Server Documentation SIMATIC. Information Server V8.0 Update 1 Information Server Documentation. Introduction 1. Web application basics 2

Nonlinear Iterative Partial Least Squares Method

Figure 1. An embedded chart on a worksheet.

BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES

XpoLog Center Suite Log Management & Analysis platform

A Streamlined Workflow for Untargeted Metabolomics

IBM SPSS Statistics 20 Part 1: Descriptive Statistics

DATASHEET. Xythos on Demand. Productivity and collaboration tools

SAS VISUAL ANALYTICS AN OVERVIEW OF POWERFUL DISCOVERY, ANALYSIS AND REPORTING

BusinessObjects Enterprise InfoView User's Guide

MultiQuant Software 2.0 for Targeted Protein / Peptide Quantification

Product Overview. Features: Toolbox. Ease of Use of Features

Scatter Plots with Error Bars

Tieto s Collection Suite Nova. Bring power to your debt collection business

Pure1 Manage User Guide

Digital Asset Management. Content Control for Valuable Media Assets

How To Cluster

Data Mining with SQL Server Data Tools

Transcription:

Bring data to life Version 10.3 All-in-one Multivariate Data Analysis and Design of Experiments software Powerful multivariate analysis methods and design of experiments Easy data importing options with intuitive workflows and interface Outstanding graphics, plots and interactive data visualization tools Cutting through complex data sets to underlying structures... is simplicity itself This intelligent engine borders upon data mining, as it cuts through prediction and classification problems

02 CAMO The Unscrambler X The Unscrambler is much more intuitive and has all the features I need plus more advanced methods than generalist statistical software Søren Bech, Head of Research, Bang & Olufsen

The Unscrambler X 03 CAMO Software for every industry For almost 30 years, The Unscrambler has enabled organizations across many industries and research fields to improve product development, process understanding and quality control through deeper data insights. Pharmaceuticals Chemicals/Petrochem Agriculture Food & Beverage Pulp & Paper Mining & Metals Oil & Gas Manufacturing Research & Academia Energy/Renewables Automotive Medical Devices Electronics Engineering Retail Semi-conductors Marketing Aerospace

04 CAMO The Unscrambler X Multivariate data analysis is a highly visual approach that helps you identify and understand patterns in large or complex data sets. Easily accessible raw data plots are a very useful tool in the analysis. In the 3D Scatter Plot above, data from paper manufacturing is shown, where each of the samples can be seen in relationship to its level of scatter, opacity and brightness. This plot can be rotated to see sample groups from different perspectives and zoomed in on individual data points. The Unscrambler X is the only major multivariate data analysis software that includes a seamlessly integrated Design of Experiments (DoE) module, giving you a powerful all-in-one tool for your data analysis needs. Multivariate analysis and Design of Experiments often go hand in hand. The Unscrambler X combines both of these powerful tools so you can use multivariate models to analyze experiments that are not well suited for classical analysis (i.e. ANOVA). Deeper data insights, faster & easier than ever When CAMO software was founded in 1984, we were pioneers and leaders in multivariate data analysis. Almost 30 years later, The Unscrambler X continues to set the standard in chemometrics and multivariate data analysis software with over 25,000 data analysts, researchers, engineers and scientists across a wide range of industries and research fi elds using the program. We believe that while your data might be complex, your software shouldn t be. That s why we focus on making The Unscrambler X easy to use. An intuitive user interface, tutorials within the software and project-based workfl ows make it easy to fi nd the data, plots and results you need quickly. And as data sets have become even more complex The Unscrambler X has also evolved, with improved handling of big data and advanced multivariate analysis tools to cut through even the most challenging data for faster, easier results. All-in-one multivariate analysis & design of experiments The latest version of The Unscrambler X builds on its tradition of powerful multivariate analytical methods and ease of use, adding even more integration options, enhanced ability to handle process data, improved Design of Experiments module and useful features for creating spectroscopic calibrations. Multivariate regression and prediction methods Multivariate classifi cation methods Exploratory data analysis tools such as PCA Extensive data pre-processing tools for spectra Classical statistics and statistical tests Integrated Design of Experiments module Easy data importing in wide range of formats Intuitive and user-friendly

The Unscrambler X 05 CAMO The plot above shows Partial Least Squares (PLS) regression analysis of designed data. PLS analysis is a useful alternative to classical DoE (ANOVA) when constraints or missing data give a non-orthogonal design that cannot or should not be analyzed using ANOVA. From the dialog box, users can chose classical ANOVA or PLS analysis depending on the orthogonality of the design and correlation between responses. Statistics are shown in the dialog box to help guide the choice of analysis. The fi gure above shows a scores plot with PCA overview using Kennard-Stone sample selection, where the points marked in green are calibration samples and points marked in orange are validation samples. Kennard-Stone sample selection is used for selecting an evenly separated set of observations for robust calibration. This is useful if the data are huge or poorly distributed. Highlights in version 10.3 Better handling of process data Option to defi ne units and tags for input variables to be saved with models Ability to set alarm and warning limits for calibration models for use in run-time applications Improved integration options through OSIsoft PI* and OPC DA* (*Not included as standard part of The Unscrambler X) Hierarchical Modeling*, Instrument Diagnostics* and more (*Not included as standard part of The Unscrambler X) Improved Design of Experiments module Enhanced D-Optimal Designs augmented with space fi lling points for more robust designs Better handling of non-simplex mixture and process/mixture designs DoE PLS for fl exible analysis of non-standard or modifi ed designs New response surface plots include control panel sliders for greatly improved interactivity and graphical optimization Improved calibration and validation sample selection Double Kennard-Stone sample selection for selecting representative samples for calibration and validation Improved features for creating sample ranges for NIR or other spectroscopic calibrations Enhanced sample labelling for easier interpretation New outlier detection tools and plots

06 CAMO The Unscrambler X The image above shows a Partial Least Squares (PLS) Regression analysis of petrochemical data in a number of different plots including Scores, Loadings, Explained Variance and Predicted vs Reference values. The samples are clearly divided into their octane levels from low (blue) to medium (red) and high (green). PLS is a powerful regression method which fi nds the best possible linear combination of indirect variables for predicting the desired outcome e.g. product quality or yield. The Unscrambler X allows you to see several different plots on the same screen, giving you a complete picture of your data from different perspectives for easier interpretation and analysis. The image above shows a plot of petrochemical data analyzed with Linear Discriminant Analysis (LDA) to classify octane levels. As shown, the samples group very clearly according to their octane levels. LDA is a powerful classifi cation method which fi nds the best separation between classes using linear or quadratic discrimination functions. When combined with PCA, LDA can be used on data with any number of correlated variables such as spectra. Regression and Prediction methods Develop models from existing data and predict the value of new samples with the powerful regression analysis tools in The Unscrambler X. These models can also be used for monitoring processes on-line, at-line or in-line. Multiple Linear Regression (MLR) Principal Component Regression (PCR) Partial Least Squares Regression (PLSR) L-shaped Partial Least Squares Regression (L-PLSR) Support Vector Machine Regression (SVM-R) Multivariate Classification methods Predict which category a sample belongs to with advanced classifi cation methods in The Unscrambler X. Classifi cation is the separation, or sorting, of a group of objects into one or more classes based on distinctive features in the objects. Linear Discriminant Analysis (LDA) Support Vector Machine Classifi cation (SVM-C) Partial Least Squares Discriminant Analysis (PLS-DA) Soft Independent Modeling of Class Analogy (SIMCA)

The Unscrambler X 07 CAMO The Unscrambler X includes a number of advanced data mining methods including Principal Component Analysis (PCA), which is a powerful tool to see, for example, how a process evolves by measuring the trajectory of samples. PCA will also help identify the variables with the most infl uence on a model. In the image above, PCA was used to model batch process data for a pharmaceutical product, with the calibration sample in blue showing how the batch evolves. A subsequent batch was projected onto the model (green), with the data showing the batch started later but closely following the calibration model s trajectory, indicating it is in the desired operating range. Pre-processing data is important when analyzing spectral data or building robust process models. The Unscrambler X includes all of the most common pre-processing options as well as more advanced tools unique to the software, such as Extended Multiplicative Scatter Correction (EMSC), which allows you to discard interference in spectra but retain interesting information relating to the chemical constituents. The Unscrambler X also has the option to preview spectra with treatments applied, so you can see the effect of the pre-processing before doing it to real data, as shown in the dialogue box above. Exploratory data analysis tools Cut through complex data to fi nd patterns easily using the powerful exploratory data analysis tools in The Unscrambler X. Exploratory data analysis, or data mining, fi nds hidden structures in large data sets. Descriptive statistics, principal component analysis and clustering are often used in initial explorations. Principal Component Analysis (PCA) Cluster Analysis Multivariate Curve Resolution (MCR) Descriptive statistics and classical statistics including T-tests, F-tests Advanced pre-treatment options Ensuring data is clean and in shape to be analyzed is essential, especially with instrument data, such as spectra. The Unscrambler X offers the most comprehensive and advanced range of data pre-treatment tools, making it the ideal software for spectroscopic applications. Smoothing Normalization Derivatives Baseline correction Standard Normal Variate (SNV) Multiplicative Scatter Correction (MSC) and Extended MSC Orthogonal Signal Correction (OSC)

08 CAMO The Unscrambler X Using The Unscrambler, we were able to resolve a product quality problem which has saved us $1M per year. It also gave us better process understanding which we have used to optimize other manufacturing processes Siri Sølberg, Senior Process Engineer, Nidar AS

The Unscrambler X 09 CAMO The enhanced DoE in The Unscrambler X includes ANOVA and response surface analysis also on constrained (D-Optimal) designs. The screen above shows a DoE overview where a corner of the response surface is removed as it is outside of the specifi ed design region. The D-Optimal algorithm ensures the design can still be analyzed using classical ANOVA. The latest version of The Unscrambler X has greatly improved graphics and interactivity. For example, the new response surface plot includes sliders for setting the levels of non-plotted design variables as well as constraints on both design and response variables. This lets you fi nd the parameter settings that give the optimal response, irrespective of the constraints in the design. Design of Experiments (DoE) The DoE module in The Unscrambler X has been improved with with enhanced D-Optimal Designs, graphical optimization and DoE PLS for more fl exible analysis of non-standard or modifi ed designs. Complete range of full and fractional factorial designs Enhanced optimization designs including Central Composite and Box-Behnken Mixture designs including Axial, Simplex Lattice and Simplex Centroid ANOVA tables, cube plots, response surfaces, Analysis of Effects, interactive tables Exceptional data visualization Understanding complex data is easy with the excellent visualization tools in The Unscrambler X. Patterns can be clearly shown, and individual samples or groups of data can be visualized from a number of perspectives. Produce publication standard plots which can be annotated as needed. The ability to see several plots on one screen simultaneously for easier interpretation The option to rotate certain plots to see data from different angles Annotate and add comments or lines to plots as required

10 CAMO The Unscrambler X The Unscrambler X has a wide range of data import options for standard formats such as Excel and ASCII as well as various scientifi c instrument formats. The software has a range of useful data import preview options, as shown above. For example auto-selection and interpolation enables similar spectra collected on different wavelengths (instruments) to be matched, and displays the spectral fi les in your folder with details on wavelength regions, size of spectra, step size etc. When The Unscrambler X is installed in compliance mode, the user is required to input their Windows credentials to log on. The audit trail will show who has logged on and which changes they have made. Easy data importing In many statistical programs, getting data into software in a format that is usable can be the fi rst hurdle, but it is easy in The Unscrambler X. The software accepts data from a wide range of scientifi c instruments, ASCII and Excel, making importing easy. Drag and drop data directly from Excel Preview data and see preview plots to ensure it looks as expected Works with most leading spectrometer data formats Accepts OSIsoft PI* and OPC DA* (*Not included as standard part of The Unscrambler X) Data security If you are working in a regulated industry, you ll understand the importance of data security. The Unscrambler X has greatly improved security options to help you meet your compliance needs. It is also helpful if you want to have models developed centrally then distributed to analytical centers. Compliance mode Digital signatures Audit trail Meets requirements of 21 CFR Part 11 (Electronic Records)

The Unscrambler X 11 CAMO Technical Specification Overview Exploratory Data Analysis Descriptive Statistics Mean/Std Dev/Quartiles/ Cross Correlations/Scatter Effects Statistical Tests Normality Test/t-Tests/F-Tests/Mardia s Multivariate Test Cluster Analysis K-Means Hierarchical Cluster Analysis (HCA) with many distance measures and cluster methods Principal Component Analysis (PCA) Choice of using NIPALS or SVD algorithms Rotation methods including Varimax, Equimax, Quartimax and Parsimax Multivariate Curve Resolution (MCR) Resolve time evolving data such as chemical reaction or chromatographic data into pure constituent profi les and pure spectra Regression and Classification Regression Methods Multiple Linear Regression (MLR)/Principal Component Regression (PCR) and Partial Least Squares Regression (PLSR) + SVR Choice of algorithms, NIPALS and SVD for PCR and NIPALS, Kernel Methods and Orthogonal Scores for PLSR L-PLS, incorporating three data tables for greater insights into data structure Advanced Classification Methods Projection using PCA and PLS models Soft Independent Modelling of Class Analogy (SIMCA) Linear Discriminant Analysis (LDA) Support Vector Machines (SVM) Classifi cation with numerous kernel types Data Pretreatments Spectral Functions Smoothing Derivatives: Moving Average/ Norris Gap/ Savitsky-Golay Baseline Correction Normalization Scatter Correction and Advanced Functions Multiplicative and Extended Multiplicative Scatter Correction (MSC/EMSC) Standard Normal Variate (SNV) Orthogonal Signal Correction (OSC) Deresolve Detrending General Transforms Choice of Centre and Scale options Spectroscopic: Refl ectance / Transmission/ Kubelka-Munck / Basic ATR correction NEW Interaction & Squares and Individual Variable Weighting Compute General Fill Missing Values Correlation Optimization Warping (COW) Design Of Experiments Improved Design Wizard Interactive design setup with full descriptions and Beginner/ Expert modes Complete range of full and fractional factorial designs Enhanced optimization designs including Central Composite (CCD), Inscribed (ICC), Face Centred (FCC) and Box-Behnken (BB) designs. Enhanced mixture designs including Axial, Simplex Lattice and Simplex Centroid designs. A new D-optimal design module with option to augment design with space-fi lling points NEW A completely new response surface plotting module with high resolution, fast graphics rendering and improved plotting controls for graphical optimization NEW Choice of Analysis Algorithms Intuitive analysis dialog helps select the best method NEW Classical DoE handles most orthogonal and weakly constrained designs Scheffé s polynomial model for mixture experiments DoE PLS can be used for any design including mixture/process design NEW Comprehensive Analysis Overview ANOVA tables and other tabular results Cube Plots Response Surfaces Analysis of Effects Interactive Tables Process Data New Alarms tab in analysis dialogs of PCA, MLR, PCR and PLSR and right-click option for setting alarm limits in the project navigator (these limits are applied for online prediction using some of our prediction engines) NEW New dialog for assigning Scalar/Vector tags as well as units. This information is used for collecting data from various sources during online monitoring of processes NEW General enhancements and bug fi xes NEW General Improvements Project Navigator Save data and analyses into projects Send complete projects to colleagues for further analysis/investigation Save and export models within a project to other applications Interactive Data Import Improved ASCII and Excel data import: View entire worksheets and interactively select data ranges before import simply using a mouse Preview data before importing from 3rd Party applications Wide range of spectral data imports including OPUS (Bruker), OMNIC (Thermo), NSAS (Foss NIRSystems), Indico (ASD), Brimrose, Guided Wave, Zeiss, Varian, Perten, JCAMP-DX, GRAMS, Matlab, previous Unscrambler versions, Perkin Elmer and Net-CDF fi les Import from Databases and OPC servers (plug-ins) Improved Security Windows domain authentication Revised 21 CFR Part 11 compliance including time stamping and time zone logging New and Improved Algorithms and Methods Basic ATR correction of absorbance transformed spectra NEW Introduced Double Kennard-Stone sample selection for PLSR, PCR and PCA NEW Plotting and Data Visualization Wide selection of plotting options Line, Bar and Scatter plots 3D and Matrix plots Histograms and Normal Probability plots Multiple Scatter plots for pair wise comparison of multiple rows or columns Enhanced Plotting Features Colour coding of unlimited categorical variables Easy access to analysis results matrices from the project navigator for plotting Interactive marking of samples or variables from plots for defi ning data ranges for analysis Add data to existing plots from other sources View Hotelling s T 2 ellipses at multiple confi dence intervals in PCA/ PCR and PLSR scores plots Plot settings in Tools Options Viewer can be used to change the default appearance of plots NEW New plots and plot layouts for Residuals and Infl uence plots in PCA, PCR, PLSR and Projection, including F-residuals with confi dence limits NEW Point labeling using value of any matching variable (Sample Grouping) NEW

Additional CAMO Software Products & Services Unscrambler X Process Pulse Real-time process monitoring software that lets you predict, identify and correct deviations in a process before they become problems. Affordable, easy to set up and use. Enterprise solutions Customized solutions which can be integrated into automation and control systems to enhance their analytical capabilities. Available for client-server and web-based architectures. Analytical Engines Software integrated directly into analytical or scientifi c instruments for on-line predictions, classifi cations or hierarchical models directly from the instrument. Training Our experienced trainers can help you use multivariate analysis to get more value from your data. Classroom, online or tailored in-house training courses from beginner to expert levels available. Consultancy and Data Analysis Services Do you have a lot of data and information but don t have resources in house or time to analyze it? Our consultants offer world-leading data analysis combined with hands-on industry expertise. Our partners CAMO Software works with a wide range of instrument and system vendors. For more information please contact your regional CAMO Software offi ce or visit www.camo.com/partners Find out more For more information please contact your regional CAMO offi ce or email sales@camo.com www.camo.com Connect with CAMO http://www.linkedin.com/company/camo-software https://twitter.com/camosoftware https://www.facebook.com/camosoftware http://dataanalysis.blog.camo.com/ http://www.youtube.com/camoseo NORWAY USA INDIA JAPAN AUSTRALIA Nedre Vollgate 8, One Woodbridge Center 14 & 15, Krishna Reddy Shibuya 3-chome Square Bldg 2F PO Box 97 N-0158 Suite 319, Woodbridge Colony, Domlur Layout 3-5-16 Shibuya Shibuya-ku St Peters Oslo NJ 07095 Bangalore - 560 071 Tokyo, 150-0002 NSW, 2044 Tel: (+47) 223 963 00 Tel: (+1) 732 726 9200 Tel: (+91) 80 4125 4242 Tel: (+81) 3 6868 7669 Tel: (+61) 4 0888 2007 Fax: (+47) 223 963 22 Fax: (+1) 973 556 1229 Fax: (+91) 80 4125 4181 Fax: (+81) 3 6730 9539 2013 CAMO Software AS