STATISTICS and PROBABILITY Definition Statistics is the science and practice of developing human knowledge through the use of empirical data



Similar documents
STATISTICAL DATA ANALYSIS

Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems.

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

How To Understand Data Science

Homeopathic times - Irish Society of Homeopaths - July 2002

Lecture 19: Chapter 8, Section 1 Sampling Distributions: Proportions

Basic Concepts in Research and Data Analysis

MASTER S DEGREE PROGRAMS Academic Year

CHANCE ENCOUNTERS. Making Sense of Hypothesis Tests. Howard Fincher. Learning Development Tutor. Upgrade Study Advice Service

Certification Requirements by Discipline. Allied Health Sciences

Big Data Big Knowledge?

BOR 6335 Data Mining. Course Description. Course Bibliography and Required Readings. Prerequisites

Calibration and Linear Regression Analysis: A Self-Guided Tutorial

Area13 Riviste di classe A

STATUS OF NUCLEAR TECHNOLOGY EDUCATION IN MONGOLIA

Gonzaga MBA Electives

The University of Vermont: Degrees Awarded by Degree Program,

TEACHING OF STATISTICS IN KENYA. John W. Odhiambo University of Nairobi Nairobi

Virtual Site Event. Predictive Analytics: What Managers Need to Know. Presented by: Paul Arnest, MS, MBA, PMP February 11, 2015

School of Business TRINITY COLLEGE DUBLIN. Masters in Finance

Teaching Engineering Students vs. Technology Students: Early Observations

Data Analytics in Organisations and Business

VI - Academic Degree Programs of the University

Database Marketing simplified through Data Mining

Chapter 5 Estimating Demand Functions

Teaching & Learning Plans. Plan 1: Introduction to Probability. Junior Certificate Syllabus Leaving Certificate Syllabus

Probability Using Dice

Undecided? You re not alone You wil excel at a major that you enjoy studying. Your college major puts FEW restrictions on your career choices.

STAT355 - Probability & Statistics

EDMS 769L: Statistical Analysis of Longitudinal Data 1809 PAC, Th 4:15-7:00pm 2009 Spring Semester

$ ( $1) = 40

Prerequisites. Course Outline

Statistics for BIG data

Kant s Dialectic. Lecture 3 The Soul, part II John Filling jf582@cam.ac.uk

Operations and Supply Chain Management Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology Madras

POL 204b: Research and Methodology

QMB 3302 Business Analytics CRN Spring 2015 T R -- 11:00am - 12:15pm -- Lutgert Hall 2209

Miami University Degrees Awarded By College/School, Major Field of Study and Type of Degree

Area 13 - Elenco delle Riviste di Classe A per Settore Concorsuale

APPENDIX B: COMPUTER SOFTWARE TO COMPILE AND ANALYZE DATA

Audit Analytics. --An innovative course at Rutgers. Qi Liu. Roman Chinchila

ANALYTICS CENTER LEARNING PROGRAM

THE DEPARTMENT OF SOCIAL AND BEHAVIORAL SCIENCES POLITICAL SCIENCE PROGRAM HANDBOOK

USING EXCEL IN AN INTRODUCTORY STATISTICS COURSE: A COMPARISON OF INSTRUCTOR AND STUDENT PERSPECTIVES

Area 13 - Elenco delle Riviste di Classe A per Settore Concorsuale (AGGIORNATO AL 01/10/2015)

The Friedman Test with MS Excel. In 3 Simple Steps. Kilem L. Gwet, Ph.D.

Claims of Fact, Value, and Policy. A multidisciplinary approach to informal argumentation

Geography 4203 / GIS Modeling. Class (Block) 9: Variogram & Kriging

Astrology s Mirror: Full-Phase Aspects

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12

Preface ANNUAL REPORT 2005

Presentation on the European Law School, Prof. Dr. Martin Heger, Humboldt Universität Berlin TDP Workshop, Berlin, April 18-20, 2008

Chapter Test B. Chapter: Measurements and Calculations

SQL INJECTION ATTACKS By Zelinski Radu, Technical University of Moldova

1 J (Gr 6): Summarize and describe distributions.

MATH Probability & Statistics - Fall Semester 2015 Dr. Brandon Samples - Department of Mathematics - Georgia College

University of Michigan Dearborn Graduate Psychology Assessment Program

Statistics Review PSY379

BOSTON UNIVERSITY SCHOOL OF PUBLIC HEALTH PUBLIC HEALTH COMPETENCIES

Introduction to Big Data! with Apache Spark" UC#BERKELEY#

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Karyn Ruiz-Cordell, MA, PhD Shunda Irons-Brown, PhD, MBA, CHCP Tamar Sapir, PhD

Being a Guarantor. Financial Series. in Alberta. What is a Guarantor? June Has someone you know asked you to be a Guarantor?

IARC Summer School in Cancer Epidemiology Application form

Study Program Handbook Psychology

Digital Cadastral Maps in Land Information Systems

Computational Science and Informatics (Data Science) Programs at GMU

Solving the math problem

What Is Probability?

Few things are more feared than statistical analysis.

WHAT IS CAPITAL BUDGETING?

What is the Purpose of College Algebra? Sheldon P. Gordon Farmingdale State College of New York

This focus on common themes has led to IFNA s motto of understanding through GLOBAL DIVERSITY, COOPERATION AND COLLABORATION.

What is Data Analysis. Kerala School of MathematicsCourse in Statistics for Scientis. Introduction to Data Analysis. Steps in a Statistical Study

International Engineering Alliance. Graduate Attributes and Professional Competencies

Preface to the Second Edition

Programme Specification 1

Prediction Markets, Fair Games and Martingales

Predictive Analytics Certificate Program

How To Teach Social Science To A Class

AP Statistics Chapters Practice Test MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

3.2 Roulette and Markov Chains

STATISTICA. Clustering Techniques. Case Study: Defining Clusters of Shopping Center Patrons. and

Fundamental Statistical Analysis for a Future with Big Data

Recall this chart that showed how most of our course would be organized:

Transcription:

STATISTICS and PROBABILITY Definition Statistics is the science and practice of developing human knowledge through the use of empirical data expressed in quantitative form. It is based on statistical theory which is supposed to be a branch of applied mathematics. Within statistical theory, randomness and uncertainty are modeled by probability theory. Because one aim of statistics is to produce the "best" information from available data, some authors consider statistics a branch of decision theory. Statistical practice includes the planning, summarizing, and interpreting of observations, allowing for variability and uncertainty. Origin The word statistics comes from - the modern Latin phrase statisticum collegium (lecture about state affairs), from which came - the Italian word statista, which means "statesman" or "politician" (compare to status) and - the German Statistik, originally designating the analysis of data about the state. It acquired the meaning

of the collection and classification of data generally in the early nineteenth century. The collection of data about states and localities continues, largely through national and international statistical services; in particular, censuses provide regular information about the population. Statistical methods We describe our knowledge (and ignorance) mathematically and attempt to learn more from whatever we can observe. This requires us to - plan our observations to control their variability (experiment design), - summarize a collection of observations to feature their commonality by suppressing details (descriptive statistics), and - reach consensus about what the observations tell us about the world we observe (statistical inference). In some forms of descriptive statistics, notably data mining, the second and third of these steps become so prominent that the first step (planning) appears to become less important. In these disciplines, data often are collected outside the control of the person doing the analysis, and the result of the analysis may be more an

operational model than a consensus report about the world. Probability The probability of an event is often defined as a number between one and zero. In reality however there is virtually nothing that has a probability of 1 or 0. Could you say that the sun will certainly rise in the morning? Your answer may be Yes. But what if an extremely unlikely event destroys the sun? What if there is a nuclear war and the sky is covered in ash and smoke? We often round the probability of such things up or down because they are so likely or unlikely to occur, that it's easier to recognize them as a probability of one or zero. However, this can often lead to misunderstandings and dangerous behavior, because people are unable to distinguish between, e.g., a probability of 10 4 and a probability of 10 9, despite the very practical difference between them. If you expect to cross the road about 10 5 or 10 6 times in your life, then reducing your risk of being run over per road crossing to 10 9 will make you

safe for your whole life, while a risk per road crossing of 10 4 will make it very likely that you will have an accident, despite the intuitive feeling that 0.01% is a very small risk. Specialized disciplines Some sciences use applied statistics so extensively that they have specialized terminology. These disciplines include: Biostatistics Business statistics Economic statistics Engineering statistics Statistical physics Demography Psychological statistics Social statistics (for all the social sciences) Process analysis and chemometrics (for analysis of data from analytical chemistry and chemical engineering) Reliability Engineering Statistics form a key basis tool in business and manufacturing as well. It is used to understand measurement systems variability, control processes (as

in statistical process control or SPC), for summarizing data, and to make data-driven decisions. In these roles it is a key tool, and perhaps the only reliable tool. Software Modern statistics is supported by computers to perform some of the very large and complex calculations required. Whole branches of statistics have been made possible by computing, for example neural networks. The computer revolution has implications for the future of statistics, with a new emphasis on 'experimental' statistics. A list of statistical packages in common use R programming language S programming language MATLAB GNU Octave Macanova MS Excel OpenOffice Calc SAS

SPSS STATA MiniTab StatPro - Free Excel add-in (http://bl-busdotnet3.ads.iu.edu/albrightbooks/default.htm#add-ins)