TEXATA 2015 PREPARATION GUIDE

Similar documents
BS Environmental Science ( )

Apigee Insights Increase marketing effectiveness and customer satisfaction with API-driven adaptive apps

Internship Opportunities Xerox Research Centre India (XRCI), Bangalore Analytics Research Group

A program of the Technology Student Association

You will need to have the ability to get the right message to the right audience in the right way at just the right time.

Better planning and forecasting with IBM Predictive Analytics

Predictive Analytics Certificate Program

2015 BACHELOR OF BUSINESS MANAGEMENT

Creating Leaders with vision

A Capability Model for Business Analytics: Part 2 Assessing Analytic Capabilities

BUDT 758B-0501: Big Data Analytics (Fall 2015) Decisions, Operations & Information Technologies Robert H. Smith School of Business

TTEC Board of Directors. Education Program Update September 26, 2010

Discover Viterbi: New Programs in Computer Science

Securing Networks, Securing Futures

Introduction to Big Data Analytics p. 1 Big Data Overview p. 2 Data Structures p. 5 Analyst Perspective on Data Repositories p.

#15. Certified for Success. CPA... Imagine the possibilities!

University of Cambridge: Programme Specifications CERTIFICATE OF HIGHER EDUCATION IN INTERNATIONAL DEVELOPMENT

Programme Specifications

FINANCE GRADUATE DEVELOPMENT PROGRAMME

Emporia State University School of Business Department of Business Administration and Education MG 370 SMALL BUSINESSS MANAGEMENT

MACHINE LEARNING BASICS WITH R

Doctor of Information Technology Management

INTERVIEWING QUESTIONS

This Symposium brought to you by

Predictive Coding Defensibility and the Transparent Predictive Coding Workflow

MEASURING EMPLOYEE EXPERIENCE TO DRIVE POSITIVE EMPLOYEE ENGAGEMENT A FORESEE WHITE PAPER

GORRONDONA SCHOLARSHIP: Promoting Young Women in Transportation

Undergraduate Resource Series

BA (Hons) International Hospitality Business Management (top up)

IBM's Fraud and Abuse, Analytics and Management Solution

Big Data Governance Certification Self-Study Kit Bundle

The National Arts Education Standards: Curriculum Standards <

The Flying Start Degree Programme Henley Business School at the University of Reading

Asse ssment Fact Sheet: Performance Asse ssment using Competency Assessment Tools

LEADERSHIP IN SUSTAINABLE DEVELOPMENT. Overview of Cohort 17 PROFESSIONAL DEVELOPMENT PROGRAM

Request for Proposal Digital Asset Management October 24, 2014

Relationships that bring possibilities to life. The EY internship program

How To Get A Certificate Of He At Leicester University

The Importance of Analytics

Appendix 2: Intended learning outcomes of the Bachelor IBA

Overview. Scope of Work

2015 Financial Planning Challenge Guidelines

Electrical and Electronic Engineering

HND Media Production. at Ashton Sixth Form College Validated by University of Salford Manchester. Guidance & Information

The 2015 Hilti Big Data Analytics Competition. Big Hilti for Hilti Online

MAOL: Assessment Report. Master of Arts in Organizational Leadership (MAOL) Assessed by: MAOL Faculty

Relationship Manager (Banking) Assessment Plan

2015 Global Identity and Access Management (IAM) Market Leadership Award

Adobe Analytics Business Practitioner Adobe Certified Expert Exam Guide. Exam number: 9A0-381

Predictive Coding Defensibility and the Transparent Predictive Coding Workflow

2016 Industry Issues Competition

Studying Marketing at University

BUSINESS STRATEGY SYLLABUS

Earn the leading credential for the development of sustainable facility management strategies. Sustainability Facility Professional

Emporia State University School of Business Department of Business Administration and Education

Design and Development of a Mobile Game - Based Learning Application in Synonyms, Antonyms, and Homonyms

TExES Art EC 12 (178) Test at a Glance

Business Process Services. White Paper. Price Elasticity using Distributed Computing for Big Data

Advanced Big Data Analytics with R and Hadoop

Information Technology Resource Services

Guidelines, Criteria and FAQs

Development and Validation of the National Home Inspector Examination

RSA Archer Certified Administrator (CA) Certification Examination Study Guide

SHORT COURSES ARTS UNIVERSITY BOURNEMOUTH WALLISDOWN POOLE DORSET BH12 5HH AUB.AC.UK

Rubrics for AP Histories. + Historical Thinking Skills

Gundersen Partners Comprehensive Team Optimization Leadership Matrix

Online Computer Science Degree Programs. Bachelor s and Associate s Degree Programs for Computer Science

PMSA Exceptional Project Management Awards 2015

Business Associate Program Class of 2016

Measuring member satisfaction

Contact us to find the program that s right for you: Phone: // executiveeducation@ualberta.ca

UNIVERSITY OF BRADFORD School of Management Programme/programme title: Master of Science in Strategic Marketing

Employers Views On College Learning In The Wake Of The Economic Downturn. A Survey Among Employers Conducted On Behalf Of:

National Standards for Quality Online Teaching

Programme Specification

Reproducing Calculations for the Analytical Hierarchy Process

This programme is only offered at: AKMI Metropolitan College (AMC)

Programme Specification for MSc Applied Sports Performance Analysis

Westpac Future Leaders Scholarship Funding Guidelines

Call for Proposals: NARST Website Redesign

Revolutionary Scholarship: Innovation of Community Engagement Models

CONCORDIA UNIVERSITY DEPARTMENT OF COMPUTER SCIENCE AND SOFTWARE ENGINEERING SOEN390 SOFTWARE ENGINEERING TEAM DEVELOPMENT PROJECT ITERATION 5

Big Data Governance Certification Self-Study Kit Bundle

Proposal for New Program: Minor in Data Science: Computational Analytics

Network Consulting Engineer

Transcription:

TEXATA 2015 PREPARATION GUIDE This booklet provides participants, educators and event partners with a preparation guide for TEXATA, the 2015 Big Data Analytics World Championships. TEXATA is a fun, independent and challenging business education competition for Big Data Analytics. The mission is to improve well-rounded technical skills, awareness and understanding of the Big Data Analytics disciplines in business. We seek to celebrate the world s best organizations, business leaders and community partners. We hope to give students and young professionals the courage to pursue exciting career paths within Big Data, Data Science and Business Analytics and collaborate together with event partners. The competition involves two Online Qualification Rounds, with a Live World Finals event in Austin, Texas USA. This preparation booklet outlines core concepts to be tested during TEXATA 2015. Testing will examine a diverse range of practical, technical and business themes at the heart of Big Data Analytics including Sentiment Analysis, Machine Learning, Statistical Methods and Predictive Modeling and Analytics Insights. Round 1 and 2 Qualification questions will combine multiple-choice, short-answer and real-world business implementation case studies. The World Finals is an advanced business case study challenge, with in-depth interviews and face-to-face presentations with global leading judging authorities and industry leaders. We hope you enjoy Round 1 on Saturday September 26 worldwide. Good luck! The TEXATA Team

Competition Structure (1 of 2) The Online Qualification Rounds Round 1 (4 hours) Saturday, September 26, 2015 Round 1 will be multiple choice, with some theory questions and some practical questions. Datasets will be open data. Final scores for Round 1 will be determined by the proportion of correct answers weighted, with additional marks available for prompt time submissions (earlier is better). Depending on performances, the Top 20%-50% of participants in TEXATA Round 1 will progress to Round 2. Round 2 (4 hours) Saturday, October 10, 2015 Round 2 will have a greater focus on case studies and real-world practical questions. Theoretical questions involve detailed treatment of technical concepts covered previously in Round 1. Machine Learning principles and algorithms will also be explored more deeply in Round 2. Practical questions involve competitors implementing predictive models on very large structured and unstructured datasets. Big data sets will be open source. Competitors are welcome to include any other public domain data they feel may improve their answers. Round 2 scores will be determined by a combination of multiple choice answer correctness, free text answer assessment against a rubric, predictive modeling score, and submission time (earlier is better).

Competition Structure (2 of 2) World Finals (Austin, Texas, 6 hours) November 8-9, 2015 As part of technical presentations, Finalists will perform a complete data analysis workflow (i.e. beginning at user interviews and ending with a results presentation). As part of business presentations, Finalists will be interviewed by a variety of industry leaders and judging panelists on their proposed creative Big Data Analytics solution and real-world business challenges. Finalists will have access to real business data to solve the issues identified. Finalists are responsible for their problem definition, scope, execution, and communication of business insights. TEXATA 2015 winners will be decided by a panel of judges and the Question Design Team (including the problem owner). An evaluation criterion will be based on a rubric covering problem identification and decomposition, approach to solution, implementation effectiveness and clarity of results communication.

Technical Requirements (1 of 2) Programming Capabilities Competitors will be required to perform coding to compete in TEXATA 2015. Competitors are free to use any languages and frameworks with which they are familiar and comfortable. Competitors will need to be comfortable in performing numerical computations over data (e.g. What is the mean of value X in this dataset? ), data processing such as aggregating and normalizing data, and working with geospatial data. More advanced machine learning and predictive modeling skills will be applicable in Round 2 and World Finals. Whilst we are not focused on code quality or style in either Rounds 1 or 2, judges may request a code review as part of their overall assessment and judging panel interviews and presentations at the Live World Finals in Texas. Business Results TEXATA 2015 skills explore commercial impacts and real-world business insights of Big Data Analytics. TEXATA 2015 is focused on applying on business industries (e.g. financial services, e-commerce and mobility). Round 1 and Round 2 performances will assess objective, fact-driven results and business insights.

Technical Requirements (2 of 2) Amazon Web Services Big Data sets used in the TEXATA 2015 Online Rounds will be hosted by Amazon Web Services. Competitors should be comfortable accessing and/or processing data stored in Amazon S3. Competitors are welcome to download the data from S3 to your preferred storage solution. Access details for the datasets will be provided in the days prior to the competition. TEXATA will not provide technical support for accessing the datasets beyond basic connection details. Competition Interface TEXATA Rounds 1 and 2 will be conducted through a web browser. Participants will have 4 hours (240 minutes) to complete each Round. Participants are expected to have access to a computer with internet access and their preferred big data analytic environment over this time. The competition is independent and product agnostic every participant can use any technological tool, methodology and process to submit their competition solution. Competitors will enter their multiple choice answers and written case study answers via the HackerRank technology competition platform.

Skills & Expertise Competitors preparing to enter TEXATA should review the following topics and skills areas. This list is neither exhaustive nor definitive. TEXATA has a strong industry focus, so don t be too concerned if you re not too experienced on matrix algebra as long as you have the technical skills to implement big data analytics, and the business understanding to apply them effectively, you will be a strong competitor. Statistics Probability theory Probability distributions Precision, recall, accuracy measures A/B(/n) testing experiment design & interpretation Computer Science Algorithm description & identification Linear algebra Database fundamentals Map/Reduce program design Big data system design Linux command line tools Machine Learning Geospatial data analysis Social network analysis Mobile data analysis Text analytics Business Skills Big data industry awareness Stakeholder engagement Communication of results Data visualization