Big Data and Scripting. (lecture, computer science, bachelor/master/phd)

Size: px
Start display at page:

Download "Big Data and Scripting. (lecture, computer science, bachelor/master/phd)"

Transcription

1 Big Data and Scripting (lecture, computer science, bachelor/master/phd)

2 Big Data and Scripting - abstract/organization abstract introduction to Big Data and involved techniques lecture (2+2) practical exercises to be turned in dates 2 lectures (Mon 1:30 pm, M628 and Thu 10 am G302) 2 lab courses (Fri 10:00 am and 1:30 pm in Z613) oral exam, end of semester me Uwe Nagel [email protected]

3 Big Data and Scripting - organizational stuff exercises website: (about) 3 projects (bash, R, NOSQL/Hadoop) programming skills usefull, but not required discussion and help in lab course (Friday)

4 agenda - contents of this lecture prologue: What is Big Data and why bother? concrete examples identify qualitatively what sets Big Data approaches apart tools and techniques for (distributed) computation (some) basic notions of data handling Unix command line scripting in R NOSQL by example the map/reduce paradigm (example: Hadoop)

5 What this lecture does not cover basics of data mining we are using some dm-techniques this is not a data mining course lecture Data Mining: Artificial Intelligence recommender systems we will touch those without detail seminar/lecture Recommender Systems

6 Prologue what does Big Data mean and why is that interesting Big Data and distributed computing seems like a fashion is there really an advantage? where does this advantage come from? 3 example applications increasing level of detail

7 What is Big Data and why bother? a simple example - Amazon basically a selling platform provides: connection of suppliers to (private) customers a common market place (one interface for all) additional services (storage, shipment, payment) recommendation what is the difference to competitors? Amazon knows customers, products, sales and views same is true for its competitors

8 What is Big Data and why bother? in comparison, Amazon has much more customers more customers, more transactions, more views a larger data collection better recommendations estimate 1 : 1/3 of Amazon s sales generated by recommendations more data = better predictions? simple answer: essentially yes real answer: it s a bit more complicated 1 elusive-big-data

9 What is Big Data? - extraction from examples what are we trying to find out? learning/data mining and artificial intelligence are not that new somehow huge amounts of data can make a difference question: how and why? approach: analyze examples using big data 1. where is the big data 2. what kind of data is involved 3. what makes a large data base crucial

10 Target and the pregnant teen Target a large discounter chain (similar to Walmart) uses data analysis for targeted marketing central to one of the most famous big data stories the story Target predicts pregnancy better than family members source: how-target-figured-out-a-teen-girl-was-pregnant-before-her-fathe

11 Target and the pregnant teen - How? in a nutshell collect data about customers predict what they are interested in adjust advertisement to the specific person

12 Target and the pregnant teen - How? 1. step: data collection create large base of data available about customers each customer gets some unique ID (credit card, ,... ) everything that can be connected to the customer is collected connected to customer ID used for interest prediction example of data to collect items purchased together time/place of purchase weather? - whatever can be collected

13 Target and the pregnant teen - How? next: search for patterns simple: people buy what they always bought recommendation: customers who bought this usually also buy... concrete targeting, example: young parents a new child is a perfect opportunity: parents have to buy a lot of stuff (without having too much money) at this stage they are more likely bound to brands prediction of pregnancy is crucial for advertisement

14 Target and the pregnant teen - How? remark: this is how one could do it, not necessary how it was done. ground truth? customers are described by their purchases goal: identify patterns typical for pregnant women first steps: identify purchase records of pregnant women (i.e. positive label, group P) of non-pregnant customers (i.e. negative label, group N) searching for hints find commonalities within P find features distinguishing P from N build predictor: P(c P) (it is unknown, how exactly Target is doing this)

15 Target and the pregnant teen - results identified patterns quoting a Target analyst: they identified 25 products when analyzed together these allow a pregnancy prediction score P(c P) example: pregnant women buy supplements like calcium, magnesium and zinc sometime in the 20 first weeks business impact start of program: 2002 revenue growth: $44 Billion (2002) $67 Billion (2010) it is assumed that data mining was crucial for this growth

16 a second example: machine translation the task automatic translation of text given: text T in language A result: text T in language B example: Google s translator URL:

17 machine translation: a naive approach word mappings hold a dictionary W : A B replace each w T by W (w) 1. problem: words don t match exactly between languages 2. problem: grammar learning grammar 1. problem: grammar is hard, especially with semantics mixed in c.f. Chomsky s hierarchy of grammars 2. problem: language is noisy

18 machine translation: a statistical approach learning from big data new approach: don t understand or analyze instead: translation by example examples are taken from a corpus of manually translated documents basic idea (roughly) learn probability P that T is translation of T find T with maximal P approach: breaking down probabilities note: the following explains the principle and is not correct in every detail

19 machine translation: breaking down probabilities example: translate french text F to english text E P(E F ) - prob. that E is correct translation of F let F = f 1 f 2... (f i sentence, E analogous) first splitting assumption: f 1 corresponds to e i E is correct, if each e i translates its f i P(E F ) = P(e i f i ) i try to maximize P(e i f i )

20 machine translation: breaking down probabilities consider a concrete pair of sentences: Je ne vous connais pas. I don t know you. Je - I vous - you connais - know ne... pas - don t some observations words are translated (Je I) some words change place (vous you) some words change number (e.g. ne... pas don t)

21 machine translation: breaking down probabilities formalize our observations into concrete probabilities: translation P(f e) f is translation of e (Je I) distortion P(t s, l) word at position t is replaced (you nous) by word at position s in sentence of lengt fertility P(n e) e is replaced by n french words (ne pas don t)

22 machine translation: breaking down probabilities how does this help for P(E F )? recall assumption P(E F ) = i P(e i f i ) P(E F ) is high, if every P(e i f i ) is high same principle can be applied on the sentence level breaking up sentences P(f i, e i ) has many parts translation, distortion, fertility for every word some more, unknown combination by product (assuming independence) P(f i, e i ) 1, if all the parts 1 use translation, distortion, fertility as indicators

23 machine translation: missing data/open questions how are partial probabilities determined? estimation by observation recall: translation by example derive approximate probabilities by counting in corpus what is left basis: large corpus of translated documents additional: matching of sentences, words not considered here, further information:

24 discussion why does this work? it does not (translate a text into your native language and you ll see) translate.google.com still the quality of the results is surprising does it scale? why is it not always correct? what would be the impact of adding more data? can it be parallelized?

Big Data and Scripting

Big Data and Scripting Big Data and Scripting 1, 2, Big Data and Scripting - abstract/organization contents introduction to Big Data and involved techniques schedule 2 lectures (Mon 1:30 pm, M628 and Thu 10 am F420) 2 tutorials

More information

TYLER JUNIOR COLLEGE School of Continuing Studies 1530 SSW Loop 323 Tyler, TX 75701 1.800.298.5226 www.tjc.edu/continuingstudies/mycaa

TYLER JUNIOR COLLEGE School of Continuing Studies 1530 SSW Loop 323 Tyler, TX 75701 1.800.298.5226 www.tjc.edu/continuingstudies/mycaa TYLER JUNIOR COLLEGE School of Continuing Studies 1530 SSW Loop 323 Tyler, TX 75701 1.800.298.5226 www.tjc.edu/continuingstudies/mycaa Education & Training Plan Marketing Professional Program Student Full

More information

1 Choosing the right data mining techniques for the job (8 minutes,

1 Choosing the right data mining techniques for the job (8 minutes, CS490D Spring 2004 Final Solutions, May 3, 2004 Prof. Chris Clifton Time will be tight. If you spend more than the recommended time on any question, go on to the next one. If you can t answer it in the

More information

Big Data. Donald Kossmann & Nesime Tatbul Systems Group ETH Zurich

Big Data. Donald Kossmann & Nesime Tatbul Systems Group ETH Zurich Big Data Donald Kossmann & Nesime Tatbul Systems Group ETH Zurich Goal of Today What is Big Data? introduce all major buzz words What is not Big Data? get a feeling for opportunities & limitations Answering

More information

CSCI6900 Assignment 2: Naïve Bayes on Hadoop

CSCI6900 Assignment 2: Naïve Bayes on Hadoop DEPARTMENT OF COMPUTER SCIENCE, UNIVERSITY OF GEORGIA CSCI6900 Assignment 2: Naïve Bayes on Hadoop DUE: Friday, September 18 by 11:59:59pm Out September 4, 2015 1 IMPORTANT NOTES You are expected to use

More information

16.1 MAPREDUCE. For personal use only, not for distribution. 333

16.1 MAPREDUCE. For personal use only, not for distribution. 333 For personal use only, not for distribution. 333 16.1 MAPREDUCE Initially designed by the Google labs and used internally by Google, the MAPREDUCE distributed programming model is now promoted by several

More information

Management Information System Prof. Biswajit Mahanty Department of Industrial Engineering & Management Indian Institute of Technology, Kharagpur

Management Information System Prof. Biswajit Mahanty Department of Industrial Engineering & Management Indian Institute of Technology, Kharagpur Management Information System Prof. Biswajit Mahanty Department of Industrial Engineering & Management Indian Institute of Technology, Kharagpur Lecture - 03 Introduction III Welcome to all. Today let

More information

Cloud Computing. Chapter 8. 8.1 Hadoop

Cloud Computing. Chapter 8. 8.1 Hadoop Chapter 8 Cloud Computing In cloud computing, the idea is that a large corporation that has many computers could sell time on them, for example to make profitable use of excess capacity. The typical customer

More information

How Big Is Big Data Adoption? Survey Results. Survey Results... 4. Big Data Company Strategy... 6

How Big Is Big Data Adoption? Survey Results. Survey Results... 4. Big Data Company Strategy... 6 Survey Results Table of Contents Survey Results... 4 Big Data Company Strategy... 6 Big Data Business Drivers and Benefits Received... 8 Big Data Integration... 10 Big Data Implementation Challenges...

More information

DEFINITELY. GAME CHANGER? EVOLUTION? Big Data

DEFINITELY. GAME CHANGER? EVOLUTION? Big Data Big Data EVOLUTION? GAME CHANGER? DEFINITELY. EMC s Bill Schmarzo and consultant Ben Woo weigh in on whether Big Data is revolutionary, evolutionary, or both. by Terry Brown EMC+ In a recent survey of

More information

Big Data Analytics. Lucas Rego Drumond

Big Data Analytics. Lucas Rego Drumond Big Data Analytics Lucas Rego Drumond Information Systems and Machine Learning Lab (ISMLL) Institute of Computer Science University of Hildesheim, Germany Big Data Analytics Big Data Analytics 1 / 36 Outline

More information

Language and Computation

Language and Computation Language and Computation week 13, Thursday, April 24 Tamás Biró Yale University [email protected] http://www.birot.hu/courses/2014-lc/ Tamás Biró, Yale U., Language and Computation p. 1 Practical matters

More information

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat ESS event: Big Data in Official Statistics Antonino Virgillito, Istat v erbi v is 1 About me Head of Unit Web and BI Technologies, IT Directorate of Istat Project manager and technical coordinator of Web

More information

Colleen s Interview With Ivan Kolev

Colleen s Interview With Ivan Kolev Colleen s Interview With Ivan Kolev COLLEEN: [TO MY READERS] Hello, everyone, today I d like to welcome you to my interview with Ivan Kolev (affectionately known as Coolice). Hi there, Ivan, and thank

More information

A Guide to Using HiFX

A Guide to Using HiFX A Guide to Using HiFX Welcome to HiFX Whatever your international payments needs, HiFX gives you access to consistent bank beating exchange rates and the ability to arrange international money transfers

More information

Big Data Integration: A Buyer's Guide

Big Data Integration: A Buyer's Guide SEPTEMBER 2013 Buyer s Guide to Big Data Integration Sponsored by Contents Introduction 1 Challenges of Big Data Integration: New and Old 1 What You Need for Big Data Integration 3 Preferred Technology

More information

The Impact of Big Data on Classic Machine Learning Algorithms. Thomas Jensen, Senior Business Analyst @ Expedia

The Impact of Big Data on Classic Machine Learning Algorithms. Thomas Jensen, Senior Business Analyst @ Expedia The Impact of Big Data on Classic Machine Learning Algorithms Thomas Jensen, Senior Business Analyst @ Expedia Who am I? Senior Business Analyst @ Expedia Working within the competitive intelligence unit

More information

Machine Translation. Agenda

Machine Translation. Agenda Agenda Introduction to Machine Translation Data-driven statistical machine translation Translation models Parallel corpora Document-, sentence-, word-alignment Phrase-based translation MT decoding algorithm

More information

Social Media Marketing for Small Business Success

Social Media Marketing for Small Business Success The Basics of Social Media Social Media Marketing for Small Business Success Social Media Revolution Constant Contact 2014 #ctctsocial @constantcontact http://youtu.be/0euel3n7fds 2 YOUR PHOTO HERE Catrine

More information

ECLT 5810 E-Commerce Data Mining Techniques - Introduction. Prof. Wai Lam

ECLT 5810 E-Commerce Data Mining Techniques - Introduction. Prof. Wai Lam ECLT 5810 E-Commerce Data Mining Techniques - Introduction Prof. Wai Lam Data Opportunities Business infrastructure have improved the ability to collect data Virtually every aspect of business is now open

More information

OPINION MINING IN PRODUCT REVIEW SYSTEM USING BIG DATA TECHNOLOGY HADOOP

OPINION MINING IN PRODUCT REVIEW SYSTEM USING BIG DATA TECHNOLOGY HADOOP OPINION MINING IN PRODUCT REVIEW SYSTEM USING BIG DATA TECHNOLOGY HADOOP 1 KALYANKUMAR B WADDAR, 2 K SRINIVASA 1 P G Student, S.I.T Tumkur, 2 Assistant Professor S.I.T Tumkur Abstract- Product Review System

More information

Big Data Big Deal? Salford Systems www.salford-systems.com

Big Data Big Deal? Salford Systems www.salford-systems.com Big Data Big Deal? Salford Systems www.salford-systems.com 2015 Copyright Salford Systems 2010-2015 Big Data Is The New In Thing Google trends as of September 24, 2015 Difficult to read trade press without

More information

7 WAYS HOW DESIGN THINKING CAN BOOST INSURANCE BUSINESS

7 WAYS HOW DESIGN THINKING CAN BOOST INSURANCE BUSINESS 7 WAYS HOW DESIGN THINKING CAN BOOST INSURANCE BUSINESS The definition of stupidity is doing the same thing everyday and yet expecting different results, as Einstein stated. To get different results, you

More information

Website Promotion for Voice Actors: How to get the Search Engines to give you Top Billing! By Jodi Krangle http://www.voiceoversandvocals.

Website Promotion for Voice Actors: How to get the Search Engines to give you Top Billing! By Jodi Krangle http://www.voiceoversandvocals. Website Promotion for Voice Actors: How to get the Search Engines to give you Top Billing! By Jodi Krangle http://www.voiceoversandvocals.com Why have a website? If you re busier than you d like to be

More information

LEAD CONVERSION SECRETS OF TOP ADVISORS

LEAD CONVERSION SECRETS OF TOP ADVISORS LEAD CONVERSION SECRETS OF TOP ADVISORS Introduction When you re in the insurance business, you re in the business of selling something that everyone needs: protection for their families and assets. As

More information

THE NEXT AD BIDDING GUIDE AN EASY GUIDE TO HELP YOU OPTIMISE YOUR BIDDING STRATEGY

THE NEXT AD BIDDING GUIDE AN EASY GUIDE TO HELP YOU OPTIMISE YOUR BIDDING STRATEGY THE NEXT AD BIDDING GUIDE AN EASY GUIDE TO HELP YOU OPTIMISE YOUR BIDDING STRATEGY Bidding strategy 3 steps for setting up your bidding strategy 1 Define your business goal 2 Choose your bidding strategy

More information

CS 40 Computing for the Web

CS 40 Computing for the Web CS 40 Computing for the Web Art Lee January 20, 2015 Announcements Course web on Sakai Homework assignments submit them on Sakai Email me the survey: See the Announcements page on the course web for instructions

More information

CS4025: Pragmatics. Resolving referring Expressions Interpreting intention in dialogue Conversational Implicature

CS4025: Pragmatics. Resolving referring Expressions Interpreting intention in dialogue Conversational Implicature CS4025: Pragmatics Resolving referring Expressions Interpreting intention in dialogue Conversational Implicature For more info: J&M, chap 18,19 in 1 st ed; 21,24 in 2 nd Computing Science, University of

More information

CPS221 Lecture: Cloud Computing last revised 10/22/14 Objectives

CPS221 Lecture: Cloud Computing last revised 10/22/14 Objectives CPS221 Lecture: Cloud Computing last revised 10/22/14 Objectives 1. To introduce the notion of cloud computing 2. To define the terms Software as a Service, Platform as a Service, and Infrastructure as

More information

Collecting Polish German Parallel Corpora in the Internet

Collecting Polish German Parallel Corpora in the Internet Proceedings of the International Multiconference on ISSN 1896 7094 Computer Science and Information Technology, pp. 285 292 2007 PIPS Collecting Polish German Parallel Corpora in the Internet Monika Rosińska

More information

Can you briefly describe, for those listening to the podcast, your role and your responsibilities at Facebook?

Can you briefly describe, for those listening to the podcast, your role and your responsibilities at Facebook? The Audience Measurement Event Speaker s Spotlight Series Featured Speaker: Fred Leach, Facebook Interviewer: Joel Rubinson, President, Rubinson Partners Can you briefly describe, for those listening to

More information

INDEX. Introduction Page 3. Methodology Page 4. Findings. Conclusion. Page 5. Page 10

INDEX. Introduction Page 3. Methodology Page 4. Findings. Conclusion. Page 5. Page 10 FINDINGS 1 INDEX 1 2 3 4 Introduction Page 3 Methodology Page 4 Findings Page 5 Conclusion Page 10 INTRODUCTION Our 2016 Data Scientist report is a follow up to last year s effort. Our aim was to survey

More information

Courtesy of: VREB Virtual Real Estate Brokerage

Courtesy of: VREB Virtual Real Estate Brokerage Courtesy of: VREB Virtual Real Estate Brokerage Why Go Mobile? In today s world almost every industry is becoming more mobile friendly because of the huge increase in tablet and smart phone usage. The

More information

ARE YOU SPENDING YOUR PPC BUDGET WISELY? BEST PRACTICES AND CREATIVE TIPS FOR PPC BUDGET MANAGEMENT

ARE YOU SPENDING YOUR PPC BUDGET WISELY? BEST PRACTICES AND CREATIVE TIPS FOR PPC BUDGET MANAGEMENT ARE YOU SPENDING YOUR PPC BUDGET WISELY? BEST PRACTICES AND CREATIVE TIPS FOR PPC BUDGET MANAGEMENT In pay-per-click marketing, as with so many things in life, you have to spend money to make money. But

More information

EAS Basic Outline. Overview

EAS Basic Outline. Overview EAS Basic Outline Overview This is the course outline for your English Language Basic Course. This course is delivered at pre intermediate level of English, and the course book that you will be using is

More information

Sentimental Analysis using Hadoop Phase 2: Week 2

Sentimental Analysis using Hadoop Phase 2: Week 2 Sentimental Analysis using Hadoop Phase 2: Week 2 MARKET / INDUSTRY, FUTURE SCOPE BY ANKUR UPRIT The key value type basically, uses a hash table in which there exists a unique key and a pointer to a particular

More information

Last time we had arrived at the following provisional interpretation of Aquinas second way:

Last time we had arrived at the following provisional interpretation of Aquinas second way: Aquinas Third Way Last time we had arrived at the following provisional interpretation of Aquinas second way: 1. 2. 3. 4. At least one thing has an efficient cause. Every causal chain must either be circular,

More information

A free guide for readers of Double Your Business. By Lee Duncan www.double Your Business.com

A free guide for readers of Double Your Business. By Lee Duncan www.double Your Business.com 7 Factors to increase the leads you generate from Pay Per Click advertising with Google Adwords. Exclusively for readers of Double Your Business. A free guide for readers of Double Your Business By Lee

More information

Effective Monetization of Music on Mobile

Effective Monetization of Music on Mobile Effective Monetization of Music on Mobile Every year, the music industry suffers huge losses from piracy and illegal downloads. To minimize these losses and achieve strong growth, the industry needs to

More information

Supply Chain Management 100 Success Secrets

Supply Chain Management 100 Success Secrets Supply Chain Management 100 Success Secrets Supply Chain Management 100 Success Secrets - 100 Most Asked Questions: The Missing SCM Software, Logistics, Solution, System and Process Guide Lance Batten

More information

Text Analytics with Ambiverse. Text to Knowledge. www.ambiverse.com

Text Analytics with Ambiverse. Text to Knowledge. www.ambiverse.com Text Analytics with Ambiverse Text to Knowledge www.ambiverse.com Version 1.0, February 2016 WWW.AMBIVERSE.COM Contents 1 Ambiverse: Text to Knowledge............................... 5 1.1 Text is all Around

More information

Busn 135 Syllabus. Business Math using Excel. (Syllabus subject to change)

Busn 135 Syllabus. Business Math using Excel. (Syllabus subject to change) Busn 135 Syllabus Business Math using Excel (Syllabus subject to change) To Get Started In This Class, Busn 135:... 2 Computer Skill Requirements For This Class:... 2 Computer Hardware & Software Requirements:...

More information

Big Data Storage, Management and challenges. Ahmed Ali-Eldin

Big Data Storage, Management and challenges. Ahmed Ali-Eldin Big Data Storage, Management and challenges Ahmed Ali-Eldin (Ambitious) Plan What is Big Data? And Why talk about Big Data? How to store Big Data? BigTables (Google) Dynamo (Amazon) How to process Big

More information

Social Media Mining. Data Mining Essentials

Social Media Mining. Data Mining Essentials Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers

More information

Big Data, Fast Data, Complex Data. Jans Aasman Franz Inc

Big Data, Fast Data, Complex Data. Jans Aasman Franz Inc Big Data, Fast Data, Complex Data Jans Aasman Franz Inc Private, founded 1984 AI, Semantic Technology, professional services Now in Oakland Franz Inc Who We Are (1 (2 3) (4 5) (6 7) (8 9) (10 11) (12

More information

GRAPHIC DESIGN 1, ART 2541. Spring Semester, 2014 Washington Hall, 1st Floor, Lab/Room 158 Mondays and Wednesdays, 5:20 p.m. 7:50 p.m.

GRAPHIC DESIGN 1, ART 2541. Spring Semester, 2014 Washington Hall, 1st Floor, Lab/Room 158 Mondays and Wednesdays, 5:20 p.m. 7:50 p.m. GRAPHIC DESIGN 1, ART 2541 Spring Semester, 2014 Washington Hall, 1st Floor, Lab/Room 158 Mondays and Wednesdays, 5:20 p.m. 7:50 p.m. Instructor: Todd Beasley Office Location: Washington Hall, 2 nd Floor

More information

It s Time to Write Your Business Plan By Jim Mulligan

It s Time to Write Your Business Plan By Jim Mulligan It s Time to Write Your Business Plan By Jim Mulligan If you re looking to start a business, the thought of developing a business plan might seem daunting. Some even question the value of spending time

More information

Web Design & Development

Web Design & Development Web Design & Development In Simplicity, Lies Beauty. - DigitalKrafts About Us The Internet is an ever changing environment that demands that you keep up with the latest and greatest communication platforms.

More information

Statistical Machine Translation: IBM Models 1 and 2

Statistical Machine Translation: IBM Models 1 and 2 Statistical Machine Translation: IBM Models 1 and 2 Michael Collins 1 Introduction The next few lectures of the course will be focused on machine translation, and in particular on statistical machine translation

More information

How to Meet EDI Compliance with Cloud ERP

How to Meet EDI Compliance with Cloud ERP How to Meet EDI Compliance with Cloud ERP Lincoln: This is Trek Talk, the Cloud ERP podcast and today s topic is Advantages of an EDI Compliant Cloud ERP. With cloud ERP you can meet your goals for EDI

More information

Email: [email protected] Office: LSK 5045 Begin subject: [ISOM3360]...

Email: justinjia@ust.hk Office: LSK 5045 Begin subject: [ISOM3360]... Business Intelligence and Data Mining ISOM 3360: Spring 2015 Instructor Contact Office Hours Course Schedule and Classroom Course Webpage Jia Jia, ISOM Email: [email protected] Office: LSK 5045 Begin subject:

More information

Strategies For Setting Up Your Organisation For Success With Big Data. Kevin Long Business Development Director Teradata

Strategies For Setting Up Your Organisation For Success With Big Data. Kevin Long Business Development Director Teradata Strategies For Setting Up Your Organisation For Success With Big Data Kevin Long Business Development Director Teradata Agenda Developing a big data strategy and plan that is aligned with your organisation

More information

P1: OTA/XYZ P2: ABC c01 JWBT043/Goins December 4, 2008 14:53 Printer Name: Courier Westford, Westford, MA SECTION I

P1: OTA/XYZ P2: ABC c01 JWBT043/Goins December 4, 2008 14:53 Printer Name: Courier Westford, Westford, MA SECTION I SECTION I Real Estate Day Trading: ANewWaytoMakeBig Money Buying and Selling Houses the Same Day COPYRIGHTED MATERIAL CHAPTER 1 Click a Mouse, Sell a House: Real Estate Day Trading Is the Easiest and

More information

IBM Global Business Services Microsoft Dynamics CRM solutions from IBM

IBM Global Business Services Microsoft Dynamics CRM solutions from IBM IBM Global Business Services Microsoft Dynamics CRM solutions from IBM Power your productivity 2 Microsoft Dynamics CRM solutions from IBM Highlights Win more deals by spending more time on selling and

More information

Problem Solving Hands-on Labware for Teaching Big Data Cybersecurity Analysis

Problem Solving Hands-on Labware for Teaching Big Data Cybersecurity Analysis , 22-24 October, 2014, San Francisco, USA Problem Solving Hands-on Labware for Teaching Big Data Cybersecurity Analysis Teng Zhao, Kai Qian, Dan Lo, Minzhe Guo, Prabir Bhattacharya, Wei Chen, and Ying

More information

How to make the most of ebay Motors.

How to make the most of ebay Motors. autorevo.com 2013 Guide #06 How to make the most of ebay Motors. an ebay Motors guide by AutoRevo. Boost exposure and get more leads... With ebay Motors, even the smallest local dealer with a handful of

More information

The Need for Training in Big Data: Experiences and Case Studies

The Need for Training in Big Data: Experiences and Case Studies The Need for Training in Big Data: Experiences and Case Studies Guy Lebanon Amazon Background and Disclaimer All opinions are mine; other perspectives are legitimate. Based on my experience as a professor

More information

Tricks To Get The Click

Tricks To Get The Click Tricks To Get The Click 10 Tips for Writing Better PPC Text Ads If you want to sell products or generate leads online, a user- friendly, conversion- optimized website is Step 1. But when it comes to search

More information

"Breakthrough New Software Automates The Optimization Process To Get You A #1 Ranking - All With The Single Click Of A Button!"

Breakthrough New Software Automates The Optimization Process To Get You A #1 Ranking - All With The Single Click Of A Button! 7 Days To Massive Website Traffic - Day 5 "Breakthrough New Software Automates The Optimization Process To Get You A #1 Ranking - All With The Single Click Of A Button!" Let's get right to it... The software

More information

https://agency.governmentjobs.com/dakota/job_bulletin.cfm?jobid=1017820

https://agency.governmentjobs.com/dakota/job_bulletin.cfm?jobid=1017820 Page 1 of 5 DAKOTA COUNTY Employee Relations Administration Center, 1590 Highway 55 Hastings, MN 55033-2372 651.438.4435 http://www.dakotacounty.us INVITES APPLICATIONS FOR THE POSITION OF: Electronic

More information

You should have a working knowledge of the Microsoft Windows platform. A basic knowledge of programming is helpful but not required.

You should have a working knowledge of the Microsoft Windows platform. A basic knowledge of programming is helpful but not required. What is this course about? This course is an overview of Big Data tools and technologies. It establishes a strong working knowledge of the concepts, techniques, and products associated with Big Data. Attendees

More information

IMPORTANT NOTICE. This syllabus is provided only as an example of what you might find in my sixteen-week lecture course.

IMPORTANT NOTICE. This syllabus is provided only as an example of what you might find in my sixteen-week lecture course. 1 IMPORTANT NOTICE Each instructor has his/her own syllabus for a particular course and section. In addition, each instructor may alter a syllabus both during and between semesters. This syllabus is provided

More information

Brainstorm a bit with friends and colleagues and add in these ideas. You'll have thousands of keywords in a very short period of time.

Brainstorm a bit with friends and colleagues and add in these ideas. You'll have thousands of keywords in a very short period of time. MKKH Marketing & Consulting www.mkkhmarketing.com 1-888-324-3878 Adwords Survival Tips Advertising on Google's Adwords can best be described as operating in a hostile environment. Even though the search

More information

Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料

Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料 Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料 美 國 13 歲 學 生 用 Big Data 找 出 霸 淩 熱 點 Puri 架 設 網 站 Bullyvention, 藉 由 分 析 Twitter 上 找 出 提 到 跟 霸 凌 相 關 的 詞, 搭 配 地 理 位 置

More information

SCM, CRM, BI, and ICE

SCM, CRM, BI, and ICE I. SUPPLY CHAIN MANAGEMENT (SCM) For example, a company the size of Wal-Mart, with operations all over the world and tens of thousands of suppliers, supply chain management and ITbased supply chain management

More information

Big Data Analytics. An Introduction. Oliver Fuchsberger University of Paderborn 2014

Big Data Analytics. An Introduction. Oliver Fuchsberger University of Paderborn 2014 Big Data Analytics An Introduction Oliver Fuchsberger University of Paderborn 2014 Table of Contents I. Introduction & Motivation What is Big Data Analytics? Why is it so important? II. Techniques & Solutions

More information

Machine Learning using MapReduce

Machine Learning using MapReduce Machine Learning using MapReduce What is Machine Learning Machine learning is a subfield of artificial intelligence concerned with techniques that allow computers to improve their outputs based on previous

More information

Social Media 101. The Basics of Social Media

Social Media 101. The Basics of Social Media Social Media 101 The Basics of Social Media Constant Contact 2014 Constant Contact 2014 2 Why are we here today? Constant Contact 2014 3 You are not alone 54% 57% of small businesses of nonprofits need

More information

Technical challenges in web advertising

Technical challenges in web advertising Technical challenges in web advertising Andrei Broder Yahoo! Research 1 Disclaimer This talk presents the opinions of the author. It does not necessarily reflect the views of Yahoo! Inc. 2 Advertising

More information

Introduction. Principle 1: Architects focus on what is essential. A Pragmatic View on Enterprise Architecture

Introduction. Principle 1: Architects focus on what is essential. A Pragmatic View on Enterprise Architecture 1 A Pragmatic View on Enterprise Architecture by Danny Greefhorst Published: June 1, 2012 (Article URL: http://www.tdan.com/view-articles/16108) This article from Danny Greefhorst describes some principles

More information

The key to knowing the best price is to fully understand consumer behavior.

The key to knowing the best price is to fully understand consumer behavior. A price optimization tool designed for small to mid-size companies to optimize infrastructure and determine the perfect price point per item in any given week DEBORAH WEINSWIG Executive Director- Head,

More information

E-mail Marketing for Martial Arts Schools:

E-mail Marketing for Martial Arts Schools: E-mail Marketing for Martial Arts Schools: Tips, Tricks, and Strategies That Will Send a Flood of New Students into Your School Practically Overnight! By Michael Parrella CEO of Full Contact Online Marketing

More information

GUIDE TO GOOGLE ADWORDS

GUIDE TO GOOGLE ADWORDS GUIDE TO GOOGLE ADWORDS How to use Google Adwords to drive relevant traffic to your website 2 April 2012 Version 1.0 Contents Contents 2 Introduction 4 Skill Level 4 Terminology 4 Video Tutorials 5 What

More information

CIKM 2015 Melbourne Australia Oct. 22, 2015 Building a Better Connected World with Data Mining and Artificial Intelligence Technologies

CIKM 2015 Melbourne Australia Oct. 22, 2015 Building a Better Connected World with Data Mining and Artificial Intelligence Technologies CIKM 2015 Melbourne Australia Oct. 22, 2015 Building a Better Connected World with Data Mining and Artificial Intelligence Technologies Hang Li Noah s Ark Lab Huawei Technologies We want to build Intelligent

More information

Strategies for Effective Tweeting: A Statistical Review

Strategies for Effective Tweeting: A Statistical Review Strategies for Effective Tweeting: A Statistical Review DATA REPORT Introduction 3 Methodology 4 Weekends Are Good for Relaxing and Tweeting 5 Best Days to Tweet By Industry 6 When Followers Are Busy Give

More information

Can people find your business online easily?

Can people find your business online easily? Can people find your business online easily? Is your company competitively positioned to reach more than a billion people who have access to the Internet? Do you have the tools and resources to position

More information

To reduce or not to reduce, that is the question

To reduce or not to reduce, that is the question To reduce or not to reduce, that is the question 1 Running jobs on the Hadoop cluster For part 1 of assignment 8, you should have gotten the word counting example from class compiling. To start with, let

More information

1001ICT Introduction To Programming Lecture Notes

1001ICT Introduction To Programming Lecture Notes 1001ICT Introduction To Programming Lecture Notes School of Information and Communication Technology Griffith University Semester 2, 2015 1 3 A First MaSH Program In this section we will describe a very

More information

Using Data Mining and Machine Learning in Retail

Using Data Mining and Machine Learning in Retail Using Data Mining and Machine Learning in Retail Omeid Seide Senior Manager, Big Data Solutions Sears Holdings Bharat Prasad Big Data Solution Architect Sears Holdings Over a Century of Innovation A Fortune

More information

PAY-PER-CLICK CALL TRACKING. How Call Tracking Data Can Improve & Optimize Your PPC Strategy

PAY-PER-CLICK CALL TRACKING. How Call Tracking Data Can Improve & Optimize Your PPC Strategy PAY-PER-CLICK CALL TRACKING & How Call Tracking Data Can Improve & Optimize Your PPC Strategy Table of Contents Introduction 3 What is PPC? 4 Google AdWords Features 6 What is Call Tracking? 8 Using Call

More information

Defending Networks with Incomplete Information: A Machine Learning Approach. Alexandre Pinto [email protected] @alexcpsec @MLSecProject

Defending Networks with Incomplete Information: A Machine Learning Approach. Alexandre Pinto alexcp@mlsecproject.org @alexcpsec @MLSecProject Defending Networks with Incomplete Information: A Machine Learning Approach Alexandre Pinto [email protected] @alexcpsec @MLSecProject Agenda Security Monitoring: We are doing it wrong Machine Learning

More information