Jay Buckingham Dynamic Signal jbuckingham@dynamicsignal.com



Similar documents
INDEX. Introduction Page 3. Methodology Page 4. Findings. Conclusion. Page 5. Page 10

Insight Data Science: Bridging the gap between academia and industry. Josiah Walton Physics Careers Seminar UIUC Department of Physics April 23, 2015

COMP9321 Web Application Engineering

Become a developer in 15 short weeks without having. to quit your job!

What do we do at Cimigo?

Sunnie Chung. Cleveland State University

5 Point Social Media Action Plan.

Big Data Executive Survey

Big Data and Analytics: Challenges and Opportunities

BUDT 758B-0501: Big Data Analytics (Fall 2015) Decisions, Operations & Information Technologies Robert H. Smith School of Business

Recruit Using Github, Quora, Dribbble & More

Using LinkedIn & Other Forms of Social Media as Job Search Tools and Ways to Brand Yourself

Strategies For Setting Up Your Organisation For Success With Big Data. Kevin Long Business Development Director Teradata

Interviewing for Software Jobs. Matt Papakipos Brown Math/CS 93 Slides presented at Brown University on October 7, 2014

Louis Gudema: Founder and President of Revenue + Associates

the beginner s guide to SOCIAL MEDIA METRICS

PR Bootcamp. The U.S. Eyes to. Jessica Hasson pulp PR STARTUP

Social Media Influencer Survey 2014

LinkedIn Tutorial. An Introduction to Today s Leading Job-Search Social Network

Hiring Manager. Intake Questionnaire

DATA EXPERTS MINE ANALYZE VISUALIZE. We accelerate research and transform data to help you create actionable insights

Better profile Better prospects The value of a positive social media presence in career planning. hays.be/socialmedia

WHAT IS SOCIAL RECRUITING? A Starter Guide Powered by

Big Data and Data Science: Behind the Buzz Words

Digital Analytics Checkup:

HOW TO GET MORE WEBSITE TRAFFIC. laurence learntocodewith.me

CIS4930/6930 Data Science: Large-scale Advanced Data Analysis Fall Daisy Zhe Wang CISE Department University of Florida

Britepaper. How to grow your business through events 10 easy steps

Using Social Media as a Recruiting Tool. Sasha Louati Adecco Staffing

SOCIAL MEDIA SUCCESS IN 14 STEPS

How to Build Online Brand Authority

Using Twitter for Business

WSI White Paper. Prepared by: Francois Muscat Search Engine Optimization Expert, WSI

UC Small Farm Program Agritourism Webinar Series. Social Media Kristin York - Instructor June 2, 2016

Postgraduate Computing at Goldsmiths

ONLINE REPUTATION MANAGEMENT

How to Win More Customers with LinkedIn Marketing Edition

How To Become A Data Scientist

CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science

Essential Communication Methods for Today s Public Social Media 101. Presented by Tom D. Trimble, CIO Tulsa County Government

Small Business Trends

Role Description. Position of a Data Scientist Machine Learning at Fractal Analytics

Big Data Analytics Process & Building Blocks

Take Advantage of Social Media. Monitoring.

What is Data Science? Girl Develop It! Meetup Renée M. P. Teate, March 2015

How To Learn To Use Big Data

Big Data: calling for a new scope in the curricula of Computer Science. Dr. Luis Alfonso Villa Vargas

Google Resume Search Engine Optimization (SEO) and. LinkedIn Profile Search Engine Optimization (SEO) Jonathan Duarte

SEO 2.0 ADVANCED SEO TIPS & TECHNIQUES ABSTRACT»

The? Data: Introduction and Future

COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK DEPARTMENT OF INDUSTRIAL ENGINEERING AND OPERATIONS RESEARCH

11 Ways to Get. Authority Links for. Your New Blog

Doing Multidisciplinary Research in Data Science

Online Marketing Module COMP. Certified Online Marketing Professional. v2.0

Demystifying The Data Scientist

SURVEY REPORT DATA SCIENCE SOCIETY 2014

Myths of Digital Marketing

HOW TO TURN A. Single E-Book Into a 101+ Piece Content Marketing Machine

SEO OVERVIEW. We want you educated about SEO before we start! By Mary Cary NewMediaLegalMarketing / Video Blog Marketing / Video Studio Sausalito

White Paper. What is SMO? Optimize For Success: Social Media Optimization PAGE 1

The Developer Hiring Landscape 2015

GETTING THE CONTENT MARKETING JOB YOU WANT

Website Analysis: The Boston Consulting Group + Recommendations for an Improved User Experience

Sources: Summary Data is exploding in volume, variety and velocity timely

Propel Careers Career Development Seminars

CHIPPERFIELD MEDIA LLC. P.O. BOX 2469 SAUSALITO, CA

The Five Biggest MISSED Internet Marketing Opportunities Most Lawyers Don't Know About

Integrating a Big Data Platform into Government:

Speak Up 2015 Grade 6-12 Survey

Inbound Marketing. The Big Idea

MARKETING. What is Online Reputation Marketing? Why is ORM Important to your Business? Netforce Performance Marketing - Call Us Today!

CPS 216: Advanced Database Systems (Data-intensive Computing Systems) Shivnath Babu

DIGITAL MARKETING STRATEGY. Setting up, Raising Awareness For and Monitoring Social Media

Is your LinkedIn profile 100% complete? Is your LinkedIn profile keyword optimized and formatted for recruiter searches?

Title/Description/Keywords & Various Other Meta Tags Development

Teleconference information: Call-in toll-free number: (US) Conference Code: Webinar call-in number:

BEYOND POINT AND CLICK THE EXPANDING DEMAND FOR CODING SKILLS BURNING GLASS TECHNOLOGIES JUNE 2016

Transcription:

Jay Buckingham Dynamic Signal jbuckingham@dynamicsignal.com

Financial Times PeHub.com Wall Street Journal Harvard Business Review

Making use of vast amounts of data to: Discover what we don t know Obtain predictive, actionable insight Better serve your customers Improve your products Invent new products customers love

LinkedIn: People you might know Amazon: Recommendation Google: Selecting ads for you Pandora: Selecting music you like Dynamic Signal: Who is influential? Commonality: Understanding users

Statistics Machine Learning Map/Reduce

Which users are most important? The ones that others interact with (comments, shares, retweets, ) Rank them Find their best posts Demographics Who are these people?

~500M Twitter users We analyze ~20M How much data is that? 5 TB How long will it take to process? 3.0 GHz Xeon 190000 sec/tb => 2 days

Map/Reduce Spread job across many machines With 48 cores: 2 days => 2 hours

Brands want to know more about these important people Demographics Twitter doesn t know We can figure it out

Machine Learning MaxEnt Support Vector Machines Boosting Decision Trees

Data preparation Data presentation Experimentation Observation Data products

Data science is a blend of hacker s arts, statistics and machine learning and the expertise in mathematics and the domain of the data for the analysis to be interpretable It requires creative decisions and openmindedness in a scientific context.

LinkedIn: Recommendation systems IR, Statistics, Java, Hadoop Velti: Mobile ads MS/PhD Statistics, R, Machine Learning,Hadoop, SQL Drawbridge: Mobile ads MS/PhD Statistics, Lucene/SOLR, Java, Hadoop Twitter: Social media MS/PhD Statistics, Math, CS; R, Hadoop Pandora: Recommender systems BS/BA CS or Statistics, R, Python, Java, Hadoop

We're looking for people who love turning data into gold. Are you someone who solves hard problems by creatively obtaining, scrubbing, exploring, modeling and interpreting big data? Do you know enough about information retrieval, machine learning, and statistics to be dangerous? Are you a hands-on implementer, ready to learn new languages and technologies to turn your ideas into solutions used by tens of millions of people around the world? If your answer is Yes, show me the data! then this is the job for you. As data scientists, we work on some of the most exciting areas of computer and information science, including information extraction, recommendation systems, social network analysis, and network visualization. Moreover, we support core business needs and drive innovation. Interested? Here is what we are looking for: Interest in solving problems with big data. A good knowledge of information retrieval, data mining, or related field. Technical skills (e.g., Java, Hadoop, Pig) and interest in new ones. Creativity and good taste in problems with business impact. You're in your final year of your Bachelor's, Master's, or PhD program.

Statistics Problem solving Curiosity and Creativity Computing

Graph Theory Machine Learning NLP (a little) Software tools

Get and process your own data: Experience with APIs (Twitter, LinkedIn, FB) Hadoop (Java) C++ (Google, FB) SQL Explore your ideas quickly: Python Data visualization/presentation R, javascript

Detail what you want to learn Gather and store data Compute trending topics Presentation github Extra points: Use Hadoop More extra points: Use AWS Hadoop

Tech Companies Key people: VC, angels, Y-Combinator Tech press: Hacker News, GigaOM

Hilary Mason DJ Patil Hacker News GigaOM Paul Graham John Doerr Kleiner Perkins Nate Silver hilarymason.com, @hmason @dpatil @HackerNews @gigaom @paulg @johndoerr @kpcb @fivethirtyeight

There's now 3 Tesla S's, 1 Ferrari, 2 SLS's, and 1 M5 in my building. Guess the economy is doing well.

Study job postings for data scientists Become familiar with companies Develop key skills they need

What do you really want? What are you good at? What are you likely to be good at?

What you want to learn What you want to become expert at What area you want to work in

Companies have lots of projects They really need help They really need to hire data scientists You just need to make it easy for them Show you have the skills and You Get Things Done! 26

You are building your brand. Think carefully about what you want people to think when they think of you. Right now: World knows little about you So you control the message Choose wisely 27

Resume Dice LinkedIn Blog Facebook Twitter Github Good recruiter

Email address: Yes: jonsmith@gmail.com jon@jonsmith.com Not: pbrshotgun@aol.com wildnights69@hotmail.com Facebook Cleanse your posts Use for professional purposes only Twitter: professional posts only 29

Do your homework: What company does, key products Technologies they use Where they are headed Be clear about why you are there Clothes: Look professional Dress like a VC Be early Relax, meditation 30

How you d approach data task Statistics Machine Learning Data structures Algorithms Eg: Graph problem Cracking the Coding Interview (Gayle McDowell) 31

Ask engineers: Key product areas What data do they have? What have they done so far? Key things they want to learn from data Technologies Ask CEO / VP: Show interest in the business What will this company achieve? Biz plan, product plan, revenue plan Competitors Tell them how you will help them achieve these goals 32

Relax! Practice interviews Do NOT talk money at interview Do NOT take offers at lame companies! Treat recruiter well 33

More opportunities Build skills faster Build gravitas Build network

Great companies: Established: Google, Facebook, LinkedIn, Twitter Startups: Drawbridge, Dynamic Signal Area saturated with tech buzz Great learning opportunities Engineers are really valuable So treated better than in Pacific NW 35

Bay Area companies are using new technologies way ahead of others Especially Big Data tools: Hadoop, (Google: MapReduce) Many more machine learning projects Great classes at Stanford, Berkeley 36

Working at Bay Area startups gives you credibility Because you ve developed exciting new products with great teams in fast-moving teams at awesome companies. You ve worked on products using technologies way ahead of others 37

VCs CEOs Best data scientists and engineers So many tech companies near each other 38

Work for a startup: learn a lot and kill it Start to build your epic story Choose startup in Bay Area, not a big company Choose based what you will learn, experience you will build. Don t worry about money, stock yet 39

Choose job to: Learn new skills / technologies Have an impact You want to be key engineer on some project Compensation Paid well Good stock options 40

Bing Gordon: Kill it This is the time: Focus on career Get on projects where you can learn what you need to learn Later: Go for impact 41

Learn your craft Ask for help from people with broad skills. Esp Dev, PM, Business, CEO, VC Early on, tell folks you'll want to schedule time for coffee with them periodically to talk technology, project management, biz dev, strategy 42

Be professional at work Don't burn bridges Communicate with your manager, GM, VP Get mentors in company not in company Get out to professional gatherings Build network 43

The sexy job in the next ten years will be statisticians Hal Varian, Chief Economist, Google http://www.slideshare.net/jaybuckingham/data-science-opportunities

Read this: http://hbr.org/2012/10/data-scientist-the-sexiest-job-of-the-21stcentury/ar/pr What does a data scientist do? http://www.slideshare.net/datasciencelondon/bigdata-sorry-data-science-what-does-a-data-scientist-do Data scientists: http://www.forbes.com/pictures/lmm45emkh/tim-oreilly-is-thefounder-of-oreily-media/ Data Science 101 blog: http://datascience101.wordpress.com/ Hammerbacher data science course at Berkeley: http://datascienc.es/ Hammerbacher on data science: http://www.quora.com/jeff- Hammerbacher/answers/Data-Science Stanford data mining courses: http://scpd.stanford.edu/public/category/coursecategorycertificateprofile.do?m ethod=load&certificateid=1209602

Berkeley class on techniques to collect and analyze Big Data using Twitter as an example: http://www.ischool.berkeley.edu/courses/i290-abdt https://blogs.ischool.berkeley.edu/i290-abdt-s12/ How LinkedIn has improved its Big Data tools: http://gigaom.com/2013/03/03/how-and-why-linkedin-is-becoming-anengineering-powerhouse/ Paper on using social media data to predict demographic characteristics: http://www.pnas.org/content/early/2013/03/06/1218772110.full.pdf Mary Meeker on internet data trends: http://www.kpcb.com/partner/marymeeker Intro to getting Twitter data with the Twitter API: https://dev.twitter.com/start https://dev.twitter.com/docs Example: how to get info about a Twitter user: https://dev.twitter.com/docs/api/1/get/users/lookup

Hadoop history: http://gigaom.com/2013/03/04/the-history-of-hadoop-from-4- nodes-to-the-future-of-data/ Why use Hadoop: http://www.slideshare.net/digitald/web-tech-6626629 Hadoop overview: http://www.slideshare.net/hadoop/practical-problem-solvingwith-apache-hadoop-pig Google MapReduce: http://static.googleusercontent.com/external_content/untrusted_dlcp/research.go ogle.com/en/us/archive/mapreduce-osdi04.pdf Google file system: http://en.wikipedia.org/wiki/google_file_system Hadoop history: http://gigaom.com/2013/03/04/the-history-of-hadoop-from-4- nodes-to-the-future-of-data/ Why use Hadoop: http://www.slideshare.net/digitald/web-tech-6626629

Andrew Moore s tutorials: http://www.autonlab.org/tutorials/ NLTK: nltk.org WordNet: wordnet.princeton.edu