Big Data Challenges for Information Retrieval

Size: px
Start display at page:

Download "Big Data Challenges for Information Retrieval"

Transcription

1 UNIVERSITY OF COPENHAGEN DEPARTMENT OF COMPUTER SCIENCE Faculty of Science Big Data Challenges for Information Retrieval Christina Lioma Department of Computer Science Slide 1/8

2 Information Retrieval: needles in haystacks Branch of computer science behind search engines: find information among large, noisy, heterogeneous data Slide 2/8 Christina Lioma Big Data Challenges for Information Retrieval

3 Information Retrieval: needles in haystacks Branch of computer science behind search engines: find information among large, noisy, heterogeneous data Slide 2/8 Christina Lioma Big Data Challenges for Information Retrieval a known needle in a known haystack a known needle in an unknown haystack an unknown needle in an unknown haystack any needle in a haystack the sharpest needle in a haystack most of the sharpest needles in a haystack all the needles in a haystack affirmation of no needles in the haystack things like needles in any haystack let me know whenever a new needle shows up where are the haystacks? needles, haystacks - whatever

4 Search engines in a nutshell Three main types of ingredients (features): 1 Words: text semantics can be approximated by word frequencies 2 Web structure: if enough people point to something, it must be good and (probably) relevant 3 Users: search behaviour, click behaviour, dwell behaviour Slide 3/8 Christina Lioma Big Data Challenges for Information Retrieval

5 Search engines in a nutshell Three main types of ingredients (features): 1 Words: text semantics can be approximated by word frequencies 2 Web structure: if enough people point to something, it must be good and (probably) relevant 3 Users: search behaviour, click behaviour, dwell behaviour User queries: distribution over features INPUT Indexed documents: distribution over features INPUT Ranking: comparing distributions OUTPUT Slide 3/8 Christina Lioma Big Data Challenges for Information Retrieval

6 Anno 2013 Realtime indexing: 20 billion pages crawled per day Instant search: retrieval time < 0.3 sec, faster than human typing Zero query search: try to retrieve information before you know what you are looking for based on user profiling In terms of scale: 50 billion indexed webpages 3 billion search requests per day 1 (world population: ca. 7 billion people) 1 Google alone Slide 4/8 Christina Lioma Big Data Challenges for Information Retrieval

7 Anno 2013 Realtime indexing: 20 billion pages crawled per day Instant search: retrieval time < 0.3 sec, faster than human typing Zero query search: try to retrieve information before you know what you are looking for based on user profiling In terms of scale: 50 billion indexed webpages 3 billion search requests per day 1 (world population: ca. 7 billion people) Data-driven technology Big Data challenges 1 Long Data 2 Your Data 3 Small Data Thinking 1 Google alone Slide 4/8 Christina Lioma Big Data Challenges for Information Retrieval

8 Big data challenge 1: long data Long as in longitudinal: spanning over time The problem is not the range but the intervals: dynamic streams of data coming in with timestamps per < seconds Implications to search engines: time-versioned indexing: fine-grained updates & threaded associations time-travel queries: what is relevant depends on when Slide 5/8 Christina Lioma Big Data Challenges for Information Retrieval

9 Big data challenge 2: your data Personalisation. Can of worms. We can collect your data BUT it is safer not to personalise rather than annoy you... Slide 6/8 Christina Lioma Big Data Challenges for Information Retrieval

10 Big data challenge 2: your data Personalisation. Can of worms. We can collect your data BUT it is safer not to personalise rather than annoy you... Big data implications: Personalised data on two axes: individual (e.g. user click through, preferences, history) and social (e.g. twitter, Facebook, blogs) Search engines must translate all this data into a single user state reflecting user preferences This state needs to be updated dynamically with every new input, but also remain consistent and below the nuisance threshold The larger and noisier the input, the harder to keep this balance Slide 6/8 Christina Lioma Big Data Challenges for Information Retrieval

11 Big data challenge 3: small data thinking R&D in information retrieval: clear division between efficiency and effectiveness Efficiency: index compression, reducing lookup time, query caching... Is not always on-topic Effectiveness: accurate feature extraction, personalisation, relevance... Does not always scale Slide 7/8 Christina Lioma Big Data Challenges for Information Retrieval

12 Sources Haystack image, page 2: Needles in haystack metaphor, page 2: Matthew Koll, Bulletin of the American Society for Information Science, Vol. 2, No. 2, December/January 2000 Typewriter image, page 3: Copyright: Roberto Zilli,, ID: , available from Distributions image, page 3: Source: Edgar Meij, Large-scale Data Processing for Information Retrieval, 2012 Tweets image, page 5: Source: Can of worms image, page 6: Copyright: munchester2cool, available from Efficiency vs. effectiveness image, page 7: Slide 8/8 Christina Lioma Big Data Challenges for Information Retrieval

SEO: What is it and Why is it Important?

SEO: What is it and Why is it Important? SEO: What is it and Why is it Important? SearchEngineOptimization What is it and Why is it Important? The term SEO is being mentioned a lot lately, but not everyone is familiar with what SEO actually is.

More information

Keyword Research for Social Media

Keyword Research for Social Media Keyword Research for Social Media David Lakins info@keymultimedia.co.uk www.keymultimedia.co.uk/seminars Follow me: twitter.com/davidlakins Keywords, keywords, keywords Keyword research plays a vital role

More information

Fast Data in the Era of Big Data: Twitter s Real-

Fast Data in the Era of Big Data: Twitter s Real- Fast Data in the Era of Big Data: Twitter s Real- Time Related Query Suggestion Architecture Gilad Mishne, Jeff Dalton, Zhenghua Li, Aneesh Sharma, Jimmy Lin Presented by: Rania Ibrahim 1 AGENDA Motivation

More information

Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities

Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities Technology Insight Paper Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities By John Webster February 2015 Enabling you to make the best technology decisions Enabling

More information

The GIJP Tech team can offer assistance setting up and implementing any of the services mentioned in this document.

The GIJP Tech team can offer assistance setting up and implementing any of the services mentioned in this document. TWITTER Twitter (www.twitter.com) allows users to answer the question, What are you doing in 140 characters or less. Users post about their activities, links of interest, and even ask questions on Twitter.

More information

PUSH INTELLIGENCE. Bridging the Last Mile to Business Intelligence & Big Data. 2013 Copyright Metric Insights, Inc.

PUSH INTELLIGENCE. Bridging the Last Mile to Business Intelligence & Big Data. 2013 Copyright Metric Insights, Inc. PUSH INTELLIGENCE Bridging the Last Mile to Business Intelligence & Big Data 2013 Copyright Metric Insights, Inc. INTRODUCTION... 3 CHALLENGES WITH BI... 4 The Dashboard Dilemma... 4 Architectural Limitations

More information

Lecture 10: HBase! Claudia Hauff (Web Information Systems)! ti2736b-ewi@tudelft.nl

Lecture 10: HBase! Claudia Hauff (Web Information Systems)! ti2736b-ewi@tudelft.nl Big Data Processing, 2014/15 Lecture 10: HBase!! Claudia Hauff (Web Information Systems)! ti2736b-ewi@tudelft.nl 1 Course content Introduction Data streams 1 & 2 The MapReduce paradigm Looking behind the

More information

Insights for Microsoft Dynamics CRM Online User s Guide December 2014

Insights for Microsoft Dynamics CRM Online User s Guide December 2014 Insights for Microsoft Dynamics CRM Online User s Guide December 2014 Copyright This document is provided "as-is". Information and views expressed in this document, including URL and other Internet Web

More information

Facebook Smart Card FB 121211_1800

Facebook Smart Card FB 121211_1800 Facebook Smart Card FB 121211_1800 Social Networks - Do s and Don ts Only establish and maintain connections with people you know and trust. Review your connections often. Assume that ANYONE can see any

More information

SOCIAL MEDIA 80 78 76 74 72 70 68 66 64 Access to free content Series 1 To learn Advanced news of products Series 1 A Social Roadmap Understand how and why people use social media Map the social

More information

Content Marketing Templates

Content Marketing Templates Market Data / Supplier Selection / Event Presentations / User Experience Benchmarking / Best Practice / Template Files / Trends and Innovation Content Marketing Templates SEO Checklist Content Marketing

More information

Comparative Analysis of Google Panda and Penguin SEO Algorithms on Blogs

Comparative Analysis of Google Panda and Penguin SEO Algorithms on Blogs Comparative Analysis of Google Panda and Penguin SEO Algorithms on Blogs Sakshi 1, Saurabh Charaya 2 P.G. Student, Department of Computer Science &Engineering,Om Institute of Engineering & Technology,

More information

Scalable Machine Learning - or what to do with all that Big Data infrastructure

Scalable Machine Learning - or what to do with all that Big Data infrastructure - or what to do with all that Big Data infrastructure TU Berlin blog.mikiobraun.de Strata+Hadoop World London, 2015 1 Complex Data Analysis at Scale Click-through prediction Personalized Spam Detection

More information

A quick guide to. Social Media

A quick guide to. Social Media A quick guide to Social Media In this guide... Learn how to integrate your email marketing with social media to get the most out of online buzz! Use Twitter and Facebook integrations to enable readers

More information

Addressing Self-Management in Cloud Platforms: a Semantic Sensor Web Approach

Addressing Self-Management in Cloud Platforms: a Semantic Sensor Web Approach Addressing Self-Management in Cloud Platforms: a Semantic Sensor Web Approach Rustem Dautov Iraklis Paraskakis Dimitrios Kourtesis South-East European Research Centre International Faculty, The University

More information

www.breaking News English.com Ready-to-Use English Lessons by Sean Banville

www.breaking News English.com Ready-to-Use English Lessons by Sean Banville www.breaking News English.com Ready-to-Use English Lessons by Sean Banville 1,000 IDEAS & ACTIVITIES FOR LANGUAGE TEACHERS www.breakingnewsenglish.com/book.html Thousands more free lessons from Sean's

More information

Social Media, Rx Promotion, & FDA

Social Media, Rx Promotion, & FDA Social Media, Rx Promotion, & FDA Results of a survey of readers & followers of Pharma Marketing News, Pharma Marketing Blog, and @pharmaguy John Mack Publisher, Pharma marketing News & Pharma Marketing

More information

5 CREATIVE MARKETING & SEO TRENDS FOR 2016

5 CREATIVE MARKETING & SEO TRENDS FOR 2016 # A new year brings about new opportunities for digital marketing strategies. More specifically, Search Engine Optimization and a well-thought-out Social Media presence are of increasing significance for

More information

CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science

CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science Dr. Daisy Zhe Wang CISE Department University of Florida August 25th 2014 20 Review Overview of Data Science Why Data

More information

Analysis of Social Media Streams

Analysis of Social Media Streams Fakultätsname 24 Fachrichtung 24 Institutsname 24, Professur 24 Analysis of Social Media Streams Florian Weidner Dresden, 21.01.2014 Outline 1.Introduction 2.Social Media Streams Clustering Summarization

More information

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance. Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analytics

More information

MEASURING GLOBAL ATTENTION: HOW THE APPINIONS PATENTED ALGORITHMS ARE REVOLUTIONIZING INFLUENCE ANALYTICS

MEASURING GLOBAL ATTENTION: HOW THE APPINIONS PATENTED ALGORITHMS ARE REVOLUTIONIZING INFLUENCE ANALYTICS WHITE PAPER MEASURING GLOBAL ATTENTION: HOW THE APPINIONS PATENTED ALGORITHMS ARE REVOLUTIONIZING INFLUENCE ANALYTICS Overview There are many associations that come to mind when people hear the word, influence.

More information

SEO Marketing Strategy. Keeping you connected through SEO

SEO Marketing Strategy. Keeping you connected through SEO SEO Marketing Strategy Keeping you connected through SEO Table of Contents Popularity Keywords Content Uniqueness Marketing Links Traffic Ranking Innovation Optimization Analysis Popularity In order to

More information

Big Data and Open Data

Big Data and Open Data Big Data and Open Data Bebo White SLAC National Accelerator Laboratory/ Stanford University!! bebo@slac.stanford.edu dekabytes hectobytes Big Data IS a buzzword! The Data Deluge From the beginning of

More information

Big Systems, Big Data

Big Systems, Big Data Big Systems, Big Data When considering Big Distributed Systems, it can be noted that a major concern is dealing with data, and in particular, Big Data Have general data issues (such as latency, availability,

More information

Google Product. Google Module 1

Google Product. Google Module 1 Google Product Overview Google Module 1 Google product overview The Google range of products offer a series of useful digital marketing tools for any business. The clear goal for all businesses when considering

More information

CIS 4930/6930 Spring 2014 Introduction to Data Science Data Intensive Computing. University of Florida, CISE Department Prof.

CIS 4930/6930 Spring 2014 Introduction to Data Science Data Intensive Computing. University of Florida, CISE Department Prof. CIS 4930/6930 Spring 2014 Introduction to Data Science Data Intensive Computing University of Florida, CISE Department Prof. Daisy Zhe Wang Data Science Overview Why, What, How, Who Outline Why Data Science?

More information

BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB

BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB Planet Size Data!? Gartner s 10 key IT trends for 2012 unstructured data will grow some 80% over the course of the next

More information

Real Time Analytics for Big Data. NtiSh Nati Shalom @natishalom

Real Time Analytics for Big Data. NtiSh Nati Shalom @natishalom Real Time Analytics for Big Data A Twitter Inspired Case Study NtiSh Nati Shalom @natishalom Big Data Predictions Overthe next few years we'll see the adoption of scalable frameworks and platforms for

More information

Search and Information Retrieval

Search and Information Retrieval Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search

More information

Social Media Marketing. Hours 45

Social Media Marketing. Hours 45 Social Media Marketing Related Certificate Course ID Audience Social Media Marketing Social Media Marketing Intermediate Hours 45 Overview: Social media remains an evolving aspect of our daily lives in

More information

Search Engine Optimization Content is Key. Emerald Web Sites-SEO 1

Search Engine Optimization Content is Key. Emerald Web Sites-SEO 1 Search Engine Optimization Content is Key Emerald Web Sites-SEO 1 Search Engine Optimization Content is Key 1. Search Engines and SEO 2. Terms & Definitions 3. What SEO does Emerald apply? 4. What SEO

More information

Internet tools and techniques at this level will be defined as advanced because:

Internet tools and techniques at this level will be defined as advanced because: Unit Title: Using the Internet OCR unit number: 41 Level: 3 Credit value: 5 Guided learning hours: 40 Unit reference number: F/502/4298 Unit purpose and aim This is the ability to set up and use appropriate

More information

A Performance Evaluation of Open Source Graph Databases. Robert McColl David Ediger Jason Poovey Dan Campbell David A. Bader

A Performance Evaluation of Open Source Graph Databases. Robert McColl David Ediger Jason Poovey Dan Campbell David A. Bader A Performance Evaluation of Open Source Graph Databases Robert McColl David Ediger Jason Poovey Dan Campbell David A. Bader Overview Motivation Options Evaluation Results Lessons Learned Moving Forward

More information

Managing your online reputation

Managing your online reputation Managing your online reputation In this internet age where every thought, feeling and opinion is tweeted, posted or blogged about for the world to see, reputation management has never been so important

More information

The Social Media Plan

The Social Media Plan The Social Media Plan 1. Objectives 1. objective #1 2. objective #2 3. objective #3 2. Target Market 1. Location 2. Lifestyle 3. Key Message(s) 1. key message #1 2. key message #2 4. Goals 1. goal #1 2.

More information

Lead Generation Lessons From 4,000 Businesses. A study based on real data from 4,000 businesses

Lead Generation Lessons From 4,000 Businesses. A study based on real data from 4,000 businesses Lead Generation Lessons From 4,000 Businesses A study based on real data from 4,000 businesses Table of Contents Introduction: Real Data from 4,000 Businesses... 3 Factor 1: Blogging... 4 Factor 2: Web

More information

A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1

A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1 A Platform for Supporting Data Analytics on Twitter: Challenges and Objectives 1 Yannis Stavrakas Vassilis Plachouras IMIS / RC ATHENA Athens, Greece {yannis, vplachouras}@imis.athena-innovation.gr Abstract.

More information

Insurance Marketing White Paper The benefits of implementing marketing automation into your email marketing strategy

Insurance Marketing White Paper The benefits of implementing marketing automation into your email marketing strategy Insurance Marketing White Paper The benefits of implementing marketing automation into your email marketing strategy Katie Traynier July 2013 Email and Website Optimisation Introduction Most email marketers

More information

Sentiment Analysis on Big Data

Sentiment Analysis on Big Data SPAN White Paper!? Sentiment Analysis on Big Data Machine Learning Approach Several sources on the web provide deep insight about people s opinions on the products and services of various companies. Social

More information

PARTITIONING DATA TO INCREASE WEBSITE VISIBILITY ON SEARCH ENGINE

PARTITIONING DATA TO INCREASE WEBSITE VISIBILITY ON SEARCH ENGINE PARTITIONING DATA TO INCREASE WEBSITE VISIBILITY ON SEARCH ENGINE Kirubahar. J 1, Mannar Mannan. J 2 1 PG Scholar, 2 Teaching Assistant, Department of IT, Anna University Regional Centre, Coimbatore, Tamilnadu

More information

to Boost SEO Growth Services To learn more, go to: teletech.com

to Boost SEO Growth Services To learn more, go to: teletech.com 10 Killer Tips to Boost SEO Brands that want to improve their online marketing performance must move away from old school SEO and adopt next-generation optimization strategies and techniques. We offer

More information

SOCIAL MEDIA: The Tailwind for SEO & Lead Generation

SOCIAL MEDIA: The Tailwind for SEO & Lead Generation SOCIAL MEDIA: The Tailwind for SEO & Lead Generation 2 INTRODUCTION How can you translate followers into dollars and social media engagement into sales and deals? Search Engine Optimization (SEO) strategy

More information

Large-Scale Test Mining

Large-Scale Test Mining Large-Scale Test Mining SIAM Conference on Data Mining Text Mining 2010 Alan Ratner Northrop Grumman Information Systems NORTHROP GRUMMAN PRIVATE / PROPRIETARY LEVEL I Aim Identify topic and language/script/coding

More information

an Essential Marketing Grow Your Business Online: The Commercial Approach to Search Engine Marketing Prepared by Flex4, December 2011, v1.

an Essential Marketing Grow Your Business Online: The Commercial Approach to Search Engine Marketing Prepared by Flex4, December 2011, v1. A Science, a Black Art or an Essential Marketing tool? Grow Your Business Online: The Commercial Approach to Search Engine Marketing Prepared by Flex4, December 2011, v1. Grow Your Business Online P a

More information

Strategic Execution for Restaurant Rewards App. Implementation of content strategy spanning search, blog, and social

Strategic Execution for Restaurant Rewards App. Implementation of content strategy spanning search, blog, and social Strategic Execution for Restaurant Rewards App Implementation of content strategy spanning search, blog, and social Company Overview Our sample company is a restaurant rewards startup with e-card app membership

More information

The Noisy Query Layer: How Brands Can Avoid Chasing Their Tails

The Noisy Query Layer: How Brands Can Avoid Chasing Their Tails The Noisy Query Layer: How Brands Can Avoid Chasing Their Tails Another Market Brew Whitepaper Abstract Search engines continue to segment the marketplace into more and more verticals, in an attempt to

More information

Becoming an Agile Digital Detective

Becoming an Agile Digital Detective February 2012 IBM Enterprise Content Management software Becoming an Agile Digital Detective Page 2 Web-based social networks connect and empower people to find like-minded individuals to quickly fuel

More information

Streamdrill: Analyzing Big Data Streams in Realtime

Streamdrill: Analyzing Big Data Streams in Realtime Streamdrill: Analyzing Big Data Streams in Realtime Mikio L. Braun mikio@streamdrill.com @mikiobraun th 6 Realtime Big Data: Sources Finance Gaming Monitoring Advertisment Sensor Networks Social Media

More information

The Need for PDF Search... 3. Search and Index Overview... 3. IFilter Architecture... 4. Performance and Scalability Are Essential...

The Need for PDF Search... 3. Search and Index Overview... 3. IFilter Architecture... 4. Performance and Scalability Are Essential... 1 Contents The Need for PDF Search... 3 Search and Index Overview... 3 IFilter Architecture... 4 Performance and Scalability Are Essential... 6 Search for PDF Documents with the Fastest PDF IFilter on

More information

Giuseppe Riccardi, Marco Ronchetti. University of Trento

Giuseppe Riccardi, Marco Ronchetti. University of Trento Giuseppe Riccardi, Marco Ronchetti University of Trento 1 Outline Searching Information Next Generation Search Interfaces Needle E-learning Application Multimedia Docs Indexing, Search and Presentation

More information

Problems to store, transfer and process the Big Data 6/2/2016 GIANG TRAN - TTTGIANG2510@GMAIL.COM 1

Problems to store, transfer and process the Big Data 6/2/2016 GIANG TRAN - TTTGIANG2510@GMAIL.COM 1 Problems to store, transfer and process the Big Data COURSE: COMPUTING CLUSTERS, GRIDS, AND CLOUDS LECTURER: ANDREY SHEVEL ITMO UNIVERSITY SAINT PETERSBURG 6/2/2016 GIANG TRAN - TTTGIANG2510@GMAIL.COM

More information

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Agenda» Overview» What is Big Data?» Accelerates advances in computer & technologies» Revolutionizes data measurement»

More information

Search Evolution. Maps Images Text Blogs Wikis Video Reviews. Personalized Search

Search Evolution. Maps Images Text Blogs Wikis Video Reviews. Personalized Search Search Evolution Maps Images Text Blogs Wikis Video Reviews Personalized Search How Search Engine Spiders Work Spider-Bot Home Page Fetches ALL Content to add to the Search Engine index Spiders & Rankings

More information

NEXT Analytics User Guide for Facebook

NEXT Analytics User Guide for Facebook NEXT Analytics User Guide for Facebook This document describes the capabilities of NEXT Analytics to retrieve data from Facebook Insights directly into your spreadsheet file. Table of Contents Table of

More information

B2B Social Media Marketing LeadFormix Best Practices

B2B Social Media Marketing LeadFormix Best Practices Introduction Social media marketing is quickly becoming one of the most popular marketing techniques for B2B enterprises. Forrester Research predicts that companies will spend some $3.1 billion annually

More information

Pharmacy Affairs Branch. Website Database Downloads PUBLIC ACCESS GUIDE

Pharmacy Affairs Branch. Website Database Downloads PUBLIC ACCESS GUIDE Pharmacy Affairs Branch Website Database Downloads PUBLIC ACCESS GUIDE From this site, you may download entity data, contracted pharmacy data or manufacturer data. The steps to download any of the three

More information

WHITEPAPER. Unlocking Your ATM Big Data : Understanding the power of real-time transaction analytics. www.inetco.com

WHITEPAPER. Unlocking Your ATM Big Data : Understanding the power of real-time transaction analytics. www.inetco.com Unlocking Your ATM Big Data : Understanding the power of real-time transaction analytics www.inetco.com Summary Banks and credit unions are heavily investing in technology initiatives such as mobile infrastructure

More information

CSC590: Selected Topics BIG DATA & DATA MINING. Lecture 2 Feb 12, 2014 Dr. Esam A. Alwagait

CSC590: Selected Topics BIG DATA & DATA MINING. Lecture 2 Feb 12, 2014 Dr. Esam A. Alwagait CSC590: Selected Topics BIG DATA & DATA MINING Lecture 2 Feb 12, 2014 Dr. Esam A. Alwagait Agenda Introduction What is Big Data Why Big Data? Characteristics of Big Data Applications of Big Data Problems

More information

Leveraging Social Media

Leveraging Social Media Leveraging Social Media Social data mining and retargeting Online Marketing Strategies for Travel June 2, 2014 Session Agenda 1) Get to grips with social data mining and intelligently split your segments

More information

MAPS/REPUTATION DASHBOARD

MAPS/REPUTATION DASHBOARD MAPS/REPUTATION DASHBOARD It pays to be listed online and monitor what your customers are saying about you. Maps/Reputation Dashboard Local consumers are online searching for nearby businesses that offer

More information

Facebook and Social Networking Security

Facebook and Social Networking Security Facebook and Social Networking Security By Martin Felsky November 2009 Table of Contents Introduction... 1 What is Facebook?... 2 Privacy Settings... 5 Friends... 7 Applications... 8 Twitter... 9 Should

More information

Search Big Data with MySQL and Sphinx. Mindaugas Žukas www.ivinco.com

Search Big Data with MySQL and Sphinx. Mindaugas Žukas www.ivinco.com Search Big Data with MySQL and Sphinx Mindaugas Žukas www.ivinco.com Agenda Big Data Architecture Factors and Technologies MySQL and Big Data Sphinx Search Server overview Case study: building a Big Data

More information

Quantifind s story: Building custom interactive data analytics infrastructure

Quantifind s story: Building custom interactive data analytics infrastructure Quantifind s story: Building custom interactive data analytics infrastructure Ryan LeCompte @ryanlecompte Scala Days 2015 Background Software Engineer at Quantifind @ryanlecompte ryan@quantifind.com http://github.com/ryanlecompte

More information

Promoting your presence at the show

Promoting your presence at the show 5/6 Promoting your presence at the show Market your way to exhibition success Exhibiting at a show is both a serious commitment and a major opportunity. It can be a very effective part of your marketing

More information

Best Practices for Auditing Your SEO Program SEO

Best Practices for Auditing Your SEO Program SEO Your Program 2 How do you know if your campaign is getting the results you need and expect? By auditing your program on a regular basis, you will stay on track with your goals and gain the highest return

More information

Local SEO vs. Google Ads

Local SEO vs. Google Ads PERFORMANCE M edia P lacement Local SEO vs. Google Ads When a local search is made on Google the page below is displayed. This is a different search result page from the standard Google search page. Local

More information

11 Core Elements Of A Successful Digital and Content Marketing Campaign. RODA marketing is a full service digital marketing and consulting agency.

11 Core Elements Of A Successful Digital and Content Marketing Campaign. RODA marketing is a full service digital marketing and consulting agency. 11 Core Elements Of A Successful Digital and Content Marketing Campaign Effective digital and content marketing optimizes a brand s web presence for organic Search Engines (Google, Yahoo, Bing), social

More information

www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage

www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage If every image made and every word written from the earliest stirring of civilization

More information

Alert Notification as a Service

Alert Notification as a Service Alert Notification as a Service Marjan Gushev, Sasko Ristov University Ss. Cyril and Methodius, FSCE {marjan.gushev, sashko.ristov}@finki.ukim.mk Goran Velkoski, Pano Gushev Innovation LLC {goran.velkoski,

More information

Social Recruiting How to Effectively Use Social Networks to Recruit Talent

Social Recruiting How to Effectively Use Social Networks to Recruit Talent Social Recruiting How to Effectively Use Social Networks to Recruit Talent Introduction As a recruiter, you want to find the most qualified, talented, and largest pool of applicants. LinkedIn, Facebook,

More information

University of Glasgow Terrier Team / Project Abacá at RepLab 2014: Reputation Dimensions Task

University of Glasgow Terrier Team / Project Abacá at RepLab 2014: Reputation Dimensions Task University of Glasgow Terrier Team / Project Abacá at RepLab 2014: Reputation Dimensions Task Graham McDonald, Romain Deveaud, Richard McCreadie, Timothy Gollins, Craig Macdonald and Iadh Ounis School

More information

Semantic Search in E-Discovery. David Graus & Zhaochun Ren

Semantic Search in E-Discovery. David Graus & Zhaochun Ren Semantic Search in E-Discovery David Graus & Zhaochun Ren This talk Introduction David Graus! Understanding e-mail traffic David Graus! Topic discovery & tracking in social media Zhaochun Ren 2 Intro Semantic

More information

Global Monitoring + Support

Global Monitoring + Support Use HyperTerminal to access your Global Monitoring Units View and edit configuration settings View live data Download recorded data for use in Excel and other applications HyperTerminal is one of many

More information

Digital Marketing Training Institute

Digital Marketing Training Institute Our USP Live Training Expert Faculty Personalized Training Post Training Support Trusted Institute 5+ Years Experience Flexible Batches Certified Trainers Digital Marketing Training Institute Mumbai Branch:

More information

Stand OUT Stay TOP of mind Sell MORE

Stand OUT Stay TOP of mind Sell MORE Stand OUT Stay TOP of mind Sell MORE Use the arrows to navigate through the pages. next 1/14 [close] What is SEO? Search Engine Optimization (SEO) is the process of improving the volume and quality of

More information

Introduction to Inbound Marketing

Introduction to Inbound Marketing Introduction to Inbound Marketing by Kevin Carney of Inbound Marketing University Page 1 of 20 InboundMarketingUniversity.biz InboundMarketingUniversity Published by Inbound Marketing University No part

More information

The 4 Pillars of Technosoft s Big Data Practice

The 4 Pillars of Technosoft s Big Data Practice beyond possible Big Use End-user applications Big Analytics Visualisation tools Big Analytical tools Big management systems The 4 Pillars of Technosoft s Big Practice Overview Businesses have long managed

More information

Google Analytics & Social Media Monitoring Jeremy Coates / @phpcodemonkey

Google Analytics & Social Media Monitoring Jeremy Coates / @phpcodemonkey Google Analytics & Social Media Monitoring Jeremy Coates / @phpcodemonkey Who am I? Jeremy Coates, MD at Magma Digital Ltd Social Media Guru of the Year 2012 @phpcodemonkey linkedin.com/in/jeremycoates

More information

Toronto User Group. Business Process Excellence. Peter Palmer Senior Director BPM North America

Toronto User Group. Business Process Excellence. Peter Palmer Senior Director BPM North America Toronto User Group Business Process Excellence Peter Palmer Senior Director BPM North America Trends and Technologies Business Process Excellence Winchester Mystery House San Jose, CA Software AG BPA Methods

More information

1. Layout and Navigation

1. Layout and Navigation Success online whether measured in visits, ad revenue or ecommerce transactions requires compelling content and intuitive design. It all starts with the fundamentals: the key building blocks to create

More information

Tuning poor performing SQL s Using Oracle 10g Enterprise Manager s Automatic SQL Tuning Advisor

Tuning poor performing SQL s Using Oracle 10g Enterprise Manager s Automatic SQL Tuning Advisor Tuning poor performing SQL s Using Oracle 10g Enterprise Manager s Automatic SQL Tuning Advisor Version 1.0 Tuning poor performing SQL s using Oracle 10g Enterprise Manager s Automatic SQL Tuning Advisor.

More information

Quick Guide: Selecting ICT Tools for your Business

Quick Guide: Selecting ICT Tools for your Business Quick Guide: Selecting ICT Tools for your Business This Quick Guide is one of a series of information products targeted at small to medium sized businesses. It is designed to help businesses better understand,

More information

Small Business Internet Marketing. Just What You Want to Know (So, What Do You Want to Know?)

Small Business Internet Marketing. Just What You Want to Know (So, What Do You Want to Know?) Small Business Internet Marketing Just What You Want to Know (So, What Do You Want to Know?) During this presentation we re going to talk about the 3 Biggest Secrets and 3 Biggest Mistakes in marketing

More information

Finding a needle in Haystack: Facebook s photo storage IBM Haifa Research Storage Systems

Finding a needle in Haystack: Facebook s photo storage IBM Haifa Research Storage Systems Finding a needle in Haystack: Facebook s photo storage IBM Haifa Research Storage Systems 1 Some Numbers (2010) Over 260 Billion images (20 PB) 65 Billion X 4 different sizes for each image. 1 Billion

More information

Why Social Media Marketing?

Why Social Media Marketing? OMG LOL Why Social Media Marketing? 10 reasons to take your marketing to the next level 2011 Constant Contact, Inc. 11-2286 BEST PRACTICES GUIDE SOCIAL MEDIA MARKETING By now, you ve probably heard everyone

More information

Big Data Patterns. Ron Bodkin Founder and President, Think Big

Big Data Patterns. Ron Bodkin Founder and President, Think Big Big Data Patterns Ron Bodkin Founder and President, Think Big 1 About Me Ron Bodkin Founder and President, Think Big I have 9 years experience working with Big Data and Hadoop. In 2010, I founded Think

More information

Computational Advertising Andrei Broder Yahoo! Research. SCECR, May 30, 2009

Computational Advertising Andrei Broder Yahoo! Research. SCECR, May 30, 2009 Computational Advertising Andrei Broder Yahoo! Research SCECR, May 30, 2009 Disclaimers This talk presents the opinions of the author. It does not necessarily reflect the views of Yahoo! Inc or any other

More information

Cloud and Big Data Summer School, Stockholm, Aug. 2015 Jeffrey D. Ullman

Cloud and Big Data Summer School, Stockholm, Aug. 2015 Jeffrey D. Ullman Cloud and Big Data Summer School, Stockholm, Aug. 2015 Jeffrey D. Ullman 2 In a DBMS, input is under the control of the programming staff. SQL INSERT commands or bulk loaders. Stream management is important

More information

IT Tools for SMEs and Business Innovation

IT Tools for SMEs and Business Innovation Purpose This Quick Guide is one of a series of information products targeted at small to medium sized enterprises (SMEs). It is designed to help SMEs better understand, and take advantage of, new information

More information

A Study on the Collection Site Profiling and Issue-detection Methodology for Analysis of Customer Feedback on Social Big Data

A Study on the Collection Site Profiling and Issue-detection Methodology for Analysis of Customer Feedback on Social Big Data , pp. 169-178 http://dx.doi.org/10.14257/ijsh.2014.8.6.16 A Study on the Collection Site Profiling and Issue-detection Methodology for Analysis of Customer Feedback on Social Big Data Eun-Jee Song 1 and

More information

Authenticating and policing the internet for consumer confidence and security

Authenticating and policing the internet for consumer confidence and security Authenticating and policing the internet for consumer confidence and security Secure On-Line ID Introduction Unique zero intervention at a glance solution Built on positive site validation Allows policing

More information

Digital Marketing Capabilities

Digital Marketing Capabilities Digital Marketing Capabilities Version : 1.0 Date : 17-Apr-2015 Company Framework Focus on ROI 2 Introduction SPACECOS is a leading IT services and marketing solutions provider. We provide the winning

More information

Search Engine Optimization & Social Media

Search Engine Optimization & Social Media Search Engine Optimization & Social Media 1.10.2014, School of Management Fribourg Evelyn Thar, CEO Amazee Metrics Amazee Metrics AG / Förrlibuckstr. 30 / 8005 Zürich / evelyn.thar@amazeemetrics.com Agenda

More information

INTERSEC BENCHMARK. High Performance for Fast Data & Real-Time Analytics Part I: Vs Hadoop

INTERSEC BENCHMARK. High Performance for Fast Data & Real-Time Analytics Part I: Vs Hadoop INTERSEC BENCHMARK High Performance for Fast Data & Real-Time Analytics Part I: Vs Hadoop BENCHMARK VS HADOOP (STAND ALONE OR COMBINED) Intersec solution in a Redhat Openstack NFV framework complements

More information

Ashish R. Jagdale, Kavita V. Sonawane, Shamsuddin S. Khan

Ashish R. Jagdale, Kavita V. Sonawane, Shamsuddin S. Khan International Journal of Scientific & Engineering Research, Volume 5, Issue 7, July-2014 1156 Data Mining and Data Pre-processing for Big Data Ashish R. Jagdale, Kavita V. Sonawane, Shamsuddin S. Khan

More information

So what is this session all about?

So what is this session all about? 1 So what is this session all about? In this session we will be looking to understand the key aspects of the digital marketing mix with specific emphasis on digital communications techniques. This session

More information

Carbon Dating the Web

Carbon Dating the Web Carbon Dating the Web: Estimating the Age of Web Resources Hany M. SalahEldeen & Michael L. Nelson Old Dominion University Department of Computer Science Web Science and Digital Libraries Lab. Hany SalahEldeen

More information

User Documentation SEO EXPERT

User Documentation SEO EXPERT The SEO Expert module helps you quickly: User Documentation SEO EXPERT Create and personalize Friendly URLs Insert Meta tags for product pages, Facebook posts and Twitter Cards in your shop This module

More information

Get Social: Engage Everyone with Exceptional Experiences. Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Get Social: Engage Everyone with Exceptional Experiences. Copyright 2012, Oracle and/or its affiliates. All rights reserved. Get Social: Engage Everyone with Exceptional Experiences 1 Key Trends Impacting Your World Social Multi-Channel Mobile Self-Service Personalization Consumerization 2 Every 60 Seconds 695,000 new searches

More information