Best Practices for Scaling a Big Data Analytics Project

Similar documents
Big Data BI and analytics: Tips and best practices for managing bigdata

Skills shortage, training present pitfalls for big data analytics

Big Data and the Data Warehouse

Data warehouse software bundles: tips and tricks

Tips to ensuring the success of big data analytics initiatives

E-Guide GROWING CYBER THREATS CHALLENGING COST REDUCTION AS REASON TO USE MANAGED SERVICES

Big Data business intelligence and analytics: Strategies for gleaning meaning from large data volumes

E-Guide CLOUD COMPUTING FACTS MAY UNCLENCH SERVER HUGGERS HOLD

Hybrid cloud computing explained

Advanced analytics key component for decision management systems

Rethink defense-in-depth security model

E-Guide BRINGING BIG DATA INTO A DATA WAREHOUSE ENVIRONMENT

E-Guide THE CHALLENGES BEHIND DATA INTEGRATION IN A BIG DATA WORLD

E-Guide CONSIDERATIONS FOR EFFECTIVE SOFTWARE LICENSE MANAGEMENT

ios7: 3 rd party or platform-enabled MAM? Taking a look behind the scenes with Jack Madden

Does consolidating multiple ERP systems make sense?

Benefits of virtualizing your network

E-Guide HOW THE VMWARE SOFTWARE DEFINED DATA CENTER WORKS: AN IAAS EXAMPLE

E-Guide THE LATEST IN SAN AND NAS STORAGE TRENDS

Making the move from a tactical to a strategic supply chain

E-Guide NETWORKING MONITORING BEST PRACTICES: SETTING A NETWORK PERFORMANCE BASELINE

How to Develop Cloud Applications Based on Web App Security Lessons

GUIDELINES FOR EVALUATING PROCUREMENT SOFTWARE

The State of Desktop Virtualization in 2013: Brian Madden analyzes uses cases, preferred vendors and effective tools

6 Point SIEM Solution Evaluation Checklist

Social Media-based Customer Loyalty Programs

E-Guide SIX ENTERPRISE CLOUD STORAGE AND FILE-SHARING SERVICES TO CONSIDER

Order Management System Best Practices

Social channels changing contact center certification

Exchange Server 2010 backup and recovery tips and tricks

HOW MICROSOFT AZURE AD USERS CAN EMPLOY SSO

Securing the SIEM system: Control access, prioritize availability

HOW TO SELECT THE BEST SOLID- STATE STORAGE ARRAY FOR YOUR ENVIRONMENT

5 free Exchange add-ons you should consider Eliminating administration pain points on a budget

E-Guide MANAGING AND MONITORING HYBRID CLOUD RESOURCE POOLS: 3 STEPS TO ENSURE OPTIMUM APPLICATION PERFORMANCE

Strategies for Writing a HIPAA-Friendly BYOD Policy

2013 Cloud Storage Expectations

Unlocking data with document capture and imaging

E-Guide WHAT IT MANAGERS NEED TO KNOW ABOUT RISKY FILE-SHARING

A Guide to MAM and Planning for BYOD Security in the Enterprise

Aligning Public Cloud Strategies to Improve Server Efficiency

How SSL-Encrypted Web Connections are Intercepted

How to Define SIEM Strategy, Management and Success in the Enterprise

Supply Chain Management Tips and Best Practices

Social media driving CRM strategies

Managing Data Center Growth Explore Your Options

E-Guide to Mobile Application Development

Solution Spotlight KEY OPPORTUNITIES AND PITFALLS ON THE ROAD TO CONTINUOUS DELIVERY

E-Guide VIDEO CONFERENCING SOFTWARE AND HARDWARE: HYBRID APPROACH NEEDED

E-Guide UNDERSTANDING PCI MOBILE PAYMENT PROCESSING SECURITY GUIDELINES

E-Guide SHAREPOINT UPGRADE BEST PRACTICES

Hyper-V 3.0: Creating new virtual data center design options Top four methods for deployment

Customer data analytics best practices from top performers

E-Guide BEST PRACTICES FOR CLOUD BASED DISASTER RECOVERY

Evaluating SaaS vs. on premise for ERP systems

Solution Spotlight BEST PRACTICES FOR DEVELOPING MOBILE CLOUD APPS REVEALED

Is Your Data Safe in the Cloud?

BUYING PROCESS FOR ALL-FLASH SOLID-STATE STORAGE ARRAYS

Expert guide to achieving data center efficiency How to build an optimal data center cooling system

Software Defined Networking Goes Well Beyond the Data Center

The changing face of scale-out networkattached

WHAT S INSIDE NEW HYPER- CONVERGED SYSTEMS

Managing the supply chain for SAP

Streamlining the move to the cloud. Key tips for selecting the right cloud tools and preparing your infrastructure for migration

Essentials Guide CONSIDERATIONS FOR SELECTING ALL-FLASH STORAGE ARRAYS

CLOUD APPLICATION INTEGRATION AND DEPLOYMENT MADE SIMPLE

Advantages on Green Cloud Computing

The skinny on storage clusters

Cloud Business Intelligence Trends to Watch

MOBILE APP DEVELOPMENT LEAPS FORWARD

Desktop virtualization: Best practices for a seamless deployment

E-Guide CONSIDER SECURITY IN YOUR DAILY BUSINESS OPERATIONS

Managing Virtual Desktop Environments

CLOUD SECURITY CERTIFICATIONS: HOW IMPORTANT ARE THEY?

E-Guide CRM: THE INTEGRATION AND CONSOLIDATION PAYOFF

The state of cloud adoption in India The use cases, industry trends, business demands, and user expectations driving cloud adoption in Indian

TIPS TO HELP EVALUATE AND DEPLOY FLASH STORAGE

Cloud Security Certification Guide What certification is right for you?

5 ways to leverage the free VMware hypervisor Key tips for working around the VMware cost barrier

Best Practices for Database Security

Preparing for the cloud: Understanding the infrastructure impacts Eight essential tips for a successful cloud migration

Key Trends in the Identity and Access Management Market and How CA IAM R12 Suite Addresses These Trends

FIVE PERVASIVE FLASH-BASED STORAGE MYTHS

Virtualization backup tools: How the field stacks up

HR Managers Focus on Recruiting Experience as War for Talent Intensifies

Key best practices for cloud testing

3 common cloud challenges eradicated with hybrid cloud

Up your game with a Predictive Analytics Program

LTO tape technology continues to evolve with LTO 5

E-Guide HOW A TOP E-COMMERCE STRATEGY LEADS TO STRONG SALES

How To Protect Your Online Backup From Being Hacked

E-Business Risk: The Coming SaaS As a Service

CALCULATING ROI FOR STORAGE VIRTUALIZATION IS TRICKY

- Solution Spotlight ACCELERATING APPLICATION DEPLOYMENT WITH DEVOPS

MDM features vs. native mobile security

E-Guide HADOOP MYTHS BUSTED

How To Manage Big Data

Cloud Storage: Top Concerns, Provider Considerations, and Application Candidates

Solution Spotlight PREPARING A DATABASE STRATEGY FOR BIG DATA

Transcription:

Best Practices for Scaling a Big Data Analytics Project

Putting an effective "big data" analytics plan in place can be a challenging proposition; thankfully, many proven data management and business intelligence best practices translate well to big data analytics. Discover best practices for scaling your big data project once you get started. Familiar Disciplines By: Beth Stackpole, Contributor With new terms, new skill sets, new products and new providers, the world of big data analytics can seem unfamiliar, but tried-and-true data management best practices do hold up well in this still-emerging discipline. As with any business intelligence (BI) and data warehouse initiative, experts say it s critical to have a clear understanding of an organization s data management requirements and a well-defined strategy before venturing too far down the big data analytics path. Big data analytics is widely hyped, and companies across all sectors are being flooded with new data sources and ever-larger amounts of information. Yet, making a big investment to attack the big data problem without first figuring out how doing so can really add value to the business is one of the most serious missteps for would-be users. Don t get too hung up on the technology -- start from a business perspective and have the conversation between the CIO, data scientists and businesspeople to figure out what the business objectives are and what value can be derived, and drive backwards from there, said David Menninger, an analyst at Ventana Research Inc. who focuses on BI, analytics and information management technologies. Defining exactly what data is available and mapping out how an organization can best leverage those resources is a key part of that exercise. CIOs, IT managers and BI and data warehouse professionals need to examine what Page 2 of 5

data is being retained, aggregated and utilized and compare that with what data is being thrown away, Menninger said. It s also critical, he added, to consider external data sources that are currently not being tapped but could be a compelling addition to the mix. Even if companies aren t sure how and when they plan to jump into big data analytics, there are benefits to going through this kind of an evaluation sooner rather than later, according to Menninger. And beginning the process of capturing data can also make you better prepared for the eventual leap. Even if you don t know what you re going to use it for, start capturing the information, he said. Otherwise, there is a missed opportunity, because you won t have that rich history of information [to draw on]. Start small with big data Analyzing big data sets is yet another instance where it makes sense to define small, high-value opportunities and use them as a starting point. As companies expand the data sources and types of information they re looking to analyze, and start to create the all-important analytical models that can help them uncover patterns and correlations in both structured and unstructured data, they need to be vigilant about homing in on the findings that are most important to their stated business objectives. If you end up in a place where all you re doing is looking for new patterns and you can t do anything with them, you ve hit a dead spot, said Gartner Inc. analyst Yvonne Genovese. ComScore Inc., a Reston, Va.-based company that tracks Internet usage and provides Web analytics and marketing intelligence services to corporate customers, knew early on that it would need some sort of big data strategy. But it picked very targeted spots and built out its big data analytics program over time. We started with small bites -- taking individual [data] flows and migrating them into different systems, said Will Duckworth, comscore s vice president of software engineering. If you re working with any kind of scale, you can t roll something like this out overnight. Page 3 of 5

Scale is something comscore is very conscious of, given the amount of data the company processes. Back in 2009, when it started collecting 300 million records a day, Duckworth began searching in earnest for a new set of systems and a technology infrastructure that could handle comscore s data processing needs -- now totaling 23 billion records a day and still growing -- in a far more cost-efficient fashion. but don t forget to think big Leveraging open source Hadoop technologies and emerging packaged analytics tools, Duckworth has been able to make the open source environment more familiar to business analysts trained in using SQL. He says companies need to consider scale as a primary factor when mapping out a big data analytics roadmap. You have to consider what the ramp-up will look like -- how much data will you be putting in six months from now, how many more servers will you need to handle that, is the software up to the task, he explained. People don t think about how much it is going to grow or how popular the solution might be once it s rolled into production. The other thing companies commonly lose sight of as they get enveloped in the new normal that is big data is that the old normal rules around data management still apply. Information governance practices are just as important today with the notion of big data as they were yesterday with data warehousing, said Marcus Collins, another Gartner analyst. Even though companies want flexibility in terms of processing, remember that information is a corporate asset and should be treated as such. Page 4 of 5

Free resources for technology professionals TechTarget publishes targeted technology media that address your need for information and resources for researching products, developing strategy and making cost-effective purchase decisions. Our network of technology-specific Web sites gives you access to industry experts, independent content and analysis and the Web s largest library of vendor-provided white papers, webcasts, podcasts, videos, virtual trade shows, research reports and more drawing on the rich R&D resources of technology providers to address market trends, challenges and solutions. Our live events and virtual seminars give you access to vendor neutral, expert commentary and advice on the issues and challenges you face daily. Our social community IT Knowledge Exchange allows you to share real world information in real time with peers and experts. What makes TechTarget unique? TechTarget is squarely focused on the enterprise IT space. Our team of editors and network of industry experts provide the richest, most relevant content to IT professionals and management. We leverage the immediacy of the Web, the networking and face-to-face opportunities of events and virtual events, and the ability to interact with peers all to create compelling and actionable information for enterprise IT professionals across all industries and markets. Related TechTarget Websites Page 5 of 5