Big Data for Investment Research Management

Similar documents
Big Data for Investment Research Management

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

Three Open Blueprints For Big Data Success

Cloudera Enterprise Data Hub in Telecom:

How To Handle Big Data With A Data Scientist

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

Databricks. A Primer

Data Governance in the Hadoop Data Lake. Michael Lang May 2015

Cisco Data Preparation

Virtualizing Apache Hadoop. June, 2012

The Inside Scoop on Hadoop

Big Data on Microsoft Platform

The Future of Data Management

Testing Big data is one of the biggest

Databricks. A Primer

This Symposium brought to you by

Embedded Analytics Vendor Selection Guide. A holistic evaluation criteria for your OEM analytics project

ORACLE PROCUREMENT AND SPEND ANALYTICS

The Power of Pentaho and Hadoop in Action. Demonstrating MapReduce Performance at Scale

Hortonworks and ODP: Realizing the Future of Big Data, Now Manila, May 13, 2015

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

BIG DATA SOLUTION DATA SHEET

Luncheon Webinar Series May 13, 2013

IBM InfoSphere Guardium Data Activity Monitor for Hadoop-based systems

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE

How Companies are! Using Spark

TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP

Cost-Effective Business Intelligence with Red Hat and Open Source

The Power of the Unica Marketing Platform

Introduction to Big data. Why Big data? Case Studies. Introduction to Hadoop. Understanding Features of Hadoop. Hadoop Architecture.

So What s the Big Deal?

Analance Data Integration Technical Whitepaper

BIG DATA TECHNOLOGY. Hadoop Ecosystem

Qsoft Inc

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

Innovation. Simplifying BI. On-Demand. Mobility. Quality. Innovative

Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January Website:

Business Intelligence for Big Data

Cloud Integration and the Big Data Journey - Common Use-Case Patterns

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Big Data Visualization with JReport

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

Big Data Analytics Nokia

QUICK FACTS. Delivering a Unified Data Architecture for Sony Computer Entertainment America TEKSYSTEMS GLOBAL SERVICES CUSTOMER SUCCESS STORIES

Bringing agility to Business Intelligence Metadata as key to Agile Data Warehousing. 1 P a g e.

Analance Data Integration Technical Whitepaper

Bringing Big Data to People

Please give me your feedback

Big Data Open Source Stack vs. Traditional Stack for BI and Analytics

A Brief Introduction to Apache Tez

Accenture Foundation Platform for Oracle

Ganzheitliches Datenmanagement

Deploying an Operational Data Store Designed for Big Data

ORACLE PROJECT MANAGEMENT

Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth

Data Integration Checklist

HDP Hadoop From concept to deployment.

Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru

Oracle Value Chain Planning Inventory Optimization

Native Connectivity to Big Data Sources in MSTR 10

Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop

Advanced Big Data Analytics with R and Hadoop

Beyond Web Application Log Analysis using Apache TM Hadoop. A Whitepaper by Orzota, Inc.

NetGain Business Intelligence. Nick Johnson, Media & Workflow Specialist

Integrate Big Data into Business Processes and Enterprise Systems. solution white paper

The Clear Path to Business Intelligence

QUICK FACTS. Implementing a Big Data Solution on Behalf of a Media House TEKSYSTEMS GLOBAL SERVICES CUSTOMER SUCCESS STORIES

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Overview, Goals, & Introductions

SQL Server 2012 PDW. Ryan Simpson Technical Solution Professional PDW Microsoft. Microsoft SQL Server 2012 Parallel Data Warehouse

Big Data. Value, use cases and architectures. Petar Torre Lead Architect Service Provider Group. Dubrovnik, Croatia, South East Europe May, 2013

White Paper: What You Need To Know About Hadoop

HDP Enabling the Modern Data Architecture

Data Governance in the Hadoop Data Lake. Kiran Kamreddy May 2015

Pentaho Reporting Overview

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

Performance and Scalability Overview

How To Turn Big Data Into An Insight

Inventory Optimization for the Consumer Products Industry

CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data

Advanced In-Database Analytics

QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM

Oracle Primavera P6 Enterprise Project Portfolio Management Performance and Sizing Guide. An Oracle White Paper October 2010

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

IBM BigInsights Has Potential If It Lives Up To Its Promise. InfoSphere BigInsights A Closer Look

Information Architecture

Midsize retailers can now relax the nightmare of trying to keep up with the

BIG DATA TRENDS AND TECHNOLOGIES

Are You Big Data Ready?

Big Data Use Cases. To Start Today. Paul Scholey Sales Director, EMEA. 2013, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866)

#mstrworld. Tapping into Hadoop and NoSQL Data Sources in MicroStrategy. Presented by: Trishla Maru. #mstrworld

[Hadoop, Storm and Couchbase: Faster Big Data]

Oracle Big Data SQL Technical Update

Interactive data analytics drive insights

Transcription:

IDT Partners www.idtpartners.com Big Data for Investment Research Management Discover how IDT Partners helps Financial Services, Market Research, and Investment firms turn big data into actionable research insights.

Executive Summary IDT Partners was retained by a financial client to investigate a legacy Research Management System (RMS) and provide its team with a more robust research dataset. The client s investment research process includes supply chain [1] research and ongoing investment portfolio monitoring. The research information is collected and aggregated through proprietary supply chain data and by conducting channel checks. [2] The IDT Partners team worked with the client to understand the business function dependencies on the legacy RMS, explored what business value potential system improvements could bring, and identified what functionality, such as reporting, researchers and executive management needed. A proposal offering options for a new system featuring improved data management, business process automation, and an intuitive user interface was put forth by IDT Partners and approved by the client. IDT Partners Solution The IDT Partners team worked with the client to develop and deploy a new system. The new system featured a much needed intuitive User Interface (UI), improved Workflows, Business Process Automation (BPA), Supply Chain Trend Spotting Tools, Dashboards, Reporting, and many other improvements. Additionally, dozens of new data sources were added and a scalable Data Warehouse was created to store and manage historical data. IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 2 of 10

IDT Partners increased the client s research dataset eightyfold, from 50GB to over 4TB, while decreasing the data processing time from ten hours to less than two hours. The IDT Partner s solution has provided the client s research team with a technology-enabled competitive edge by improving the client s operational efficiency by 20%, significantly improving its business processes management and automation capabilities, and using big data processing to provide actionable research insights needed to support its investment research process. IDT Partners Solution Highlights 8,000% Larger Dataset 1,000% Increase in number of Data Sources 500% Faster Data Processing and Up-to-date Information 100% ROI within 12 months 20% Increase in Operational Efficiency Automated Trend Spotting & Monitoring with Big Data Business Process Management & Automation (BPM, BPA) Consistent Data with Automated Data De-duplication Intuitive and Streamlined User Interface (UI) Highly Customized Automated Reports Executive Summary Dashboards IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 3 of 10

Solution Overview: Research Management System (RMS) The following is an overview of a Research Management System (RMS) built by IDT Partners for a financial client. The system facilitates investment research, provides actionable insights, and helps support investment decisions by analyzing the supply chain [1] for companies in the client s investment portfolio. System Architecture A proprietary framework was used to build the RMS as a Service-Oriented Architecture (SOA). This has allowed for an increased flexibility in tool choice and made it easy to exchange information with client s internal systems (Business Intelligence, Compliance, Vendor Management, and Billing) via XML-based Web Services. Big Data Hadoop and MapReduce technology was used to convert multi-terabyte supplychain-related data sets into actionable research data and Business Intelligence (BI) tools used by client to help facilitate investment research and decisions. The solution analyzes over a billion supply chain data points on ongoing basis to provide insights into historical supply chain changes. The Hadoop infrastructure breaks up large data sets into "chunks" and coordinates the processing of the data out into the distributed, clustered IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 4 of 10

environment. The Hadoop infrastructure included the following core components: HDFS containers, MapReduce processing infrastructure, Hive for immutable table data store, and HBase for mutable table data store. The following diagram highlights the process of converting various data sources into actionable insights within the RMS. Data Sources Dozens of Proprietary and Commercial Supply Chain-related Data Sources (multi-terabyte) Data Ingestion Data Processing Hadoop Compute Cluster Scheduler Classification Master 1 Map Task Repository Deduplication Master 2 Admin UI Normalization Slave 1 Slave 2 Map Slave n Map Research Dataset Reduce Current Dataset Custom Trends DW BI Platform Research Management System IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 5 of 10

Key Technologies The following are some of the key technologies used for this solution. PHP MapReduce Linux Lucene LAMP HDFS Apache Solr JAVA HBase MongoDB ETL jquery Pig MSSQL SSIS Hadoop Hive MarkLogic Pentaho User Interface (UI) The solution leveraged Rich Internet Application (RIA) technology and intuitive User Interface (UI) design to allow clients to easily navigate through the supply chain and to manage the underlying research processes. The following are some of the features of the system. Intuitive upstream and downstream supply chain navigation with rollover informational popups Analysis of companies within each node in the supply chain Competitor information for companies within the supply chain Supply chain information for suppliers, vendors, manufacturers, customers, distributors Comparison matrix for any two entities in the supply chain Supply chain filter with dozens of criteria to filter companies based on research criteria IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 6 of 10

Supply chain search using hundreds of criteria including historical trends Sharing of screens, graphs, charts, and dashboards and research feedback capture Automated Trend Spotting & Monitoring IDT Partners developed a data warehouse and tools to keep track of historical supply chain changes to help identify supply chain trends. Automated trend spotting and monitoring has proved to be invaluable for the client s research team by automating and streamlining what had previously been a highly time consuming and error-prone manual process. Reporting The solution featured dozens of highly customized reports leveraging a much larger data set and the newly built data warehouse. Some of the available features included Custom Dashboards, Trend and Portfolio Monitoring Alerts, dozens of Highly Customized Reports, and Data Export (Excel, PDF, Email). Definitions The following are definitions for some domain-specific terminology referenced in this white paper. IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 7 of 10

[1] Supply Chain A supply chain is a system of organizations, people, technology, activities, information and resources involved in moving a product or service from supplier to customer. Supply chain activities transform natural resources, raw materials and components into a finished product that is delivered to the end customer. The following is an overview of an electronics manufacturing supply chain. Manufacturing Marketing R&D Design Assembly Distribution Sales The following simplified supply chain diagram is an example of how NVIDIA, a global technology company and GPU manufacturer, supplies directly to the customer, to PC vendors, and retailers. Each point in the supply chain may include hundreds of companies across many tiers. IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 8 of 10

Semiconductor Materials Direct PC Vendors Component Makers / Semiconduct or Foundries Customers Semiconductor Manufacturing Equipment Indirect PC Vendors Distributors Resellers / Retailers [2] Channel Check In financial analysis, a channel check is third-party research on a company's business based on collecting information from the distribution channels of the company. It may be conducted in order to value the company or to perform due diligence in various contexts. Industries where channel checks are more often conducted include retail, technology, commodities, etc. Channel checks can give insights complementary to balance sheet analysis, such as distributor and retailer attitudes towards a product and its competitors, seasonal and geographic variation, and inventory levels (notably channel stuffing). IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 9 of 10

About IDT Partners IDT Partners is a New York City based technology solutions and web application development firm specializing in enterprise-level web application development, custom product development, and technology solutions that help customers accelerate growth, capitalize on new market opportunities, and optimize operational efficiency by leveraging the latest technology. Contact Us For additional information or to learn more about IDT Partners capabilities contact info@idtpartners.com IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 10 of 10