Big Data for Investment Research Management



Similar documents
Big Data for Investment Research Management

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

This Symposium brought to you by

MDM and Data Warehousing Complement Each Other

Cisco Data Preparation

The Power of Pentaho and Hadoop in Action. Demonstrating MapReduce Performance at Scale

How Companies are! Using Spark

Three Open Blueprints For Big Data Success

Databricks. A Primer

Big Data on Microsoft Platform

How To Handle Big Data With A Data Scientist

Cloudera Enterprise Data Hub in Telecom:

Virtualizing Apache Hadoop. June, 2012

Big Data Analytics Nokia

Databricks. A Primer

Ganzheitliches Datenmanagement

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

Analance Data Integration Technical Whitepaper

Testing Big data is one of the biggest

BIG DATA SOLUTION DATA SHEET

The Inside Scoop on Hadoop

Luncheon Webinar Series May 13, 2013

Analance Data Integration Technical Whitepaper

ORACLE PROJECT MANAGEMENT

Beyond Web Application Log Analysis using Apache TM Hadoop. A Whitepaper by Orzota, Inc.

Cost-Effective Business Intelligence with Red Hat and Open Source

The Future of Data Management

Big Data Open Source Stack vs. Traditional Stack for BI and Analytics

Next Generation Business Performance Management Solution

TURN YOUR DATA INTO KNOWLEDGE

NetGain Business Intelligence. Nick Johnson, Media & Workflow Specialist

BIG DATA TRENDS AND TECHNOLOGIES

PLATFORA INTERACTIVE, IN-MEMORY BUSINESS INTELLIGENCE FOR HADOOP

So What s the Big Deal?

TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP

Cloud Integration and the Big Data Journey - Common Use-Case Patterns

Has been into training Big Data Hadoop and MongoDB from more than a year now

IBM InfoSphere Guardium Data Activity Monitor for Hadoop-based systems

Advanced In-Database Analytics

Advanced Big Data Analytics with R and Hadoop

ORACLE PROCUREMENT AND SPEND ANALYTICS

Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January Website:

Reporting component for templates, reports and documents. Formerly XML Publisher.

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

Navigating Big Data business analytics

Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop

Data Integration Checklist

Data Governance in the Hadoop Data Lake. Michael Lang May 2015

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

Overview, Goals, & Introductions

The Power of the Unica Marketing Platform

Embedded Analytics Vendor Selection Guide. A holistic evaluation criteria for your OEM analytics project

Please give me your feedback

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Testing 3Vs (Volume, Variety and Velocity) of Big Data

Native Connectivity to Big Data Sources in MSTR 10

Mastering Big Data. Steve Hoskin, VP and Chief Architect INFORMATICA MDM. October 2015

TURN YOUR DATA INTO KNOWLEDGE

Intro to Big Data and Business Intelligence

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

How To Turn Big Data Into An Insight

Bringing Big Data to People

Introduction to TIBCO MDM

Hortonworks and ODP: Realizing the Future of Big Data, Now Manila, May 13, 2015

SQL Server 2012 Business Intelligence Boot Camp

What's New in SAS Data Management

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Business Intelligence for Big Data

How To Choose A Business Intelligence Toolkit

Big Data Visualization with JReport

Innovation. Simplifying BI. On-Demand. Mobility. Quality. Innovative

Blueprints for Big Data Success

Deploying an Operational Data Store Designed for Big Data

The Clear Path to Business Intelligence

Big Data Buzzwords From A to Z. By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012

QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM

Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru

Tap into Hadoop and Other No SQL Sources

Inventory Optimization for the Consumer Products Industry

Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence

Evolution from Big Data to Smart Data

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

Delivering new insights and value to consumer products companies through big data

Analytics in the Cloud. Peter Sirota, GM Elastic MapReduce

XpoLog Competitive Comparison Sheet

SQL Server 2012 PDW. Ryan Simpson Technical Solution Professional PDW Microsoft. Microsoft SQL Server 2012 Parallel Data Warehouse

PROPOSAL To Develop an Enterprise Scale Disease Modeling Web Portal For Ascel Bio Updated March 2015

Extending the Enterprise Data Warehouse with Hadoop Robert Lancaster. Nov 7, 2012

Transcription:

IDT Partners www.idtpartners.com Big Data for Investment Research Management Discover how IDT Partners helps Financial Services, Market Research, and Investment Management firms turn big data into actionable research insights.

Executive Summary IDT Partners was retained by a financial client to investigate a legacy Research Management System (RMS) and provide its team with a more robust research dataset. The client s investment research process includes supply chain [1] research and ongoing investment portfolio monitoring. The research information is collected and aggregated through proprietary supply chain data and by conducting channel checks. [2] The IDT Partners team worked with the client to understand the business function dependencies on the legacy RMS, explored what business value potential system improvements could bring, and identified what functionality, such as reporting, researchers and executive management needed. A proposal offering options for a new system featuring improved data management, business process automation, and an intuitive user interface was put forth by IDT Partners and approved by the client. IDT Partners Solution The IDT Partners team worked with the client to develop and deploy a new system. The new system featured a much needed intuitive User Interface (UI), improved Workflows, Business Process Automation (BPA), Supply Chain Trend Spotting Tools, Dashboards, Reporting, and many other improvements. Additionally, dozens of new data sources were added and a scalable Data Warehouse was created to store and manage historical data. IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 2 of 12

IDT Partners increased the client s research dataset eightyfold, from 50GB to over 4TB, while decreasing the data processing time from ten hours to less than two hours. The IDT Partner s solution has provided the client s research team with a technology enabled competitive edge by improving the client s operational efficiency by 20%, significantly improving its business processes management and automation capabilities, and using big data processing to provide actionable research insights needed to support its investment research process. IDT Partners Solution Highlights 8,000% Larger Dataset 1,000% Increase in number of Data Sources 500% Faster Data Processing and Up to date Information 100% ROI within 12 months 20% Increase in Operational Efficiency Automated Trend Spotting & Monitoring with Big Data Business Process Management & Automation (BPM, BPA) Consistent Data with Automated Data De duplication Intuitive and Streamlined User Interface (UI) Highly Customized Automated Reports Executive Summary Dashboards IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 3 of 12

Solution Overview: Research Management System (RMS) The following is an overview of a Research Management System (RMS) built by IDT Partners for a financial client. The system facilitates investment research, provides actionable insights, and helps support investment decisions by analyzing the supply chain [1] for companies in the client s investment portfolio. System Architecture A proprietary framework was used to build the RMS as a Service Oriented Architecture (SOA). This has allowed for an increased flexibility in tool choice and made it easy to exchange information with client s internal systems (Business Intelligence, Compliance, Vendor Management, and Billing) via XML based Web Services. Data Sources IDT Partners assisted the client with data vendor selection, custom data collection, data aggregation, data processing, and Master Data Management (MDM) strategy for the following multi terabyte datasets. Supply Chain Data Company Data Product Data Company Taxonomies News Market Data Industry Contacts Supply Chain Taxonomies Social Media International Trade Data Custom Research Proprietary Datasets IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 4 of 12

The following diagram provides an overview of the data management process as it relates to extracting research insights from raw data sources. Data Source Management Data Ingestion & Integration Master Data Management (MDM) Market Data Industry Contacts Supply Chain Data Company Taxonomies Product Data Company Data Supply Chain Taxonomies News Social Media International Trade Data Custom Research Proprietary Datasets

Data Warehousing (DW) Custom Data Processing Analytics & Business Intelligence (BI) Research Management System (RMS) Business Process Automation (BPA) Actionable Research Insights Big Data Hadoop and MapReduce technology was used to convert multi terabyte supplychain related datasets into actionable research data and Business Intelligence (BI) tools used by client to help facilitate investment research and decisions. The solution analyzes over a billion supply chain data points on ongoing basis to provide insights into historical supply chain changes. The Hadoop infrastructure breaks up large datasets into "chunks" and coordinates the processing of the data out into the distributed, clustered environment. The Hadoop infrastructure included the following core components: HDFS containers, MapReduce processing infrastructure, Hive for immutable table data store, and HBase for mutable table data store. IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 6 of 12

The following diagram illustrates a simplified version of a data workflow that utilizes big data processing technology to provide the RMS with actionable data via an integrated BI platform. Data Sources Dozens of Proprietary and Commercial Supply Chain related Data Sources (multi terabyte) Data Ingestion Data Processing Hadoop Compute Cluster Scheduler Classification Master 1 Map Task Repository Deduplication Master 2 Admin UI Normalization Slave 1 Slave 2 Map Slave n Map Research Dataset Reduce Current Dataset Custom Trends DW BI Platform Research Management System IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 7 of 12

Key Technologies The following are some of the key technologies used for this solution. PHP MapReduce Linux Lucene LAMP HDFS Apache Solr JAVA HBase MongoDB ETL jquery Pig MSSQL SSIS Hadoop Hive MarkLogic Pentaho User Interface (UI) The solution leveraged a combination of Rich Internet Application (RIA) technology and intuitive User Interface (UI) design to simplify the supply chain navigation process, seamlessly manage the underlying research processes, and to provide a way to easily review large quantities of data using data visualization and dashboards. IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 8 of 12

The following are some of the UI features of the RMS. Innovative Data Visualization interfaces Intuitive upstream and downstream supply chain navigation with rollover information popups. Analysis of companies within each node in the supply chain. Supplier, vendor, manufacturer, customer, and distributor information. Competitor information for companies within the supply chain. Comparison matrix for any two entities in the supply chain. Supply chain company filter with research focused criteria. Supply chain search using hundreds of criteria including historical trends. Sharing of screens, graphs, charts, dashboards, and research feedback. Automated Trend Spotting & Monitoring IDT Partners developed a data warehouse and tools to keep track of historical supply chain changes to help identify supply chain trends. Automated trend spotting and monitoring automated and streamlined what had previously been a highly time consuming and error prone manual process. This has proved to be invaluable for the client s research team. Reporting The solution features dozens of highly customized reports leveraging a much larger dataset and the newly built data warehouse. IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 9 of 12

Some of the features include Custom Dashboards, Trend and Portfolio Monitoring Alerts, dozens of Highly Customized Reports, and Data Export (Excel, PDF, Email). Definitions The following are definitions for some domain specific terminology referenced in this white paper. [1] Supply Chain A supply chain is a system of organizations, people, technology, activities, information and resources involved in moving a product or service from supplier to customer. Supply chain activities transform natural resources, raw materials and components into a finished product that is delivered to the end customer. The following is an overview of an electronics manufacturing supply chain. Manufacturing Marketing R&D Design Assembly Distribution Sales The following simplified supply chain diagram is an example of how NVIDIA, a global technology company and GPU manufacturer, supplies IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 10 of 12

directly to the customer, to PC vendors, and retailers. Each point in the supply chain may include hundreds of companies across many tiers. Semiconductor Materials Direct PC Vendors Component Makers / Semiconduct or Foundries Customers Semiconductor Manufacturing Equipment Indirect PC Vendors Distributors Resellers / Retailers [2] Channel Check In financial analysis, a channel check is third party research on a company's business based on collecting information from the distribution channels of the company. It may be conducted in order to value the company or to perform due diligence in various contexts. Industries where channel checks are more often conducted include retail, technology, commodities, etc. Channel checks can give insights complementary to balance sheet analysis, such as distributor and retailer attitudes towards a product and its competitors, seasonal and geographic variation, and inventory levels (notably channel stuffing). IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 11 of 12

About IDT Partners IDT Partners is a New York City based technology solutions and web application development firm specializing in enterprise level web application development, custom product development, and technology solutions that help customers accelerate growth, capitalize on new market opportunities, and optimize operational efficiency by leveraging the latest technology. Contact IDT Partners at info@idtpartners.com IDT Partners New York, NY www.idtpartners.com info@idtpartners.com Page 12 of 12