Set Up Hortonworks Hadoop with SAP Sybase IQ



Similar documents
Set Up Hortonworks Hadoop with SQL Anywhere

SAP BW on HANA & HANA Smart Data Access Setup

Upgrade: SAP Mobile Platform Server for Windows SAP Mobile Platform 3.0 SP02

Installation Guide: Agentry Device Clients SAP Mobile Platform 2.3

R49 Using SAP Payment Engine for payment transactions. Process Diagram

LVS Troubleshooting Common issues and solutions

SAP HANA Big Data Intelligence rapiddeployment

Architecting the Future of Big Data

Landscape Design and Integration. SAP Mobile Platform 3.0 SP02

Sybase ASE Linux Installation Guide Installation and getting started guide for SAP Sybase ASE on Linux

SAP BusinessObjects Business Intelligence 4 Innovation and Implementation

SEPA in SAP CRM. Application Innovation, CRM & Service Industries. Customer

SAP HANA Client Installation and Update Guide

CUSTOMER Presentation of SAP Predictive Analytics

How-to guide: Monitoring of standalone Hosts. This guide explains how you can enable monitoring for standalone hosts in SAP Solution Manager

SAP 3D Visual Enterprise Rapid-Deployment Solution

SAP ERP E-Commerce and SAP CRM Web Channel Enablement versions available on the market

Software and Delivery Requirements

SFSF EC to 3 rd party payroll Integration Software and Delivery Requirements

Creating a universe on Hive with Hortonworks HDP 2.0

SAP Business Intelligence Suite Patch 10.x Update Guide

Architecting the Future of Big Data

Getting Started with the License Administration Workbench 2.0 (LAW 2.0)

Price and Revenue Management - Manual Price Changes. SAP Best Practices for Retail

SAP BusinessObjects Business Intelligence Suite Document Version: 4.1 Support Package Patch 3.x Update Guide

Leveraging SAP HANA & Hortonworks Data Platform to analyze Wikipedia Page Hit Data

SAP NetWeaver Identity Management Identity Services Configuration Guide

Information security guidelines

Architecting the Future of Big Data

How To Make Your Software More Secure

Installing and Configuring MySQL as StoreGrid Backend Database on Linux

Using SAP Crystal Reports with SAP Sybase SQL Anywhere

Secure MobiLink Synchronization using Microsoft IIS and the MobiLink Redirector

SAP Business One, version for SAP HANA Platform Support Matrix

Setting up Visual Enterprise Integration (WM6)

How-To Guide SAP NetWeaver Document Version: How To Guide - Configure SSL in ABAP System

StoreGrid Backup Server With MySQL As Backend Database:

How-To Guide SAP Cloud for Customer Document Version: How to replicate marketing attributes from SAP CRM to SAP Cloud for Customer

Setting up an Apache Server in Conjunction with the SAP Sybase OData Server

How to Install and Configure EBF15328 for MapR or with MapReduce v1

Partner Certification to Operate SAP Solutions and SAP Software Environments

Agentry and SMP Metadata Performance Testing Guidelines for executing performance testing with Agentry and SAP Mobile Platform Metadata based

Users Guide. Ribo 3.0

Data Integration using Integration Gateway. SAP Mobile Platform 3.0 SP02

How-To Guide SAP Cloud for Customer Document Version: How to Configure SAP HCI basic authentication for SAP Cloud for Customer

Run SAP Risk Management in Utilities to Get Business Value Fast

Configuration (X87) SAP Mobile Secure: SAP Afaria 7 SP5 September 2014 English. Building Block Configuration Guide

Rapid database migration of SAP Business Suite to SAP HANA (V4.10): Software and Delivery Requirements. SAP HANA November 2014 English

ODBC Driver User s Guide. Objectivity/SQL++ ODBC Driver User s Guide. Release 10.2

SAP Project Portfolio Monitoring Rapid- Deployment Solution: Software Requirements

SAP PartnerEdge program for Application Development Building Business Success for the Future

Use Advanced Analytics to Guide Your Business to Financial Success

SAP Best Practices for SAP Mobile Secure Cloud Configuration March 2015

Contents. About this Support Package / Patch...5. To install the EPM Add-in for Microsoft Office Support Package 15 / Patch XX...

CS WinOMS Practice Management Software Server Migration Help Guide

Supported Platforms. HP Vertica Analytic Database. Software Version: 7.1.x

Installing and Configuring the HANA Cloud Connector for On-premise OData Access

Manual to Access SAP Training Systems Technical Description for Customer On-Site Training

SimbaEngine SDK 9.4. Build a C++ ODBC Driver for SQL-Based Data Sources in 5 Days. Last Revised: October Simba Technologies Inc.

Dell Statistica Statistica Enterprise Installation Instructions

CA Workload Automation Agent for Databases

Enhance Customer Service with Integrated Scale Management Software from SAP

Connect to an SSL-Enabled Microsoft SQL Server Database from PowerCenter on UNIX/Linux

Ariba Procure-to-Pay Integration rapiddeployment

How to configure BusinessObjects Enterprise with Citrix Presentation Server 4.0

SAP Payroll Processing control center rapiddeployment

Cost-Effective Data Management and a Simplified Data Warehouse

How To Use An Automotive Consulting Solution In Ansap

Installation Guide Customized Installation of SQL Server 2008 for an SAP System with SQL4SAP.VBS

Optimize Retail Label and Poster Printing with SAP Software

Plug-In for Informatica Guide

CA Workload Automation Agent for Microsoft SQL Server

Harness the Power of Analytics Across Lines of Business with Speed and Ease

Supported Platforms HPE Vertica Analytic Database. Software Version: 7.2.x

Consumption of OData Services of Open Items Analytics Dashboard using SAP Predictive Analysis

The Edge Editions of SAP InfiniteInsight Overview

GR5 Access Request. Process Diagram

K75 SAP Payment Engine for Credit transfer (SWIFT & SEPA) Process Diagram

Streamline Processes and Gain Business Insights in the Cloud

Your Intelligent POS Solution: User-Friendly with Expert Analysis

Installation Guide. SAP Control Center 3.3

Simplify and Secure Cloud Access to Critical Business Data

Installing the BlackBerry Enterprise Server Management console with a remote database

SAP BusinessObjects Document Version: 4.1 Support Package Dashboards and Presentation Design Installation Guide

Inmagic ODBC Driver 8.00 Installation and Upgrade Notes

SuccessFactors HCM Suite November 2014 Release Version: December 5, SuccessFactors Learning Programs Administration Guide

Sharing Pictures, Music, and Videos on Windows Media Center Extender

HP Web Jetadmin Database Connector Plug-in reference manual

Automotive Consulting Solution. CHEP - EDI- Container Data

Installing RMFT on an MS Cluster

Multi Channel Sales Order Management: Mail Order. SAP Best Practices for Retail

Configuring a SQL Anywhere server to autorestart after a fatal error

SAP Business One Hardware Requirements Guide

Installing Microsoft SQL Server Linux ODBC Driver For Use With Kognitio Analytical Platform

Improve Business Efficiency by Automating Intercompany Transactions

Configuring Java IDoc Adapter (IDoc_AAE) in Process Integration. : SAP Labs India Pvt.Ltd

Understanding Security and Rights in SAP BusinessObjects Business Intelligence 4.1

Transcription:

Set Up Hortonworks Hadoop with SAP Sybase IQ

Table of Contents Introduction...3 Install Hadoop Environment...3 Set Up IQ Environment... 4 unixodbc... 4 DSN...5 Connect Using IQ... 8 Create Remote Server... 9 Access Hadoop from Server...10 Access Hadoop from Server...10 Create Table...11 2

Introduction Welcome to setting up Hadoop and Hive with SAP Sybase IQ! Hadoop is an open-source framework designed to handle big data. It allows large amounts of data to be distributed across clusters of computers. Then, when retrieving the data, a MapReduce algorithm is implemented to divide up the work across the clusters (map) and then recombining the data (reduce). Hive is an infrastructure which can be used on top of Hadoop. It provides the ability to access data without using MapReduce or code files. It provides HiveQL, which is similar to SQL, which allows for querying data in a Hadoop table. In previous versions of IQ, the best way to integrate Hadoop with an IQ database was to use user-defined functions and Java code. While this can be useful in certain cases and is still supported, connecting to a Hive server on top of the Hadoop data can be much simpler. This guide is intended to provide directions on the setup required to use IQ with the Hortonworks distribution of the Hive server and HiveQL. This is not a guide to help with IQ installation or setup. For instructions on IQ setup, please check the SAP website. Install Hadoop Environment Hortonworks Sandbox is a distribution of the Hadoop environment that installs as a standalone virtual machine. Download the product and follow the installation instructions at: http://hortonworks.com/products/hortonworks-sandbox/#install Note: This installation requires an installation of VMware Player, Virtualbox or equivalent software. Download links are available on the Hortonworks website. 3

Set Up IQ Environment unixodbc unixodbc is a driver manager that can be used to connect to data sources using specified drivers. We will use this driver manager to open the Hive server data source in IQ, instead of the default driver manager in a typical IQ install. Install the latest unixodbc driver manager by downloading the.tar.gz file from the following website: http://www.unixodbc.org/download.html Unzip and untar as specified on the website. Then, open a terminal in the unixodbc-2.3.1 folder (or corresponding folder), and run the following commands:./configure make makeinstall The command should finish without obvious errors. When installed correctly, you should be able to run the isql or the odbcinst commands. SAP Sybase IQ creates a soft link in its own directory to the driver manager that it will use, when connecting to data sources. By default, this is directed to the SQL Anywhere driver, so the connection expects a SQL Anywhere or IQ server at all times. SAP Sybase IQ is build using SQL Anywhere connectivity, so using a SQL Anywhere driver is expected.we will direct this to the newly installed unixodbc driver manager to allow us to connect to any ODBC driver. In the terminal, execute the following commands to view all driver manager shared object libraries. cd $SYBASE/IQ-16_0/lib64 ls -l This will display all library files in the folder. IQ uses the libodbc.so and libodbcinst.so files to find the driver manager. If a libodbc.so link already exists, remove it. Then, we will create links to the library files installed by the unixodbc installation. Ensure that the files exist at the /usr/local/lib location before running this command. Note: If you cannot find the libodbc.so and libodbcinst.so, run locate libodbc.so, which will specify the file location. rm libodbc.so ln s /usr/local/lib/libodbc.so libodbc.so ln s /usr/local/lib/libodbcinst.so libodbcinst.so ls l Now, a list should appear of all files. Check for the appropriate shared object files and ensure they link to the right place. The list should appear like the list on the next page: 4

Now the unixodbc driver manager will be used. Install Hortonworks ODBC Driver In the Linux environment with IQ installed, a Hortonworks ODBC driver is required to use to access the Hive server. This can be downloaded at the following link: http://hortonworks.com/products/hdp/hdp-1-3/#add_ons Save the.tar.gz file. Unzip it using the following command: tar zxvf hive-odbc-native.1.2.13.1018.tar.gz Install the.rpm file in the resulting folder, ensuring you choose the correct one (32 or 64-bit based on your IQ installation): rpm ivh hive-odbc-native-1.2.13.1018-1.el6.x86_64.rpm DSN To access connections, you can store attributes to the connection in a DSN. Typically, this connection data is stored in an.odbc.ini file. An.odbcinst.ini file is used to store driver information, including the newly installed driver. The Hortonworks distribution also includes a.hortonworks.hiveodbc.ini file which stores necessary additional information on the connection to the driver. 5

Open your home directory. Check if.odbc.ini or.odbcinst.ini files exist, by checking the folder with hidden files shown (ctrl-h): We will now update the existing files or create new ones, with the required DSN data for the Hive ODBC driver. Note: The Hive ODBC installation comes with default configuration files, which can be used, but may have newline errors. To avoid introducing errors and for simplicity, we will create our own. If you wish to use the provided files, ensure that no extra newline characters are introduced. The.odbc.ini file should be configured as follows: 6

The name of data source we are creating, in this case hive, should be specified with corresponding driver name (in.odbcinst.ini), under [ODBC Data Sources] Under [hive], the attributes for the data source should be specified. The HOST will be the IP address of the Hive server, as specified in the Hortonworks VM (the IP address used to access the web interface). 10000 is the default Hive listening port. Ensure that HS2AuthMech is specified if the HiveServerType is 2, to avoid connection issues. The Driver must contain a path to the actual driver.so file. The.odbcinst.ini file should be configured as follows: The driver field must contain the path to the actual.so file, as in the.odbc.ini file. Each entry under [ODBC Drivers] is the name of the driver, as referenced in the.odbc.ini file. In this case, the driver is named Hortonworks. 7

The.hortonworks.hiveodbc.ini file should be configured as follows: The ErrorMessagePath is the actual path to the error messages folder, which is created when the driver is installed. This is used to return accurate errors when accessing the server. Connect Using IQ Now that the environment and DSN are set up, we can use them to access the Hive server. We will access the Hive server from an existing IQ server. To begin, we will start an IQ server. Open a Linux terminal in the directory with the desired database and configuration files. Run a command similar to the follow, with the correct variable names: start_iq @iqdemo.cfg iqdemo.db Now we will open Interactive SQL and connect to that running server: dbisql 8

In the prompt, connect to the server which was started above. Note: You can connect to the Hive server directly, using the Connect with an ODBC Data Source option, if you want to access the Hive server as a server, without using proxy tables. This, however, is not supported and does not allow you to access both SA data and Hive data simultaneously. Create Remote Server Now, in the SQL Statements window, we will create a remote server. Run the following command, choosing a server name (hiveserver) and specifying the DSN as created above. CREATE server hiveserver class ODBC using dsn=hive 9

Access Hadoop from Server Data can be retrieved from the newly connected Hive server using normal SQL statements, provided that those statements also exist in HiveQL. Select statements can be run with the following commands: Forward to hiveserver; Select * from sample_07; Note: Existing data, in the Hadoop tables, can also be accessed by accessing the IP address specified in the Hortonworks VM in a web browser. JOIN also works in the same way. Access Hadoop from Proxy Table A proxy table can be created to access the data as well. Execute the following command: Forward to; Create existing table users At hiveserver.hive.default.users Note: The connection string hiveserver.hive.default.users is generated by servername.databasename.username. tablename. Now the users table is a proxy to the users table on the Hive server. You can access data from that table from IQ: Select * from users; 10

Create Table To create a table from IQ, run the following command: Forward to; Create table hive_t (bee int, nest int) At hiveserver.hive.default.hive_t Then you can access the table both from IQ and from the Hive server directly, with a proxy table already created. 11

www.sap.com 2013 SAP AG or an SAP affiliate company. All rights reserved. No part of this publication may be reproduced or transmitted in any form or for any purpose without the express permission of SAP AG. The information contained herein may be changed without prior notice. Some software products marketed by SAP AG and its distributors contain proprietary software components of other software vendors. National product specifications may vary. These materials are provided by SAP AG and its affiliated companies ( SAP Group ) for informational purposes only, without representation or warranty of any kind, and SAP Group shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP Group products and services are those that are set forth in the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional warranty. SAP and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP AG in Germany and other countries. Please see http://www.sap.com/corporate-en/legal/copyright/index. epx#trademark for additional trademark information and notices.