Set Up Hortonworks Hadoop with SQL Anywhere



Similar documents
SAP BW on HANA & HANA Smart Data Access Setup

Set Up Hortonworks Hadoop with SAP Sybase IQ

Using SAP Crystal Reports with SAP Sybase SQL Anywhere

SAP BusinessObjects Business Intelligence 4 Innovation and Implementation

SAP PartnerEdge Program: Opportunities for SAP-Authorized Resellers

How-to guide: Monitoring of standalone Hosts. This guide explains how you can enable monitoring for standalone hosts in SAP Solution Manager

Extend the SAP FIORI app HCM Timesheet Approval

Open Items Analytics Dashboard System Configuration

Agentry and SMP Metadata Performance Testing Guidelines for executing performance testing with Agentry and SAP Mobile Platform Metadata based

How to Extend a Fiori Application: Purchase Order Approval

SAP Landscape Transformation (SLT) Replication Server User Guide

LVS Troubleshooting Common issues and solutions

Sybase ASE Linux Installation Guide Installation and getting started guide for SAP Sybase ASE on Linux

Consumption of OData Services of Open Items Analytics Dashboard using SAP Predictive Analysis

Create and run apps on HANA Cloud in SAP Web IDE

Creating a Fiori Starter Application for sales order tracking

How to Implement a SAP HANA Database Procedure and consume it from an ABAP Program Step-by-Step Tutorial

Configuring Java IDoc Adapter (IDoc_AAE) in Process Integration. : SAP Labs India Pvt.Ltd

Using Database Performance Warehouse to Monitor Microsoft SQL Server Report Content

Compare & Adjust How to Guide for Compare & Adjust in SAP Solution Manager Application Lifecycle Management

Memory Management simplifications in ABAP Kernel 7.4*

Creating a universe on Hive with Hortonworks HDP 2.0

Installing and Configuring the HANA Cloud Connector for On-premise OData Access

LHI Leasing Simplifying and Automating the IT Landscape with SAP Software. SAP Customer Success Story Financial Services Provider LHI Leasing

Nine Reasons Why SAP Rapid Deployment Solutions Can Make Your Life Easier Get Where You Want to Be, One Step at a Time

SAP CRM Service Manager 3.1 Mobile App Extended Feature List An extended list of all the features included in the default delivery of the SAP CRM

Setting up Single Sign-On (SSO) with SAP HANA and SAP BusinessObjects XI 4.0

SAP Solution Manager - Content Transfer This document provides information on architectural and design questions, such as which SAP Solution Manager

Certification Guide Network Connectivity for SAP on Premise and Cloud Solutions Integration

BW Source System: Troubleshooting Guide

Training.sap.com User Guide

SAP BUSINESS PLANNING AND CONSOLIDATION 10.0, VERSION FOR SAP NETWEAVER, POWERED BY SAP HANA STARTER KIT FOR USGAAP

SAP Sybase Adaptive Server Enterprise Shrinking a Database for Storage Optimization 2013

Setting up Single Sign-On (SSO) with SAP HANA and SAP BusinessObjects XI 4.0

Active Quality Management

Additional Guide to Implementing the SAP CRM Service Management rapiddeployment

SAP Security Recommendations December Secure Software Development at SAP Embedding Security in the Product Innovation Lifecycle Version 1.

Information Design Tool User Guide SAP BusinessObjects Business Intelligence platform 4.0 Feature Pack 3

What's New in SAP BusinessObjects XI 3.1 Service Pack 5

Cloud Single Sign-On and On-Premise Identity Federation with SAP NetWeaver Cloud White Paper

Five Strategies Small and Medium Enterprises Can Use to Successfully Implement High Value Business Mobility

Quick Guide to the SAP Customer Relationship Management Rapid- Deployment Solution (based on EhP1) Demo/Evaluation Appliance

SAP BusinessObjects Dashboarding Strategy and Statement of Direction

How To Install The Sap Business Explorer 7.X 2.X (Sap) On A Windows 7.30 Computer (Windows 7)

Secure MobiLink Synchronization using Microsoft IIS and the MobiLink Redirector

How To... Master Data Governance for Material: Create Custom Print forms. Applicable Releases: MDG 7

SAP Thought Leadership Paper Engineering, Construction, and Operations. Beyond Enterprise Resource Planning Construction in the ipad Age

Setting up the Environment for Creating or Extending SAP Fiori Apps

Installing the BlackBerry Enterprise Server Management console with a remote database

SAP PartnerEdge Program Guide for Language Services Partners

Crystal Reports Server Embedded 2008 with Service Pack 7 for Windows Supported Platforms

Design Thinking for. Requirements Analysis

Streamlined Planning and Consolidation for Finance Teams in Any Organization

SAP BusinessObjects Query as a Web Service Designer SAP BusinessObjects Business Intelligence platform 4.0

Inmagic ODBC Driver 8.00 Installation and Upgrade Notes

SAP BusinessObjects Business Intelligence Suite Document Version: 4.1 Support Package Patch 3.x Update Guide

ForFarmers: SAP Business Communications Management for Call Center Workload Distribution

How To... Master Data Governance for Material: Maintenance for multiple Materials in one Change Request. Applicable Releases: all

SAP HANA Big Data Intelligence rapiddeployment

SAP Work Manager 6.0 Mobile App Extended Feature List

SAP Business One OnDemand. SAP Business One OnDemand Solution Overview

How to Implement the X.509 Certificate Based Single Sign-On Solution with SAP Netweaver Single Sign-On

SAP HANA. SAP HANA Performance Efficient Speed and Scale-Out for Real-Time Business Intelligence

Installation Guide: Agentry Device Clients SAP Mobile Platform 2.3

SAP White Paper Enterprise Information Management

Implementing an Enterprise Information Management Strategy An Approach That Mitigates Risk and Drives Down Costs

Installing the BlackBerry Enterprise Server Management Software on an administrator or remote computer

Installing SQL Express. For CribMaster 9.2 and Later

SAP BusinessObjects Edge BI, Standard Package Preferred Business Intelligence Choice for Growing Companies

The purpose of this document is to describe how to connect Crystal Reports with BMC Remedy AR System using ODBC.

Getting Started with the License Administration Workbench 2.0 (LAW 2.0)

2. Unzip the file using a program that supports long filenames, such as WinZip. Do not use DOS.

H2G Install SAP Web IDE locally for trial (Mac version)

SAP Business Intelligence Suite Patch 10.x Update Guide

Data Governance. Data Governance, Data Architecture, and Metadata Essentials Enabling Data Reuse Across the Enterprise

Guide to Installing BBL Crystal MIND on Windows 7

How To Create An Easybelle History Database On A Microsoft Powerbook (Windows)

Installing Cobra 4.7

Integration Option for Microsoft SharePoint Software Getting Started Guide SAP BusinessObjects 4.0 Support Package 4

HANA Input Parameters with Multi-Values to Filter Calculation Views

WhatsUp Gold v16.2 Installation and Configuration Guide

How To Use the ESR Eclipse Tool with the Enterprise Service Repository

How to... Master Data Governance for Material: Use the Data Import Framework for Material. Applicable Releases: EhP6, MDG 6.1, MDG 7.

Hadoop Basics with InfoSphere BigInsights

Setting Up ALERE with Client/Server Data

How to Move an SAP BusinessObjects BI Platform System Database and Audit Database

Millennium Drive. Installation Guide

Upgrade: SAP Mobile Platform Server for Windows SAP Mobile Platform 3.0 SP02

Crystal Reports Installation Guide

Installation Guide for Windows

FAS Asset Accounting FAS CIP Accounting FAS Asset Inventory SQL Server Installation & Administration Guide Version

FTP Server Configuration

Reconfiguring VMware vsphere Update Manager

Installing RMFT on an MS Cluster

CRM WebClient UI & Netweaver Enterprise Portal Integration

StarWind Virtual SAN Installation and Configuration of Hyper-Converged 2 Nodes with Hyper-V Cluster

Leveraging SAP HANA & Hortonworks Data Platform to analyze Wikipedia Page Hit Data

Guide to the SAP Extended Business Program

Integrating SAP BusinessObjects with Hadoop. Using a multi-node Hadoop Cluster

Transcription:

Set Up Hortonworks Hadoop with SQL Anywhere

TABLE OF CONTENTS 1 INTRODUCTION... 3 2 INSTALL HADOOP ENVIRONMENT... 3 3 SET UP WINDOWS ENVIRONMENT... 5 3.1 Install Hortonworks ODBC Driver... 5 3.2 ODBC Driver Setup... 5 4 CONNECT USING SYBASE CENTRAL... 6 4.1 Create Remote Server... 8 4.2 Create and Access a Proxy Table... 11 5 CONNECT USING INTERACTIVE SQL... 14 5.1 Create Remote Server... 14 5.2 Access Hadoop from Server... 15 5.3 Access Hadoop from Proxy Table... 16 5.4 Create Table... 16

1 INTRODUCTION Welcome to setting up Hadoop and Hive with SQL Anywhere! Hadoop is an open-source framework designed to handle big data. It allows large amounts of data to be distributed across clusters of computers. Then, when retrieving the data, a MapReduce algorithm is implemented to divide up the work across the clusters (map) and then recombine the data (reduce). Hive is an infrastructure which can be used on top of Hadoop. It provides the ability to access data without using MapReduce or code files. Hive allows the creation of tables, which are stored as directories in the Hadoop File System (HDFS). These tables can be accessed using HiveQL, a subset of SQL-92, which implicitly makes MapReduce calls. Note: Since a Hive query must access the server, wait for a MapReduce job and aggregate results, access to a proxy table may be quite slow. This guide provides directions to set up SQL Anywhere with a Hortonworks Hadoop Hive server and HiveQL. This is not a guide to help with installation, set-up or usage of SQL Anywhere. For SQL Anywhere documentation check: http://dcx.sybase.com/index.html The provided instructions assume an installation of SQL Anywhere on a Windows machine. 2 INSTALL HADOOP ENVIRONMENT Hortonworks Sandbox is a distribution of the Hadoop environment that installs as a standalone virtual machine. This environment is a portable Hadoop environment, which is smaller and easier to use, but does not allow for a multi-node cluster or some of the functionality of a complete Hadoop installation. For the full platform, consider the Hortonworks Data Platform. Download the product and follow the installation instructions at: http://hortonworks.com/products/hortonworks-sandbox/#install Note: This installation requires an installation of VMware Player, Virtualbox or equivalent software. Download links are available on the Hortonworks website. 3

The simplest installation process is to use VMware Player, and then open and import the provided file from the player. Once this is installed and running, the provided IP address can be used to connect to the Hive server or access a Hadoop UI in a web browser. To ensure the IP address is accessible, enter the given IP address in a web browser. You should be able to continue to the sandbox and access a page similar to the following: 4

Note: Network security can limit access to the IP address generated for the Hadoop VM. It may only be accessible from other devices in the same subnet. Ensure that the host which IQ is running from can access the IP. 3 SET UP WINDOWS ENVIRONMENT Note: Screenshots and further instructions for this set up can be found at: http://hortonworks.com/hadoop-tutorial/how-to-install-and-configure-the-hortonworks-odbc-driver-onwindows-7/ 3.1 Install Hortonworks ODBC Driver In the Windows environment with SQL Anywhere installed, a Hortonworks ODBC driver is required to use to access the Hive server. This can be downloaded at the following link: http://hortonworks.com/products/hdp/hdp-1-3/#add_ons Save the appropriate.msi file (32 or 64-bit). Double-click to run the file. Finish the installation allowing it to continue at all prompts. 3.2 ODBC Driver Setup To use the installed ODBC driver, you must set up the ODBC settings. Open the Control Panel. Then select Administrative Tools. Double-click Data Sources (ODBC). Click the System DSN tab. Select the Sample Hortonworks Hive DSN and click Configure. In the window that appears, fill out the information, using the IP address from the Hortonworks installation for Host and sandbox for User Name. 5

Click the Test button and ensure it completes successfully. 4 CONNECT USING SYBASE CENTRAL Now that the environment and DSN are set up, we can use them to access the Hive server. If you would rather use SQL statements executed in Interactive SQL, instead of the Sybase Central graphical tool, to access the Hive server, skip this section and go to section 5. We will access the Hive server from an existing SQL Anywhere server, by creating a remote server and then proxy tables, as necessary. Open Sybase Central. 6

If a server or database is not currently connected, then add a connection. Click Connections in the top menu and choose Connect with SQL Anywhere 16. Connect to any created or running database, as applicable. In this example, we will connect to the demo database, that comes with the SQL Anywhere installation. We will choose Connect with an ODBC Data Source. Choosing ODBC Data Source Name, click Browse to choose the data source. Choose SQL Anywhere 16 Demo and click OK. Note: You can connect to the Sample Hortonworks Hive DSN directly, if you want to access the Hive server as a server, without using proxy tables. This, however, does not allow you to access both SQL Anywhere data and Hive data simultaneously. 7

Now, click Connect, no User ID or Password is required to connect to the demo database. Click the Folders icon to change to Folder View. 4.1 Create Remote Server Now, we will create a remote server. 8

Right-click on Remote Servers and choose New, Remote Server Choose a name for the remote server and click Next. 9

Choose Generic for the type of server and click Next. Specify the name of the DSN created in the ODBC Driver Setup for connection information and click Next. Continue to click next, accepting the defaults until the remote server is created. 10

4.2 Create and Access a Proxy Table We can now create a proxy table, which is a table in SQL Anywhere pointing to a table on the Hive server. Right-click on Tables and click New, Proxy Tables 11

Choose the remote server that was just created and click Next. Highlight the table for which to create a proxy of, and choose a new name and user, if desired. Click Next. 12

Change column choices, if desired. Click Next. Continue to click next and finish until the proxy table is created. Now the newly created proxy table will show up in the list of tables. 13

Double-click on the table and choose the Data tab. All the data from the table in the Hive server will appear. 5 CONNECT USING INTERACTIVE SQL This section is an alternative to section 4. Instead of connecting using the Sybase Central graphical tool, this section describes how to connect using SQL statements executed from Interactive SQL. Now that the environment and DSN are set up, we can use them to access the Hive server. In the Start Menu, open SQL Anywhere 16 -> Administration Tools -> Interactive SQL. Connect to the demo database by choosing the SQL Anywhere 16 Demo ODBC Data Source Name from the Browse menu. Connect without any User ID or Password. Alternatively, connect to any existing database. Note: You can connect to the Sample Hortonworks Hive DSN directly, if you want to access the Hive server as a server, without using proxy tables. This, however, does not allow you to access both SA data and Hive data simultaneously. 5.1 Create Remote Server Now, in the SQL Statements window, we will create a remote server. Run the following command, choosing a server name (hiveserver) and specifying the DSN as created above. CREATE server hiveserver class ODBC using dsn=sample Hortonworks Hive DSN 14

5.2 Access Hadoop from Server Data can be retrieved from the newly connected Hive server using normal SQL statements, provided that those statements also exist in HiveQL. Select statements can be run with the following commands: Forward to hiveserver; Select * from sample_07; Note: Existing data, in the Hadoop tables, can also be accessed by accessing the IP address specified in the Hortonworks VM in a web browser. 15

JOIN also works in the same way. 5.3 Access Hadoop from Proxy Table A proxy table can be created to access the data as well. Execute the following command: Forward to; Create existing table users At hiveserver.hive.default.users Note: The connection string hiveserver.hive.default.users is generated by servername.databasename.username.tablename. The users table must already exist in Hive for this to work. Now the users table is a proxy to the users table on the Hive server. You can access data from that table from IQ: Select * from users; 5.4 Create Table To create a table from IQ, run the following command: Forward to; Create table hive_t (bee int, nest int) At hiveserver.hive.default.hive_t Then you can access the table both from IQ and from the Hive server directly, with a proxy table already created. 16

www.sap.com 2014 SAP AG. All rights reserved. SAP, R/3, SAP NetWeaver, Duet, PartnerEdge, ByDesign, SAP BusinessObjects Explorer, StreamWork, SAP HANA, and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP AG in Germany and other countries. Business Objects and the Business Objects logo, BusinessObjects, Crystal Reports, Crystal Decisions, Web Intelligence, Xcelsius, and other Business Objects products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of Business Objects Software Ltd. Business Objects is an SAP company. Sybase and Adaptive Server, ianywhere, Sybase 365, SQL Anywhere, and other Sybase products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of Sybase Inc. Sybase is an SAP company. Crossgate, m@gic EDDY, B2B 360, and B2B 360 Services are registered trademarks of Crossgate AG in Germany and other countries. Crossgate is an SAP company. All other product and service names mentioned are the trademarks of their respective companies. Data contained in this document serves informational purposes only. National product specifications may vary. These materials are subject to change without notice. These materials are provided by SAP AG and its affiliated companies ("SAP Group") for informational purposes only, without representation or warranty of any kind, and SAP Group shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP Group products and services are those that are set forth in the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional warranty.