Open Source Business Intelligence Tools: A Review



Similar documents
Open Source Business Intelligence Intro

A DATA WAREHOUSE SOLUTION FOR E-GOVERNMENT

Open Source Business Intelligence

Business Intelligence for SUPRA. WHITE PAPER Cincom In-depth Analysis and Review

BUILDING OLAP TOOLS OVER LARGE DATABASES

Palo Open Source BI Suite

Business Benefits From Microsoft SQL Server Business Intelligence Solutions How Can Business Intelligence Help You? PTR Associates Limited

University of Gaziantep, Department of Business Administration

Open Source meets Business Intelligence Seminar Business Intelligence Winter Term 06/07

Business Intelligence & Product Analytics

Cincom Business Intelligence Solutions

Connecting Pentaho Suite applications with Microsoft SQL Server Analysis Services

Turnkey Hardware, Software and Cash Flow / Operational Analytics Framework

Business Intelligence, Analytics & Reporting: Glossary of Terms

Turkish Journal of Engineering, Science and Technology

Foundations of Business Intelligence: Databases and Information Management

DATA MINING USING PENTAHO / WEKA

Open Source Business Intelligence Platforms for Engineering Education

Pentaho BI Capability Profile

Monitoring Genebanks using Datamarts based in an Open Source Tool

IAF Business Intelligence Solutions Make the Most of Your Business Intelligence. White Paper November 2002

Building Geospatial Business Intelligence Solutions with Free and Open Source Components

Open source geospatial Business Intelligence (BI) in action!

The Microsoft Business Intelligence 2010 Stack Course 50511A; 5 Days, Instructor-led

MS 50511A The Microsoft Business Intelligence 2010 Stack

ElegantJ BI. White Paper. Considering the Alternatives Business Intelligence Solutions vs. Spreadsheets

Implementing Data Models and Reports with Microsoft SQL Server

End to End Microsoft BI with SQL 2008 R2 and SharePoint 2010

ORACLE OLAP. Oracle OLAP is embedded in the Oracle Database kernel and runs in the same database process

SSIS Training: Introduction to SQL Server Integration Services Duration: 3 days

SAP BW Connector for BIRT Technical Overview

Business Intelligence Solutions for Gaming and Hospitality

CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University

Evaluating Business Intelligence Offerings: Oracle and Microsoft.

Model-Driven Data Warehousing

WHITE PAPER. Domo Advanced Architecture

Real-time Data Replication

How To Build A Business Intelligence System In Stock Exchange

Delivering Business Intelligence With Microsoft SQL Server 2005 or 2008 HDT922 Five Days

SQL Server 2012 Business Intelligence Boot Camp

Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers

Data W a Ware r house house and and OLAP II Week 6 1

Implementing Data Models and Reports with Microsoft SQL Server 20466C; 5 Days

Open Source Business Intelligence

Implementing Data Models and Reports with Microsoft SQL Server 2012 MOC 10778

What is Data Virtualization? Rick F. van der Lans, R20/Consultancy

NEMUG Feb Create Your Own Web Data Mart with MySQL

Microsoft Implementing Data Models and Reports with Microsoft SQL Server

What is Data Virtualization?

Business Intelligence on a Budget: Open Source BI. Paul O Rorke

Business Intelligence in E-Learning

Breadboard BI. Unlocking ERP Data Using Open Source Tools By Christopher Lavigne

Data Warehouse design

IDCORP Business Intelligence. Know More, Analyze Better, Decide Wiser

5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2

Deriving Business Intelligence from Unstructured Data

Integrating Ingres in the Information System: An Open Source Approach

<Insert Picture Here> Introducing Data Modeling and Design with Oracle SQL Developer Data Modeler

The difference between. BI and CPM. A white paper prepared by Prophix Software

Integrating data in the Information System An Open Source approach

COURSE SYLLABUS COURSE TITLE:

Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University

CHAPTER 5: BUSINESS ANALYTICS

Turning your Warehouse Data into Business Intelligence: Reporting Trends and Visibility Michael Armanious; Vice President Sales and Marketing Datex,

White Paper. Comparison of Business Intelligence Stacks: Microsoft SQL Server Reporting Services and SAP Business Objects July 7, 2010

Integrating SAP and non-sap data for comprehensive Business Intelligence

Quick start. A project with SpagoBI 3.x

8. Business Intelligence Reference Architectures and Patterns

Microsoft Services Exceed your business with Microsoft SharePoint Server 2010

MicroStrategy Course Catalog

ETL tools for Data Warehousing: An empirical study of Open Source Talend Studio versus Microsoft SSIS

Pentaho Reporting Overview

Business Intelligence tools comparison MS SQL Server Vs Pentaho Open Source

Big Data Analytics with IBM Cognos BI Dynamic Query IBM Redbooks Solution Guide

SQL Server Administrator Introduction - 3 Days Objectives

When to consider OLAP?

Food & Beverage Industry Brief

Big Data for Investment Research Management

Outlines. Business Intelligence. What Is Business Intelligence? Data mining life cycle

Business Intelligence and Healthcare

Tracking System for GPS Devices and Mining of Spatial Data

CS2032 Data warehousing and Data Mining Unit II Page 1

Information management software solutions White paper. Powerful data warehousing performance with IBM Red Brick Warehouse

A Framework for Developing the Web-based Data Integration Tool for Web-Oriented Data Warehousing

Implementing Data Models and Reports with Microsoft SQL Server

Vendor briefing Business Intelligence and Analytics Platforms Gartner 15 capabilities

Business Intelligence: Effective Decision Making

<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise

Transcription:

Open Source Business Intelligence Tools: A Review Amid Khatibi Bardsiri 1 Seyyed Mohsen Hashemi 2 1 Bardsir Branch, Islamic Azad University, Kerman, IRAN 2 Science and Research Branch, Islamic Azad University, Tehran, IRAN Abstract Nowadays, information and knowledge represent the fundamental wealth of an organization. The organization can business and requirements needed by its use of Business intelligence (BI) justification. Business intelligence is the process of transforming raw data into useful information for more effective strategic, operational insights, and decision-making purposes so that it yields real business benefit. Business intelligence joins as a key strategy for any organization to achieve a competitive advantage. Business intelligence tools are a type of application software designed to retrieve, analyze and report data. In the paper, we consider a number of open source BI tools. To understand the use of open source BI tools better, it is of interest to consider which tools are available and what they are capable of. This is the goal of this paper. Keywords: Business Intelligence (BI), Open Source, Tool 1. Introduction In the past, Decision-Support Systems (DSS) were independent systems within an organization and had a weak relationship with other systems. Now, enterprise systems are the foundation of an organization, and practitioners design and may implement BI to create a comprehensive decision-support environment for management [1, 2]. Business Intelligence (BI) is a broad category of applications and technologies for gathering, storing, analyzing, and providing access to data to help organizations make better business decisions [3]. In the organizations the information can be seen as a competitive advantage over its competitors, knowing the core business and discover knowledge patterns in the daily and operational processes is the way to achieve management effectiveness. Thomas Davenport [4] says BI more as a set of technologies and processes to collect and analyze data for decision making. Negash [5] explains Business Intelligence concept as BI systems combine data gathering, data storage, and knowledge management with tools to present complex internal and competitive information to decision makers. Today companies took advantage of Intelligence of its competitors to move their business to use and efficient. Intelligence can gives a good feature business case reports and data analysis and performance improvement company [6]. The value of BI for business is primarily said in the fact that such systems cast some light on information that may serve as the basis for carrying out fundamental changes in a specific organization, i.e. establishing new co-operation, acquiring new customers, making new markets, offering products to customers [7]. Software suppliers provide users with BI tools and it can be assumed that they have distinct views of what BI means. There are now mature and powerful open source tools available in all categories, so it is now for the first time possible to make a complete BI solution using only open source tools. To collect data about the tools, the Internet was searched for open source tools in each category. The presented data was found by inspecting the products official Amid Khatibi Bardsiri et.al. 164 www.irjcsea.org

homepages as well as their documentation (if any), mailing lists and forums. In this survey paper, we consider the capabilities of a number of open source tools for BI. 2. Tools As some organizations are looking to reduce costs in their large Business Intelligence (BI) deployments, they are hoping that open source gives them greater leverage for their money. Application developers that are looking for a way to embed BI functionality into their applications initiate other open source BI deployments. The open source tools are currently prevalent in business intelligent and those description have been summarized in Table1. Table 1: Survey on open source business intelligence tools Tool Category Platform Description Ingres MonetDB PostgreSQL MySQL Clover.ETL ETL Pequel ETL Perl Talend ETL Kettle ETL Apatar ETL A commercially supported, open-source SQL relational database management system intended to support large commercial and government applications. Ingres Database is fully open source with a growing global community of contributors [8]. An open source column-oriented database management system developed at the Centrum Wiskunde & Informatica (CWI) in the Netherlands. It was designed to provide high performance on complex queries against large databases, e.g. combining tables with hundreds of columns and multi-million rows [9]. An object-relational database management system (ORDBMS) available for many platforms including Linux, FreeBSD, Solaris, Microsoft and Mac OS X. It is released under the PostgreSQL License, which is an MIT-style license, and is thus free and open source software [10]. The world's most used open source relational database management system (RDBMS) as of 2008 that runs as a server providing multi-user access to a number of databases [11]. Used for processing and transforming large volumes of data and is usually employed in cases of data integration, data warehousing, data quality and cleansing, and data migration [12]. Comprehensive & high performance data distribution & transformation system. Features a simple, user-friendly event driven scripting interface transparently generates & execs highly efficient Perl/C code [13]. An open source software vendor that provides data integration, data management, enterprise application integration and big data software and solutions [14]. Pentaho Data Integration Community Edition (PDI CE) also known as Kettle. Pentaho Data Integration delivers powerful Extraction, Transformation and Loading (ETL) capabilities using an innovative, metadata-driven approach. Pentaho Data Integration is increasingly the choice for organizations over traditional, proprietary ETL or data integration tools [15]. An open source ETL (Extract-Transform-Load) and data integration software application. Apatar s user and customer Amid Khatibi Bardsiri et.al. 165 www.irjcsea.org

Scriptella ETL Octopus ETL ETL Integrator Mondrian Palo JPalo Client and Client JPivot OpenI REX FreeAnalysis ETL servers servers Linux, and base ranges from small companies and individuals to large organizations such as the World Bank Group, Thomson Reuters, John Wiley & Sons, R.R. Donnelley, Autodesk, and more [16]. An open source ETL and script execution tool written in. Its primary focus is simplicity. It doesn't require the user to learn another complex XML-based language to use it, but allows the use of SQL or another scripting language suitable for the data source to perform required transformations [17]. A simple -based Extraction, Transformation, and Loading (ETL) tool. It can connect to any JDBC data source and perform transformations defined in an XML file, and supports Ant and JUnit for creating databases and tables and extracting and loading data during a build or test process [18]. A Data Integration tool which can be used to perform extract - transform - load to build data warehouse] or data migration. This tool provide connectivity to a vast range of heterogeneous and diversified data sources including non database sources [19]. An open source (online analytical processing) server, written in. It supports the MDX (multidimensional expressions) query language and the XML for Analysis and olap4j interface specifications. It reads from SQL and other data sources and aggregates data in a memory cache [20]. A memory resident multidimensional (online analytical processing () or multidimensional online analytical processing (M)) database server and typically used as a business intelligence tool for controlling and budgeting purposes with spreadsheet software acting as the user interface [21]. Tools to visualize and model data of a Palo or XMLA database. Their main purpose is Business Intelligence Reporting and Planning. With the API, programmers can easily model their specific needs [22]. A JSP custom tag library that renders an table and let users perform typical navigations like slice and dice, drill down and roll up. It uses Mondrian as its Server. JPivot also supports XMLA data-source access [23]. Developed by the company Loyalty Matrix from which commercial support was also available. The company has now been acquired by another company and its technology is being integrated into a closed source application. It is therefore not known if the OpenI project will continue [24]. A java client that provides easy-to-use GUI for browsing multidimensional data sources that support XMLA protocol. Rex also includes MDX editor and MDX Builder Tool [25]. A complete java (Eclipse RCP) and 2.0 (Dojo) application that provide functions against Pentaho Mondrian Server and other MDX/XMLA compliant cubes data-sources such as Microsoft Analysis or Hyperion [26]. JRubik is an client developed in /Swing and based on Amid Khatibi Bardsiri et.al. 166 www.irjcsea.org

JRubik JMagallanes Olap & Reports JPivot project components. The queries could be issued using MDX language [27]. An open source end user application for and Dynamic Reports written in /J2EE. Combines static reports, pivot table and charts. It reads from SQL and other data sources [28]. 3. Conclusions In Organizations discovering business knowledge is the process to achieve competitive edge over their competitors and this is supported by Decisions Support Systems or Business Intelligence current term in use nowadays. After several years in existence, Open Source Business Intelligence tools are mature from the functional and business model point of view to become a solid option to meet and exceed the business intelligence needs of an organization. The use of Business Intelligence (BI) tools is popular in industry. However, the use of open source tools for BI is still quite limited compared to other types of software. From our analysis, we conclude that the Open Source Business Intelligence tools are growing. 4. References [1] Alter, S. A work system view of DSS in its fourth decade, Decision Support Systems, Vol. 38, No. 3, pp. 319 327, 2004. [2] Lönnqvist, A. and Pirttimäki, V. The measurement of business intelligence, Information Systems Management, Vol. 23, No. 1, pp. 32 40, 2006. [3] Power, D.J. A Brief History of Decision Support Systems, http://dssresources.com/history/dsshistory.html, version 4.1, November 10, 2012. [4] Davenport, T. Competing on analytics, Harvard business review, pp. 12, 2007. [5] Solomon, N. Communications of the Association for Information Systems, Vol.13, pp. 177-195, 2004. [6] Hostmann, B. Business intelligence trends, with Gartner analyst, available at: http://searchdatamanagement.techtarget.com/, 2012. [7] Olszak, C. M., and Ziemba, E. Business intelligence systems in the holistic infrastructure development supporting decision-making in organizations. Interdisciplinary Journal of Information, Knowledge and Management, Vol. 1, pp. 47-58, 2006. [8] Enterprise Open Source Database Ingres. Retrieved from http://ingres.com/, 2012. [9] Query Processing at Light Speed. available at: http://monetdb.cwi.nl/, 2012. [10] PostgreSQL: The world s most advanced open source database. Retrieved from http://postgresql.org/, 2012. [11] MySQL : The world s most popular open source database. available at: http://mysql.com/, 2012. [12] Clover.ETL open source data integration tool. Retrieved from http://cloveretl.org/, 2012. [13] Pequel ETL Data Transformation Engine. available at: http://sourceforge.net/projects/pequel/, 2012. [14] Talend first provider of open source data integration software. Retrieved from http://talend.com/, 2012. Amid Khatibi Bardsiri et.al. 167 www.irjcsea.org

[15] Pentaho Commercial Open Source Business Intelligence: Kettle Project. available at: http://kettle.pentaho.org/, 2012. [16] Apatar, Inc. is the developer of a commercial open source data integration product, Retrieved from http://apatar.com/, 2012. [17] Kupolov, F. Welcome to Scriptella ETL Project. available at: http://scriptella.javaforge.com/, 2012. [18] JDBC Data Transformations. Retrieved from http://www.enhydra.org/tech/octopus/, 2012. [19] JBIWiki: ETLSE. Retrieved from http://wiki.openesb. java.net/wiki.jsp?page=etlse, 2012. [20] Pentaho Analysis Services: Mondrian Project. available at: http://mondrian.pentaho.org/, 2012. [21] Palo Open Source for Excel Multidimensional Database for Budgeting, Forecasting, Planning, Reporting, M, Analysis Software Retrieved from http://www.jedox.com/en/enterprise spreadsheet server/excel olapserver/ palo server.html, 2012. [22] JPalo Palo World Business Intelligence Reporting. available at: http://www.jpalo.com/en/products/start_products.html, 2012. [23] JPivot Home. Retrieved from http://jpivot.sourceforge.net/, 2012. [24] Openi.org Open Source Application for Reporting. available at: http://openi.sourceforge.net/, 2012. [25] SourceForge.net: REX warehouse Explorer. Retrieved from http://sourceforge.net/projects/whex/, 2012. [26] BPM Conseil, available at: http://freeanalysis.org/, 2012. [27] Introduction to JRubik, Retrieved from http://rubik.sourceforge.net/jrubik/intro.html, 2012. [28] JMagallanes, available at: http://jmagallanes.sourceforge.net/en/, 2012. Amid Khatibi Bardsiri et.al. 168 www.irjcsea.org