Cloud Search Based Applications for Big Data - Challenges and Methodologies for Acceleration



Similar documents
Industry 4.0 and Big Data

Big Data Challenges. Alexandru Adrian TOLE Romanian American University, Bucharest, Romania

Why Big Data in the Cloud?

BEIA CONSULT INTERNATIONAL SRL

Big Data Executive Survey

How To Make Sense Of Data With Altilia

How To Handle Big Data With A Data Scientist

What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy

ICT 10: Software Technologies

Create and Drive Big Data Success Don t Get Left Behind

The Future of Data Management

ICT 10: Software Technologies

HadoopTM Analytics DDN

Search and Real-Time Analytics on Big Data

Transforming the Telecoms Business using Big Data and Analytics

How To Find Out What Is Going On Online

Transform your customer relationships. Avanade Customer Relationship Management Services

Increase Agility and Reduce Costs with a Logical Data Warehouse. February 2014

Integrated Social and Enterprise Data = Enhanced Analytics

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

The Future of Data Management with Hadoop and the Enterprise Data Hub

Technology Enablement

HADOOP SOLUTION USING EMC ISILON AND CLOUDERA ENTERPRISE Efficient, Flexible In-Place Hadoop Analytics

Ignite Your Creative Ideas with Fast and Engaging Data Discovery

Telecommunication (120 ЕCTS)

Oracle Data Integration: CON7926 Oracle Data Integration: A Crucial Ingredient for Cloud Integration

ELEFTHO : Supporting Business Incubators & technology parks.

BIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research &

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper

2015 Analyst and Advisor Summit. Advanced Data Analytics Dr. Rod Fontecilla Vice President, Application Services, Chief Data Scientist

BIG DATA STRATEGY. Rama Kattunga Chair at American institute of Big Data Professionals. Building Big Data Strategy For Your Organization

The NEW POSSIBILITY. How the Data Center Helps Your Organization Excel in the Digital Services Economy

locuz.com Big Data Services

The Enterprise Data Hub and The Modern Information Architecture

Redefining Infrastructure Management for Today s Application Economy

The Purview Solution Integration With Splunk

ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V

Future Internet Service- Based Architecture According to FI-WARE

Rethinking Your Finance Functions

COMP9321 Web Application Engineering

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

The 5G Infrastructure Public-Private Partnership

TAMING THE BIG CHALLENGE OF BIG DATA MICROSOFT HADOOP

Accelerate your Big Data Strategy. Execute faster with Capgemini and Cloudera s Enterprise Data Hub Accelerator

SQLstream Blaze and Apache Storm A BENCHMARK COMPARISON

Powerful Duo: MapR Big Data Analytics with Cisco ACI Network Switches

Cisco Data Preparation

Ganzheitliches Datenmanagement

Modern IT Operations Management. Why a New Approach is Required, and How Boundary Delivers

BIG DATA: BIG CHALLENGE FOR SOFTWARE TESTERS

Cloud Computing and Business Intelligence

A TECHNICAL WHITE PAPER ATTUNITY VISIBILITY

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

Architecting for the Internet of Things & Big Data

Economic Benefits of Cisco CloudVerse

Gain Contextual Awareness for a Smarter Digital Enterprise with SAP HANA Vora

Top 10 Automotive Manufacturer Makes the Business Case for OpenStack

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

Economic Benefits of Cisco CloudVerse

UNIFY YOUR (BIG) DATA

FITMAN Future Internet Enablers for the Sensing Enterprise: A FIWARE Approach & Industrial Trialing

Worldwide Advanced and Predictive Analytics Software Market Shares, 2014: The Rise of the Long Tail

How To Help The European Single Market With Data And Information Technology

How To Understand The Power Of The Internet Of Things

HOW TO DO A SMART DATA PROJECT

International Journal of Advancements in Research & Technology, Volume 3, Issue 5, May ISSN BIG DATA: A New Technology

Industrial Dr. Stefan Bungart

Middleware- Driven Mobile Applications

Quickly Deploy Microsoft Private Cloud and SQL Server 2012 Data Warehouse on Hitachi Converged Solutions. September 25, 2013

Big Data: Overview and Roadmap eglobaltech. All rights reserved.

Big Data and Analytics: Challenges and Opportunities

Building Confidence in Big Data Innovations in Information Integration & Governance for Big Data

GigaSpaces Real-Time Analytics for Big Data

Deploying Big Data to the Cloud: Roadmap for Success

How To Make Data Streaming A Real Time Intelligence

Integrating SAP and non-sap data for comprehensive Business Intelligence

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

Data Refinery with Big Data Aspects

I D C V E N D O R S P O T L I G H T

Simple. Extensible. Open.

A Grid Architecture for Manufacturing Database System

Enabling the SmartGrid through Cloud Computing

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

Cloud Computing For Distributed University Campus: A Prototype Suggestion

BIG DATA TOOLS. Top 10 open source technologies for Big Data

How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns

IoT Analytics: Four Key Essentials and Four Target Industries

Data Integration Checklist

IBM System x reference architecture solutions for big data

Microsoft Analytics Platform System. Solution Brief

Transcription:

Cloud Search Based Applications for Big Data - Challenges and Methodologies for Acceleration George Suciu, Ana Maria Sticlan, Cristina Butca, Alexandru Vulpe, Alexandru Stancu and Simona Halunga R&D Department, Beia Consult International University POLITEHNICA of Bucharest Workshop on Adaptive Resource Management and Scheduling for Cloud Computing, ARMS-CC 2015 Donostia-San Sebastián, Spain, July 20th, 2015 Held in conjunction with ACM Symposium on Principles of Distributed Computing PODC 2015 (July 21-23, 2015)

Content Short Biography Introduction Business and Virtual Accelerators Challenges for Acceleration Methodologies and Techniques for Acceleration Conclusions and Discussions

Short Biography (1) Graduated from the Faculty of Electronics, Telecommunications and Information Technology at the University Politehnica of Bucharest (UPB), Romania (www.upb.ro) MBA in Informatics Project Management from the Faculty of Cybernetics, Statistics and Economic Informatics of the Academy of Economic Studies Bucharest (www.ase.ro) Ph.D. Student / Researcher at Aalborg University Currently, Ph.D. Eng. Post-doc Researcher focused on the field of big data, cloud communications, open source, IPR and IoT/M2M Since 2008 IT&C Solutions Manager - R&D Department, being employed starting 1998 at BEIA Consult International, a research performing SME (www.beiaro.eu)

Short Biography (2) Projects www.beiaro.eu / www.mobcomm.pub.ro FP7 (2 on-going) REDICT : Regional Economic Development by ICT ewall : Electronic Wall for Active Long Living NMSDMON : Network Management System Development and Monitoring FAIR : Friendly Application for Interactive Receiver Cloud Consulting : Cloud-based Automation of ERP and CRM software for Small Businesses ACCELERATE: A Platform for the Acceleration of go-to market in the ICT industry H2020 (1 on-going, 2 accepted, 3 under review) SWITCH: Software Workbench for Interactive, Time Critical and Highly self-adaptive Cloud applications (ICT-9) Power2SME: Cloud Platform for intelligent energy use by SMES Speech2Platform S2P: Smart, natural language semantic analyser platform to process oriented back-ends

Short Biography (3) National (more than 10 past projects, 5 on-going) MobiWay: Mobility Beyond Individualism: an Integrated Platform for Intelligent Transportation Systems of Tomorrow EV-BAT: Redox battery with fast charging capacity as a main source of energy for electric autovehicles CarbaDetect: Imuno-biosensors for fast detection of carbamic pesticide residues (carbaryl, carbendazim) in horticultural products SARAT-IWSN : Scalable Radio Transceiver for Instrumental Wireless Sensor Networks COMM-CENTER : Developing of a cloud communication center" by integrating a call/contact center platform with unified communication technology, CRM system, text-to-speech and automatic speech recognition solutions in different languages (including Romanian)

Introduction The innovation flow is definitely becoming faster and faster, not only for technology but also for services and business models. Speed is becoming the key to market access. In today s hypercompetitive business environment, companies not only have to find and analyze the relevant data they need, they must do it quickly. There are several solution for high-performance searches in a Big Data (BD) system : Cloudera Enterprise, with Apache Hadoop at the core, is unified into one integrated system, bringing different users and application workloads to one pool of data on a common infrastructure MarkLogic is a solution whose search and query capability makes it easier to find better answers in BD Fusion is a solution that is aiming to make BD searches as simple as googling. This solution implements and extends the open source Apache Solr search framework.

Business and Virtual Accelerators (1) Business Accelerators A business accelerators main goal is to produce successful firms that will leave the program financially viable and freestanding in the go-tomarket process. Critical to the definition of an accelerator is the provision of management guidance, technical assistance and consulting tailored to young growing companies. Accelerators vary widely in their size and service offering and may provide clients, for example, access to appropriate rental space and flexible leases, shared basic business services and equipment, technology support services and assistance in obtaining the financing necessary for company growth.

Business and Virtual Accelerators (2) Virtual Accelerators Virtual acceleration is defined as the delivery of services solely through electronic means, eliminating requirement for geographical proximity of the clients. Examples of virtual accelerators operating in Europe can be found in the New Hampshire Virtual Business Incubator or the FIWARE platform. Several large ICT companies have installed proprietary venturing programs. They are often custom made for the company and therefore public access to details about how these programs deal with Validated Learning is not always available. Except for the last category, these initiatives are mainly focused on start-ups in a very early stage. They are not adapted to offering support to a large industry. Often these initiatives are venture driven and require equity; and are now emerging in Europe.

Challenges for Acceleration(1) ICT companies have faced many kinds of challenges with respect to introducing their products or services in the market. We identify in the following three major development waves, each bringing a very different breed of problems to ICT companies: Solving the engineering headache: Delivering software products on time, within budget and with an acceptable quality has always been a challenge for software companies. Solving the innovation headache: As software and ICT became more and more an instrument for innovation Accelerating and monetizing innovations: ICT companies have realized that innovation is much more than technology itself and they struggle to translate the scientific advances they make into marketable innovations..

Challenges for Acceleration (2) The current state of the art can be classified into three categories: Tool support for the progress follow up of validated learning Technological support/ methodologies for defining and validating features of Minimal Viable Products: In addition to face-to-face interviews, some companies use prototypes to test specific assumptions Technological support for observing users and collecting metrics: Validated Learning can be achieved by studying the usage data collected from customers that use the applications

Methodologies and Techniques for Acceleration (1) Multimedia content is the fastest growing type of user-generated content, with millions of photos, audio files and videos uploaded to the Web and enterprise servers daily. Recently, technologies like automatic speech-to-text transcription and object recognition processing are enabling us to structure this content from the inside out. A search system is therefore a Data Management System like its NoSQL and NewSQL counterparts, and it achieves massive scalability in much the same way, i.e. through distributed architectures, parallel processing, column-oriented data models etc. A Search-Based Application (SBA) based on EXALEAD CloudView will combine web-scale semantic technologies, rapid drag-and-drop application development and hybrid quantitative/ qualitative analytics to deliver a consumer-style information experience to mission-critical business processes Acceleration techniques can be applied to meet demands for real-time, in-context, accuratelydelivered information, accessible from diverse web and enterprise BD sources, yet delivered faster and with less cost than with traditional application architectures.

Methodologies and Techniques for Acceleration (2) The SBA platform is composed of four core components administered via a Management and Monitoring console, as presented in Fig. 2.

Methodologies and Techniques for Acceleration (3) BD raises difficult issues of information system cost, scaling and performance, as well as data security, privacy and ownership. Furthermore, advantages and challenges of public cloud versus private cloud need to be considered. Also, it carries the potential for breakthrough insights and innovation in business, science, medicine and government, machines and data together to reveal the natural information intelligence locked inside mountains of BD, as depicted in Fig. 3.

Methodologies and Techniques for Acceleration (4) We analyzed CloudViews usability, agility and performance for search and search-based applications (SBAs) for BD environments using the following criteria: Performance: it is a search platform which offers wide access to information on infrastructure level and is used for SBA for both online and enterprise level Connectivity: CloudView uses the most advanced Web crawler to gather multi-format data from virtually any source, including an intelligent extraction of complex structured data and associated rich metadata Analytics: The platform supports query-time computation of complex numerical, geophysical and virtual aggregates and clusters, and supports dynamic 2D faceting for creating advanced pivotstyle tables. Business Application development: The SBA platform is unique, because it provides a drag-anddrop development framework

Conclusions We analyzed the challenges and methodologies for acceleration of Search Based Applications for Big Data and proposed a practical implementation by using EXALEAD CloudView platform. Furthermore, we presented a general overview on the techniques and interfaces of SBAs for Big Data and described how EXALEAD CloudView can be applied for the acceleration of SBAs using heterogeneous BD sources, thus revealing accurate information assets in time critical applications. As future work we envision to develop the proposed solution for analyzing environmental conditions from sensor data which can be utilized in agriculture for precision farming.

The future is already here. It is just not evenly distributed. Willian Gibson (speculative fiction novelist) Any questions? george@beia.ro Beia Consult International www.beiaro.eu www.facebook.com/beiaconsult

References(1) @GeorgeSuciuG #beyond2050

References(2)