1 Big Data Solutions Portal Development with MongoDB and Liferay Solutions
2 Introduction Companies have made huge investments in Business Intelligence and analytics to better understand their clients and their business. Reporting infrastructure, query tools, dashboards and data visualization have all helped companies visualize and use their data to make better decisions. Now companies are hearing about Big Data. More than just a popular tag, Big Data represents a remarkable move in the data available to companies: Much more data, of different types, coming at a dramatic pace. Companies of all sizes and in all fields wonder if Big Data is a game changer for them as it is for others. To address the different issues, companies need robust solution to face these challenges, to take Big Data and use it to deliver more customer value, faster. They need a new approach and a new platform, one focused on using Big Data analytics to drive customer value. Know Key Drivers of Big Data In 2010, the world produced over 1 zettabyte of data; by 2014, we will produce approximately 7 zettabyte a year. Much of this data surge is caused by a significant increase in devices located at the periphery of the network including embedded sensors, tablet, and smartphones. Not just Volume 85% users using it for 3 TB or less No need to have large data to leverage the benefits of Big Data's modern architecture and solution Dimensions of Data Volume Data volume is the primary attribute of big data. Compared to non-traditional data, machine-generated data is generated in much larger amounts. Enterprises are awash with ever-growing data of all kinds, simply accumulating terabytes-even petabytes-of information. Velocity One factor that actually makes big data gigantic is the different sources. The daily addition of millions of connected devices (tablets and smartphones) will upsurge not only volume but also velocity. More and more data is produced each day that has to be collected in shorter time frame, as data becomes out-of-date very swiftly. Variety From various sources such as text, video, audios, log files, click streams, etc. both structured and unstructured data exhibit a dizzying rate of change. As new services are integrated, or new marketing campaigns carried out, new data types becomes essential to grab the resulting information.
3 Complexity Big Data is an opportunity to discover insights in fresh and growing types of content and data, to make your business more agile, and to resolve issues that were earlier considered challenging. It has become vital to reduce the complexity to improve accessibility of the data as per customer needs. MongoDB over RDBMS Typical, so-called relational, databases are made for a world in which data is scattered, and therefore can be and will be curated properly. It is a community in which the questions one would like to answer using the data must be clear at the start, to ensure that the database is created to answer them - and only them - successfully.
4 Limitations of RDBMS Its support is limited to Terabyte only. It does not support Petabyte and Manages only Structured Data; doesn't support Semi Structured and Unstructured Data RDBMS doesn't scale inherently - Scale up/scale out (Load Balancing & Replication) Hard to Shard/Partition Unable to manage large Data Files Both Read/Write not possible - With large set of data, it becomes quite difficult to bring data from the data layer into the analytics layer to perform computation Relational database theory is not ideal for geospatial applications and decent relational geospatial databases are expensive and normally tough to maintain. But with a flexible document database like MongoDB it simply becomes easier to iterate toward Big Data insights. Open-source technology like MongoDB can be easily applied to the Big Data problems. MongoDB is a scalable, high-performance, open-source, document-oriented database. MongoDB is an open-source document database, and the leading NoSQL database, written in C++. As soon as one remembers that Big Data is a question of both storage and processing, it makes sense that the top operational data store would be MongoDB, given its flexibility and scalability; it rejects the orthodox approach of relational databases. Speed, Scalability, and Performance to Handling Unstructured Data is the major reason why MongoDB has become a complimentary to RDBMS. Furthermore, it's much simpler to code against MongoDB compared to relational databases for developers.
6 NoSQL The Right Solution With an increasing amount of data, it has become inevitable to use the next generation database i.e. non-relational, distributed, open-source and horizontally scalable. NoSQL, often termed as 'not only sql', is a whole new way of thinking about a database. It is not a relational database and it does not provide full ACID (atomicity, consistency, isolation, durability) guarantees, but yet has a distributed and error tolerant architecture; and therefore, it is the best solution for all situations. Following characteristics relates to NoSQL: Chema-free Easy replication support Simple API Eventually consistent / BASE (not ACID) Gives high query performance Supports a huge amount of data and more Most NoSQL databases are open source
7 NoSQL Data Models 4 broad Data models 120+ variants available in the market Concept of Big Data Portal 90% of the data in this modern time has been created in the past two years alone and every day, we create 2.5 quintillion bytes of data that comes from everywhere: posts to social media sites, sensors used to collect weather information, digital images and videos, purchase transaction information, and mobile phone GPS signals to name a few. In such scenario, companies need a web based solution that integrates the robust presentation features of a portal such as rich user interfaces, collaboration, and secure access, with centralized & enormously scalable data storage as the back end, composing of different types of content (Images, Audio, Video, Documents, Metadata) in huge amount. Organizations are leveraging Big Data for its sophisticated architecture solution. Concept of Big Data Portal has been defined to harness the potential of Liferay & MongoDB.
8 Liferay Portal is a free and open source enterprise portal written in Java that allows users to set up features common to websites. Even though Liferay provides an advanced programming interface for developers, no programming expertise are required for basic website installation and management. MongoDB is a highly scalable, NoSQL and agile document-oriented database based on JSON-like document storage with dynamic schemas. The integration of Liferay and MongoDB result in a powerful, cost effective solution: i.e. "BIG DATA PORTAL" Big Data portal caters to the ever growing need to manage the huge amount of information available.
9 Benefits: a) How does MongoDB enhance Liferay? b) How does Liferay enhance MongoDB? How does MongoDB enhance Liferay? Scalability Leverages Auto sharding & replica set features Elasticity in scaling storage - go up or down Cost Effectiveness Commodity Hardware Eliminates need for high-end storage systems Agility & Performance Faster Development Easier Deployment Flexible & Schema less
10 Large Object Storage & Centralized Data Management GridFS enables large binary objects like images, Video or Audio Simplifies Management of data Single system to manage structured & unstructured data How does Liferay enhance MongoDB? Rich Front End Powerful websites consisting of Gadgets & Portlets - Portions of a Web page that may be Complete application Pages & Themes - Common, Consistent look & feel across multiple pages Navigation - Menu bar, Tabs, Links Secure Views to data Role based Site based Login status based Mobile Integration Data access on the go Different Themes for Mobile - HTML5, CSS3 Flexible Architecture and Lean Platform Use of Open standards, Web services and integration tools SOA Other Potential Enhancements Dynamic Data Lists End Users define custom entities to store into database, forms for capture and display Project Status Surveys/Polls Arbitrary, exensible, typed data entry CMS, Document Library Custom Content Types Huge volumes of unstructured, user-generated content
11 Solutions and features of web portal Portal provides secure user interface to manage all sorts of content such as texts, video and images. The portal provides administrators to configure fine grained access control to the site. The portal also provides a scalable & flexible data storage to scale for ever growing variety of content. Technical Architecture Portal provides secure user interface to manage all sorts of content such as texts, video and images. The portal provides administrators to configure fine grained access control to the site. The portal also provides a scalable & flexible data storage to scale for ever growing variety of content.
12 NoSQL is not always a replacement of RDBMS. Its rather complimentary to RDBMS. So it's important to understand your data first before you choose to implementating NoSQL. NoSQL is perfect when you need CMS, Document Library Handle unstructured data Schema-less persistence High performance Replication Low cost Conclusion MongoDB enables Portals for scalability (for huge volumes of data) and flexibility (schema-less content). Liferay's rich user interface, security, content management, social and mobile features complement MongoDB's powerful storage features. Big Data Portal with Liferay and MongoDB provide higher ROI to enterprises and lower TCO. About Softweb Solutions Software Solutions is a leading Software Development Company that serves clients worldwide of all sizes. We have a strong performance record in offering quality software development services demonstrated by ISO 9001:2008 certification and Microsoft Gold Certified partnership. We provide cutting edge Mobile App Development solutions for various platforms like Apple ios, Android OS, Blackberry and Windows Phone. We attempt to deal with constantly evolving business needs by incorporating all our strengths in our solutions. To know more about us and how we deliver mobility solutions to the organizations across the world. Contact our representative or us at You can also visit our corporate website
MASARYK UNIVERSITY FACULTY OF INFORMATICS Best Practices in Scalable Web Development MASTER THESIS Martin Novák May, 2014 Brno, Czech Republic Declaration Hereby I declare that this paper is my original
David Chappell October 2012 WINDOWS AZURE DATA MANAGEMENT CHOOSING THE RIGHT TECHNOLOGY Sponsored by Microsoft Corporation Copyright 2012 Chappell & Associates Contents Windows Azure Data Management: A
BUILDING AND OPTIMIZING MULTI-CHANNEL WEB EXPERIENCES PUBLISHED MAY 2013 WHITE PAPER Leading enterprises strive to achieve higher levels of customer engagement through online channels, and this means they
David Chappell Understanding NoSQL on Microsoft Azure Sponsored by Microsoft Corporation Copyright 2014 Chappell & Associates Contents Data on Azure: The Big Picture... 3 Relational Technology: A Quick
Convergence of Social, Mobile and Cloud: 7 Steps to Ensure Success June, 2013 Contents Executive Overview...4 Business Innovation & Transformation...5 Roadmap for Social, Mobile and Cloud Solutions...7
An Oracle White Paper June 2013 Oracle: Big Data for the Enterprise Executive Summary... 2 Introduction... 3 Defining Big Data... 3 The Importance of Big Data... 4 Building a Big Data Platform... 5 Infrastructure
SPECIAL REPORT W I N T E R C O R P O R A T I O N T h e L a r g e S c a l e Big Data What Does It Really Cost? D a t a M a n a g e m e n t Expe r t s W I N T E R C O R P O R A T I O N Big Data What Does
W I N T E R C O R P O R A T I O N Executive Report BIG DATA: BUSINESS OPPORTUNITIES, REQUIREMENTS AND ORACLE S APPROACH RICHARD WINTER December 2011 SUMMARY NEW SOURCES OF DATA and distinctive types of
A Fresh Graduate s Guide to Software Development Tools and Technologies Chapter 1 Cloud Computing CHAPTER AUTHORS Wong Tsz Lai Hoang Trancong Steven Goh PREVIOUS CONTRIBUTORS: Boa Ho Man; Goh Hao Yu Gerald;
OPEN DATA CENTER ALLIANCE : sm Big Data Consumer Guide SM Table of Contents Legal Notice...3 Executive Summary...4 Introduction...5 Objective...5 Big Data 101...5 Defining Big Data...5 Big Data Evolution...7
Trends in Cloud Computing and Big Data Nikita Bhagat, Ginni Bansal, Dr.Bikrampal Kaur firstname.lastname@example.org, email@example.com, firstname.lastname@example.org Abstract - BIG data refers to the
INTELLIGENT BUSINESS STRATEGIES W H I T E P A P E R Architecting A Big Data Platform for Analytics By Mike Ferguson Intelligent Business Strategies October 2012 Prepared for: Table of Contents Introduction...
3 Big Data: Challenges and Opportunities Roberto V. Zicari Contents Introduction... 104 The Story as it is Told from the Business Perspective... 104 The Story as it is Told from the Technology Perspective...
Three steps to put Predictive Analytics to Work The most powerful examples of analytic success use Decision Management to deploy analytic insight in day to day operations helping organizations make more
How to embrace Big Data A methodology to look at the new technology Contents 2 Big Data in a nutshell 3 Big data in Italy 3 Data volume is not an issue 4 Italian firms embrace Big Data 4 Big Data strategies
Plug Into The Cloud with Oracle Database 12c ORACLE WHITE PAPER DECEMBER 2014 Disclaimer The following is intended to outline our general product direction. It is intended for information purposes only,
Big Data: Beyond the Hype Why Big Data Matters to You White Paper BY DATASTAX CORPORATION October 2013 Table of Contents Abstract 3 Introduction 3 Big Data and You 5 Big Data Is More Prevalent Than You
Expert Reference Series of White Papers Big Data and Apache Hadoop Adoption: Key Challenges and Rewards 1-800-COURSES www.globalknowledge.com Big Data and Apache Hadoop Adoption: Key Challenges and Rewards
32 Big Data: present and future Big Data: present and future Mircea Răducu TRIFU, Mihaela Laura IVAN University of Economic Studies, Bucharest, Romania email@example.com, firstname.lastname@example.org
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 4, Number 1 (2014), pp. 33-40 International Research Publications House http://www. irphouse.com /ijict.htm Big Data
TABLE OF CONTENTS Introduction... 3 The Importance of Triplestores... 4 Why Triplestores... 5 The Top 8 Things You Should Know When Considering a Triplestore... 9 Inferencing... 9 Integration with Text
Introduction to Cloud Computing architecture White Paper 1st Edition, June 2009 Abstract Cloud computing promises to increase the velocity with which applications are deployed, increase innovation, and
QLIKVIEW AND BIG DATA: HAVE IT YOUR WAY A QlikView White Paper November 2012 qlikview.com Table of Contents Executive Summary 3 Introduction 3 The Two Sides of Big Data Analytics 3 How Big Data Flows from
For Big Data Analytics There s No Such Thing as Too Big The Compelling Economics and Technology of Big Data Computing March 2012 By: 4syth.com Emerging big data thought leaders Forsyth Communications 2012.