A Review on the Role of Big Data in Business
|
|
- Carmella Willis
- 2 years ago
- Views:
Transcription
1 Available Online at International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN X IJCSMC, Vol. 3, Issue. 4, April 2014, pg REVIEW ARTICLE A Review on the Role of Big Data in Business Jafar Raza Alam 1, Asma Sajid 2, Ramzan Talib 3, Muneeb Niaz 4 1,2,3,4 College of Computer Science and Information Studies, Government College University, Faisalabad, Pakistan Abstract Big data is a game changing thing. Successful organizations are achieving business advantages by analyzing big data. It has received significant attention in recent years but some challenges are one of the major causes in diminishing the growth of organizations. The main issues why these organizations are not begin their planning stage to implement the big data strategy because they don t know enough about the big data and they don t understand the benefits of big data. In this study, an attempt is made to review the role of big data in the business. Keywords Big data; Cloud computing; Data Mining I. INTRODUCTION The aim of big data is to provide better usage of resources and storage, reduce the time of computation and good business decision making. The term Big data indicates data, but in huge or enormous form, which cannot be processed by the conventional database systems. Its basic characteristics are 3v s which are volume, velocity and variety. Here volume means data in large quantities. Velocity is just like social media data streams in which data is increasing on social networks for example posts on facebook and tweets on twitter. Last thing variety means data in many formats like structured, unstructured and semi structured. Every day world create 2.5 quintillion bytes of data; 90% of the data in the world today has been created in the last two years alone [1]. Data is growing at exponential rate and the experts of the data analytics technology do not have enough knowledge to analyze that enormous amount of data. Big data presents three main aspects to any organization for interest, first one is lacking of arrangement. Second it produces new opportunities. Third, technologies used for big data having low cost. 2014, IJCSMC All Rights Reserved 446
2 This review is an attempt to investigate the role of big data in business. The main aim of this research is to discuss the impact of using data mining techniques and frameworks of big data as the optimal strategy in achieving accuracy and timing in fast and reliable business decision making. Key challenges also discussed in this study, which big data are facing now. II. MATERIALS AND METHODS This paper is based on the randomly selected articles in the field of big data. Twenty five articles were positively identified for the study and read completely for the review. Calculations included in these articles were carefully rerun to guarantee information accuracy. In this review a question is posed Question: What is the role of big data in business? The required data is extracted from the papers to answer the question posed above. III. BIG DATA FOR BUSINESS Organizations are grappling with what big data is and how it effects their organizations and how it makes benefits to their organizations. A survey is conducted in which found that the only 12 percent organizations are implementing or executing the big data strategy and 71 percent organizations are going to begin the planning stage [2]. It is clear that organizations need good knowledge of customers, goods and rules, with the help of big data organizations can find new ways to compete with other organizations. The organizations of the world are using the big data for their future decisions. Types of decisions that organizations can make from big data are smarter decisions, future decisions and decisions that make the difference [3]. Organizations are making business decisions on the basis of the transactional data in past and in present but there is another kind of data which are nontraditional, less structured data for example weblogs, social media, and photographs that can be used for effective business decisions making. Oracle offers the products to acquire and organize these data types and analyze them to find new insights. Oracle s big data solution have 4 steps which are acquire big data, organize big data, analyze big data and decide on the basis of these analyses[1]. Three models are also described for extracting value from big data. First model is ETL Extract, Transform, and Load. Second model is Interactive Queries. Third model is Predictive Analytics. Intel is taking advantage from big data and it has helped to speed up the innovation process [4]. Organizations which built around big data from start are Google, ebay, LinkedIn, and Facebook. These organizations did not need to integrate big data with their existing sources of data [5]. A process has been described (Figure 1) for the organizations that are interested in adopting Big Data [6]. Figure 1: Research Framework of Technology Strategy Planning [6] 2014, IJCSMC All Rights Reserved 447
3 Steps of this process are following Decision criteria are dependent on the decision factors which are social factors, technological factors and economical factors. Candidate Scenarios, There are different scenarios which organizations can select for big data e.g. big demand and cautiously optimistic. Candidate Technologies are data warehouse, cloud analytics, embedded analytics and big data visualization. Technology Assessment Indicators are global market size, enterprise adoption ratio, entrance barrier and strength of industry. Technology Planning Implications, two types of implications are here which is technology planning implications for scenario big demand and technology planning implications for scenario cautiously optimistic. By adopting this strategy organizations can get the fruits of big data. Big data helps to achieve various goals [5], which are following. 4.1 Cost Reduction IV. BIG DATA GOALS Hadoop is a framework for storing huge amount of data on distributed clusters. In Hadoop cluster, one year storage cost for one terabyte is $2,000. That is 800 times less than the traditional relational databases. 4.2 Time Reduction Macy s merchandise pricing optimization application calculates data sets in seconds or in minutes which actually can take hours for calculation. 4.3 Support in Internal Business Decisions The main idea of big data is to assist in the interior company decisions like, what kind of new products should be offered to people?, How much stock should be detained? And what must be the cost of our item? 4.4 Developing New Big Data-Based Offerings Big data must be used to create new products and offerings. LinkedIn is the top example, which has used big data to develop products and offerings, including jobs you may be interested in, who have viewed my profile, people you may know, and numerous others. These ideas have pulled people to LinkedIn. V. DATA MINING WITH BIG DATA In production environment big data mining process does not end. A good big data analytics platform has factors like speed of development, robustness, easily analyze huge amount of data. Growth of data in terms of size and number of users has increased in past few years. In 2010 there were only 4 data analysts working at twitter but now there are thousands of employees working at hadoop cluster node data centers of twitter. Twitter increased the analytics power before the exact time and if an organization does not do this before it needed then these issues will become nightmares [7]. Extracting information from the stream data at real time is the good way to come to know what is happening at the spot. Stream data arrive at very high speed and it is very difficult to analyze stream data at real time, stream data requires very efficient algorithms for mining, that algorithm should be accurate. [8].Online 2014, IJCSMC All Rights Reserved 448
4 news, social media and micro blogs are the examples of streams created by the users. Solutions to deal with these streams were not designed. Samoa was a platform for mining these streams. This is a tool for online mining in the cloud environment. Samoa can be run on different distributed stream processing engines like storm. In future Samoa will be open source and that will be evolution in the research area of the big data stream mining. Samoa also provides API for algorithm developers [9]. A data driven model named Hace is also proposed which aggregate the multiple sources of information; analyze data from the data mining perspective [10]. Big data helping Intel for improving their business intelligence; large portion of their data was unstructured which was 90 percent of their enterprise data. Aims of the Intel were make better decisions, increase business velocity, discover and tap new markets. Intel wants to increase their Business intelligence strength from the descriptive analytics to predictive analytics by using data mining from big data [11]. VI. BIG DATA APPLICATIONS Big data applications are being used in different industries. Fed BP Disaster response was an application for the government s [USA government] response in the disaster situation. This was built in 2010 when oil rate flow was a key issue. BP and independent groups presented changeable estimates preventing efforts to manage the level and range of the U.S. Government s response. NIST analyzed the estimates and formed actionable intelligence on which to support the final reaction [12]. Applications in oil and gas business could be Equipment maintenance to prevent failure, production optimization, price optimization, safety and compliance. Oil and gas companies have big competition in their field, and facing regular change. Firms need to increase their production volumes and at the other hand they also want the healthy, safety and low risk environment. From exploration and production of the oil, leading companies are using big data for new business values, reduce cost and increase production [13]. Website of Whitehouse consists of all speeches of Barrack Obama. Aim was to find the influence of these speeches on the election. A scrapper was made to collect all speeches from the website. Hadoop and Map reduce both are used for parallel processing of these speeches. Study finds the most spoken or referred words, most referenced countries, personalities and also focus on internal and foreign affairs. According to [14], these all things are the basis for the influence on the elections. VII. BIG DATA FRAMEWORKS Organizations daily process large amount of data, this generates high network traffic. Researchers are trying to design a data analytics system which supports complex analysis from high network traffic. CLAaaS stands for Cloud-based Analytics-as-a- Service. CLAaaS was a conceptual architecture for the big data analytics in the cloud environment. It has features, which are customization, collaboration and assistance. Data privacy can be created by implementing CLAaaS in a private cloud [15]. Camcube is a cluster design and it used a topology to connect servers directly with one another. Camdoop is used to increase the capability like processing of packets in networks to perform aggregation of data. In the network Output size is small as compare to the input size. To overcomes this issue we adopted a new technique by decreasing traffic instead of increasing bandwidth. Camdoop have property that camcube uses to forward traffic to perform in network aggregation of data [16]. Bigtable is used to store structured data having size in petabytes. In [17] a data model of Bigtable has described. Bigtable store data of Google applications. Web indexing, Google Finance and Google earth are applications of the Google. These applications have different requirements for storage. The storage, collection and use of data can also create new vulnerabilities and risks. After analyzing these risks a framework has been proposed to help the effective use of data. In this framework few domains are considered which are ethics, governance, science and technology. By using these all domains together organizations can be more effective while making their decisions and avoid the failures of future projects [18]. 2014, IJCSMC All Rights Reserved 449
5 The method which is used for the fast and accurate analysis on the large data sets are sampling, sampling is the data set which almost represents the all data set. In this article [19] an Earl framework was proposed which is an accuracy inference method and increasing big data analytics. It works by choosing the appropriate sample size. EARL is used for mining algorithms to calculate their outcome and errors. Accuracy of earl system will be same by increasing the samples sizes. VIII. BIG DATA CHALLENGES 8.1 Security and Privacy Cloud security alliance big data working group identify top security and privacy problems that need to capture for making the big data computing and infrastructure more secure. Most of these issues are related to the big data storage and computation. Some of the challenges are secure data storage [20]. Various security challenges related to data security and privacy are discussed in [21] which include data breaches, data integrity, data availability and data backup. 8.2 Dynamic Provisioning A service of the cloud computing is infrastructure as service in which it provides computation resources on demand, many cloud related companies are implementing this concept and to making it easy for customers to access these services. Current frameworks do not have the property of the dynamic provisioning. Here is an issue that Compute resources can be insufficient for the submitted job, some process may requires more resources. Another issue is scheduling and protection algorithm, current algorithms does not consider these aspects [22]. 8.3 Algorithms Organizations were granting the papers by capturing key words from the abstract and titles. Analyzing the science with hand was difficult task. After that work was done by program analyst. They use algorithms to do this work. These algorithms can be varying from each other. This difference can reduce the effectiveness and reliability of the final result. Improvement in the data management will result in better technology but it will face many issues. [23]. 8.4 Misuse of Big Data Challenges including potential misuse of big data are here, because information is power. Types of the data which people will produce in the future are unknown. To overcome these challenges we have to strengthen and increase our intent and capacity [24]. 8.5 Data Management Data management is also a critical issue for corporation and industries. Data warehouse has efficient data management techniques. In [25], two data warehouse management strategies are discussed; which are Immediate Incremental Management (IIM), Deferred Incremental Management (DIM) but the favor is given to IIM because of its algorithmic implementation. IX. BIG DATA FACTS AND FIGURES In an organization everything is dependent on the decisions of the policymakers and their decisions are dependent on the data mining techniques, data mining algorithms and frameworks of big data. By integrating data mining with big data frameworks we can get more accurate business decision making. To find true business worth from big data, organizations require the tools to analyze and arrange different data types from different sources. If, organizations have power to analyze data of any size, any type 2014, IJCSMC All Rights Reserved 450
6 and from many sources then outcome is in form of deeper and reliable information about business trends, values and patterns. Data types which are used for business decisions [2] are shown in figure 2. Figure 2: Business Data Types for Decision support Figure 3 indicates fraction of business data which is used for business decisions. An organization s, 76 percent of customer data and 70 percent of product data are required for decision support. In the customer data 66 percent of business consumer data are used to make decisions and from the product data 62 percent of selling side and 61 percent of buying side data are used to make decisions [2]. Figure 3: Percentage of Business Decision Support Data The 63 percent organization reports that the use of big data is beneficial for their companies and organizations [3]. Analytic techniques go further than data mining to determine not only what has happened, but to predict what is going to happen based on all data. X. CONCLUSION Big data must be integrated in the organization s architecture, even the organization have their well established and large businesses. Countries in the world, IT companies and the relevant departments have started working on big data. Organizations which built 2014, IJCSMC All Rights Reserved 451
7 around big data are Google, ebay, LinkedIn, and Facebook. Large organizations are joining the data economy and combining the big data analytics with traditional analytics. This will effect on the organization s skills, leadership, structures and technologies. The 63 percent organization reports that the use of big data is beneficial for their companies and organizations. Organization s more than 70 percent of customer and product data are used for the business decisions making. Key challenges which appear are designing big data sampling and building prediction models from the big data streams. Challenges including potential misuse of big data are also here, because information is power. Types of the data which people will produce in the future are unknown. REFERENCES [1] J. P. Dijcks, "Oracle: Big data for the enterprise," Oracle White Paper, [2] Big Data Survey Research Brief, Sas White Paper,2013. [3] M. Schroeck, R. Shockley, J. Smart, D. Romero-Morales, and P. Tufano, "Analytics: the real-world use of big data: How innovative enterprises extract value from uncertain data, Executive Report," IBM Institute for Business Value and Said Business School at the University of Oxford, [4] Turn Big Data into Big Value, A Practical Strategy, Intel White Paper,2013. [5] T. H. Davenport and J. Dyche, "Big Data in Big Companies," May 2013, [6] W.-H. Weng and W.-T. Lin, "A Scenario Analysis Of Big Data Technology Portfolio Planning," in International Journal of Engineering Research and Technology, [7] J. Lin and D. Ryaboy, "Scaling big data mining infrastructure: the twitter experience," ACM SIGKDD Explorations Newsletter, vol. 14, pp. 6-19, [8] A. Bifet, "Mining Big Data in Real Time," Informatica ( ), vol. 37, [9] G. De Francisci Morales, "SAMOA: A platform for mining big data streams," in Proceedings of the 22nd international conference on World Wide Web companion, 2013, pp [10] X. Wu, X. Zhu, G.-Q. Wu, and W. Ding, "Data mining with big data," Knowledge and Data Engineering, IEEE Transactions on, vol. 26, pp , [11] Mining big data in the enterprise for better business inteligence. Intel White Paper,2012. [12] The Compelling Economics and Technology of Big Data Computing, For Big Data Analytics There s No Such Thing as Too Big. 4syth White Paper,2012. [13] A. Hems, A. Soofi, and E. Perez, "How innovative oil and gas companies are using big data to outmaneuver the competition," [14] A. Bensrhir, "Big data for geo-political analysis: Application on Barack Obama's remarks and speeches," in Computer Systems and Applications (AICCSA), 2013 ACS International Conference on, 2013, pp [15] F. Zulkernine, P. Martin, Y. Zou, M. Bauer, F. Gwadry-Sridhar, and A. Aboulnaga, "Towards Cloud-Based Analytics-as-a- Service (CLAaaS) for Big Data Analytics in the Cloud," in Big Data (BigData Congress), 2013 IEEE International Congress on, 2013, pp [16] P. Costa, A. Donnelly, A. Rowstron, and G. O Shea, "Camdoop: Exploiting in-network aggregation for big data applications," in USENIX NSDI, [17] F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach, M. Burrows, et al., "Bigtable: A distributed storage system for structured data," ACM Transactions on Computer Systems (TOCS), vol. 26, p. 4, , IJCSMC All Rights Reserved 452
8 [18] K. Crawford, G. Faleiros, A. Luers, P. Meier, C. Perlich, and J. Thorp, Big Data, Communities and Ethical Resilience, A Framework for Action, Big Data, Communities and Ethical Resilience White paper,2013. [19] N. Laptev, K. Zeng, and C. Zaniolo, "Very fast estimation for result and accuracy of big data analytics: The EARL system," in Data Engineering (ICDE), 2013 IEEE 29th International Conference on, 2013, pp [20] Top ten big data security and privacy challenges, Cloud Security Alliance White paper,2012. [21] A. A. Soofi, M. I. Khan, R. Talib, and U. Sarwar, "Security Issues in SaaS Delivery Model of Cloud Computing," [22] M. N. Vijayaraj, M. D. Rajalakshmi, and M. C. Sanoj, "Issues and challenges of scheduling and protection algorithms for proficient parallel data processing in cloud." [23] G. Halevi and H. Moed, "The evolution of big data as a research and scientific topic: overview of the literature," Research Trends, Special Issue on Big Data, vol. 30, pp. 3-6, [24] U. G. Pulse, "Big Data for development: challenges & opportunities," Naciones Unidas, Nueva York, mayo, [25] U. Rasheed, M. U. Sarwar, and R. Talib, "A Review on Data Warehouse Management." 2014, IJCSMC All Rights Reserved 453
BIG DATA FUNDAMENTALS
BIG DATA FUNDAMENTALS Timeframe Minimum of 30 hours Use the concepts of volume, velocity, variety, veracity and value to define big data Learning outcomes Critically evaluate the need for big data management
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK A SURVEY ON BIG DATA ISSUES AMRINDER KAUR Assistant Professor, Department of Computer
Ashish R. Jagdale, Kavita V. Sonawane, Shamsuddin S. Khan
International Journal of Scientific & Engineering Research, Volume 5, Issue 7, July-2014 1156 Data Mining and Data Pre-processing for Big Data Ashish R. Jagdale, Kavita V. Sonawane, Shamsuddin S. Khan
International Journal of Advancements in Research & Technology, Volume 3, Issue 5, May-2014 18 ISSN 2278-7763. BIG DATA: A New Technology
International Journal of Advancements in Research & Technology, Volume 3, Issue 5, May-2014 18 BIG DATA: A New Technology Farah DeebaHasan Student, M.Tech.(IT) Anshul Kumar Sharma Student, M.Tech.(IT)
Paolo Costa costa@imperial.ac.uk
joint work with Ant Rowstron, Austin Donnelly, and Greg O Shea (MSR Cambridge) Hussam Abu-Libdeh, Simon Schubert (Interns) Paolo Costa costa@imperial.ac.uk Paolo Costa CamCube - Rethinking the Data Center
Managing Cloud Server with Big Data for Small, Medium Enterprises: Issues and Challenges
Managing Cloud Server with Big Data for Small, Medium Enterprises: Issues and Challenges Prerita Gupta Research Scholar, DAV College, Chandigarh Dr. Harmunish Taneja Department of Computer Science and
Big Data: Study in Structured and Unstructured Data
Big Data: Study in Structured and Unstructured Data Motashim Rasool 1, Wasim Khan 2 mail2motashim@gmail.com, khanwasim051@gmail.com Abstract With the overlay of digital world, Information is available
Keywords Big Data, NoSQL, Relational Databases, Decision Making using Big Data, Hadoop
Volume 4, Issue 1, January 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Transitioning
Big Data and Hadoop with Components like Flume, Pig, Hive and Jaql
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 7, July 2014, pg.759
Data Refinery with Big Data Aspects
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 655-662 International Research Publications House http://www. irphouse.com /ijict.htm Data
Hexaware E-book on Predictive Analytics
Hexaware E-book on Predictive Analytics Business Intelligence & Analytics Actionable Intelligence Enabled Published on : Feb 7, 2012 Hexaware E-book on Predictive Analytics What is Data mining? Data mining,
BIG DATA CHALLENGES AND PERSPECTIVES
BIG DATA CHALLENGES AND PERSPECTIVES Meenakshi Sharma 1, Keshav Kishore 2 1 Student of Master of Technology, 2 Head of Department, Department of Computer Science and Engineering, A P Goyal Shimla University,
W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract
W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the
Role of Cloud Computing in Big Data Analytics Using MapReduce Component of Hadoop
Role of Cloud Computing in Big Data Analytics Using MapReduce Component of Hadoop Kanchan A. Khedikar Department of Computer Science & Engineering Walchand Institute of Technoloy, Solapur, Maharashtra,
Big Data. Fast Forward. Putting data to productive use
Big Data Putting data to productive use Fast Forward What is big data, and why should you care? Get familiar with big data terminology, technologies, and techniques. Getting started with big data to realize
Outline. What is Big data and where they come from? How we deal with Big data?
What is Big Data Outline What is Big data and where they come from? How we deal with Big data? Big Data Everywhere! As a human, we generate a lot of data during our everyday activity. When you buy something,
Indexed Terms: Big Data, benefits, characteristics, definition, problems, unstructured data
Managing Data through Big Data: A Review Harsimran Singh Anand Assistant Professor, PG Dept of Computer Science & IT, DAV College, Amritsar Email id: harsimran_anand@yahoo.com A B S T R A C T Big Data
Cloud Computing and Big Data That s Why! Ray Walshe 14 th March 2013
Cloud Computing and Big Data That s Why! Ray Walshe 14 th March 2013 Data Centre Growth We need more data centres??? (c) Ray Walshe 2013 2 By 2015 By 2015, about 24% of all new business software purchases
Volume 3, Issue 6, June 2015 International Journal of Advance Research in Computer Science and Management Studies
Volume 3, Issue 6, June 2015 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online at: www.ijarcsms.com Image
Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2
Volume 6, Issue 3, March 2016 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue
International Journal of Engineering Research ISSN: 2348-4039 & Management Technology November-2015 Volume 2, Issue-6
International Journal of Engineering Research ISSN: 2348-4039 & Management Technology Email: editor@ijermt.org November-2015 Volume 2, Issue-6 www.ijermt.org Modeling Big Data Characteristics for Discovering
Beyond Watson: The Business Implications of Big Data
Beyond Watson: The Business Implications of Big Data Shankar Venkataraman IBM Program Director, STSM, Big Data August 10, 2011 The World is Changing and Becoming More INSTRUMENTED INTERCONNECTED INTELLIGENT
Hadoop for Enterprises:
Hadoop for Enterprises: Overcoming the Major Challenges Introduction to Big Data Big Data are information assets that are high volume, velocity, and variety. Big Data demands cost-effective, innovative
How Big Data is Different
FALL 2012 VOL.54 NO.1 Thomas H. Davenport, Paul Barth and Randy Bean How Big Data is Different Brought to you by Please note that gray areas reflect artwork that has been intentionally removed. The substantive
AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW
AGENDA What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story Hadoop PDW Our BIG DATA Roadmap BIG DATA? Volume 59% growth in annual WW information 1.2M Zetabytes (10 21 bytes) this
www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage
www.pwc.com/oracle Next presentation starting soon Business Analytics using Big Data to gain competitive advantage If every image made and every word written from the earliest stirring of civilization
Improving Data Processing Speed in Big Data Analytics Using. HDFS Method
Improving Data Processing Speed in Big Data Analytics Using HDFS Method M.R.Sundarakumar Assistant Professor, Department Of Computer Science and Engineering, R.V College of Engineering, Bangalore, India
A Study of Data Management Technology for Handling Big Data
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 9, September 2014,
Big Data Analytic and Mining with Machine Learning Algorithm
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 4, Number 1 (2014), pp. 33-40 International Research Publications House http://www. irphouse.com /ijict.htm Big Data
Volume 3, Issue 8, August 2015 International Journal of Advance Research in Computer Science and Management Studies
Volume 3, Issue 8, August 2015 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online at: www.ijarcsms.com An
End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ
End to End Solution to Accelerate Data Warehouse Optimization Franco Flore Alliance Sales Director - APJ Big Data Is Driving Key Business Initiatives Increase profitability, innovation, customer satisfaction,
Analytics: The real-world use of big data
Findings from the research collaboration of IBM Institute for Business Value and Saïd Business School, University of Oxford Analytics: The real-world use of big data How innovative enterprises extract
Are You Ready for Big Data?
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics February 11, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
Big Data and Hadoop with components like Flume, Pig, Hive and Jaql
Abstract- Today data is increasing in volume, variety and velocity. To manage this data, we have to use databases with massively parallel software running on tens, hundreds, or more than thousands of servers.
International Journal of Innovative Research in Computer and Communication Engineering
FP Tree Algorithm and Approaches in Big Data T.Rathika 1, J.Senthil Murugan 2 Assistant Professor, Department of CSE, SRM University, Ramapuram Campus, Chennai, Tamil Nadu,India 1 Assistant Professor,
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK A REVIEW ON HIGH PERFORMANCE DATA STORAGE ARCHITECTURE OF BIGDATA USING HDFS MS.
Finding Insights & Hadoop Cluster Performance Analysis over Census Dataset Using Big-Data Analytics
Finding Insights & Hadoop Cluster Performance Analysis over Census Dataset Using Big-Data Analytics Dharmendra Agawane 1, Rohit Pawar 2, Pavankumar Purohit 3, Gangadhar Agre 4 Guide: Prof. P B Jawade 2
Indian Journal of Science The International Journal for Science ISSN 2319 7730 EISSN 2319 7749 2016 Discovery Publication. All Rights Reserved
Indian Journal of Science The International Journal for Science ISSN 2319 7730 EISSN 2319 7749 2016 Discovery Publication. All Rights Reserved Perspective Big Data Framework for Healthcare using Hadoop
Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.
Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analytics
Executive Summary... 2 Introduction... 3. Defining Big Data... 3. The Importance of Big Data... 4 Building a Big Data Platform...
Executive Summary... 2 Introduction... 3 Defining Big Data... 3 The Importance of Big Data... 4 Building a Big Data Platform... 5 Infrastructure Requirements... 5 Solution Spectrum... 6 Oracle s Big Data
New Design Principles for Effective Knowledge Discovery from Big Data
New Design Principles for Effective Knowledge Discovery from Big Data Anjana Gosain USICT Guru Gobind Singh Indraprastha University Delhi, India Nikita Chugh USICT Guru Gobind Singh Indraprastha University
Data Centric Computing Revisited
Piyush Chaudhary Technical Computing Solutions Data Centric Computing Revisited SPXXL/SCICOMP Summer 2013 Bottom line: It is a time of Powerful Information Data volume is on the rise Dimensions of data
We are Big Data A Sonian Whitepaper
EXECUTIVE SUMMARY Big Data is not an uncommon term in the technology industry anymore. It s of big interest to many leading IT providers and archiving companies. But what is Big Data? While many have formed
Getting Started Practical Input For Your Roadmap
Getting Started Practical Input For Your Roadmap Mike Ferguson Managing Director, Intelligent Business Strategies BA4ALL Big Data & Analytics Insight Conference Stockholm, May 2015 About Mike Ferguson
EXECUTIVE REPORT. Big Data and the 3 V s: Volume, Variety and Velocity
EXECUTIVE REPORT Big Data and the 3 V s: Volume, Variety and Velocity The three V s are the defining properties of big data. It is critical to understand what these elements mean. The main point of the
Ramesh Bhashyam Teradata Fellow Teradata Corporation bhashyam.ramesh@teradata.com
Challenges of Handling Big Data Ramesh Bhashyam Teradata Fellow Teradata Corporation bhashyam.ramesh@teradata.com Trend Too much information is a storage issue, certainly, but too much information is also
How Big Is Big Data Adoption? Survey Results. Survey Results... 4. Big Data Company Strategy... 6
Survey Results Table of Contents Survey Results... 4 Big Data Company Strategy... 6 Big Data Business Drivers and Benefits Received... 8 Big Data Integration... 10 Big Data Implementation Challenges...
Hadoop Technology for Flow Analysis of the Internet Traffic
Hadoop Technology for Flow Analysis of the Internet Traffic Rakshitha Kiran P PG Scholar, Dept. of C.S, Shree Devi Institute of Technology, Mangalore, Karnataka, India ABSTRACT: Flow analysis of the internet
BIG DATA IN BUSINESS ENVIRONMENT
Scientific Bulletin Economic Sciences, Volume 14/ Issue 1 BIG DATA IN BUSINESS ENVIRONMENT Logica BANICA 1, Alina HAGIU 2 1 Faculty of Economics, University of Pitesti, Romania olga.banica@upit.ro 2 Faculty
Big Data Defined Introducing DataStack 3.0
Big Data Big Data Defined Introducing DataStack 3.0 Inside: Executive Summary... 1 Introduction... 2 Emergence of DataStack 3.0... 3 DataStack 1.0 to 2.0... 4 DataStack 2.0 Refined for Large Data & Analytics...
Manifest for Big Data Pig, Hive & Jaql
Manifest for Big Data Pig, Hive & Jaql Ajay Chotrani, Priyanka Punjabi, Prachi Ratnani, Rupali Hande Final Year Student, Dept. of Computer Engineering, V.E.S.I.T, Mumbai, India Faculty, Computer Engineering,
The 4 Pillars of Technosoft s Big Data Practice
beyond possible Big Use End-user applications Big Analytics Visualisation tools Big Analytical tools Big management systems The 4 Pillars of Technosoft s Big Practice Overview Businesses have long managed
Big Data on Microsoft Platform
Big Data on Microsoft Platform Prepared by GJ Srinivas Corporate TEG - Microsoft Page 1 Contents 1. What is Big Data?...3 2. Characteristics of Big Data...3 3. Enter Hadoop...3 4. Microsoft Big Data Solutions...4
Sunnie Chung. Cleveland State University
Sunnie Chung Cleveland State University Data Scientist Big Data Processing Data Mining 2 INTERSECT of Computer Scientists and Statisticians with Knowledge of Data Mining AND Big data Processing Skills:
An Oracle White Paper October 2011. Oracle: Big Data for the Enterprise
An Oracle White Paper October 2011 Oracle: Big Data for the Enterprise Executive Summary... 2 Introduction... 3 Defining Big Data... 3 The Importance of Big Data... 4 Building a Big Data Platform... 5
SOCIAL NETWORKS AND STUDENTS' FUTURE JOBS
SOCIAL NETWORKS AND STUDENTS' FUTURE JOBS Lorena Batagan 1 and Catalin Boja 2 1) 2) Bucharest University of Economic Studies, Romania E-mail: lorena.pocatilu@ie.ase.ro; E-mail: catalin.boja@ie.ase.ro Abstract
Packet Flow Analysis and Congestion Control of Big Data by Hadoop
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 6, June 2015, pg.456
15.00 15.30 30 XML enabled databases. Non relational databases. Guido Rotondi
Programme of the ESTP training course on BIG DATA EFFECTIVE PROCESSING AND ANALYSIS OF VERY LARGE AND UNSTRUCTURED DATA FOR OFFICIAL STATISTICS Rome, 5 9 May 2014 Istat Piazza Indipendenza 4, Room Vanoni
Introduction to Big Data the four V's
Chapter 1: Introduction to Big Data the four V's This chapter is mainly based on the Big Data script by Donald Kossmann and Nesime Tatbul (ETH Zürich) Big Data Management and Analytics 15 Goal of Today
BIG DATA TECHNOLOGY. Hadoop Ecosystem
BIG DATA TECHNOLOGY Hadoop Ecosystem Agenda Background What is Big Data Solution Objective Introduction to Hadoop Hadoop Ecosystem Hybrid EDW Model Predictive Analysis using Hadoop Conclusion What is Big
Big Data Components for Business Process Optimization
72 Big Data Components for Business Process Optimization Mircea Răducu TRIFU, Mihaela-Laura IVAN Bucharest University of Economic Studies, Bucharest, Romania trifumircearadu@yahoo.com, ivanmihaela88@gmail.com
Big Data Collection Study for Providing Efficient Information
, pp. 41-50 http://dx.doi.org/10.14257/ijseia.2015.9.12.03 Big Data Collection Study for Providing Efficient Information Jun-soo Yun, Jin-tae Park, Hyun-seo Hwang and Il-young Moon Computer Science and
Are You Ready for Big Data?
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics April 10, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop
ISSN: 2454-2377, October 2015 Big Data and Hadoop Simmi Bagga 1 Satinder Kaur 2 1 Assistant Professor, Sant Hira Dass Kanya MahaVidyalaya, Kala Sanghian, Distt Kpt. INDIA E-mail: simmibagga12@gmail.com
Chapter 7. Using Hadoop Cluster and MapReduce
Chapter 7 Using Hadoop Cluster and MapReduce Modeling and Prototyping of RMS for QoS Oriented Grid Page 152 7. Using Hadoop Cluster and MapReduce for Big Data Problems The size of the databases used in
A Novel Cloud Computing Data Fragmentation Service Design for Distributed Systems
A Novel Cloud Computing Data Fragmentation Service Design for Distributed Systems Ismail Hababeh School of Computer Engineering and Information Technology, German-Jordanian University Amman, Jordan Abstract-
Impact of Big Data in Oil & Gas Industry. Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India.
Impact of Big Data in Oil & Gas Industry Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India. New Age Information 2.92 billions Internet Users in 2014 Twitter processes 7 terabytes
Application Development. A Paradigm Shift
Application Development for the Cloud: A Paradigm Shift Ramesh Rangachar Intelsat t 2012 by Intelsat. t Published by The Aerospace Corporation with permission. New 2007 Template - 1 Motivation for the
Business Challenges and Research Directions of Management Analytics in the Big Data Era
Business Challenges and Research Directions of Management Analytics in the Big Data Era Abstract Big data analytics have been embraced as a disruptive technology that will reshape business intelligence,
USING BIG DATA FOR INTELLIGENT BUSINESSES
HENRI COANDA AIR FORCE ACADEMY ROMANIA INTERNATIONAL CONFERENCE of SCIENTIFIC PAPER AFASES 2015 Brasov, 28-30 May 2015 GENERAL M.R. STEFANIK ARMED FORCES ACADEMY SLOVAK REPUBLIC USING BIG DATA FOR INTELLIGENT
A Study on Effective Business Logic Approach for Big Data Mining
A Study on Effective Business Logic Approach for Big Data Mining T. Sathis Kumar Assistant Professor, Dept of C.S.E, Saranathan College of Engineering, Trichy, Tamil Nadu, India. ABSTRACT: Big data is
Role of Social Networking in Marketing using Data Mining
Role of Social Networking in Marketing using Data Mining Mrs. Saroj Junghare Astt. Professor, Department of Computer Science and Application St. Aloysius College, Jabalpur, Madhya Pradesh, India Abstract:
INTELLIGENT BUSINESS STRATEGIES WHITE PAPER
INTELLIGENT BUSINESS STRATEGIES WHITE PAPER Improving Access to Data for Successful Business Intelligence Part 2: Supporting Multiple Analytical Workloads in a Changing Analytical Landscape By Mike Ferguson
Bringing Big Data into the Enterprise
Bringing Big Data into the Enterprise Overview When evaluating Big Data applications in enterprise computing, one often-asked question is how does Big Data compare to the Enterprise Data Warehouse (EDW)?
REAL-TIME OPERATIONAL INTELLIGENCE. Competitive advantage from unstructured, high-velocity log and machine Big Data
REAL-TIME OPERATIONAL INTELLIGENCE Competitive advantage from unstructured, high-velocity log and machine Big Data 2 SQLstream: Our s-streaming products unlock the value of high-velocity unstructured log
HOW TO MAKE SENSE OF BIG DATA TO BETTER DRIVE BUSINESS PROCESSES, IMPROVE DECISION-MAKING, AND SUCCESSFULLY COMPETE IN TODAY S MARKETS.
HOW TO MAKE SENSE OF BIG DATA TO BETTER DRIVE BUSINESS PROCESSES, IMPROVE DECISION-MAKING, AND SUCCESSFULLY COMPETE IN TODAY S MARKETS. ALTILIA turns Big Data into Smart Data and enables businesses to
Big Data: Tools and Technologies in Big Data
Big Data: Tools and Technologies in Big Data Jaskaran Singh Student Lovely Professional University, Punjab Varun Singla Assistant Professor Lovely Professional University, Punjab ABSTRACT Big data can
White Paper. How Streaming Data Analytics Enables Real-Time Decisions
White Paper How Streaming Data Analytics Enables Real-Time Decisions Contents Introduction... 1 What Is Streaming Analytics?... 1 How Does SAS Event Stream Processing Work?... 2 Overview...2 Event Stream
Keywords Big Data Analytic Tools, Data Mining, Hadoop and MapReduce, HBase and Hive tools, User-Friendly tools.
Volume 3, Issue 8, August 2013 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Significant Trends
Chapter 6. Foundations of Business Intelligence: Databases and Information Management
Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:
ANALYTICS BUILT FOR INTERNET OF THINGS
ANALYTICS BUILT FOR INTERNET OF THINGS Big Data Reporting is Out, Actionable Insights are In In recent years, it has become clear that data in itself has little relevance, it is the analysis of it that
Testing Big data is one of the biggest
Infosys Labs Briefings VOL 11 NO 1 2013 Big Data: Testing Approach to Overcome Quality Challenges By Mahesh Gudipati, Shanthi Rao, Naju D. Mohan and Naveen Kumar Gajja Validate data quality by employing
The Challenge of Big Data Benchmarking Large-Scale Data Management Insights from Benchmark Research
Benchmarking Large-Scale Data Management Insights from Presentation Confidentiality Statement The materials in this presentation are protected under the confidential agreement and/or are copyrighted materials
Библиотека БГУИР BIG DATA IN BANKING. Constantine DZIK. Dzmitry BALTUNOU. MOHAMMED Utech LLC location Madison, MS USA
BIG DATA IN BANKING Constantine DZIK Utech LLC location Madison, MS USA Utech LLC location Madison, MS USA Farooq MOHAMMED Utech LLC location Madison, MS USA 144 Dzmitry BALTUNOU Utech LLC location Madison,
Why is BIG Data Important?
Why is BIG Data Important? March 2012 1 Why is BIG Data Important? A Navint Partners White Paper May 2012 Why is BIG Data Important? March 2012 2 What is Big Data? Big data is a term that refers to data
INTELLIGENT BUSINESS STRATEGIES WHITE PAPER
INTELLIGENT BUSINESS STRATEGIES WHITE PAPER Improving Access to Data for Successful Business Intelligence Part 1: Meeting Today s Business Requirements in an Increasingly Complex Environment By Mike Ferguson
How to Enhance Traditional BI Architecture to Leverage Big Data
B I G D ATA How to Enhance Traditional BI Architecture to Leverage Big Data Contents Executive Summary... 1 Traditional BI - DataStack 2.0 Architecture... 2 Benefits of Traditional BI - DataStack 2.0...
Concept and Project Objectives
3.1 Publishable summary Concept and Project Objectives Proactive and dynamic QoS management, network intrusion detection and early detection of network congestion problems among other applications in the
DATAOPT SOLUTIONS. What Is Big Data?
DATAOPT SOLUTIONS What Is Big Data? WHAT IS BIG DATA? It s more than just large amounts of data, though that s definitely one component. The more interesting dimension is about the types of data. So Big
BIG DATA TRENDS AND TECHNOLOGIES
BIG DATA TRENDS AND TECHNOLOGIES THE WORLD OF DATA IS CHANGING Cloud WHAT IS BIG DATA? Big data are datasets that grow so large that they become awkward to work with using onhand database management tools.
III Big Data Technologies
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
Big Data Analytics. An Introduction. Oliver Fuchsberger University of Paderborn 2014
Big Data Analytics An Introduction Oliver Fuchsberger University of Paderborn 2014 Table of Contents I. Introduction & Motivation What is Big Data Analytics? Why is it so important? II. Techniques & Solutions
Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012. Viswa Sharma Solutions Architect Tata Consultancy Services
Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012 Viswa Sharma Solutions Architect Tata Consultancy Services 1 Agenda What is Hadoop Why Hadoop? The Net Generation is here Sizing the
Analytics in the Cloud. Peter Sirota, GM Elastic MapReduce
Analytics in the Cloud Peter Sirota, GM Elastic MapReduce Data-Driven Decision Making Data is the new raw material for any business on par with capital, people, and labor. What is Big Data? Terabytes of
The Next Wave of Data Management. Is Big Data The New Normal?
The Next Wave of Data Management Is Big Data The New Normal? Table of Contents Introduction 3 Separating Reality and Hype 3 Why Are Firms Making IT Investments In Big Data? 4 Trends In Data Management
Big Data Explained. An introduction to Big Data Science.
Big Data Explained An introduction to Big Data Science. 1 Presentation Agenda What is Big Data Why learn Big Data Who is it for How to start learning Big Data When to learn it Objective and Benefits of
AN INTRO TO DATA MANAGEMENT
AN INTRO TO DATA MANAGEMENT HOW TO USE DATA TO SUCCEED AT YOUR JOB by: TABLE OF CONTENTS Chapter 1: The Rise of Global Data Chapter 2: Taking Advantage of Big Data Chapter 3: Data Projects from Start to
A Proposed Framework to Use Cloud Computing Power in Business Intelligence
International Academic Institute for Science and Technology International Academic Journal of Science and Engineering Vol. 3, No. 7, 2016, pp. 10-14. ISSN 2454-3896 International Academic Journal of Science
Big Data Use Cases Update
Big Data Use Cases Update Sanat Joshi Industry Solutions Manufacturing Industries Business Unit 1 Data Explosion Web & social networks experienced it first Infographic by Go-gulf.com 2 Number Of Connected