Inconsistencies in Big Data
|
|
|
- Felix Sharp
- 10 years ago
- Views:
Transcription
1 Inconsistencies in Big Data Du Zhang Department of Computer Science California State University Sacramento, CA Abstract We are faced with a torrent of data generated and captured in digital form as a result of the advancement of sciences, engineering and technologies, and various social, economical and human activities. This big data phenomenon ushers in a new era where human endeavors and scientific pursuits will be aided by not only human capital, and physical and financial assets, but also data assets. Research issues in big data and big data analysis are embedded in multi-dimensional scientific and technological spaces. In this paper, we first take a close look at the dimensions in big data and big data analysis, and then focus our attention on the issue of inconsistencies in big data and the impact of inconsistencies in big data analysis. We offer classifications of four types of inconsistencies in big data and point out the utility of inconsistency-induced learning as a tool for big data analysis. Keywords: big data, big data analysis, inconsistencies in big data, inconsistency-induced learning. 1. Introduction Today, the advancement of sciences, engineering and technologies, the human endeavors, and the social and economic activities have collectively created a torrent of data in digital form. This big data phenomenon will only get intensified and diversified in the years to come. How to turn this big data phenomenon into a positive force for good has drawn tremendous and intensified interest from an everincreasing set of big data stakeholders. As big data becomes an increasingly popular buzzword, we must not lose sight of the fact that research issues behind big data and big data analysis are embedded in multi-dimensional scientific and technological spaces. We must maintain a clear picture of what big data is and what big data analysis entails. The objectives of big data analysis are varied. They are largely aligned with the objectives of big data stakeholders. These can translate into creating values in healthcare, accelerating the pace of scientific discoveries for life and physical sciences, improving the productivity in manufacturing, developing a competitive edge for business, retail, or service industries, and innovating in education, media, transportation, or government. How to better utilize data assets, in addition to physical and financial assets, and human capital, to create value has become a fertile ground for enterprises to gain competitive advantages. As big data analysis becomes the next frontier for advancement of knowledge, innovation, and enhanced decision-making process, the significance of its impact on the society as a whole can never be underestimated. Many domains and economic sectors can benefit from the big data push: life and physical sciences, medicine, education, healthcare, location-based services, manufacturing, retail, communication and media, government, transportation, banking, insurance, financial services, utilities, environment, and energy industry [3, 14]. In this paper, we first take a close look at various dimensions in big data and big data analysis, and highlight some major issues in those dimensions. We then focus our attention on an important issue: inconsistencies in big data and their impact on the outcome of big data analysis. Inconsistencies are ubiquitous in the real world, manifesting themselves in a plethora of human behaviors and decisionmaking processes for which big data are acquired, integrated, analyzed, and utilized in an attempt to create values and accelerate scientific discoveries [4,12,15-16,18-19,22,23-27]. Once captured in big data, inconsistencies can occur at various granularities of knowledge content, from data, information, knowledge, meta-knowledge, to expertise [27]. If not handled properly, inconsistencies can have adverse impact on the quality of the outcomes in big data analysis process [1,7]. Inconsistencies can also exhibit in reasoning methods, heuristics, or problem-solving approaches of various analysis tasks, creating challenges for big data analysis. In the paper, we describe classifications for four types of inconsistencies in big data. It turns out that big data inconsistencies can be utilized as important heuristics for improving the performance in various analysis tasks and the quality of outcomes in big data analysis. The recently proposed inconsistency-induced learning, or i 2 Learning [28-30], offers a promising approach toward proper handling of big data inconsistencies.
2 The rest of the paper is organized as follows. Section 2 gives a panoramic view of big data and big data analysis in terms of the issues and challenges. In Section 3, we focus our attention on four types of inconsistencies in big data and how they impact on the big data analysis. Section 4 discusses how inconsistency-induced learning can be utilized as a tool to turn big data inconsistencies into helpful heuristics for better analysis results. Finally, Section 5 concludes the paper with remarks on future work. 2. Dimensions in Big Data After surveying the landscape, we summarize various dimensions of big data and big data analysis in Figure 1. As a technical term, big data generates many different interpretations and definitions. A meta-definition based on the size dimension is given in [13]: big data should be defined at any point in time as data whose size forces us to look beyond the tried-and-true methods that are prevalent at that time. The volume-variety-velocity definition [11] attempts to capture not only the size dimension, but also the types and speed (at which data are generated) dimensions of the datasets we encounter today. The survey results in [20] indicated a list of alternative definitions for big data. What has been glossed over in the literature is the following: what exactly does a dataset contain, primitive data elements, or pieces of information, knowledge, or meta-knowledge, or any combination of them? The terms of data and information have been used interchangeably in the literature, but there are distinct definitions for data, information, knowledge, meta-knowledge, and expertise, respectively. Figure 1. Dimensions in Big Data. In [1], big data analysis is defined to be a pipeline of acquisition and recording; extraction, cleaning and annotation; integration, aggregation and representation; analysis and modeling; and interpretation. Additionally, there are other alternative definitions on what big data analysis entails [14, 21]. Sources of big data include: transactions, scientific experiments, genomic investigations, logs, events, s, social media, sensors, RFID scans, texts, geospatial data, audio data, medical records, surveillance, images, and videos [3, 20]. These sources of big data can be semistructured (e.g., tabular, relational, categorical, or metadata), or unstructured (e.g., text, messages). Instances in a dataset can have many properties. For example, data instances may have the same or different probabilistic distributions. As keenly observed in [13], what makes big data big is repeated observations over time and/or space. Hence, most large datasets have inherent temporal or spatial dimensions, or both [13]. Recognizing this inherent temporal/spatial property is very important because this is where performance problems stem from when we try to conduct big data analysis using the prevailing database model (current RDBS model does not honor the order of rows in tables [13]). Another property is that most large datasets exhibit predictable characteristics in the following sense: the largest cardinalities of most datasets specifically, the number of
3 distinct entities about which observations are made are small compared with the total number of observations [13]. This is a very important heuristic in big data analysis. For scientific datasets, they are typically multi-dimensional, have embedded physical models, possess meta-data about experiments and their provenance, and have low update rates with most updates append-only [2]. The success of big data analysis depends critically on the following array of technologies: machine learning, cloud computing, crowd sourcing [21], data mining, time series analysis, stream processing, and visualization [5, 14]. Big data analysis faces many challenges. In addition to volume, variety and velocity that create challenges in storage, curation, search, retrieval, and visualization issues, veracity generates data uncertainty handling complications [20]. There are a whole host of inconsistent or conflicting circumstances during big data analysis [1, 7]. How to properly handle various types of inconsistencies during data pre-processing and analysis is another challenge. Additional challenges exist that include privacy, security, provenance, and modeling [1, 14]. In the pursuit of advancing knowledge or creating value out of data, there are potential pitfalls. While data are plentiful in today s digital society, we need to be mindful that data alone are not enough to advance knowledge or create value. Every learner must embody some knowledge or assumptions beyond the data it is given in order to generalize beyond it [8]. The curse of dimensionality is another potential snag. When utilizing machine learning algorithms to generalize beyond the input data, generalizing correctly becomes exponentially harder as the dimensionality (number of features) of the examples grows, because a fixed-size training set covers a dwindling fraction of the input space. Even with a moderate dimension of 100 and a huge training set of a trillion examples, the latter covers only a fraction of about of the input space [8]. A related issue is feature engineering [8], the large dataset in its raw format is not in a form that is amenable to learning, but you can construct features from it that are. In the depth of knowledge, there are layers of knowledge content. Noise can be described as items that carry no content of knowledge. Data denotes values drawn from some domain of discourse. Information defines the meanings of data values as understood by those who use them. Knowledge represents specialized information about some domain that allows one to make decision. Metaknowledge is knowledge about knowledge. Expertise is specialized operative knowledge that is inherently taskspecific and relatively inflexible. Knowledge content of large granularity has small connotations and knowledge content of small granularity has large connotations. Table 1 summarizes properties and structures in levels of knowledge content in the knowledge hierarchy. Table 2 indicates types of reasoning involved in the knowledge hierarchy: induction that goes from data to knowledge (bottom-up arrow), deduction that applies knowledge to obtain conclusions for data (top-down arrow), and transduction that allows conclusions to be drawn for new data directly from existing data without formulating the knowledge for new data (double arrow). Table 1. Levels of knowledge content in big data. Content Property Structure Expertise Metaknowledge Knowledge Information Data Specialized operative knowledge, taskspecific, relatively inflexible Control strategies Learning decisions Declarative Procedural Functional dependencies Associations Symbolic, numeric, text, graphic, categorical, waveform, video More structured Rich semantics Small connotations Advanced representation Less structured Simple semantics Large connotations Simple representation Table 2. Types of reasoning in big data analysis. Content Expertise Meta-knowledge Knowledge Information Data Decision Process Inductive Deductive Transductive As a categorical phrase, big data has been used to refer to large datasets. But what exactly such a large dataset contains is what we need to be precise and specific about. Data, information, and knowledge should not be regarded as interchangeable terms denoting the same entities and having the same set of connotations. Table 3 offers some examples illustrating the differences. We believe that bringing concepts in granularities of knowledge content explicitly into the big data analysis can lead to a better and more accurate curation. It is conducive to various tasks at different stages in the analysis process. For instance, depending on the circumstance of an input set (e.g., containing data elements only, or data elements plus domain knowledge), a learning algorithm that works best under the circumstance can be selected. With regard to the terminology, in addition to big data, terms of big information, big knowledge, or big meta-knowledge can be more pertinently utilized to accurately describe circumstances where an input set contains large volume of information, knowledge, or meta-knowledge, respectively.
4 Knowledge Information Data Table 3. Examples of knowledge content granularities in big data. Locationbased services Restaurant ratings Restaurants Latitudelongitude coordinates Social networks Healthcare Retail Social network structures People who tweet and people who follow other people Tweets 3. Big Data Inconsistencies Diagnoses Patients X-ray images Purchase patterns Groups of customers Transactions In circumstances where big data are produced, acquired, aggregated, transformed, or represented, inconsistencies invariably find their way into large datasets. This can be attributed to a number of factors in human behaviors and in decision-making process. Once captured in big data, inconsistent or conflicting phenomena can occur at various granularities of knowledge content, from data, information, knowledge, meta-knowledge, to expertise, and can adversely affect the quality of the outcomes in big data analysis process [1, 7]. Inconsistencies can also arise in reasoning methods, heuristics, or problem-solving approaches deployed for various analysis tasks, resulting in complications for big data analysis. Before deciding on how to go about with the inconsistencies found in big data, we need to recognize different types of inconsistencies for different types of big data. For instance, for location-based data, temporal or spatial inconsistencies [27] will dominate, whereas for unstructured text data, inconsistencies stemming from syntactic, semantic, and pragmatic circumstances of a natural language will occupy a commanding position. How we identify and differentiate categories of inconsistent phenomena at levels of data, information, knowledge, metaknowledge will help pave the way for subsequent handling tasks. Inconsistencies at data level involve various types of values (symbolic, numeric, categorical, waveform, etc.). Inconsistencies at information level manifest in terms of functional dependencies or associations. At knowledge level, inconsistencies display in declarative or procedural beliefs. Meta-knowledge inconsistencies are demonstrated through control strategies or learning decisions [27]. Analysis is needed to establish correspondence between big data analysis tasks and types of inconsistencies impacting or affecting those tasks. Finally, we can use the analysis task to inconsistent phenomena correspondence to guide the development of task-inconsistency specific methods or tools to help assist various tasks in big data analysis. One example is inconsistency-induced learning, or i 2 Learning in [28-30], that allows inconsistencies to be utilized as stimuli to initiate learning episodes. The outcome of such perpetual learning mechanism will result in refined or augmented knowledge that in turn improves the big data analysis performance. In the rest of this section, we will elaborate on four important types of inconsistencies in big data. They include: temporal inconsistencies, spatial inconsistencies, inconsistencies found in unstructured natural language text, and inconsistencies stemming from violations of functional dependencies Temporal Inconsistencies When datasets contain a temporal attribute, data items with conflicting circumstances may coincide or overlap in time. The time interval relationships between conflicting data items can result in partial temporal inconsistency or complete temporal inconsistency [23]. Temporal inconsistencies have been utilized as problem-solving heuristics in IBM Watson open-domain QA system where temporal reasoning is deployed to detect inconsistencies between dates in the clue and those associated with a candidate answer [10]. In a temperature time-series data, a temperature recording of 35 in July in New Orleans would be inconsistent with the context. In human electrocardiogram time-series data, a prolonged period of low value output in the ECG is inconsistent with the normal heart rhythm pattern, an indication for atrial premature contraction [6]. Table 4 gives a list of temporal inconsistencies. Partial Complete Anomalous value Contextual Motif Table 4. Temporal inconsistencies Spatial Inconsistencies Time intervals of two inconsistent events are partially overlapping. Time intervals of two inconsistent events coincide or satisfy containment. A time-series data has an anomalous value. A time-series data has an anomalous instance in a given context. Time-series data has a segment of data values that reoccurs and is anomalous When datasets include geometric or spatial dimension, data items are often about objects in space that have geometric properties (location, shape) and that observe spatial relations (topological, directional and distance relations) (see Table 5). Spatial inconsistencies can arise from the geometric representation of objects (a spatial object having multiple conflicting geometric locations), spatial relations between objects (violations of spatial constraints with regard to some spatial relation), or aggregation of composite objects (different representations of the same object from different sources resulting in
5 violation of the constraint that objects must have unique geometric representation) [19]. Table 5. Spatial relations. Rotation Translation Scaling Topological invariant invariant invariant Directional changing invariant invariant Distance invariant invariant changing Again in IBM Watson system, geospatial reasoning is deployed that is capable of detecting spatial inconsistencies stemming from conflicting spatial relations such as directionality, borders, and containment between geoentities [10]. Table 6 is a list of possible spatial inconsistencies based on [19]. Geometric location Geometric shape Topological Directional Distance Scaling induced Semantic constraint Structural constraint Integration induced Table 6. Spatial inconsistencies. A spatial object has conflicting geometric locations. A spatial object has conflicting geometric shapes. Violation of topological constraints. Violation of directional properties. Violation of distance properties. Different geometric representations of a spatial object at different scales. Violation of semantic integrity constraints. Violation of structural integrity constraints of geometric primitives. Different representations of the same spatial object from different sources resulting in violation of the constraint that objects must have unique geometric representation. Another example is the work in [22] that defined a concept called conflation that reconciles spatial inconsistencies arising from combining information from diverse sources. The work deals with conflating road maps with aerial images using road intersections as conjugate features and is based on a three-process workflow that includes preprocessing for road candidate identification, spatial inconsistency removal, and shape disagreement removal. The results indicated that the conflated approach yields a 36.6% accuracy improvement over the nonconflated approach for the experiments Text Inconsistencies As big datasets are increasingly generated from social media, blogs, s, crowd-sourced ratings, inconsistencies in unstructured text and messages become an important research topic [15,18]. If two texts are referring to the same event or entity, then they are said to be of co-reference [15]. Event or entity co-referencing is a necessary condition for text inconsistencies [15]. Table 7 summarizes a list of text inconsistencies. Table 7. Text inconsistencies. Complementary - Miami Heat was in the 2012 NBA final. - Miami Heat was not in the 2012 NBA final. Mutual exclusive - Sea cucumber is animal. - Sea cucumber is vegetable. Inheritance - Penguin cannot fly. - Penguin is bird and bird can fly, hence penguin can fly. Synonym - The system has a fast response time. - The system s response time is not rapid. Antonym - The system has a fast response time. - The system has a slow response time. Anti-subsumption - John is a surgeon. - John is not a doctor. Anti-supertype - BigDog is not a robot. - BigDog is a legged squad support system. Asymmetric - John is married to Jane. - Jane is not married to John. Anti-inverse - John is parent of Mike. - Mike is not child of John. Mismatching - M 5 is a mobile agent that runs in both Android and ios environments. - M 5 does not run in Android environment. Disagreeing - M 5 has a memory of 10 GB. - M 5 has a memory of 5000 MB. Contradictory - M 5 was developed in March M 5 was deployed in December Precedence - Obama succeeded Bush as president in Bush succeeded Obama as president in Factive [15] - The bombers had not managed to enter the embassy. - The bombers entered the embassy. Lexical [15] - John said Jane did nothing wrong. World knowledge [15] Functional [18] - John accuses Jane. - Microsoft Israel, one of the first Microsoft branches outside the USA, was founded in Microsoft was established in Mozart was born in Salzburg. - Mozart was born in Vienna Functional Dependency Inconsistencies Many big datasets are stored, aggregated, and cleaned through the help of relational database systems where functional dependencies (FD) [16] or conditional functional dependencies [9] play a critically important role in enforcing the integrity constraints for the database. Violations of such functional dependencies or conditional functional dependencies will result in inconsistencies in data and information (Table 8) [9,16]. Table 8. Functional dependency inconsistencies. Single FD Multiple FD Conditional FD Violation of single functional dependency Violation of multiple functional dependencies. Violation of conditional functional dependencies
6 3.5. Occurrence in Knowledge Content Levels The aforementioned types of inconsistencies can manifest themselves at various levels of knowledge content. Table 9 summarizes possible occurrences of various types of inconsistencies at different levels of knowledge content. Table 9. Occurrence in knowledge content levels. Data Information Knowledge Meta-K Temporal Spatial Text FD 4. Inconsistency-Induced Learning A framework for inconsistency-induced learning, or i 2 Learning, has been proposed in [28-30]. i 2 Learning accommodates perpetual or lifelong learning by allowing successive learning episodes to be triggered through inconsistencies an agent encounters during its problemsolving episodes. Learning in the framework is accomplished through the continuous knowledge refinement and/or augmentation in order to overcome encountered inconsistencies. An agent s performance at tasks can be incrementally improved with each learning episode. i 2 Learning offers an overarching structure that facilitates the growth and expansion of various inconsistency-specific learning strategies. The essential idea behind i 2 Learning is to identify the cause of inconsistency and then apply cause-specific heuristics to resolve inconsistencies. For instance, if an inconsistent phenomenon stems from irrelevant features, then we can deploy a search algorithm that discerns relevant features from irrelevant ones [29]. We can then overcome inconsistencies by excluding irrelevant features from participating in the analysis process. If an inconsistent case arises as a result of property inheritance, then the heuristic rules of subclass-specificity superseding superclassgenerality can be utilized to resolve inheritance inconsistencies [30]. In the context of big data and big data analysis, i 2 Learning can also play an active role in improving the data quality by reconciling the inconsistencies found in the datasets, in refining or augmenting knowledge for analysis, modeling or interpretation of big data, and in helping enhance big data applications. For instance, in a crowdsourced customer ranking application for goods or services, comments made by customers invariably contain inconsistencies (text, temporal, or spatial). Treating customers comments as a knowledge base that contains pockets of incompatible opinions, we can apply i 2 Learning algorithms to resolve or overcome the inconsistencies in customers comments, which in turn refines or augments this knowledge base to render a more coherent and accurate ratings of the goods or services. This process is continuous and perpetual, with each new inconsistent customer comment on things triggering the next learning episode. 5. Conclusion In this paper, we highlight the multi-dimensional issues and challenges in big data and big data analysis. We then focus our attention on one of the challenges, inconsistencies in big data and their impact on big data analysis. We examine four types of inconsistencies in big data, namely, temporal inconsistencies, spatial inconsistencies, text inconsistencies, and functional dependency inconsistencies. The contribution of this work lies in the fact that articulating explicitly the types of inconsistent phenomena in big data can help pave the way to improve the quality of big data analysis. Future work can be carried out in the following directions. Details of other frequently encountered types of inconsistencies in big data and their impact on big data analysis still need to be fleshed out. Empirical study is planned on utilizing i 2 Learning algorithms with some real world dataset to improve the analysis results. Acknowledgements. We express our appreciation to four anonymous reviewers for their comments that help improve the content and the presentation of this paper. References [1] D. Agrawal, P. Bernstein, E. Bertino, S. Davidson, and U. Dayal, Challenges and opportunities with big data, Cyber Center Technical Report , Purdue University, January 1, [2] A. Ailamaki, V. Kantere, and D. Dash, Managing scientific data, Communications of the ACM, Vol.53, No.6, (2010) [3] Big Data, [4] R.J. Brachman, and H.J. Levesque, Knowledge Representation and Reasoning. Morgan Kaufmann Publishers, [5] S. Bryson, D. Kenwright, M. Cox, D. Ellsworth, and R. Haimes, Visually exploring gigabyte data sets in real time, Communications of the ACM, Vol.42, No.8, (1999) [6] V. Chandola, A. Banerjee, and V. Kumar, Anomaly detection: a survey, ACM Computing Surveys 41 (3) 2009, pp [7] S. Chaudhuri, U. Dayal, and V. Narasayya, An overview of business intelligence technology, Communications of the ACM, Vol.54, No.8, (2011) [8] P. Domingos, A few useful things to know about machine learning, Communications of the ACM, Vol.55, No.10, (2012) [9] W. Fan, F. Geerts, X. Jia, and A. Kementsietsidis, Conditional functional dependencies for capturing data inconsistencies, ACM Transactions on Database Systems, Vol. 33, Issue 2, June [10] D. Ferrucci, et al, Building Watson: An Overview of the DeepQA Project, AI Magazine, Fall 2010, pp
7 [11] Gartner Group press release, Pattern-based strategy: getting value from big data, July [12] R. Gotesky, The Uses of Inconsistency. Philosophy and Phenomenological Research. Vol. 28, No. 4, 1968, pp [13] A. Jacobs, The pathologies of big data, Communications of the ACM, Vol.52, No.8, (2009) [14] J. Manyika, M.Chui, B. Brown, J. Bughin, R. Dobbs, C. Roxburgh, and A. H. Byers, Big Data: the next frontier for innovation, competition, and productivity, McKinsey Global Institute, June [15] M-C de Marneffe, A. N. Rafferty and C. D. Manning, Finding Contradictions in Text, Proc. of 46 th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2008, pp [16] M. V. Martinez, A. Pugliese, G. I. Simari, V. S. Subrahmanian, and H. Prade, How dirty is your relational database? An axiomatic approach, in Proc. 9th European Conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty, Hammamet, Tunisia, LNAI 4724, 2007, pp [17] G. Mone, Beyond Hadoop, Communications of the ACM, Vol.56, No.1, (2013) [18] A. Ritter, D. Downey, S. Soderland and O. Etzioni, It s a Contradiction-No, It s Not: A Case Study Using Functional Relations, Proc. of Conference on Empirical Methods in Natural Language Processing, [19] A. Rodriguez, Inconsistency issues in spatial databases, in L. Bertossi et al (eds.) Inconsistency Tolerance, LNCS 3300, Springer-Verlag, 2004, pp [20] M. Schroeck, R. Shockley, J. Smart, D. Romero-Morales, and P. Tufano, Analytics: the real-world use of big data: how innovative enterprises extract value from uncertain data, Executive Report, IBM Institute for Business Value and Said Business School at the University of Oxford, [21] The White House Big Data Research and Development Initiative, big_data_press_release_final_2.pdf [22] S. Yang, C. Kim, Y. Huh, and K. Yu, Removal of spatial inconsistency and shape disagreement for conflation of road maps with aerial images, Canadian Journal of Remote Sensing, 38(06), 2012 pp [23] D. Zhang, On Temporal Properties of Knowledge Base Inconsistency. Springer Transactions on Computational Science V, LNCS 5540, 2009, pp [24] D. Zhang, Toward a classification of antagonistic manifestations of knowledge. Proc. of Twenty Second International Conference on Tools with Artificial Intelligence, Arras, France, 2010, pp [25] D. Zhang, Inconsistency: the good, the bad, and the ugly. International Transactions on Systems Science and Applications, Vol.6, No.2/3, August 2010, pp [26] D. Zhang, The utility of inconsistencies in information security and digital forensics. In T. Özyer et al (ed.) Recent Trends in Information Reuse and Integration, Springer- Verlag, 2011, pp [27] D. Zhang and E. Gregoire, The landscape of inconsistency: a perspective, International Journal of Semantic Computing, Vol. 5, No.3, 2011, pp [28] Du Zhang and M. Lu, Inconsistency-induced learning for perpetual learners, International Journal of Software Science and Computational Intelligence, Vol.3, No.4, 2011, pp [29] D. Zhang, i 2 Learning: perpetual learning through bias shifting, in Proc. of the 24 th International Conference on Software Engineering and Knowledge Engineering, July 2012, pp [30] D. Zhang and M. Lu, Learning through Overcoming Inheritance Inconsistencies, in Proc. of the 13 th IEEE International Conference on Information Reuse and Integration, August 2012, pp
GRANULARITIES AND INCONSISTENCIES IN BIG DATA ANALYSIS
International Journal of Software Engineering and Knowledge Engineering World Scientific Publishing Company GRANULARITIES AND INCONSISTENCIES IN BIG DATA ANALYSIS DU ZHANG Department of Computer Science,
Big Data: Study in Structured and Unstructured Data
Big Data: Study in Structured and Unstructured Data Motashim Rasool 1, Wasim Khan 2 [email protected], [email protected] Abstract With the overlay of digital world, Information is available
Data Refinery with Big Data Aspects
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 655-662 International Research Publications House http://www. irphouse.com /ijict.htm Data
The Scientific Data Mining Process
Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In
Information Visualization WS 2013/14 11 Visual Analytics
1 11.1 Definitions and Motivation Lot of research and papers in this emerging field: Visual Analytics: Scope and Challenges of Keim et al. Illuminating the path of Thomas and Cook 2 11.1 Definitions and
Draft Martin Doerr ICS-FORTH, Heraklion, Crete Oct 4, 2001
A comparison of the OpenGIS TM Abstract Specification with the CIDOC CRM 3.2 Draft Martin Doerr ICS-FORTH, Heraklion, Crete Oct 4, 2001 1 Introduction This Mapping has the purpose to identify, if the OpenGIS
Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.
Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analytics
Text Analytics with Ambiverse. Text to Knowledge. www.ambiverse.com
Text Analytics with Ambiverse Text to Knowledge www.ambiverse.com Version 1.0, February 2016 WWW.AMBIVERSE.COM Contents 1 Ambiverse: Text to Knowledge............................... 5 1.1 Text is all Around
SPATIAL DATA CLASSIFICATION AND DATA MINING
, pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal
Formal Methods for Preserving Privacy for Big Data Extraction Software
Formal Methods for Preserving Privacy for Big Data Extraction Software M. Brian Blake and Iman Saleh Abstract University of Miami, Coral Gables, FL Given the inexpensive nature and increasing availability
CHAPTER 1 INTRODUCTION
CHAPTER 1 INTRODUCTION 1.1 Research Motivation In today s modern digital environment with or without our notice we are leaving our digital footprints in various data repositories through our daily activities,
Healthcare Measurement Analysis Using Data mining Techniques
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 03 Issue 07 July, 2014 Page No. 7058-7064 Healthcare Measurement Analysis Using Data mining Techniques 1 Dr.A.Shaik
Dynamic Data in terms of Data Mining Streams
International Journal of Computer Science and Software Engineering Volume 2, Number 1 (2015), pp. 1-6 International Research Publication House http://www.irphouse.com Dynamic Data in terms of Data Mining
How To Understand The Benefits Of Big Data
Findings from the research collaboration of IBM Institute for Business Value and Saïd Business School, University of Oxford Analytics: The real-world use of big data How innovative enterprises extract
BIG DATA & ANALYTICS. Transforming the business and driving revenue through big data and analytics
BIG DATA & ANALYTICS Transforming the business and driving revenue through big data and analytics Collection, storage and extraction of business value from data generated from a variety of sources are
Hexaware E-book on Predictive Analytics
Hexaware E-book on Predictive Analytics Business Intelligence & Analytics Actionable Intelligence Enabled Published on : Feb 7, 2012 Hexaware E-book on Predictive Analytics What is Data mining? Data mining,
Report on the Dagstuhl Seminar Data Quality on the Web
Report on the Dagstuhl Seminar Data Quality on the Web Michael Gertz M. Tamer Özsu Gunter Saake Kai-Uwe Sattler U of California at Davis, U.S.A. U of Waterloo, Canada U of Magdeburg, Germany TU Ilmenau,
Government Technology Trends to Watch in 2014: Big Data
Government Technology Trends to Watch in 2014: Big Data OVERVIEW The federal government manages a wide variety of civilian, defense and intelligence programs and services, which both produce and require
META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING
META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING Ramesh Babu Palepu 1, Dr K V Sambasiva Rao 2 Dept of IT, Amrita Sai Institute of Science & Technology 1 MVR College of Engineering 2 [email protected]
Data Quality Mining: Employing Classifiers for Assuring consistent Datasets
Data Quality Mining: Employing Classifiers for Assuring consistent Datasets Fabian Grüning Carl von Ossietzky Universität Oldenburg, Germany, [email protected] Abstract: Independent
A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS
A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS Stacey Franklin Jones, D.Sc. ProTech Global Solutions Annapolis, MD Abstract The use of Social Media as a resource to characterize
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH Kalinka Mihaylova Kaloyanova St. Kliment Ohridski University of Sofia, Faculty of Mathematics and Informatics Sofia 1164, Bulgaria
Data Isn't Everything
June 17, 2015 Innovate Forward Data Isn't Everything The Challenges of Big Data, Advanced Analytics, and Advance Computation Devices for Transportation Agencies. Using Data to Support Mission, Administration,
COMP9321 Web Application Engineering
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 11 (Part II) http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2411
How To Make Sense Of Data With Altilia
HOW TO MAKE SENSE OF BIG DATA TO BETTER DRIVE BUSINESS PROCESSES, IMPROVE DECISION-MAKING, AND SUCCESSFULLY COMPETE IN TODAY S MARKETS. ALTILIA turns Big Data into Smart Data and enables businesses to
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
Database Marketing, Business Intelligence and Knowledge Discovery
Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski
CHAPTER 1 INTRODUCTION
1 CHAPTER 1 INTRODUCTION Exploration is a process of discovery. In the database exploration process, an analyst executes a sequence of transformations over a collection of data structures to discover useful
Information Services for Smart Grids
Smart Grid and Renewable Energy, 2009, 8 12 Published Online September 2009 (http://www.scirp.org/journal/sgre/). ABSTRACT Interconnected and integrated electrical power systems, by their very dynamic
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK A SURVEY ON BIG DATA ISSUES AMRINDER KAUR Assistant Professor, Department of Computer
International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 ISSN 2229-5518
International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 Over viewing issues of data mining with highlights of data warehousing Rushabh H. Baldaniya, Prof H.J.Baldaniya,
Anuradha Bhatia, Faculty, Computer Technology Department, Mumbai, India
Volume 3, Issue 9, September 2013 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Real Time
A Big Data Analytical Framework For Portfolio Optimization Abstract. Keywords. 1. Introduction
A Big Data Analytical Framework For Portfolio Optimization Dhanya Jothimani, Ravi Shankar and Surendra S. Yadav Department of Management Studies, Indian Institute of Technology Delhi {dhanya.jothimani,
Big Data a threat or a chance?
Big Data a threat or a chance? Helwig Hauser University of Bergen, Dept. of Informatics Big Data What is Big Data? well, lots of data, right? we come back to this in a moment. certainly, a buzz-word but
Big Data and Analytics: Challenges and Opportunities
Big Data and Analytics: Challenges and Opportunities Dr. Amin Beheshti Lecturer and Senior Research Associate University of New South Wales, Australia (Service Oriented Computing Group, CSE) Talk: Sharif
Big Data Introduction, Importance and Current Perspective of Challenges
International Journal of Advances in Engineering Science and Technology 221 Available online at www.ijaestonline.com ISSN: 2319-1120 Big Data Introduction, Importance and Current Perspective of Challenges
An Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep. Neil Raden Hired Brains Research, LLC
Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep Neil Raden Hired Brains Research, LLC Traditionally, the job of gathering and integrating data for analytics fell on data warehouses.
Appendix B Data Quality Dimensions
Appendix B Data Quality Dimensions Purpose Dimensions of data quality are fundamental to understanding how to improve data. This appendix summarizes, in chronological order of publication, three foundational
Investigative Research on Big Data: An Analysis
Investigative Research on Big Data: An Analysis Shyam J. Dhoble 1, Prof. Nitin Shelke 2 ME Scholar, Department of CSE, Raisoni college of Engineering and Management Amravati, Maharashtra, India. 1 Assistant
5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014
5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for
ICT Perspectives on Big Data: Well Sorted Materials
ICT Perspectives on Big Data: Well Sorted Materials 3 March 2015 Contents Introduction 1 Dendrogram 2 Tree Map 3 Heat Map 4 Raw Group Data 5 For an online, interactive version of the visualisations in
TECHNOLOGY ANALYSIS FOR INTERNET OF THINGS USING BIG DATA LEARNING
TECHNOLOGY ANALYSIS FOR INTERNET OF THINGS USING BIG DATA LEARNING Sunghae Jun 1 1 Professor, Department of Statistics, Cheongju University, Chungbuk, Korea Abstract The internet of things (IoT) is an
IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper
IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper CAST-2015 provides an opportunity for researchers, academicians, scientists and
Healthcare, transportation,
Smart IT Argus456 Dreamstime.com From Data to Decisions: A Value Chain for Big Data H. Gilbert Miller and Peter Mork, Noblis Healthcare, transportation, finance, energy and resource conservation, environmental
KNOWLEDGE-BASED IN MEDICAL DECISION SUPPORT SYSTEM BASED ON SUBJECTIVE INTELLIGENCE
JOURNAL OF MEDICAL INFORMATICS & TECHNOLOGIES Vol. 22/2013, ISSN 1642-6037 medical diagnosis, ontology, subjective intelligence, reasoning, fuzzy rules Hamido FUJITA 1 KNOWLEDGE-BASED IN MEDICAL DECISION
Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2
Volume 6, Issue 3, March 2016 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue
Dr. John E. Kelly III Senior Vice President, Director of Research. Differentiating IBM: Research
Dr. John E. Kelly III Senior Vice President, Director of Research Differentiating IBM: Research IBM Research Priorities Impact on IBM and the Marketplace Globalization and Leverage Balanced Research Agenda
Big Data Classification: Problems and Challenges in Network Intrusion Prediction with Machine Learning
Big Data Classification: Problems and Challenges in Network Intrusion Prediction with Machine Learning By: Shan Suthaharan Suthaharan, S. (2014). Big data classification: Problems and challenges in network
Data Mining and Database Systems: Where is the Intersection?
Data Mining and Database Systems: Where is the Intersection? Surajit Chaudhuri Microsoft Research Email: [email protected] 1 Introduction The promise of decision support systems is to exploit enterprise
Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems
Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems Volker Markl [email protected] dima.tu-berlin.de dfki.de/web/research/iam/ bbdc.berlin Based on my 2014 Vision Paper On
What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy
What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy Much higher Volumes. Processed with more Velocity. With much more Variety. Is Big Data so big? Big Data Smart Data Project HAVEn: Adaptive Intelligence
A Strategic Approach to Unlock the Opportunities from Big Data
A Strategic Approach to Unlock the Opportunities from Big Data Yue Pan, Chief Scientist for Information Management and Healthcare IBM Research - China [contacts: [email protected] ] Big Data or Big Illusion?
Facilitating Business Process Discovery using Email Analysis
Facilitating Business Process Discovery using Email Analysis Matin Mavaddat [email protected] Stewart Green Stewart.Green Ian Beeson Ian.Beeson Jin Sa Jin.Sa Abstract Extracting business process
Developing the SMEs Innovative Capacity Using a Big Data Approach
Economy Informatics vol. 14, no. 1/2014 55 Developing the SMEs Innovative Capacity Using a Big Data Approach Alexandra Elena RUSĂNEANU, Victor LAVRIC The Bucharest University of Economic Studies, Romania
From Data to Insight: Big Data and Analytics for Smart Manufacturing Systems
From Data to Insight: Big Data and Analytics for Smart Manufacturing Systems Dr. Sudarsan Rachuri Program Manager Smart Manufacturing Systems Design and Analysis Systems Integration Division Engineering
Search Result Optimization using Annotators
Search Result Optimization using Annotators Vishal A. Kamble 1, Amit B. Chougule 2 1 Department of Computer Science and Engineering, D Y Patil College of engineering, Kolhapur, Maharashtra, India 2 Professor,
International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop
ISSN: 2454-2377, October 2015 Big Data and Hadoop Simmi Bagga 1 Satinder Kaur 2 1 Assistant Professor, Sant Hira Dass Kanya MahaVidyalaya, Kala Sanghian, Distt Kpt. INDIA E-mail: [email protected]
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM
TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM Thanh-Nghi Do College of Information Technology, Cantho University 1 Ly Tu Trong Street, Ninh Kieu District Cantho City, Vietnam
Training for Big Data
Training for Big Data Learnings from the CATS Workshop Raghu Ramakrishnan Technical Fellow, Microsoft Head, Big Data Engineering Head, Cloud Information Services Lab Store any kind of data What is Big
Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum
Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum Siva Ravada Senior Director of Development Oracle Spatial and MapViewer 2 Evolving Technology Platforms
2 Associating Facts with Time
TEMPORAL DATABASES Richard Thomas Snodgrass A temporal database (see Temporal Database) contains time-varying data. Time is an important aspect of all real-world phenomena. Events occur at specific points
Knowledge Discovery and Data Mining. Structured vs. Non-Structured Data
Knowledge Discovery and Data Mining Unit # 2 1 Structured vs. Non-Structured Data Most business databases contain structured data consisting of well-defined fields with numeric or alphanumeric values.
redesigning the data landscape to deliver true business intelligence Your business technologists. Powering progress
redesigning the data landscape to deliver true business intelligence Your business technologists. Powering progress The changing face of data complexity The storage, retrieval and management of data has
MULTI AGENT-BASED DISTRIBUTED DATA MINING
MULTI AGENT-BASED DISTRIBUTED DATA MINING REECHA B. PRAJAPATI 1, SUMITRA MENARIA 2 Department of Computer Science and Engineering, Parul Institute of Technology, Gujarat Technology University Abstract:
Research of Postal Data mining system based on big data
3rd International Conference on Mechatronics, Robotics and Automation (ICMRA 2015) Research of Postal Data mining system based on big data Xia Hu 1, Yanfeng Jin 1, Fan Wang 1 1 Shi Jiazhuang Post & Telecommunication
Mining Big Data. Pang-Ning Tan. Associate Professor Dept of Computer Science & Engineering Michigan State University
Mining Big Data Pang-Ning Tan Associate Professor Dept of Computer Science & Engineering Michigan State University Website: http://www.cse.msu.edu/~ptan Google Trends Big Data Smart Cities Big Data and
Keywords: Big Data, HDFS, Map Reduce, Hadoop
Volume 5, Issue 7, July 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Configuration Tuning
Information Management course
Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli ([email protected])
Enterprise Solutions. Data Warehouse & Business Intelligence Chapter-8
Enterprise Solutions Data Warehouse & Business Intelligence Chapter-8 Learning Objectives Concepts of Data Warehouse Business Intelligence, Analytics & Big Data Tools for DWH & BI Concepts of Data Warehouse
From Data to Foresight:
Laura Haas, IBM Fellow IBM Research - Almaden From Data to Foresight: Leveraging Data and Analytics for Materials Research 1 2011 IBM Corporation The road from data to foresight is long? Consumer Reports
IEEE JAVA Project 2012
IEEE JAVA Project 2012 Powered by Cloud Computing Cloud Computing Security from Single to Multi-Clouds. Reliable Re-encryption in Unreliable Clouds. Cloud Data Production for Masses. Costing of Cloud Computing
Big Data Analytics in Mobile Environments
1 Big Data Analytics in Mobile Environments 熊 辉 教 授 罗 格 斯 - 新 泽 西 州 立 大 学 2012-10-2 Rutgers, the State University of New Jersey Why big data: historical view? Productivity versus Complexity (interrelatedness,
EFFICIENT DATA PRE-PROCESSING FOR DATA MINING
EFFICIENT DATA PRE-PROCESSING FOR DATA MINING USING NEURAL NETWORKS JothiKumar.R 1, Sivabalan.R.V 2 1 Research scholar, Noorul Islam University, Nagercoil, India Assistant Professor, Adhiparasakthi College
Self-Service Big Data Analytics for Line of Business
I D C A N A L Y S T C O N N E C T I O N Dan Vesset Program Vice President, Business Analytics and Big Data Self-Service Big Data Analytics for Line of Business March 2015 Big data, in all its forms, is
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS Mrs. Jyoti Nawade 1, Dr. Balaji D 2, Mr. Pravin Nawade 3 1 Lecturer, JSPM S Bhivrabai Sawant Polytechnic, Pune (India) 2 Assistant
Toward Effective Big Data Analysis in Continuous Auditing. By Juan Zhang, Xiongsheng Yang, and Deniz Appelbaum
Toward Effective Big Data Analysis in Continuous Auditing By Juan Zhang, Xiongsheng Yang, and Deniz Appelbaum Introduction New sources: emails, phone calls, click stream traffic, social media, news media,
TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS
9 8 TRENDS IN THE DEVELOPMENT OF BUSINESS INTELLIGENCE SYSTEMS Assist. Prof. Latinka Todoranova Econ Lit C 810 Information technology is a highly dynamic field of research. As part of it, business intelligence
A Hurwitz white paper. Inventing the Future. Judith Hurwitz President and CEO. Sponsored by Hitachi
Judith Hurwitz President and CEO Sponsored by Hitachi Introduction Only a few years ago, the greatest concern for businesses was being able to link traditional IT with the requirements of business units.
CHAPTER 1 INTRODUCTION
CHAPTER 1 INTRODUCTION 1.1 Background The command over cloud computing infrastructure is increasing with the growing demands of IT infrastructure during the changed business scenario of the 21 st Century.
Big Data Mining: Challenges and Opportunities to Forecast Future Scenario
Big Data Mining: Challenges and Opportunities to Forecast Future Scenario Poonam G. Sawant, Dr. B.L.Desai Assist. Professor, Dept. of MCA, SIMCA, Savitribai Phule Pune University, Pune, Maharashtra, India
Flattening Enterprise Knowledge
Flattening Enterprise Knowledge Do you Control Your Content or Does Your Content Control You? 1 Executive Summary: Enterprise Content Management (ECM) is a common buzz term and every IT manager knows it
Analyzing survey text: a brief overview
IBM SPSS Text Analytics for Surveys Analyzing survey text: a brief overview Learn how gives you greater insight Contents 1 Introduction 2 The role of text in survey research 2 Approaches to text mining
Big Data R&D Initiative
Big Data R&D Initiative Howard Wactlar CISE Directorate National Science Foundation NIST Big Data Meeting June, 2012 Image Credit: Exploratorium. The Landscape: Smart Sensing, Reasoning and Decision Environment
Big Data & Analytics: Your concise guide (note the irony) Wednesday 27th November 2013
Big Data & Analytics: Your concise guide (note the irony) Wednesday 27th November 2013 Housekeeping 1. Any questions coming out of today s presentation can be discussed in the bar this evening 2. OCF is
Data Mining Solutions for the Business Environment
Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania [email protected] Over
FUTURE RESEARCH DIRECTIONS OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING *
International Journal of Software Engineering and Knowledge Engineering World Scientific Publishing Company FUTURE RESEARCH DIRECTIONS OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING * HAIPING XU Computer
BIG DATA: CHALLENGES AND OPPORTUNITIES IN LOGISTICS SYSTEMS
BIG DATA: CHALLENGES AND OPPORTUNITIES IN LOGISTICS SYSTEMS Branka Mikavica a*, Aleksandra Kostić-Ljubisavljević a*, Vesna Radonjić Đogatović a a University of Belgrade, Faculty of Transport and Traffic
Big Data Text Mining and Visualization. Anton Heijs
Copyright 2007 by Treparel Information Solutions BV. This report nor any part of it may be copied, circulated, quoted without prior written approval from Treparel7 Treparel Information Solutions BV Delftechpark
CONNECTING DATA WITH BUSINESS
CONNECTING DATA WITH BUSINESS Big Data and Data Science consulting Business Value through Data Knowledge Synergic Partners is a specialized Big Data, Data Science and Data Engineering consultancy firm
Boarding to Big data
Database Systems Journal vol. VI, no. 4/2015 11 Boarding to Big data Oana Claudia BRATOSIN University of Economic Studies, Bucharest, Romania [email protected] Today Big data is an emerging topic,
Anatomy of a Decision
[email protected] @BlueHillBoston 617.624.3600 Anatomy of a Decision BI Platform vs. Tool: Choosing Birst Over Tableau for Enterprise Business Intelligence Needs What You Need To Know The demand
ASSESSMENT OF VISUALIZATION SOFTWARE FOR SUPPORT OF CONSTRUCTION SITE INSPECTION TASKS USING DATA COLLECTED FROM REALITY CAPTURE TECHNOLOGIES
ASSESSMENT OF VISUALIZATION SOFTWARE FOR SUPPORT OF CONSTRUCTION SITE INSPECTION TASKS USING DATA COLLECTED FROM REALITY CAPTURE TECHNOLOGIES ABSTRACT Chris Gordon 1, Burcu Akinci 2, Frank Boukamp 3, and
