Automated Diagnosis of Chronic Performance Problems in Production Systems
|
|
|
- Ursula Gilmore
- 10 years ago
- Views:
Transcription
1 Automated Diagnosis of Chronic Performance Problems in Production Systems Soila P. Kavulya Christos Faloutsos, CMU Greg Ganger, CMU Matti Hiltunen, AT&T Labs Priya Narasimhan, CMU (Advisor)
2 Motivation Major outages are rare in production systems Such blackouts also detected by alarms Chronic performance problems (brownouts) System still works, but with degraded performance Problem typically affects subset of users/requests Admins often unaware until the user complains Chronics are fairly common Production VoIP system: Source of 42% of failed calls in worst month of outages Production Hadoop cluster (OpenCloud): Source of 78% of reported problems in 11-month period 2
3 Challenges (1) Scale: Thousands of network elements, millions of calls Under radar: Symptoms often not severe enough to trigger system alarms Application Servers Gateway Servers Customers with Non-IP service ISP IP Base Elements PSTN Multiple ongoing problems: Difficult to disambiguate Never seen before: First indication is cryptic customer complaint Customers with IP service 3
4 Challenges (2) Labeled failure-data not always available Difficult to diagnose problems not encountered before Desired level of instrumentation might not be possible Existing vendor instrumentation with limited control Cost of adding instrumentation might be high Instrumentation might be diverse, at different sampling rate 4
5 5
6 Thesis statement Approach Instrumentation Anomaly detection Problem localization Experimental evaluation Fault injection Case studies Extensions Conclusion Outline 6
7 Thesis Statement Diagnosis of chronic performance problems in production systems is possible through the analysis of common white-box logs to extract local behavior and system-wide dependencies, coupled with the analysis of common black-box metrics to identify the resource at fault. 7
8 Goals and Non-goals Goals of approach Diagnosis using existing instrumentation in production systems Anomaly detection in the absence of labeled failure-data Differentiation of workload changes from anomalies Non-goals Diagnosis of system-wide outages Diagnosis of value faults and transient faults Root-cause analysis at code-level 8
9 Assumptions Majority of the system is working correctly Problems manifest as observable behavioral changes Exceptions or performance degradations Visible to the end-user All instrumentation is locally time-stamped Clocks are synchronized to enable system-wide correlation of data Instrumentation faithfully captures system behavior 9
10 Target Systems for Validation VoIP system at large ISP 10s of millions of calls per day 1000s of network elements with heterogeneous hardware 24x7 Ops team uses alarm correlation to diagnose outages Separate team troubleshoots long-term chronics Labeled traces available Hadoop: Open-source implementation of MapReduce Diverse kinds of data-intensive workloads Graph mining, language translation Hadoop clusters have homogeneous hardware 400-node Yahoo! M45, 64-node OpenCloud clusters Controlled experiments in Amazon EC2 cluster Long running jobs (> 100s): Hard to label failures 10
11 Contributions VoIP HADOOP Anomaly Detection Problem Localization Types of chronics Experimental Evaluation Heuristics-based Localize to customer/networkelement/resource/error-code Exceptions, performance degradation, single-source, multiple-source Production VoIP system, 1000s of network elements Peer comparison without labeled data Localize to node/task/ resource Exceptions, performance degradation, single-source, multiple-source OpenCloud, 64 nodes Publications SLAML 11, OSR 11, DSN 12 WASL 08, HotMetrics 09, ISSRE 09, ICDCS 10, NOMS 10, CCGRID
12 Thesis statement Approach Instrumentation Anomaly detection Problem localization Experimental evaluation Fault injection Case studies Extensions Conclusion Outline 12
13 Overview of Approach visualizations to support root-cause inference Anomalous nodes Visualization white-box instrumentation black-box instrumentation White-box Analysis Black-box Analysis End-to-end flows Anomaly Detection Normalized blackbox metrics Labeled End-to-end flows Problem Localization list of problems ranked by severity 13
14 Black-Box Instrumentation For both Hadoop and VoIP Resource-usage metrics collected periodically from OS Monitoring interval varies from 1s to 15min Examples of metrics CPU utilization, CPU run-queue size Pages in, pages out Memory used, memory free Context-switches Packets received, packets sent Disk blocks read, disk blocks written 14
15 White-Box Instrumentation Each node logs each request that passes through it Timestamp, IP address, request duration/size, phone no., Log formats vary across components Application-specific parsers extract relevant attributes Construction of end-to-end traces Extract control flow information Control flow captures sequence of events executed E.g., dependencies between Maps and Reduces in Hadoop Extract data flow information Data flow captures transfer of data between components Stitch end-to-end flows using control and data flow information 15
16 Target Systems Instrumentation Hadoop Clusters (OpenCloud, Yahoo! M45) ISP s VoIP System Master JobTracker NameNode Application Servers Application Servers Application Servers ISP Gateways Job Job IP Base Elements Slaves Maps Reduces Maps TaskTracker Reduces PSTN DataNode HDFS Customers Customers 16
17 White-Box Analysis visualizations to support root-cause inference Anomalous nodes Visualization white-box instrumentation black-box instrumentation White-box Analysis Black-box Analysis End-to-end flows Anomaly Detection Normalized blackbox metrics Labeled End-to-end flows Problem Localization list of problems ranked by severity Questions How do we extract local control- and data-flow? How do we infer dependencies with other components? How do we deal with missing dependency information? 17
18 White-Box Logs Hadoop Logs control flow :59:55,625 INFO org.apache.hadoop.mapred.tasktracker: attempt_ _9630_m_013846_ % Records R/W=208/1! :59:55,943 INFO org.apache.hadoop.mapred.tasktracker: Sent out bytes for reduce: 37 from map: attempt_ _9630_m_013677_0 given 43000! VoIP Call Detail Record Mon Apr 29 07:30: ! NAS-Identifier = other! Acct-Status-Type = Accounting-off! NAS-IP-Address = ! Client-IP-Address = ! Acct-Input-Packets = 5! Acct-Output-Packets = 5! Acct-Input-Octets = 100! Acct-Output-Octets = 2789! control and data flow control flow data flow 18
19 White-Box Analysis: Hadoop 1 Each node logs task (or block) info. locally TaskTracker 6 10:03:59, MAP LaunchTaskAction! task_188_m_98! 2 Domain-specific knowledge used to extract attributes of interest 3 Infer dependency using exact key match on task ID. TaskTracker 8 10:03:59, SHUFFLE task_656_r_900 copying task_188_m_98!! 10:03:59, REDUCE task_656_r_ ! 4 Match on IP address within given time window Datanode 34 10:04:01, BLOCK_WRITE blk_ ! to ! 19
20 White-Box Analysis: VoIP 1 Each node logs call outcomes locally in Call Detail Record IP Base Element 3 2 IPBE CDR! 10:03:59, START! to ! to ! 10:04:02, STOP! Domain-specific knowledge used to extract attributes of interest 3 Infer dependency using exact key match on phone no. Application Server 5 AS CDR! 10:04:05, ATTEMPT! to \! 4 Approx match on partial phone no. and time Gateway Server 12 GS CDR! 10:04:10, ATTEMPT! xxxx to xxxx! 20
21 Output of White-box Analysis Inferred dependencies between components Reduce! Block Write! Timestamp: 10:04:01 Type: task_656_r_900! From IP: From IP: ! Timestamp: 10:04:01 Type: BLOCK_WRITE! BlockID: blk_ ! From IP: To IP: ! Unstructured logs transformed into structured log 21
22 Anomaly Detection visualizations to support root-cause inference Anomalous nodes Visualization white-box instrumentation black-box instrumentation White-box Analysis Black-box Analysis End-to-end flows Anomaly Detection Normalized blackbox metrics Labeled End-to-end flows Problem Localization list of problems ranked by severity Questions How to detect performance problems in the absence of labeled data? How to distinguish legitimate application behavior vs. problems? 22
23 Anomaly Detection Reduce! Block Write! Some user-visible problems manifest as errors Detected by extracting error codes from failed flows, or Apply domain-specific heuristics Performance problems can be harder to detect Exploit the notions of peers to detect performance problems Determine what system behaviors can be considered equivalent ( peers ) under normal conditions Significant deviation from peers is regarded anomalous 23
24 24 rika (Swahili), noun. peer, contemporary, age-set, undergoing rites of passage (marriage) at similar times.
25 Anomaly Detection: Hadoop (1) Reduce! Block Write! Extract exceptions from failed and canceled tasks Detect performance problems using peers Empirical analysis of production data to identify peers 219,961 successful jobs (Yahoo! M45 and OpenCloud) 89% of jobs had low variance in their Map durations 65% of jobs had low variance in their Reduce durations Designate tasks belonging to the same job as peers 25
26 Anomaly Detection: Hadoop (2) At the same time, behavior amongst peers can legitimately diverge due to various application factors Identified 12 such factors on OpenCloud Example: HDFS bytes written/read Flag tasks which do not fit regression model as anomalous Exploit regression to automatically learn factors influencing task durations 26
27 Anomaly Detection: VoIP IP Base Element! App. Server! Gateway Server! Detecting blocked or dropped calls Extract defect codes (e.g., timeout) from failed calls Exploit domain-specific heuristics to detect problems Examples: Callback soon after call end, zero talk-time Detecting performance problems Designate peers to be calls belonging to the same service Simple statistical technique to detect peers when deviate Packet loss exceeds 90 th percentile of calls 27
28 Output of Anomaly Detection Labeled end-to-end flows FAIL Reduce! Block Write! Block Read! SUCCESS Map! Block Write! 28
29 Problem Localization visualizations to support root-cause inference Anomalous nodes Visualization white-box instrumentation black-box instrumentation White-box Analysis Black-box Analysis End-to-end flows Anomaly Detection Normalized blackbox metrics Labeled End-to-end flows Problem Localization list of problems ranked by severity Questions How to identify problems due to combination of factors? How to distinguish multiple ongoing problems? How to find resource that caused the problem? How to handle noise due to flawed anomaly detection? 29
30 Identify Suspect Attributes (1) STEP 1: Find individual attributes most likely to occur in failed flows Labeled end-to-end traces generated by anomaly-detection Task1: 09:31am,SUCCESS, Server1, Map2, BlockRead3,! Task2: 09:32am,FAIL, Server1, Map3, Server8, BlockRead5,!!:!!:! 10s of millions of flows 10s of thousands of attributes Server1! Server8! Map2! Map3! Outcome! Task1! 1! 0! 1! 0! SUCCESS! Task2! 1! 1! 0! 1! FAIL! Binary representation supports scalable representation of attributes as sparse table. 30
31 Identify Suspect Attributes (2) STEP 1: (contd) Estimate conditional probability distributions Prob(Success Attribute) vs. Prob(Failure Attribute) Update belief on distribution with each flow seen Success Server8! Failure Server8! Degree of Belief Anomaly score: Distance between distributions Probability 31
32 Find Attribute Combinations STEP 2: Determine if chronic is triggered by combination of factors Find attribute combinations that maximize anomaly score Greedy, iterative search limits combinations explored Step 1: All flows 120! 350! Step 2: Remove all flows that match Signature1. Repeat. 290! Server6! Server8! Server6! 450! Block Read! Map 65! Shuffles! Indict path with highest anomaly score 32
33 Rank Problems by Severity STEP 3: Rank list of identified problems by severity Rank problems based on number of flows affected Spurious attributes introduced by noise receive low-rank UI: Ranked list of chronics identified 1. Chronic signature1 Server8! BlockRead! 2. Chronic signature2 Server64! Failed Flows Time of Day (GMT) Visualization allows operators to identify recurrent problems 33
34 Fusing Black-box Metrics STEP 4: Determine if resource-usage metrics affected Annotate flows associated with culprit nodes (and peers) Culprit Node Peer Peer Server 8 Time: 10:03:59,! Map ID: task_188_m_98 Bytes Read: 7867! Duration: 25 seconds! Status: FAILED! Server 10 Time: 10:03:59,! Map ID: task_188_m_76 Bytes Read: 7867! Duration: 3 seconds! Status: SUCCESS! Server 13 Time: 10:03:59,! Map ID: task_188_m_85 Bytes Read: 6863! Duration: 2 seconds! Status: SUCCESS! Mean CPU: 70.4% Mean Memory: 500MB Mean DiskUtil: 30KB Mean CPU: 12.4% Mean Memory: 430MB Mean DiskUtil: 32KB Mean CPU: 15.4% Mean Memory: 480MB Mean DiskUtil: 23KB Mean resource-usage on node during event duration 34
35 Culprit Black-Box Metrics STEP 4: (contd) Compare distribution of each black-box metric for successful/failed flows Successful Flows Failed Flows Degree of Belief CPU usage (%) Indict metric if difference between distributions is statistically significant 35
36 Thesis statement Approach Instrumentation Anomaly detection Problem localization Experimental evaluation Fault injection Case studies Extensions Conclusion Outline 36
37 Experimental Evaluation CASE STUDIES FAULT INJECTION HADOOP VOIP Workload Gridmix cluster benchmark One-week of ISP s call logs Injected faults Experimental setup Production Sytem Status Resource hogs/task hangs 10 iterations per fault 10-node EC2 cluster 2 1.2GHz cores, 7GB RAM OpenCloud Post-mortem offline analysis of real incidents Server/Customer problems 1000 simulated faults in total 1 simulation node 2 2.4GHz cores, 16GB RAM Production VoIP system at ISP Used in production for 2 years since
38 Expt #1: Impact of Dependencies QUESTION: Does knowledge of dependencies affect diagnosis? METHOD: Hadoop EC2 cluster, 10 nodes, fault injection. Apply problem localization with white-box metrics. Compare against approach without knowledge of dependencies. Disk Hog Map hang Reduce hang Flow-level 38 Without dependencies With dependencies Packet loss Knowledge of dependencies improves diagnosis
39 Expt #2: Impact of Fusion QUESTION: Does fusion of metrics provide insight on root-cause? METHOD: Hadoop EC2 cluster, 10 nodes, fault injection. Apply problem localization with fused white/black-box metrics. Top Metrics Indicted Fault Injected White box Black-box Insight on root-cause Disk hog Maps Disk Packet-loss Shuffles - Map hang (Hang1036) Maps - Reduce hang (Hang1152) Reduces - Fusion of metrics provides insight on most injected faults 39
40 Expt #3: Impact of Fault Probability QUESTION: Can we effectively diagnose low-probability faults? METHOD: One week of ISP s call logs, 1 node, fault simulation. Randomly label 1-10% of flows with attributes of interest as faulty. Apply problem localization with white-box metrics. Robust to changes in fault probability 40
41 Expt #4: Impact of Noise QUESTION: Does flawed anomaly-detection (noise) impact diagnosis? METHOD: One week of ISP s call logs, 1 node, fault simulation. Mislabel 5-20% of failed flows. Apply problem localization with white-box metrics 41 Recall robust to noisy labels. Precision drops due to spurious attributes.
42 Case #1: Multiple Hardware Issues INCIDENT: Multiple hardware problems in OpenCloud cluster User experiences multiple job failures with cryptic exceptions. Administrators initially suspected memory configuration issue. Took a week to resolve. Bad disk and bad NIC on two nodes. DIAGNOSIS APPLIED Apply problem-localization approach with white-box metrics. Correctly identified nodes with bad hardware in top-10 ranked list Identified multiple simultaneous problems affecting user s job. 42
43 Case #2: Quality (QoS) Violations INCIDENT: Calls in VoIP system experiencing high packet-loss Operators suspect issue with network elements at ISP. Took a weeks to resolve. DIAGNOSIS APPLIED Flag calls whose packet-loss exceeds 85 th percentile as faulty. Apply problem-localization approach with white-box metrics. Showed most QoS issues were tied to specific customers. Operators alerted customers of problem on their site. Customer fixed problem and QoS violation resolved. 43
44 Case #3: Performance Problem INCIDENT: Intermittent performance problem in VoIP system Intermittent performance problem with two application servers. Affected 0.1% of all calls passing through application servers. DIAGNOSIS APPLIED Apply problem-localization using white/black-box metrics. Post-mortem analysis confirmed issue with application servers. Also flagged anomalous CPU and memory usage. Successfully identified low-severity fault affecting multiple servers. 44
45 Thesis statement Approach Instrumentation Anomaly detection Problem localization Experimental evaluation Fault injection Case studies Extensions Conclusion Outline 45
46 Lessons Learned (1) Synthesis of end-to-end causal traces possible Local logs capture local control- and data-flow info Approximate-matching infers implicit dependencies In absence of labeled data, peer-comparison is feasible approach for anomaly detection Peers can be tasks (Hadoop), end-to-end flows, calls within the same service (VoIP) Regression can help to differentiate between Legitimate application behavior (more bytes read/written) vs. Anomalous behavior (task taking longer to run for other unexplained reasons) 46
47 Lessons Learned (2) Important to analyze both successful and failed flows Limiting analysis to only failed flows might elevate common elements over causal elements Fusion of white+black-box data can provide more insight into source of problem Ranking problems by severity helps tolerate noise Spurious labels receive lower ranking 47
48 Limitations No diagnosis for the Master node of a Hadoop cluster Problems at master typically result in system-wide issues Peer-groups are defined statically Need to automate identification of peers False positives occur if root-cause not in logs Algorithm tends to implicate adjacent network elements Need to incorporate more data to improve visibility Does not detect dormant problems that do not impact user-perceived system behavior Examples: Blacklisted nodes in Hadoop 48
49 Extensions (Future Work) Visualization in heterogeneous systems User study on diagnosis interfaces in Hadoop [CHIMIT11] Visual signatures of problems in Hadoop [LISA12] Visual signatures of problems in heterogeneous systems Extensible visualization framework for diagnosis Future Work Online monitoring and diagnosis Generic framework for monitoring and diagnosis [WADS09] Streaming implementation of problem-localization [DSN12] Scalable monitoring and diagnostic framework 49
50 Visualization visualizations to support root-cause inference Visualization white-box instrumentation White-box Analysis Black-box Analysis Anomaly Detection Problem Localization list of problems ranked by severity black-box instrumentation Questions How to develop compact visualizations for large clusters? Can visualizations help spot/discriminate different anomalies? 50
51 Theia: Visual Signatures of Problems Maps anomalies observed to broad problem classes Hardware failures, application issue, data skew Supports interactive data exploration Users drill-down from cluster- to job-level displays Hovering over the visualization gives more context Compact representation for scalability Can support clusters with 100s of nodes Performance degradation due to failing disk *USENIX LISA 2012 Best Student-Paper Award 1 26 Node ID Job ID (sorted by start time) 51
52 Conclusion Approach for diagnosis of chronic problems Amenable for use in production systems Infers dependencies from existing white-box logs Uses heuristics and peer-comparison to detect anomalies Localizes source of problem using statistical approach Incorporates both white-box and black-box logs Demonstrated for two production systems VoIP system at ISP (approach deployed for 2 years now) OpenCloud Hadoop cluster Initial progress on extensions (visualization) 52
53 Collaborators & Thanks VoIP (AT&T) Matti Hiltunen, Kaustubh Joshi, Scott Daniels Hadoop visualization Christos Faloutsos, U Kang, Elmer Garduno, Jason Campbell (Intel), HCI team OpenCloud Greg Ganger, Garth Gibson, Julio Lopez, Kai Ren, Mitch Franzos, Michael Stroucken Hadoop diagnosis Jiaqi Tan, Xinghao Pan, Rajeev Gandhi, Keith Bare, Michael Kasick, Eugene Marinelli Soila March 2012" 53
54 Publications (1) Diagnosis in VoIP Visualization, User studies, Surveys White-box diagnosis Black-box diagnosis 1. S. P. Kavulya, S. Daniels, K. Joshi, M. Hiltunen, R. Gandhi, P. Narasimhan. Draco: Statistical Diagnosis of Chronic Problems in Large Distributed Systems. IEEE Dependable Systems and networks (DSN 12), Boston, MA, Jun S. P. Kavulya, K. Joshi, M. Hiltunen, S. Daniels, R. Gandhi, P. Narasimhan. Practical Experiences with Chronics Discovery in Large Telecommunications Systems. Best Papers from SLAML in Operating Systems Review (OSR 12), S. P. Kavulya, K. Joshi, M. Hiltunen, S. Daniels, R. Gandhi, P. Narasimhan. Practical Experiences with Chronics Discovery in Large Telecommunications Systems. Workshop on Managing Large-Scale Systems via the Analysis of System Logs and the Application of Machine Learning Techniques (SLAML 11), E. Garduno, S. Kavulya, J. Tan, R. Gandhi and P. Narasimhan. Theia: Visual Signatures for Problem Diagnosis in Large Hadoop Clusters. In Large Installation System Administration Conference (LISA) 2012, San Diego, CA, Dec Best Student Paper Award. 5. S. P. Kavulya, K. Joshi, F. Di Giandomenico, P. Narasimhan. Failure Diagnosis of Complex Systems.Book on Resilience Assessment and Evaluation (RAE 12). Wolter, J. Campbell, A. Ganesan, B. Gotow, S. Kavulya, J. Mulholland, P. Narasimhan, S. Ramasubramanian, M. Shuster, and J. Tan. Understanding and Improving the Diagnostic Workflow of MapReduce Users. In 5th ACM Symposium on Computer Human Interaction for Management of Information Technology (CHIMIT), Boston, MA, Dec S. Kavulya, J. Tan, R. Gandhi, P. Narasimhan. An Analysis of Traces from a Production MapReduce Cluster. 10th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) 2010, Melbourne, Victoria, Australia, May J. Tan, S. Kavulya, R. Gandhi, P. Narasimhan. Visual, Log-based Causal Tracing for Performance Debugging of MapReduce Systems. 30th IEEE International Conference on Distributed Computing Systems (ICDCS) 2010, Genoa, Italy, Jun J. Tan, X. Pan, S. Kavulya, R. Gandhi, P. Narasimhan. Mochi: Visual Log-Analysis Based Tools for Debugging Hadoop. USENIX Workshop on Hot Topics in Cloud Computing (HotCloud '09), San Diego, CA, Jun J. Tan, X. Pan, S. Kavulya, R. Gandhi, P. Narasimhan. SALSA: Analyzing Logs as State Machines. USENIX Workshop on Analysis of System Logs (WASL 08), San Diego, CA, Dec J. Tan, S. Kavulya, R. Gandhi, P. Narasimhan. Lightweight Black-box Failure Detection for Distributed Systems. In Workshop on Management of Big Data systems (MBDS) 2012, co-located with the International Conference on Autonomic Computing, San Jose, SA, Sep X. Pan, S. Kavulya, J. Tan, R. Gandhi, P. Narasimhan. Ganesha: Black-Box Diagnosis for MapReduce Systems. Workshop on Hot Topics in Measurement & Modeling of Computer Systems (HotMetrics), Seattle, WA, Jun
55 Publications (2) Black-box + White box diagnosis 13. J. Tan, X. Pan, S. Kavulya, E. Marinelli, R. Gandhi, P. Narasimhan. Kahuna: Problem Diagnosis for MapReduce-based Cloud Computing Environments. 12th IEEE/IFIP Network Operations and Management Symposium (NOMS) 2010, Osaka, Japan, Apr X. Pan, J. Tan, S. Kavulya, R. Gandhi, P. Narasimhan. Blind Men and the Elephant: Piecing Together Hadoop for Diagnosis. 20th IEEE International Symposium on Software Reliability Engineering (ISSRE), Industrial Track, Mysuru, India, Nov S. Kavulya, R. Gandhi, P. Narasimhan. Gumshoe: Diagnosing Performance Problems in Replicated File-Systems. IEEE Symposium on Reliable Distributed systems (SRDS 08), Naples, Italy, October S. Pertet, R. Gandhi, P. Narasimhan. Fingerpointing Correlated Failures in Replicated Systems. SysML, April
56 Questions? 56 Soila March 2012"
57 Related Work (1) M. Attariyan, M. Chow, and J. Flinn, X-ray: Automating root-cause diagnosis of perfor- mance anomalies in production software. USENIX Symposium on Operating Systems Design and Implementation (OSDI 12), Hollywood, CA, October P. Bodík, M. Goldszmidt, A. Fox, D. B. Woodard, H. Andersen. Fingerprinting the datacenter: automated classification of performance crises. EuroSys [Cohen05]: Capturing, indexing, clustering and retrieving system history. Ira Cohen, Steve Zhang, Moises Goldszmidt, Julie Symons, Terence Kelly, Armando Fox. SOSP, [Kandula09]: Detailed diagnosis in enterprise networks. Srikanth Kandula, Ratul Mahajan, Patrick Verkaik, Sharad Agarwal, Jitendra Padhye, Paramvir Bahl. SIGCOMM [Kasick10]: Black-Box Problem Diagnosis in Parallel File Systems. Michael P. Kasick, Jiaqi Tan, Rajeev Gandhi, Priya Narasimhan. FAST
58 Related Work (2) [Kiciman05]: Detecting application-level failures in component-based Internet Services. Emre Kiciman, Armando Fox. IEEE Trans. on Neural Networks [Mahimkar09]: Towards automated performance diagnosis in a large IPTV network. Ajay Anil Mahimkar, Zihui Ge, Aman Shaikh, Jia Wang, Jennifer Yates, Yin Zhang, Qi Zhao. SIGCOMM [Sambasivan11]: Diagnosing Performance Changes by Comparing Request Flows. Raja R. Sambasivan, Alice X. Zheng, Michael De Rosa, Elie Krevat, Spencer Whitman, Michael Stroucken, William Wang, Lianghong Xu, and Gregory R. Ganger. NSDI [Tati12] S. Tati, B.-J. Ko, G. Cao, A. Swami, and T. F. L. Porta, Adaptive algorithms for diagnosing large-scale failures in computer networks. IEEE Conference on Dependable Systems and Networks (DSN), Boston, MA, Jun
Ganesha: Black-Box Diagnosis of MapReduce Systems
Ganesha: Black-Box Diagnosis of MapReduce Systems Xinghao Pan, Jiaqi Tan, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan Electrical & Computer Engineering Department, Carnegie Mellon University Pittsburgh,
Abstract. 1 Introduction. 2 Problem Statement. 1 Mochi: Visual Log-Analysis Based Tools for Debugging Hadoop
1 Mochi: Visual Log-Analysis Based Tools for Debugging Hadoop Jiaqi Tan, Xinghao Pan, Soila Kavulya, Rajeev Gandhi, Priya Narasimhan Electrical & Computer Engineering Department Carnegie Mellon University,
Fingerprinting the datacenter: automated classification of performance crises
Fingerprinting the datacenter: automated classification of performance crises Peter Bodík 1,3, Moises Goldszmidt 3, Armando Fox 1, Dawn Woodard 4, Hans Andersen 2 1 RAD Lab, UC Berkeley 2 Microsoft 3 Research
Diagnosing Heterogeneous Hadoop Clusters
Diagnosing Heterogeneous Hadoop Clusters Shekhar Gupta, Christian Fritz, Johan de Kleer, and Cees Witteveen 2 Palo Alto Research Center, Palo Alto, CA 930, USA {ShekharGupta, cfritz, dekleer}@parccom 2
Hadoop Performance Monitoring Tools
Hadoop Performance Monitoring Tools Kai Ren, Lianghong Xu, Zongwei Zhou Abstract Diagnosing performance problems in large-scale data-intensive programs is inherently difficult due to its scale and distributed
SALSA: Analyzing Logs as StAte Machines 1
SALSA Analyzing s as StAte Machines 1 Jiaqi Tan, Xinghao Pan, Soila Kavulya, Rajeev Gandhi and Priya Narasimhan Electrical & Computer Engineering Department, Carnegie Mellon University {jiaqit, xinghaop,
Hadoop: A Framework for Data- Intensive Distributed Computing. CS561-Spring 2012 WPI, Mohamed Y. Eltabakh
1 Hadoop: A Framework for Data- Intensive Distributed Computing CS561-Spring 2012 WPI, Mohamed Y. Eltabakh 2 What is Hadoop? Hadoop is a software framework for distributed processing of large datasets
ASDF: Automated, Online Fingerpointing for Hadoop (CMU-PDL-08-104)
Research Centers and Institutes Parallel Data Laboratory Carnegie Mellon University Year 2008 ASDF: Automated, Online Fingerpointing for Hadoop (CMU-PDL-08-104) Keith Bare Michael P. Kasick Soila Kavulya
Mobile Cloud Computing for Data-Intensive Applications
Mobile Cloud Computing for Data-Intensive Applications Senior Thesis Final Report Vincent Teo, [email protected] Advisor: Professor Priya Narasimhan, [email protected] Abstract The computational and storage
Hadoop Architecture. Part 1
Hadoop Architecture Part 1 Node, Rack and Cluster: A node is simply a computer, typically non-enterprise, commodity hardware for nodes that contain data. Consider we have Node 1.Then we can add more nodes,
Fault Tolerance in Hadoop for Work Migration
1 Fault Tolerance in Hadoop for Work Migration Shivaraman Janakiraman Indiana University Bloomington ABSTRACT Hadoop is a framework that runs applications on large clusters which are built on numerous
Chapter 7. Using Hadoop Cluster and MapReduce
Chapter 7 Using Hadoop Cluster and MapReduce Modeling and Prototyping of RMS for QoS Oriented Grid Page 152 7. Using Hadoop Cluster and MapReduce for Big Data Problems The size of the databases used in
Draco: Top-Down Statistical Diagnosis of Large-scale VoIP Networks
Draco: Top-Down Statistical Diagnosis of Large-scale VoIP Networks Soila P. Kavulya, Kaustubh Joshi, Matti Hiltunen, Scott Daniels, Rajeev Gandhi,Priya Narasimhan Carnegie Mellon University, AT&T Labs
GraySort on Apache Spark by Databricks
GraySort on Apache Spark by Databricks Reynold Xin, Parviz Deyhim, Ali Ghodsi, Xiangrui Meng, Matei Zaharia Databricks Inc. Apache Spark Sorting in Spark Overview Sorting Within a Partition Range Partitioner
Introduction to Hadoop
Introduction to Hadoop 1 What is Hadoop? the big data revolution extracting value from data cloud computing 2 Understanding MapReduce the word count problem more examples MCS 572 Lecture 24 Introduction
Energy Efficient MapReduce
Energy Efficient MapReduce Motivation: Energy consumption is an important aspect of datacenters efficiency, the total power consumption in the united states has doubled from 2000 to 2005, representing
Apache Hadoop. Alexandru Costan
1 Apache Hadoop Alexandru Costan Big Data Landscape No one-size-fits-all solution: SQL, NoSQL, MapReduce, No standard, except Hadoop 2 Outline What is Hadoop? Who uses it? Architecture HDFS MapReduce Open
An Experimental Approach Towards Big Data for Analyzing Memory Utilization on a Hadoop cluster using HDFS and MapReduce.
An Experimental Approach Towards Big Data for Analyzing Memory Utilization on a Hadoop cluster using HDFS and MapReduce. Amrit Pal Stdt, Dept of Computer Engineering and Application, National Institute
Hadoop & its Usage at Facebook
Hadoop & its Usage at Facebook Dhruba Borthakur Project Lead, Hadoop Distributed File System [email protected] Presented at the Storage Developer Conference, Santa Clara September 15, 2009 Outline Introduction
System Behavior Analysis by Machine Learning
CSC456 OS Survey Yuncheng Li [email protected] December 6, 2012 Table of contents 1 Motivation Background 2 3 4 Table of Contents Motivation Background 1 Motivation Background 2 3 4 Scenarios Motivation
Exploiting Cloud Heterogeneity for Optimized Cost/Performance MapReduce Processing
Exploiting Cloud Heterogeneity for Optimized Cost/Performance MapReduce Processing Zhuoyao Zhang University of Pennsylvania, USA [email protected] Ludmila Cherkasova Hewlett-Packard Labs, USA [email protected]
Telecom Data processing and analysis based on Hadoop
COMPUTER MODELLING & NEW TECHNOLOGIES 214 18(12B) 658-664 Abstract Telecom Data processing and analysis based on Hadoop Guofan Lu, Qingnian Zhang *, Zhao Chen Wuhan University of Technology, Wuhan 4363,China
1.1 Difficulty in Fault Localization in Large-Scale Computing Systems
Chapter 1 Introduction System failures have been one of the biggest obstacles in operating today s largescale computing systems. Fault localization, i.e., identifying direct or indirect causes of failures,
Adaptive Probing: A Monitoring-Based Probing Approach for Fault Localization in Networks
Adaptive Probing: A Monitoring-Based Probing Approach for Fault Localization in Networks Akshay Kumar *, R. K. Ghosh, Maitreya Natu *Student author Indian Institute of Technology, Kanpur, India Tata Research
Task Scheduling in Hadoop
Task Scheduling in Hadoop Sagar Mamdapure Munira Ginwala Neha Papat SAE,Kondhwa SAE,Kondhwa SAE,Kondhwa Abstract Hadoop is widely used for storing large datasets and processing them efficiently under distributed
Large-Scale Data Sets Clustering Based on MapReduce and Hadoop
Journal of Computational Information Systems 7: 16 (2011) 5956-5963 Available at http://www.jofcis.com Large-Scale Data Sets Clustering Based on MapReduce and Hadoop Ping ZHOU, Jingsheng LEI, Wenjun YE
International Journal of Advancements in Research & Technology, Volume 3, Issue 2, February-2014 10 ISSN 2278-7763
International Journal of Advancements in Research & Technology, Volume 3, Issue 2, February-2014 10 A Discussion on Testing Hadoop Applications Sevuga Perumal Chidambaram ABSTRACT The purpose of analysing
Introduction to Hadoop. New York Oracle User Group Vikas Sawhney
Introduction to Hadoop New York Oracle User Group Vikas Sawhney GENERAL AGENDA Driving Factors behind BIG-DATA NOSQL Database 2014 Database Landscape Hadoop Architecture Map/Reduce Hadoop Eco-system Hadoop
A Cost-Evaluation of MapReduce Applications in the Cloud
1/23 A Cost-Evaluation of MapReduce Applications in the Cloud Diana Moise, Alexandra Carpen-Amarie Gabriel Antoniu, Luc Bougé KerData team 2/23 1 MapReduce applications - case study 2 3 4 5 3/23 MapReduce
Network-Aware Scheduling of MapReduce Framework on Distributed Clusters over High Speed Networks
Network-Aware Scheduling of MapReduce Framework on Distributed Clusters over High Speed Networks Praveenkumar Kondikoppa, Chui-Hui Chiu, Cheng Cui, Lin Xue and Seung-Jong Park Department of Computer Science,
Final Project Proposal. CSCI.6500 Distributed Computing over the Internet
Final Project Proposal CSCI.6500 Distributed Computing over the Internet Qingling Wang 660795696 1. Purpose Implement an application layer on Hybrid Grid Cloud Infrastructure to automatically or at least
The Performance Characteristics of MapReduce Applications on Scalable Clusters
The Performance Characteristics of MapReduce Applications on Scalable Clusters Kenneth Wottrich Denison University Granville, OH 43023 [email protected] ABSTRACT Many cluster owners and operators have
Research on Job Scheduling Algorithm in Hadoop
Journal of Computational Information Systems 7: 6 () 5769-5775 Available at http://www.jofcis.com Research on Job Scheduling Algorithm in Hadoop Yang XIA, Lei WANG, Qiang ZHAO, Gongxuan ZHANG School of
Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop
Lecture 32 Big Data 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop 1 2 Big Data Problems Data explosion Data from users on social
NetFlow Analysis with MapReduce
NetFlow Analysis with MapReduce Wonchul Kang, Yeonhee Lee, Youngseok Lee Chungnam National University {teshi85, yhlee06, lee}@cnu.ac.kr 2010.04.24(Sat) based on "An Internet Traffic Analysis Method with
Hadoop Scheduler w i t h Deadline Constraint
Hadoop Scheduler w i t h Deadline Constraint Geetha J 1, N UdayBhaskar 2, P ChennaReddy 3,Neha Sniha 4 1,4 Department of Computer Science and Engineering, M S Ramaiah Institute of Technology, Bangalore,
Detection of Distributed Denial of Service Attack with Hadoop on Live Network
Detection of Distributed Denial of Service Attack with Hadoop on Live Network Suchita Korad 1, Shubhada Kadam 2, Prajakta Deore 3, Madhuri Jadhav 4, Prof.Rahul Patil 5 Students, Dept. of Computer, PCCOE,
Distributed Framework for Data Mining As a Service on Private Cloud
RESEARCH ARTICLE OPEN ACCESS Distributed Framework for Data Mining As a Service on Private Cloud Shraddha Masih *, Sanjay Tanwani** *Research Scholar & Associate Professor, School of Computer Science &
http://www.wordle.net/
Hadoop & MapReduce http://www.wordle.net/ http://www.wordle.net/ Hadoop is an open-source software framework (or platform) for Reliable + Scalable + Distributed Storage/Computational unit Failures completely
Machine Learning Algorithm for Pro-active Fault Detection in Hadoop Cluster
Malaviya National Institute of Technology Department of Computer Science & Engineering Machine Learning Algorithm for Pro-active Fault Detection in Hadoop Cluster Winter Internship of: Agrima Seth Advisor:
Hadoop Performance Diagnosis By Post-execution Log Analysis
Hadoop Performance Diagnosis By Post-execution Log Analysis cs598 Project Proposal Zhijin Li, Chris Cai, Haitao Liu ([email protected],[email protected],[email protected]) University of Illinois
Towards Combining Online & Offline Management for Big Data Applications
Towards Combining Online & Offline Management for Big Data Applications Brian Laub, Chengwei Wang, Karsten Schwan, and Chad Huneycutt, Georgia Institute of Technology https://www.usenix.org/conference/icac14/technical-sessions/presentation/laub
Context-Aware Time Series Anomaly Detection for Complex Systems
Context-Aware Time Series Anomaly Detection for Complex Systems Manish Gupta 1, Abhishek B. Sharma 2, Haifeng Chen 2, Guofei Jiang 2 1 UIUC, 2 NEC Labs, America Abstract Systems with several components
Finding Insights & Hadoop Cluster Performance Analysis over Census Dataset Using Big-Data Analytics
Finding Insights & Hadoop Cluster Performance Analysis over Census Dataset Using Big-Data Analytics Dharmendra Agawane 1, Rohit Pawar 2, Pavankumar Purohit 3, Gangadhar Agre 4 Guide: Prof. P B Jawade 2
Research Article Hadoop-Based Distributed Sensor Node Management System
Distributed Networks, Article ID 61868, 7 pages http://dx.doi.org/1.1155/214/61868 Research Article Hadoop-Based Distributed Node Management System In-Yong Jung, Ki-Hyun Kim, Byong-John Han, and Chang-Sung
Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data
Introduction to Hadoop HDFS and Ecosystems ANSHUL MITTAL Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data Topics The goal of this presentation is to give
Welcome to the unit of Hadoop Fundamentals on Hadoop architecture. I will begin with a terminology review and then cover the major components
Welcome to the unit of Hadoop Fundamentals on Hadoop architecture. I will begin with a terminology review and then cover the major components of Hadoop. We will see what types of nodes can exist in a Hadoop
Self-Compressive Approach for Distributed System Monitoring
Self-Compressive Approach for Distributed System Monitoring Akshada T Bhondave Dr D.Y Patil COE Computer Department, Pune University, India Santoshkumar Biradar Assistant Prof. Dr D.Y Patil COE, Computer
Analysing Large Web Log Files in a Hadoop Distributed Cluster Environment
Analysing Large Files in a Hadoop Distributed Cluster Environment S Saravanan, B Uma Maheswari Department of Computer Science and Engineering, Amrita School of Engineering, Amrita Vishwa Vidyapeetham,
Failure Diagnosis of Complex Systems
Failure Diagnosis of Complex Systems Soila P. Kavulya 1, Kaustubh Joshi 2, Felicita Di Giandomenico 3, Priya Narasimhan 1 1 Carnegie Mellon University, PA, USA [email protected],[email protected] 2 AT&T
Scalable NetFlow Analysis with Hadoop Yeonhee Lee and Youngseok Lee
Scalable NetFlow Analysis with Hadoop Yeonhee Lee and Youngseok Lee {yhlee06, lee}@cnu.ac.kr http://networks.cnu.ac.kr/~yhlee Chungnam National University, Korea January 8, 2013 FloCon 2013 Contents Introduction
America s Most Wanted a metric to detect persistently faulty machines in Hadoop
America s Most Wanted a metric to detect persistently faulty machines in Hadoop Dhruba Borthakur and Andrew Ryan dhruba,[email protected] Presented at IFIP Workshop on Failure Diagnosis, Chicago June
Deploying Hadoop with Manager
Deploying Hadoop with Manager SUSE Big Data Made Easier Peter Linnell / Sales Engineer [email protected] Alejandro Bonilla / Sales Engineer [email protected] 2 Hadoop Core Components 3 Typical Hadoop Distribution
Keywords: Big Data, HDFS, Map Reduce, Hadoop
Volume 5, Issue 7, July 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Configuration Tuning
MapReduce and Hadoop Distributed File System
MapReduce and Hadoop Distributed File System 1 B. RAMAMURTHY Contact: Dr. Bina Ramamurthy CSE Department University at Buffalo (SUNY) [email protected] http://www.cse.buffalo.edu/faculty/bina Partially
Big Data - Infrastructure Considerations
April 2014, HAPPIEST MINDS TECHNOLOGIES Big Data - Infrastructure Considerations Author Anand Veeramani / Deepak Shivamurthy SHARING. MINDFUL. INTEGRITY. LEARNING. EXCELLENCE. SOCIAL RESPONSIBILITY. Copyright
Analysis and Modeling of MapReduce s Performance on Hadoop YARN
Analysis and Modeling of MapReduce s Performance on Hadoop YARN Qiuyi Tang Dept. of Mathematics and Computer Science Denison University [email protected] Dr. Thomas C. Bressoud Dept. of Mathematics and
Text Mining Approach for Big Data Analysis Using Clustering and Classification Methodologies
Text Mining Approach for Big Data Analysis Using Clustering and Classification Methodologies Somesh S Chavadi 1, Dr. Asha T 2 1 PG Student, 2 Professor, Department of Computer Science and Engineering,
A Study on Workload Imbalance Issues in Data Intensive Distributed Computing
A Study on Workload Imbalance Issues in Data Intensive Distributed Computing Sven Groot 1, Kazuo Goda 1, and Masaru Kitsuregawa 1 University of Tokyo, 4-6-1 Komaba, Meguro-ku, Tokyo 153-8505, Japan Abstract.
CSE 590: Special Topics Course ( Supercomputing ) Lecture 10 ( MapReduce& Hadoop)
CSE 590: Special Topics Course ( Supercomputing ) Lecture 10 ( MapReduce& Hadoop) Rezaul A. Chowdhury Department of Computer Science SUNY Stony Brook Spring 2016 MapReduce MapReduce is a programming model
Figure 1. The cloud scales: Amazon EC2 growth [2].
- Chung-Cheng Li and Kuochen Wang Department of Computer Science National Chiao Tung University Hsinchu, Taiwan 300 [email protected], [email protected] Abstract One of the most important issues
Cloud Platforms, Challenges & Hadoop. Aditee Rele Karpagam Venkataraman Janani Ravi
Cloud Platforms, Challenges & Hadoop Aditee Rele Karpagam Venkataraman Janani Ravi Cloud Platform Models Aditee Rele Microsoft Corporation Dec 8, 2010 IT CAPACITY Provisioning IT Capacity Under-supply
Lecture 5: GFS & HDFS! Claudia Hauff (Web Information Systems)! [email protected]
Big Data Processing, 2014/15 Lecture 5: GFS & HDFS!! Claudia Hauff (Web Information Systems)! [email protected] 1 Course content Introduction Data streams 1 & 2 The MapReduce paradigm Looking behind
Introduction to Hadoop
1 What is Hadoop? Introduction to Hadoop We are living in an era where large volumes of data are available and the problem is to extract meaning from the data avalanche. The goal of the software tools
Reference Architecture and Best Practices for Virtualizing Hadoop Workloads Justin Murray VMware
Reference Architecture and Best Practices for Virtualizing Hadoop Workloads Justin Murray ware 2 Agenda The Hadoop Journey Why Virtualize Hadoop? Elasticity and Scalability Performance Tests Storage Reference
This exam contains 13 pages (including this cover page) and 18 questions. Check to see if any pages are missing.
Big Data Processing 2013-2014 Q2 April 7, 2014 (Resit) Lecturer: Claudia Hauff Time Limit: 180 Minutes Name: Answer the questions in the spaces provided on this exam. If you run out of room for an answer,
CSE-E5430 Scalable Cloud Computing Lecture 2
CSE-E5430 Scalable Cloud Computing Lecture 2 Keijo Heljanko Department of Computer Science School of Science Aalto University [email protected] 14.9-2015 1/36 Google MapReduce A scalable batch processing
Comparative analysis of mapreduce job by keeping data constant and varying cluster size technique
Comparative analysis of mapreduce job by keeping data constant and varying cluster size technique Mahesh Maurya a, Sunita Mahajan b * a Research Scholar, JJT University, MPSTME, Mumbai, India,[email protected]
CiteSeer x in the Cloud
Published in the 2nd USENIX Workshop on Hot Topics in Cloud Computing 2010 CiteSeer x in the Cloud Pradeep B. Teregowda Pennsylvania State University C. Lee Giles Pennsylvania State University Bhuvan Urgaonkar
Efficacy of Live DDoS Detection with Hadoop
Efficacy of Live DDoS Detection with Hadoop Sufian Hameed IT Security Labs, NUCES, Pakistan Email: [email protected] Usman Ali IT Security Labs, NUCES, Pakistan Email: [email protected] Abstract
Hadoop Technology for Flow Analysis of the Internet Traffic
Hadoop Technology for Flow Analysis of the Internet Traffic Rakshitha Kiran P PG Scholar, Dept. of C.S, Shree Devi Institute of Technology, Mangalore, Karnataka, India ABSTRACT: Flow analysis of the internet
Hadoop & its Usage at Facebook
Hadoop & its Usage at Facebook Dhruba Borthakur Project Lead, Hadoop Distributed File System [email protected] Presented at the The Israeli Association of Grid Technologies July 15, 2009 Outline Architecture
Challenges for Data Driven Systems
Challenges for Data Driven Systems Eiko Yoneki University of Cambridge Computer Laboratory Quick History of Data Management 4000 B C Manual recording From tablets to papyrus to paper A. Payberah 2014 2
Software System Performance Debugging with Kernel Events Feature Guidance
Software System Performance Debugging with Kernel Events Feature Guidance Junghwan Rhee, Hui Zhang, Nipun Arora, Guofei Jiang, Kenji Yoshihira NEC Laboratories America Princeton, New Jersey 08540 USA Email:
Group Based Load Balancing Algorithm in Cloud Computing Virtualization
Group Based Load Balancing Algorithm in Cloud Computing Virtualization Rishi Bhardwaj, 2 Sangeeta Mittal, Student, 2 Assistant Professor, Department of Computer Science, Jaypee Institute of Information
Virtualizing Apache Hadoop. June, 2012
June, 2012 Table of Contents EXECUTIVE SUMMARY... 3 INTRODUCTION... 3 VIRTUALIZING APACHE HADOOP... 4 INTRODUCTION TO VSPHERE TM... 4 USE CASES AND ADVANTAGES OF VIRTUALIZING HADOOP... 4 MYTHS ABOUT RUNNING
A Framework for Performance Analysis and Tuning in Hadoop Based Clusters
A Framework for Performance Analysis and Tuning in Hadoop Based Clusters Garvit Bansal Anshul Gupta Utkarsh Pyne LNMIIT, Jaipur, India Email: [garvit.bansal anshul.gupta utkarsh.pyne] @lnmiit.ac.in Manish
Fingerprinting the Datacenter: Automated Classification of Performance Crises
Fingerprinting the Datacenter: Automated Classification of Performance Crises Peter Bodík University of California, Berkeley Armando Fox University of California, Berkeley Moises Goldszmidt Microsoft Research
Introduction to Cloud Computing
Introduction to Cloud Computing Cloud Computing I (intro) 15 319, spring 2010 2 nd Lecture, Jan 14 th Majd F. Sakr Lecture Motivation General overview on cloud computing What is cloud computing Services
Overview. Big Data in Apache Hadoop. - HDFS - MapReduce in Hadoop - YARN. https://hadoop.apache.org. Big Data Management and Analytics
Overview Big Data in Apache Hadoop - HDFS - MapReduce in Hadoop - YARN https://hadoop.apache.org 138 Apache Hadoop - Historical Background - 2003: Google publishes its cluster architecture & DFS (GFS)
A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM
A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM Sneha D.Borkar 1, Prof.Chaitali S.Surtakar 2 Student of B.E., Information Technology, J.D.I.E.T, [email protected] Assistant Professor, Information
Hadoop MapReduce and Spark. Giorgio Pedrazzi, CINECA-SCAI School of Data Analytics and Visualisation Milan, 10/06/2015
Hadoop MapReduce and Spark Giorgio Pedrazzi, CINECA-SCAI School of Data Analytics and Visualisation Milan, 10/06/2015 Outline Hadoop Hadoop Import data on Hadoop Spark Spark features Scala MLlib MLlib
BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB
BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB Planet Size Data!? Gartner s 10 key IT trends for 2012 unstructured data will grow some 80% over the course of the next
Testing Big data is one of the biggest
Infosys Labs Briefings VOL 11 NO 1 2013 Big Data: Testing Approach to Overcome Quality Challenges By Mahesh Gudipati, Shanthi Rao, Naju D. Mohan and Naveen Kumar Gajja Validate data quality by employing
Performance Comparison of SQL based Big Data Analytics with Lustre and HDFS file systems
Performance Comparison of SQL based Big Data Analytics with Lustre and HDFS file systems Rekha Singhal and Gabriele Pacciucci * Other names and brands may be claimed as the property of others. Lustre File
Snapshots in Hadoop Distributed File System
Snapshots in Hadoop Distributed File System Sameer Agarwal UC Berkeley Dhruba Borthakur Facebook Inc. Ion Stoica UC Berkeley Abstract The ability to take snapshots is an essential functionality of any
GraySort and MinuteSort at Yahoo on Hadoop 0.23
GraySort and at Yahoo on Hadoop.23 Thomas Graves Yahoo! May, 213 The Apache Hadoop[1] software library is an open source framework that allows for the distributed processing of large data sets across clusters
Big Data With Hadoop
With Saurabh Singh [email protected] The Ohio State University February 11, 2016 Overview 1 2 3 Requirements Ecosystem Resilient Distributed Datasets (RDDs) Example Code vs Mapreduce 4 5 Source: [Tutorials
