lprof : A Non- intrusive Request Flow Pro5iler for Distributed Systems

Size: px

Start display at page:

Download "lprof : A Non- intrusive Request Flow Pro5iler for Distributed Systems"

Brian Gaines
10 years ago
Views:

1 lprof : A Non- intrusive Request Flow Pro5iler for Distributed Systems Xu Zhao, Yongle Zhang, David Lion, Muhammad FaizanUllah, Yu Luo, Ding Yuan, Michael Stumm Oct. 8th 2014

2 Performance analysis tools are needed Poor performance of distributed systems leads to o Increase of user latency o Increase of data center cost Distributed system behavior is hard to understand o Concurrent requests being processed by mullple nodes To diagnose poor performance, tools are needed to o Reconstruct the request control flow o Understand system behavior 2

to understand o Concurrent requests being processed by mullple nodes To diagnose poor

3 Existing tools are intrusive Instrument systems to infer request control flow o E.g. MagPie, Project 5, X- Trace, Dapper, etc. o Incur performance overhead o InstrumentaLons are oqen system specific 3

4 System logs contain rich information Rich informalon in logs is not coincidence o Developers rely on logs to perform manual debugging Distributed systems generate lots of logs o During normal execulon 7TB/month [Cloudera 13] 25TB/day [Rothschild 09] 4

debugging Distributed systems generate lots of logs o During

5 Existing log analyzers are limited Cannot infer request control flow o Machine learning based log analyzers E.g. [Xu 09], DISTALYZER, SynopGc, etc. Only detect system anomalies o Commercial tools E.g. splunk, VMWare LogInsight Require users to perform key- word based searches 5

Only detect system anomalies o Commercial tools E.g.

6 lprof: a non- intrusive pro5iler Infers request control flow from system logs o Along with Lming informalon o Group logs printed by the same request on mullple nodes o Use informagon generated by stagc analysis System logs Request control 5low Node 2 Node 1 request Node1 Node2 Thread1 Thread3 Thread2 Thread4 6

mullple nodes o Use informagon generated by stagc analysis System logs

7 Outline IntroducLon Case Study Design EvaluaLon 7

8 A real- world example Performance regression HDFS writeblock Latency for each type of request readblock (ms) Latency :00 08:00 12:00 16:00 Time 0 writeblock is suspecious 8

request readblock (ms) 1500 1000 Latency 500

9 (ms) 1000 Zoom into per- node latency Per- node Latency DN1 Latency DN3 DN2 DN1 DN3 DN pre- update writeblock Node post- update writeblock o Intra- node latency doesn t increases while inter- node does o Conclusion: unnecessary network communicalon 9

Node post- update writeblock o Intra- node latency doesn t

10 Outline IntroducLon Case Study Design EvaluaLon 10

11 Overview Byte code StaLc analysis lprof Model VisualizaLon Logs Log analysis Request database 11

12 Challenges Goal: to sltch log messages with respeclve requests DataNode2 Receiving block blk_01 DataNode1 Receiving block blk_01 Receiving block blk_02 Received block blk_01 HDFS_READ block blk_01 blk_01 terminating log snippet from HDFS data nodes writeblock1 writeblock2 readblock Logs are interleaved From different request types From different request instances of the same type Perfect idenlfiers doesn t always exist Distributed across mullple nodes 12

snippet from HDFS data nodes writeblock1 writeblock2 readblock Logs are interleaved From different request types

13 Code snippet in HDFS 1 dataxceiver() { 2 switch(opcode){ 3 case WRITE_BLOCK: 4 blk_id = getblock(); 5 writeblock(blk_id, ); 6 break; 7 case READ_BLOCK: 8 blk_id = getblock(); 9 readblock(blk_id, ); 10 break; 11 } 12} 13 writeblock(blk_id, ) { 14 log( Receiving block + blk_id); 15 new PacketResponder().start(); 16 } 17 readblock(blk_id, ) { 18 log( HDFS_READ block + blk_id); 19 } 20 PacketResponder.run() { 21 log( Received block + blk_id); log(blk_id + terminating ); 24 } Top level method - starlng method to process a request Request idenlfier - logged variable not modified in one request Log temporal order - possible order between log statements CommunicaLon behavior - communicalon between threads 13

$start(); 16 } 17 readblock(blk_id, ) { 18 log( HDFS_READ block + blk_id); 19 } 20 PacketResponder.$

14 Find top level method Find request idenlfiers IntuiLon Request analysis o Request idenlfiers already exist for manual debugging o Not modified within one request o Once modified, outside of the request 14

already exist for manual debugging o Not modified

15 Bokom- up analysis on call graph o Logged variables idenlfier candidates (IC) o Number of Lmes they got printed count Call Graph 1 dataxceiver () { 2 switch(opcode){ 3 case WRITE_BLOCK: Request analysis example 4 blk_id = getblock(); 5 writeblock(blk_id, ); 6 break; 7 case READ_BLOCK: 8 blk_id = getblock(); 9 readblock(blk_id, ); 10 break; 11 } 12} writeblock() IC: {blk_id} count: 8 dataxceiver() IC: {} count:0 readblock() IC: {blk_id} count: 7 top level method idengfiers writeblock readblock blk_id blk_id o Once count decreases, pick top level method and idenlfier 15

$8 blk_id = getblock(); 9 readblock(blk_id, ); 10 break; 11 } 12} writeblock() IC: {blk_id} count: 8 dataxceiver() IC: {} count:0 readblock()$

16 Temporal order analysis Control flow analysis in each top level method 20 PacketResponder.run() { 21 log( Received block + blk_id); log(blk_id + terminating ); 24 } temporal order 16

$run() { 21 log( Received block + blk_id); 22$

17 Communication pair analysis CommunicaLon between request top level methods o Intra- node: thread crealon, shared objects o Network: socket, RPC Pair serializing and de- serializing methods 17

18 Summary of static analysis output Top level method & request idenlfier top level method idengfiers writeblock() blk_id readblock() blk_id Log temporal order PacketResponder.run() CommunicaLon pair writeblock() type: Network id: blk_id writeblock() type: Thread CreaLon id: blk_id writeblock() PacketResponder.run() 18

19 Distributed log stitching Implemented as a MapReduce job Node 3 Node 2 Node 1 1 Receiving blk_01 2 Received blk_01 3 blk_01 terminating Node 1 s log writeblock [log1],blk_01 PktRsp.run [log2,3],blk_01 [log1,2,3],blk_01 writeblock [9 logs from 3 nodes], blk_01 Map Combine Reduce 19

1 s log writeblock [log1],blk_01 PktRsp.

20 Outline IntroducLon Case Study Design EvaluaLon 20

21 Evaluation methodology Evaluated on logs from 4 distributed systems o HDFS, Yarn, HBase, Cassandra o Logs generated on 200 Amazon EC2 nodes o HiBench, YCSB workload Authors manually verified each unique log sequence logs Receiving block Received block terminating writeblock log sequence 21

22 Request attribution accuracy System Correct Incomplete Failed Incorrect HDFS Yarn Cassandra HBase accuracy for all the log messages 97.0% 79.6% 95.3% 90.6% 0.1% 19.2% 0.1% 2.5% 2.6% 1.2% 4.6% 3.4% 0.3% 0.0% 0.0% 3.5% Average 90.4% 5.7% 3.0% 1.0% 22

23 Real- world performance anomalies Randomly selected 23 anomalies o Reproduced each one to collect logs lprof is helpful for idenlfying the root cause for 65% Reasons for the cases lprof cannot help o Abnormal requests don t print any logs o The abnormal request only print 1 log But latency is needed for debugging 23

24 Related work Intrusive tools o E.g. MagPie, Project 5, X- Trace, Dapper, etc. ExisLng log analyzers o E.g. [Xu 09], DISTALYZER, SynopGc, etc. The Mystery Machine [Chow 14] o Infers request flow across soqware layers o Analyzes crilcal path and slack o But requires instrumenlng IDs into logs 24

25 Conclusions lprof: a profiler for distributed system o Infers request control flow along with Lming informalon o Non- intrusive because enlrely from system logs o Analyzes logs with informalon generated by stalc analysis lprof leverages the natural way developers do logging Demo 25

26 Limitations lprof benefits from good logging praclce o lprof cannot help when there s no log o Timestamp is required for latency analysis o Good idenlfier can improve the accuracy lprof cannot infer request across soqware layers lprof currently works on Java byte code 26

Benchmarking Hadoop & HBase on Violin

Technical White Paper Report Technical Report Benchmarking Hadoop & HBase on Violin Harnessing Big Data Analytics at the Speed of Memory Version 1.0 Abstract The purpose of benchmarking is to show advantages