Targeted Resource Management in Multi-tenant Distributed Systems

Transcription

1 Targeted Resource Management in Multi-tenant Distributed Systems Jonathan Mace Brown University Peter Bodik MSR Redmond Rodrigo Fonseca Brown University Madanlal Musuvathi MSR Redmond

2 Resource Management in Multi-Tenant Systems 2

3 Resource Management in Multi-Tenant Systems Failure in availability zone cascades to shared control plane, causes thread pool starvation for all zones April 2011 Amazon EBS Failure 2

4 Resource Management in Multi-Tenant Systems Failure in availability zone cascades to shared control plane, causes thread pool starvation for all zones April 2011 Amazon EBS Failure Aggressive background task responds to increased hardware capacity with deluge of warnings and logging Aug Azure Storage Outage 2

5 Resource Management in Multi-Tenant Systems Failure in availability zone cascades to shared control plane, causes thread pool starvation for all zones April 2011 Amazon EBS Failure Aggressive background task responds to increased hardware capacity with deluge of warnings and logging Aug Azure Storage Outage Code change increases database usage, shifts bottleneck to unmanaged application-level lock Nov Visual Studio Online outage 2

6 Resource Management in Multi-Tenant Systems Failure in availability zone cascades to shared control plane, causes thread pool starvation for all zones April 2011 Amazon EBS Failure Aggressive background task responds to increased hardware capacity with deluge of warnings and logging Aug Azure Storage Outage Code change increases database usage, shifts bottleneck to unmanaged application-level lock Nov Visual Studio Online outage Shared storage layer bottlenecks circumvent resource management layer 2014 Communication with Cloudera 2

7 Resource Management in Multi-Tenant Systems Failure in availability zone cascades to shared control plane, causes thread pool starvation for all zones April 2011 Amazon EBS Failure Aggressive background task responds to increased hardware capacity with deluge of warnings and logging Aug Azure Storage Outage Code change increases database usage, shifts bottleneck to unmanaged application-level lock Nov Visual Studio Online outage Shared storage layer bottlenecks circumvent resource management layer 2014 Communication with Cloudera Degraded performance, Violated SLOs, system outages 2

8 Resource Management in Multi-Tenant Systems Failure in availability zone cascades to shared control plane, causes thread pool starvation for all zones April 2011 Amazon EBS Failure Aggressive background task responds to increased hardware capacity with deluge of warnings and logging Aug Azure Storage Outage Code change increases database usage, shifts bottleneck to unmanaged application-level lock Nov Visual Studio Online outage Shared storage layer bottlenecks circumvent resource management layer 2014 Communication with Cloudera Degraded performance, Violated SLOs, system outages 3

9 Resource Management in Multi-Tenant Systems Failure in availability zone cascades to shared control plane, causes thread pool starvation for all zones April 2011 Amazon EBS Failure Aggressive background task responds to increased hardware capacity with deluge of warnings and logging Aug Azure Storage Outage Code change increases database usage, shifts bottleneck to unmanaged application-level lock Nov Visual Studio Online outage OS Hypervisor Shared storage layer bottlenecks circumvent resource management layer 2014 Communication with Cloudera Degraded performance, Violated SLOs, system outages 3

10 Resource Management in Multi-Tenant Systems Failure in availability zone cascades to shared control plane, causes thread pool starvation for all zones April 2011 Amazon EBS Failure Aggressive background task responds to increased hardware capacity with deluge of warnings and logging Aug Azure Storage Outage Code change increases database usage, shifts bottleneck to unmanaged application-level lock Nov Visual Studio Online outage OS Hypervisor Shared storage layer bottlenecks circumvent resource management layer 2014 Communication with Cloudera Degraded performance, Violated SLOs, system outages 3

11 Containers / VMs 4

12 Containers / VMs Shared Systems: Storage, Database, Queueing, etc. 4

13 5

14 5

15 Monitors resource usage of each tenant in near real-time Actively schedules tenants and activities 5

16 Monitors resource usage of each tenant in near real-time Actively schedules tenants and activities High-level, centralized policies: Encapsulates resource management logic 5

17 Monitors resource usage of each tenant in near real-time Actively schedules tenants and activities High-level, centralized policies: Encapsulates resource management logic Abstractions not specific to resource type, system Achieve different goals: guarantee average latencies, fair share a resource, etc. 5

18 Hadoop Distributed File System (HDFS) 6

19 Hadoop Distributed File System (HDFS) HDFS NameNode Filesystem metadata 6

20 Hadoop Distributed File System (HDFS) HDFS NameNode HDFS DataNode HDFS DataNode HDFS DataNode Filesystem metadata Replicated block storage 6

21 Hadoop Distributed File System (HDFS) Rename HDFS NameNode HDFS DataNode HDFS DataNode HDFS DataNode Filesystem metadata Replicated block storage 6

22 Hadoop Distributed File System (HDFS) Rename Read HDFS NameNode HDFS DataNode HDFS DataNode HDFS DataNode Filesystem metadata Replicated block storage 6

23 Hadoop Distributed File System (HDFS) Rename Read HDFS NameNode HDFS DataNode HDFS DataNode HDFS DataNode Filesystem metadata Replicated block storage 6

24 HDFS NameNode HDFS DataNode HDFS DataNode HDFS DataNode 7

25 500 0 time HDFS NameNode HDFS DataNode HDFS DataNode HDFS DataNode 7

26 500 0 time HDFS NameNode HDFS DataNode HDFS DataNode HDFS DataNode 7

27 500 0 HDFS NameNode HDFS DataNode HDFS DataNode HDFS DataNode 8

35 HDFS DataNode Local storage 11

36 HDFS DataNode Local storage 11

37 HBase RegionServer HDFS DataNode Local storage 11

38 HBase RegionServer HDFS DataNode Local storage 11

39 MapReduce Shuffler Hadoop YARN Yarn Container Yarn Container Container MapReduce Tasks HBase RegionServer HDFS DataNode Local storage 11

40 MapReduce Shuffler Hadoop YARN Yarn Container Yarn Container Container MapReduce Tasks HBase RegionServer HDFS DataNode Local storage 11

41 MapReduce Hadoop ShufflerYARN NodeManager Hadoop YARN NodeManager Hadoop Yarn Hadoop Yarn Hadoop Yarn Container YARN Container Container YARN Container Yarn Container Yarn Container YARN MapReduce Container Tasks Container MapReduce MapReduce HBase RegionServer HBase RegionServer HBase RegionServer HDFS DataNode HDFS DataNode HDFS DataNode Local storage Local storage Local storage 11

42 Goals 12

43 Goals Co-ordinated control across processes, machines, and services 12

44 Goals Co-ordinated control across processes, machines, and services Handle system and application level resources 12

45 Goals Co-ordinated control across processes, machines, and services Handle system and application level resources Principals: tenants, background tasks 12

46 Goals Co-ordinated control across processes, machines, and services Handle system and application level resources Principals: tenants, background tasks Real-time and reactive 12

47 Goals Co-ordinated control across processes, machines, and services Handle system and application level resources Principals: tenants, background tasks Real-time and reactive Efficient: Only control what is needed 12

48 Architecture 13

49 14

50 Tenant Requests 14

51 Workflows Purpose: identify requests from different users, background activities eg, all requests from a tenant over time eg, data balancing in HDFS 15

52 Workflows Purpose: identify requests from different users, background activities eg, all requests from a tenant over time eg, data balancing in HDFS Unit of resource measurement, attribution, and enforcement 15

53 Workflows Purpose: identify requests from different users, background activities eg, all requests from a tenant over time eg, data balancing in HDFS Unit of resource measurement, attribution, and enforcement Tracks a request across varying levels of granularity Orthogonal to threads, processes, network flows, etc. 15

54 Workflows 16

55 Workflows 16

56 Workflows Resources Purpose: cope with diversity of resources 16

57 Workflows Resources Purpose: cope with diversity of resources What we need: 1. Identify overloaded resources 16

58 Workflows Resources Purpose: cope with diversity of resources What we need: 1. Identify overloaded resources 2. Identify culprit workflows 16

59 Workflows Resources Purpose: cope with diversity of resources What we need: 1. Identify overloaded resources Slowdown 2. Identify culprit workflows Ratio of how slow the resource is now compared to its baseline performance with no contention. 16

60 Workflows Resources Purpose: cope with diversity of resources What we need: 1. Identify overloaded resources Slowdown 2. Identify culprit workflows Load Ratio of how slow the resource is now compared to its baseline performance with no contention. Fraction of current utilization that we can attribute to each workflow 16

63 Workflows Slowdown Load Resources (queue time + execute time) / execute time eg. 100ms queue, 10ms execute => slowdown 11 time spent executing eg. 10ms execute => load 10 Purpose: cope with diversity of resources What we need: 1. Identify overloaded resources Slowdown 2. Identify culprit workflows Load Ratio of how slow the resource is now compared to its baseline performance with no contention. Fraction of current utilization that we can attribute to each workflow 17

64 Workflows Resources 17

65 Workflows Resources Control Points Goal: enforce resource management decisions 18

66 Workflows Resources Control Points Goal: enforce resource management decisions Decoupled from resources Rate-limits workflows, agnostic to underlying implementation e.g., token bucket priority queue 18

74 Workflows Resources Control Points 20

75 Pervasive Measurement Workflows Resources Control Points 1. Pervasive Measurement Aggregated locally then reported centrally once per second 20

76 Retro Controller API Pervasive Measurement Workflows Resources Control Points 1. Pervasive Measurement Aggregated locally then reported centrally once per second 2. Centralized Controller Global, abstracted view of the system 20

77 Retro Controller API Policy Policy Policy Pervasive Measurement Workflows Resources Control Points 1. Pervasive Measurement Aggregated locally then reported centrally once per second 2. Centralized Controller Global, abstracted view of the system Policies run in continuous control loop 20

78 Retro Controller API Policy Policy Policy Pervasive Measurement 1. Pervasive Measurement Aggregated locally then reported centrally once per second 2. Centralized Controller Global, abstracted view of the system Policies run in continuous control loop Distributed Enforcement Workflows Resources Control Points 3. Distributed Enforcement Co-ordinates enforcement using distributed token bucket 20

79 Retro Controller API Policy Policy Policy Pervasive Measurement Distributed Enforcement Workflows Resources Control Points Control Plane for resource management Global, abstracted view of the system Easier to write Reusable 21

80 Policy Example: LatencySLO 22

81 Policy Example: LatencySLO H High Priority Workflows 200ms average request latency 22

82 Policy Example: LatencySLO H High Priority Workflows L Low Priority Workflows 200ms average request latency (use spare capacity) 22

83 Policy Example: LatencySLO H High Priority Workflows L Low Priority Workflows 200ms average request latency monitor latencies (use spare capacity) 22

84 Policy Example: LatencySLO H High Priority Workflows L Low Priority Workflows 200ms average request latency monitor latencies (use spare capacity) attribute interference 22

85 Policy Example: LatencySLO H High Priority Workflows L Low Priority Workflows 200ms average request latency monitor latencies (use spare capacity) throttle interfering workflows 22

86 Policy Example: LatencySLO H High Priority Workflows L Low Priority Workflows 1 foreach candidate in H 2 miss[candidate] = latency(candidate) / guarantee[candidate] 3 W = candidate in H with max miss[candidate] 4 foreach rsrc in resources() // calculate importance of each resource for hipri 5 importance[rsrc] = latency(w, rsrc) * log(slowdown(rsrc)) 6 foreach lopri in L // calculate low priority workflow interference 7 interference[lopri] = Σ rsrc importance[rsrc] * load(lopri, rsrc) / load(rsrc) 8 foreach lopri in L // normalize interference 9 interference[lopri] /= Σ k interference[k] 10 foreach lopri in L 11 if miss[w] > 1 // throttle 12 scalefactor = 1 α * (miss[w] 1) * interference[lopri] 13 else // release 14 scalefactor = 1 + β 15 foreach cpoint in controlpoints() // apply new rates 16 set_rate(cpoint, lopri, scalefactor * get_rate(cpoint, lopri) 23

87 Policy Example: LatencySLO H High Priority Workflows L Low Priority Workflows 1 foreach candidate in H 2 miss[candidate] = latency(candidate) / guarantee[candidate] 3 W = candidate in H with max miss[candidate] 4 foreach rsrc in resources() // calculate importance of each resource for hipri 5 importance[rsrc] = latency(w, rsrc) * log(slowdown(rsrc)) 6 foreach lopri in L // calculate low priority workflow interference 7 interference[lopri] = Σ rsrc importance[rsrc] * load(lopri, rsrc) / load(rsrc) 8 foreach lopri in L // normalize interference 9 interference[lopri] /= Σ k interference[k] 10 foreach lopri in L 11 if miss[w] > 1 // throttle 12 scalefactor = 1 α * (miss[w] 1) * interference[lopri] 13 else // release 14 scalefactor = 1 + β 15 foreach cpoint in controlpoints() // apply new rates 16 set_rate(cpoint, lopri, scalefactor * get_rate(cpoint, lopri) 23

88 Policy Example: LatencySLO H High Priority Workflows L Low Priority Workflows 1 foreach candidate in H 2 miss[candidate] = latency(candidate) / guarantee[candidate] 3 W = candidate in H with max miss[candidate] Select the high priority workflow W with worst performance 4 foreach rsrc in resources() // calculate importance of each resource for hipri 5 importance[rsrc] = latency(w, rsrc) * log(slowdown(rsrc)) 6 foreach lopri in L // calculate low priority workflow interference 7 interference[lopri] = Σ rsrc importance[rsrc] * load(lopri, rsrc) / load(rsrc) 8 foreach lopri in L // normalize interference 9 interference[lopri] /= Σ k interference[k] 10 foreach lopri in L 11 if miss[w] > 1 // throttle 12 scalefactor = 1 α * (miss[w] 1) * interference[lopri] 13 else // release 14 scalefactor = 1 + β 15 foreach cpoint in controlpoints() // apply new rates 16 set_rate(cpoint, lopri, scalefactor * get_rate(cpoint, lopri) 24

89 Policy Example: LatencySLO H High Priority Workflows L Low Priority Workflows 1 foreach candidate in H 2 miss[candidate] = latency(candidate) / guarantee[candidate] 3 W = candidate in H with max miss[candidate] Select the high priority workflow W with worst performance 4 foreach rsrc in resources() // calculate importance of each resource for hipri 5 importance[rsrc] = latency(w, rsrc) * log(slowdown(rsrc)) Weight low priority workflows by their interference with W 6 foreach lopri in L // calculate low priority workflow interference 7 interference[lopri] = Σ rsrc importance[rsrc] * load(lopri, rsrc) / load(rsrc) 8 foreach lopri in L // normalize interference 9 interference[lopri] /= Σ k interference[k] 10 foreach lopri in L 11 if miss[w] > 1 // throttle 12 scalefactor = 1 α * (miss[w] 1) * interference[lopri] 13 else // release 14 scalefactor = 1 + β 15 foreach cpoint in controlpoints() // apply new rates 16 set_rate(cpoint, lopri, scalefactor * get_rate(cpoint, lopri) 24

90 Policy Example: LatencySLO H High Priority Workflows L Low Priority Workflows 1 foreach candidate in H 2 miss[candidate] = latency(candidate) / guarantee[candidate] 3 W = candidate in H with max miss[candidate] Select the high priority workflow W with worst performance 4 foreach rsrc in resources() // calculate importance of each resource for hipri 5 importance[rsrc] = latency(w, rsrc) * log(slowdown(rsrc)) Weight low priority workflows by their interference with W 6 foreach lopri in L // calculate low priority workflow interference 7 interference[lopri] = Σ rsrc importance[rsrc] * load(lopri, rsrc) / load(rsrc) 8 foreach lopri in L // normalize interference 9 interference[lopri] /= Σ k interference[k] 10 foreach lopri in L 11 if miss[w] > 1 // throttle 12 scalefactor = 1 α * (miss[w] 1) * interference[lopri] 13 else // release 14 scalefactor = 1 + β Throttle low priority workflows proportionally to their weight 15 foreach cpoint in controlpoints() // apply new rates 16 set_rate(cpoint, lopri, scalefactor * get_rate(cpoint, lopri) 24

91 Policy Example: LatencySLO H High Priority Workflows L Low Priority Workflows Select the high priority workflow W with worst performance 1 foreach candidate in H 2 miss[candidate] = latency(candidate) / guarantee[candidate] 3 W = candidate in H with max miss[candidate] 4 foreach rsrc in resources() // calculate importance of each resource for hipri 5 importance[rsrc] = latency(w, rsrc) * log(slowdown(rsrc)) Weight low priority workflows by their interference with W 6 foreach lopri in L // calculate low priority workflow interference 7 interference[lopri] = Σ rsrc importance[rsrc] * load(lopri, rsrc) / load(rsrc) 8 foreach lopri in L // normalize interference 9 interference[lopri] /= Σ k interference[k] 10 foreach lopri in L 11 if miss[w] > 1 // throttle 12 scalefactor = 1 α * (miss[w] 1) * interference[lopri] 13 else // release 14 scalefactor = 1 + β Throttle low priority workflows proportionally to their weight 15 foreach cpoint in controlpoints() // apply new rates 16 set_rate(cpoint, lopri, scalefactor * get_rate(cpoint, lopri) 25

92 Policy Example: LatencySLO H High Priority Workflows L Low Priority Workflows Select the high priority workflow W with worst performance 1 foreach candidate in H 2 miss[candidate] = latency(candidate) / guarantee[candidate] 3 W = candidate in H with max miss[candidate] Weight low priority workflows by their interference with W 4 foreach rsrc in resources() // calculate importance of each resource for hipri 5 importance[rsrc] = latency(w, rsrc) * log(slowdown(rsrc)) 6 foreach lopri in L // calculate low priority workflow interference 7 interference[lopri] = Σ rsrc importance[rsrc] * load(lopri, rsrc) / load(rsrc) 8 foreach lopri in L // normalize interference 9 interference[lopri] /= Σ k interference[k] 10 foreach lopri in L 11 if miss[w] > 1 // throttle 12 scalefactor = 1 α * (miss[w] 1) * interference[lopri] 13 else // release 14 scalefactor = 1 + β Throttle low priority workflows proportionally to their weight 15 foreach cpoint in controlpoints() // apply new rates 16 set_rate(cpoint, lopri, scalefactor * get_rate(cpoint, lopri) 26

93 Policy Example: LatencySLO H High Priority Workflows 1 foreach candidate in H 2 miss[candidate] = latency(candidate) / guarantee[candidate] 3 W = candidate in H with max miss[candidate] 4 foreach rsrc in resources() // calculate importance of each resource for hipri 5 importance[rsrc] = latency(w, rsrc) * log(slowdown(rsrc)) 6 foreach lopri in L // calculate low priority workflow interference 7 interference[lopri] = Σ rsrc importance[rsrc] * load(lopri, rsrc) / load(rsrc) 8 foreach lopri in L // normalize interference 9 interference[lopri] /= Σ k interference[k] L Low Priority Workflows Select the high priority workflow W with worst performance Weight low priority workflows by their interference with W Throttle low priority workflows proportionally to their weight 10 foreach lopri in L 11 if miss[w] > 1 // throttle 12 scalefactor = 1 α * (miss[w] 1) * interference[lopri] 13 else // release 14 scalefactor = 1 + β 15 foreach cpoint in controlpoints() // apply new rates 16 set_rate(cpoint, lopri, scalefactor * get_rate(cpoint, lopri) 27

94 Policy Example: LatencySLO H High Priority Workflows 1 foreach candidate in H 2 miss[candidate] = latency(candidate) / guarantee[candidate] 3 W = candidate in H with max miss[candidate] 4 foreach rsrc in resources() // calculate importance of each resource for hipri 5 importance[rsrc] = latency(w, rsrc) * log(slowdown(rsrc)) 6 foreach lopri in L // calculate low priority workflow interference 7 interference[lopri] = Σ rsrc importance[rsrc] * load(lopri, rsrc) / load(rsrc) 8 foreach lopri in L // normalize interference 9 interference[lopri] /= Σ k interference[k] L Low Priority Workflows Select the high priority workflow W with worst performance Weight low priority workflows by their interference with W Throttle low priority workflows proportionally to their weight 10 foreach lopri in L 11 if miss[w] > 1 // throttle 12 scalefactor = 1 α * (miss[w] 1) * interference[lopri] 13 else // release 14 scalefactor = 1 + β 15 foreach cpoint in controlpoints() // apply new rates 16 set_rate(cpoint, lopri, scalefactor * get_rate(cpoint, lopri) 27

95 Other types of policy 28

96 Other types of policy Policy Bottleneck Fairness Detect most overloaded resource Fair-share resource between tenants using it 28

97 Other types of policy Policy Policy Bottleneck Fairness Detect most overloaded resource Fair-share resource between tenants using it Dominant Resource Fairness Estimate demands and capacities from measurements 28

98 Other types of policy Policy Policy Bottleneck Fairness Detect most overloaded resource Fair-share resource between tenants using it Dominant Resource Fairness Estimate demands and capacities from measurements Concise Any resources can be bottleneck (policy doesn t care) Not system specific 28

99 Evaluation 29

100 Instrumentation 30

101 Instrumentation Retro implementation in Java Instrumentation Library Central controller implementation 30

102 Instrumentation Retro implementation in Java Instrumentation Library Central controller implementation To enable Retro Propagate Workflow ID within application (like X-Trace, Dapper) Instrument resources with wrapper classes 30

103 Instrumentation Retro implementation in Java Instrumentation Library Central controller implementation To enable Retro Propagate Workflow ID within application (like X-Trace, Dapper) Instrument resources with wrapper classes Overheads 31

104 Instrumentation Retro implementation in Java Instrumentation Library Central controller implementation To enable Retro Propagate Workflow ID within application (like X-Trace, Dapper) Instrument resources with wrapper classes Overheads Resource instrumentation automatic using AspectJ 31

105 Instrumentation Retro implementation in Java Instrumentation Library Central controller implementation To enable Retro Propagate Workflow ID within application (like X-Trace, Dapper) Instrument resources with wrapper classes Overheads Resource instrumentation automatic using AspectJ Overall lines per system to modify RPCs 31

106 Instrumentation Retro implementation in Java Instrumentation Library Central controller implementation To enable Retro Propagate Workflow ID within application (like X-Trace, Dapper) Instrument resources with wrapper classes Overheads Resource instrumentation automatic using AspectJ Overall lines per system to modify RPCs Retro overhead: max 1-2% on throughput, latency 31

107 Experiments 32

108 Experiments Systems YARN ZooKeeper MapReduce HBase HDFS 32

109 Experiments Workflows MapReduce Jobs (HiBench) HBase (YCSB) HDFS clients Background Data Replication Background Heartbeats Systems YARN ZooKeeper MapReduce HBase HDFS 32

110 Experiments Workflows MapReduce Jobs (HiBench) HBase (YCSB) HDFS clients Background Data Replication Background Heartbeats Systems YARN ZooKeeper MapReduce HBase HDFS Resources CPU, Disk, Network (All systems) Locks, Queues (HDFS, HBase) 32

111 Experiments Workflows MapReduce Jobs (HiBench) HBase (YCSB) HDFS clients Background Data Replication Background Heartbeats Resources CPU, Disk, Network (All systems) Locks, Queues (HDFS, HBase) Systems YARN ZooKeeper Policies LatencySLO MapReduce HDFS HBase Bottleneck Fairness Dominant Resource Fairness 32

112 Experiments Workflows MapReduce Jobs (HiBench) HBase (YCSB) HDFS clients Background Data Replication Background Heartbeats Resources CPU, Disk, Network (All systems) Locks, Queues (HDFS, HBase) Systems YARN ZooKeeper Policies LatencySLO MapReduce HDFS HBase Bottleneck Fairness Dominant Resource Fairness Policies for a mixture of systems, workflows, and resources Results on clusters up to 200 nodes See paper for full experiment results 32

113 Experiments Workflows MapReduce Jobs (HiBench) HBase (YCSB) HDFS clients Background Data Replication Background Heartbeats Resources CPU, Disk, Network (All systems) Locks, Queues (HDFS, HBase) Systems YARN ZooKeeper Policies LatencySLO MapReduce HDFS HBase Bottleneck Fairness Dominant Resource Fairness Policies for a mixture of systems, workflows, and resources Results on clusters up to 200 nodes See paper for full experiment results This talk LatencySLO policy results 32

114 Experiments Workflows MapReduce Jobs (HiBench) HBase (YCSB) HDFS clients Background Data Replication Background Heartbeats Resources CPU, Disk, Network (All systems) Locks, Queues (HDFS, HBase) Systems YARN ZooKeeper Policies LatencySLO MapReduce HDFS HBase Bottleneck Fairness Dominant Resource Fairness Policies for a mixture of systems, workflows, and resources Results on clusters up to 200 nodes See paper for full experiment results This talk LatencySLO policy results 32

115 HDFS read 8k HBase read 1 row HBase read 1 cached row 33

116 HBase Table Scan HDFS mkdir HBase Cached Table Scan HDFS read 8k HBase read 1 row HBase read 1 cached row 33

117 SLO-Normalized Latency HBase Table Scan HDFS mkdir HBase Cached Table Scan HDFS read 8k Time [minutes] HBase read 1 row HBase read 1 cached row 33

119 SLO-Normalized Latency HBase Table Scan HDFS mkdir HBase Cached Table Scan HDFS read 8k SLO target Time [minutes] HBase read 1 row HBase read 1 cached row 33

122 Slowdown SLO-Normalized Latency HBase Table Scan HDFS mkdir HBase Cached Table Scan HDFS read 8k SLO target Time [minutes] HBase read 1 row HBase read 1 cached row Time [minutes] 34

123 Slowdown SLO-Normalized Latency HBase Table Scan HDFS mkdir HBase Cached Table Scan HDFS read 8k SLO target Time [minutes] HBase read 1 row HBase read 1 cached row Disk Time [minutes] 34

124 Slowdown SLO-Normalized Latency HBase Table Scan HDFS mkdir HBase Cached Table Scan HDFS read 8k SLO target Time [minutes] HBase read 1 row HBase read 1 cached row Disk 10 5 CPU Time [minutes] 34

125 Slowdown SLO-Normalized Latency HBase Table Scan HDFS mkdir HBase Cached Table Scan HDFS read 8k SLO target Time [minutes] HBase read 1 row HBase read 1 cached row Disk 10 CPU Time [minutes] HDFS NN Lock 34

126 Slowdown SLO-Normalized Latency HBase Table Scan HDFS mkdir HBase Cached Table Scan HDFS read 8k SLO target Time [minutes] HBase read 1 row HBase read 1 cached row Disk 10 CPU Time [minutes] HDFS NN Lock HDFS NN Queue 34

127 Slowdown SLO-Normalized Latency HBase Table Scan HDFS mkdir HBase Cached Table Scan HDFS read 8k SLO target Time [minutes] HBase read 1 row HBase read 1 cached row Disk 10 CPU Time [minutes] HDFS NN Lock HDFS NN Queue HBase Queue 34

128 SLO-Normalized Latency SLO-Normalized Latency HBase Table Scan HDFS mkdir HBase Cached Table Scan HDFS read 8k SLO target HBase read 1 row Time [minutes] HBase read 1 cached row 10 + SLO Policy Enabled SLO target Time [minutes] 35

129 SLO-Normalized Latency SLO-Normalized Latency HBase Table Scan HDFS mkdir HBase Cached Table Scan HDFS read 8k SLO target HBase read 1 row Time [minutes] HBase read 1 cached row 10 + SLO Policy Enabled SLO target Time [minutes] 35

131 SLO-Normalized Latency HBase Table Scan HDFS mkdir HBase Cached Table Scan HDFS read 8k Time [minutes] HBase read 1 row HBase read 1 cached row 1 HBase Table Scan

134 SLO-Normalized Latency HBase Table Scan HDFS mkdir HBase Cached Table Scan HDFS read 8k Time [minutes] HBase read 1 row HBase read 1 cached row 1 HBase Table Scan HDFS mkdir

137 SLO-Normalized Latency HBase Table Scan HDFS mkdir HBase Cached Table Scan HDFS read 8k Time [minutes] HBase read 1 row HBase read 1 cached row 1 HBase Table Scan HDFS mkdir HBase Cached 0.5 Table Scan 36

140 Conclusion 37

141 Conclusion Resource management for shared distributed systems 37

142 Conclusion Resource management for shared distributed systems Centralized resource management 37

143 Conclusion Resource management for shared distributed systems Centralized resource management Comprehensive: resources, processes, tenants, background tasks 37

144 Conclusion Resource management for shared distributed systems Centralized resource management Comprehensive: resources, processes, tenants, background tasks Abstractions for writing concise, general-purpose policies: Workflows Resources (slowdown, load) Control points 37

145 Conclusion Resource management for shared distributed systems Centralized resource management Comprehensive: resources, processes, tenants, background tasks Abstractions for writing concise, general-purpose policies: Workflows Resources (slowdown, load) Control points