Operating System Support for Java Virtual Machine: A New Methodology of Boosting Java Runtime Performance
|
|
|
- Ethan Martin
- 10 years ago
- Views:
Transcription
1 Operating System Support for Java Virtual Machine: A New Methodology of Boosting Java Runtime Performance Shui Fu Hainan Telcom Corp. Hainan, China Steve Zhang Korea University Seoul, South Korea Abstract Java virtual machine (JVM) provides a virtualized execution environment for Java applications. To maintain safety and efficiency of the execution environment, JVM employs some major runtime sub-systems such as garbage collection, just-in-time compilation and lock management. Recent work found that the runtime overheads increase significantly when workload of Java applications increase. To solve this problem, researchers of academia and industry have developed various approaches of utilizing VM-OS-hardware interactions to improve Java runtime performance. In this article, I discussed recent representative works in this research area. Keywords-Java Virtual Machine; Garbage Collection I. INTRODUCTION Software systems are becoming more and more complex due to multiple software layers that include application servers, shared libraries and virtual machines. Moreover, the widespread adoption of chip-multiprocessor systems also promotes the use of thread-level parallelism in today s software development. For example, the execution of an EJB application needs to go through several layers from Java application server, Java virtual machine to operating system. As a middle layer, Java virtual machine needs several major runtime sub-systems to maintain efficiency and safeness of the execution environment, such as JIT (Just-In-Compiler), GC (Garbage Collector), lock management (or thread synchronization), etc. For large EJB applications, it is common for them to employ hundreds to thousands of threads. Xian et al. studied SPECjAppServer2004 (as shown in Figure 1) [11][24][25]and found that the overhead of JVM runtime system increases significantly when workload and number of threads increase. To reduce overhead of Java runtime systems, some techniques have been implemented inside Java virtual machines to tune performance through utilizing runtime information. For example, method invocation information has been used by the VMs to select the complexity of dynamic compilation optimizations. Heap usage information can be used for dynamically resizing heap to reduce the frequency of garbage collections. However, the above traditional approaches have limited power of improving virtual machine performance. To support highly scalable and efficient Java application servers, a new technology trend has emerged which makes Java virtual machine interact with operating systems and hardware supports to improve efficiency of runtime systems. Recently, Azul Systems Inc. developed Azul Virtual Machine (AVM) [1] which removes the limitations of general-purposed Java virtual machines. AVM runs on top of customized operating system and Azul Vegas processors and it can scales up to 864 processor cores and 670GB memory heap. Similarly, there are some other research efforts to improve Java virtual machine performance by making operating systems aware of virtual machine runtime activities. In the following sections, I will present recent works on utilizing OS and architecutal support to reduce Java runtime overheads. Figure 1. Accumulative time spent in major runtime functions[25] /10/$26.00 C 2010 IEEE V4-423
2 II. INTERACTING WITH VIRTUAL MEMORY MANAGER (VMM) Some researchers extended virtual memory manager to interact with Java virtual machine to improve garbage collection performance. The GC performance is dependent on the heap size: if heap size is too large, it could incur paging overhead; if heap size is too small, garbage collection will be invoked frequently which causes long pause time. Since page information can be retrieved from VMM, recent GC systems introduce ways to better coordinate with VMM to guide heap resizing or GC triggering decisions. Bookmarking Collector (BC)[5], Cooperative Robust Automatic Memory Management (CRAMM)[21][20], Yield Predictor (YP)[13] are representative works which extended virtual memory manager of operating system for GC runtimes. The Bookmarking Collector (BC) proposed by Hertz et al. is a garbage collection approach which is tightly coupled with VMM of the operating system to minimize paging overhead when memory pressure is very high. The BC enables interaction between the Java virtual machine and operating system to guide heap sizing decisions. Furthermore, the BC records summary information of evicted pages and performs GC only on pages that are in physical memory without touching already evicted pages. In addition, BC guides operating system to make better decision on which pages need to be evicted. The CRAMM proposed by Yang et al. [20][21][22] is a framework which dynamically adjusts heap size through coordination between VMM and Java virtual machine. In CRAMM, a new VMM is implemented to compute workingset size information of each process. The information is passed to a heap sizing analytical model [20] to predict an appropriate heap size that can improve performance as well as minimizing page faults. The heap sizing decisions from VMM is passed through to Java virtual machine and guides it to dynamically adjust heap size. The Yield Predictor (YP) developed by Wegiel et al. provides a simple solution to improve GC efficiency by estimating GC yield (i.e., number of dead objects) using hardware page reference bits used by OS virtual memory manager. The YP exploits the prior empirical result that objects with similar life spans tend to be clustered in the heap and tend to die together[2][3][4][6][7][8]. These dead objects are likely in one or multiple virtual pages. Therefore, pages that have not been recently referenced by the application are likely to be dead. The YP uses a dedicated polling thread to consult the OS kernel to obtain reference bits of each page. This information is fed into a simple analytical model to estimate total number of dead pages in the heap. When the number of dead pages is small, an impending garbage collection is likely to be ineffective (i.e., unable to reclaim sufficient space) and YP will skip the ineffective collection. By reducing number of ineffictive collections, YP can reduce GC pause time. III. MAKING OS SCHEDULER AWARE OF JVM ACTIVITIES Modern OS s are designed to maintain responsiveness and fairness, and thus, they cannot distinguish whether a thread to be scheduled is executing the application logic or performing JVM functions (GC, JIT and lock management). So, they simply treat them equally. However, JVM functions often have much different resource requirements and execution phases that often require different scheduling strategies to gain optimal performance. To solve the mismatch between resource scheduling of the OS and resource requirements of JVM functions, some researchers have built an specific OS framework which is aware of JVM activities. A. Allocation-aware : improving GC performance Xian et al. presented an allocation-phase aware [10], which exploits object allocation information to guide thread scheduling decisions. The principle of the is based on the following two insights[23][26]: Objects are created and then die in phases. Invoking garbage collection in the middle of an allocation phase usually results in ineffective collection performance. Furthermore, a long allocation pause often indicates that an allocation phase is ending and a mortality phase (i.e., a phase that most objects get used and die) is starting. Commonly used thread scheduling policies degrade garbage collection performance. In large multithreaded Java applications, existing s tend to schedule threads in such a way that leaves many incomplete or partial allocation phases in the heap. Because most collectors trigger garbage collection when the heap is full, having a large number partial phases mean that garbage collection is ineffective because these objects are still alive. In allocation-phase aware, time-based round robin is replaced with memory-based round robin, a policy that regulates the amount of memory each thread can allocate during its turn on the CPU. For example, if the memory quantum is set to 200 KB, a thread can stay on the processor until it allocates 200 KB of heap memory. At that point, the thread is suspended, and the next successive thread is scheduled. If the memory quantum is tuned to be slightly larger than the most common heap working set of a thread, it can ensure that in most cases, a thread has enough time on the processor to allocate its current working set and then executes the subsequent object mortality phase. B. Contention-aware : reducing synchronization overhead The emerging popularity of chip-multiprocessor systems has instigated a widespread adoption of thread-level V4-424
3 parallelism. In applications exploiting thread level parallelism, locks are frequently used as a mechanism to prevent multiple threads from having concurrent accesses to a shared resource and to synchronize the execution order of threads. With locks, whenever a thread attempts to acquire a lock held by another thread (i.e., lock contention), it either retries to obtain the lock (spin-locking) or is suspended until the lock becomes available. In JVM, locks are maintained through the monitor mechanism. A monitor is a mutual exclusive lock. In language level, Java provides two semantics to identify monitor regions: synchronized statements and synchronized methods. These two mechanisms are supported by two special bytecode instructions (monitor enter and monitor exit) in the Java virtual machine s instruction set. The OS does not know monitor activities in JVM. This leads to early preemption of threads in critical sections. If a thread is preempted during a critical section, other contending threads would fail acquiring the lock associated with the critical section and relinquish their time quantums. This phenomena is referred as lock convoy. The consequence of lock convoy is the overhead increase of lock contentions and context switches. To reduce such overhead, Xian et al. developed a Contention-Aware Scheduler (CA-Scheduler) [12] which exploits object synchronization information to reduce the occurrences of lock convoys in large application servers running in multiprocessor systems. CA-Scheduler relies on collaborations between operating system kernels and JVM to improve the efficiency of Java runtime environments. The basic notion of CA-Scheduler is (i) to collect the necessary information as part of JVM execution, and (ii) to pass on that information to the operating system to guide its scheduling decisions. C. JIT-aware : improving Just-In-Compilation In Java, there is a separate compilation thread which compiles hot (or frequently-used) Java methods. Kulkarni et al. [17] found that round-robin scheduling affects compilation performance significantly. In round-robin scheduling, each thread including the compilation thread has a fixed amount of timeslice. If the timeslice is too small, the compilation thread cannot complete its task in one timeslice. Then the execution of that method has to be delayed until the compiler thread is done. This can take several timeslices. Even worse, as the number of threads in the application increases, the resources given to the compilation thread are reduced and the performance of applications degrades. To solve this problem, they modified to give higher priority and a longer timeslice to compiler thread so that it can have enough time to complete the compilation, avoiding extensive execution delay. This solution can significantly reduce start-up time and improve application s performance. IV. HARDWARE/OS SUPPORT The Azul VM (AVM) [1] of Azul Systems Inc. is the first JVM which utilizes a synergistic hardware and software approach to improve scalability and performance of Java runtime systems. The AVM runs on the Vegas processor, a new general purpose processor designed for running virtual machines. The AVM utilizes the new architecture support to implement a highly scalable concurrent garbage collector and lock management. Due to the strong correlation between GC pause time and the heap size, Java application servers cannot scale well in large memory footprint. Azul Pauseless Garbage Collector overcomes this by decoupling the pause time from the heap size. It is achieved by the following architectural support. Dedicated processor cores for garbage collection. Hardware-assisted read barrier instructions The AVM implements a lock management scheme called Optimistic Thread Concurrency (OTC) which improves throughput by letting multiple threads execute synchronized code concurrently, detecting data contentions and rolling back any conflicted threads. The OTC uses similar technology as optimistic locking which is widely used in database systems and transactional memory. Since in most cases threads contending locks do not contend data that locks protect, the OTC has rare roll-back events. The OTC uses hardware-assisted write/read barrier instructions to monitor write/read accesses to data. With barrier instructions, AVM is able to detect possible data contention. If a thread is writing data and another thread has read the data, or a thread is reading data and another thread has written the data, data contention is deemed to have occurred. If a data contention has been detected for a thread, the execution of this thread will be rolled back to the state of lock acquisition point. To improve JIT performance, the AVM implements multiple compilation threads to allow compilation to be performed in parallel. The AVM spawns on the order of 50 compilation threads during the ramp-up period of large Java application servers. Since a Vegas processor usually has hundreds of processor cores, using multiple compilation threads is an effective solution to exploit the additional resources available on the multicore architecture.. V. CONCLUSIONS Java virtual machine provides a virtualized execution environment for Java applications. To maintain safety and efficiency of the execution environment, JVM needs employ runtime systems such as garbage collection, just-in-time compilation and lock management. Recent work found that the runtime overheads increase significantly when workload of Java applications increase. To solve this problem, researchers of academia and industry start studying various approaches which use VM-OS-hardware interactions to improve runtime performance. Table 1 summarizes the toxonamy of recent works on this direction. From the summary, most of recent works are focused on reducing GC V4-425
4 overheads since GC is the major bottleneck which limits the overall performance and scalability of Java applications. Researchers explored many ways of OS support such as extending virtual memory manager, making aware of JVM activities, and implementing some GC instructions. There are also a few works to reduce lock contention and JIT compilation overhead. TABLE I. Categorization of technologies using VM-OS interactions to improve runtime performance VM functions OS subsystems Virtual Memory Manager OS Hardware/OS support Garbage Collection -CRAMM - Bookmarking Collector -Yield Predictor Allocationaware Azul Pauseless collector REFERENCES Just-In-Time Compilation N/A JIT-aware Azul s Parallel JITs on multicores Lock Management N/A Contentionaware Azul OTC [1] Azul Virtual Machine. [2] M. Hertz, S.M. Blackburn, J. E. B. Moss, K. S. Mckliney, D. Stefanovi c. Generatingobject lifetime traces with Merlin. Transactions on Programming Languages And Systems (TOPLAS) 28, 3(May), [3] M. Hertz. Quantifying and Improving the Performance of Garbage Collection. VDM Verlag, Saarbr ucken, Germany. [4] M. Hertz, E. D. Berger. The performance of automatic vs. explicit memory management. In Proceedings of the 2005 ACM SIGPLAN Conference on Object-Oriented Programming Systems, Languages & Applications (OOPSLA 2005), Volume 40(11) of ACM SIGPLAN Notices (San Diego, CA, Oct. 2005), pp [5] M. Hertz, Y. Feng, E. D. Berger. Garbage collection without paging. In Proceedings of the 2005 ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI 2005), Volume 40(7) of ACM SIGPLAN Notices (Chicago, IL, June 2005), pp [6] M. Hertz, N. Immerman, J. E. B. Moss. Framework for analyzing garbage collection. In R. BAEZA-YATES, U. MONTANARI, AND N. SANTORO Eds., Foundations of Information Technology in the Era of Network and Mobile Computing: IFIP 17th World Computer Congress - TC1 Stream (TCS 2002), Volume 223 of IFIP Conference Proceedings (Montreal, Canada, 2002), pp [7] M. Hertz, S.M. Blackburn, J. E. B. Moss, K. S. Mckliney, D. Stefanovi c. Error free garbage collection traces: How to cheat and not get caught. In Proceedings of the International Conference on Measurement and Modeling of Computer Systems, Volume 30(1) of ACM SIGMETRICS Performance Evaluation Review (Marina Del Rey, CA, June 2002), pp [8] M. Hertz, J. Bard, S. Kane., E. Keudel, T. Bai, X. Gu, C. Ding. Waste not, want not: Resource-based garbage collection in a shared environment. Technical report TR 951, University of Rochester [9] M. Hertz. Quantifying and Improving the Performance of Garbage Collection. Ph.D Dissertation. University of Massachusetts Amherst. [10] F. Xian, W. Srisa-an, H. Jiang. Allocation-phase aware thread scheduling policies to improve garbage collection performance. In Proceedings of the 6th international Symposium on Memory Management (Montreal, Quebec, Canada, October 21-22, 2007). ACM, New York, NY, [11] F. Xian, W. Srisa-an, H. Jiang. Garbage collection: Java application servers' Achilles heel. Sci. Comput. Program. 70, 2-3 (Feb. 2008), [12] F. Xian, W. Srisa-an, H. Jiang. Contention-aware : unlocking execution parallelism in multithreaded java programs. In Proceedings of the 23rd ACM SIGPLAN Conference on Object- Oriented Programming Systems Languages and Applications (Nashville, TN, USA, October 19-23, 2008). ACM, New York, NY, [13] M. Wegiel, C. Krintz. Dynamic prediction of collection yield for managed runtimes. In Proceeding of the 14th international Conference on Architectural Support For Programming Languages and Operating Systems (Washington, DC, USA, March 07-11, 2009). ACM, New York, NY, [14] M. Wegiel and C. Krintz, XMem: Type-Safe, Transparent, Shared Memory for Cross-Runtime Communication and Coordination, ACM Conference Programming Language Design and Implementation (PLDI), Jun, 2008 (PLDI), Mar, [15] M. Wegiel and C. Krintz, The Mapping Collector: Virtual Memory Support for Generational, Parallel, and Concurrent Compaction, ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Mar, [16] M. Weigel and C. Krintz, Cross-Language, Type-Safe, and Transparent Object Sharing For Co-Located Managed Runtimes ACM Conference on Object-Oriented Programming Systems, Languages, and Applications (OOPSLA), [17] P. Kulkarni, M. Arnold, M. Hind. Dynamic compilation: the benefits of early investing. In Proceedings of the 3rd international Conference on Virtual Execution Environments (San Diego, California, USA, June 13-15, 2007). ACM, New York, NY, [18] L. Zhang, C. Krintz, and P. Nagpurkar, Supporting Exception Handling for Futures in Java, ACM International Conference on the Principles and Practice on Programming in Java (PPPJ), Sep, [19] L. Zhang, C. Krintz, and P. Nagpurkar, Language and Virtual Machine Support for Efficient Fine-Grained Futures in Java, The International Conference on Parallel Architectures and Compilation Techniques, (PACT) Sep, [20] T. Yang, T. Liu, E. D. Berger, S. F. Kaplan, and J. E. B. Moss. "Redline: First Class Support of Interactivity in Commodity Operating Systems". In Proceedings of the 8th USENIX Symposium on Operating Systems Design and Implementation. [21] T. Yang, E. D. Berger, S. F. Kaplan, J. E. B. Moss. CRAMM: virtual memory support for garbage-collected applications. In Proceedings of the 7th Symposium on Operating Systems Design and Implementation (Seattle, Washington, November 06-08, 2006). USENIX Association, Berkeley, CA, [22] T. Yang, M. Hertz, E. D. Berger, S. F. Kaplan, J. E. B. Moss: Automatic heap sizing: taking real memory into account. In V4-426
5 Proceedings of the 4th international Symposium on Memory Management. Vancouver, BC, Canada, Oct 24-25, [23] F. Xian, W. Srisa-an, H. Jiang. MicroPhase: Proactively Invoking Garbage Collection for Improved Performance. In Proceedings of ACM SIGPLAN Conference on Object-Oriented Programming, Systems, Languages and Applications (OOPSLA'07), Montreal, Canada. Oct [24] F. Xian, W. Srisa-an, C. Jia, H. Jiang. "AS-GC: An Efficient Generational Garbage Collector for Java Application Servers". In Proceedings of 21st European Conference on Object-Oriented Programming (ECOOP'07), Berlin, Germany. July 30-Aug 03, [25] F. Xian, W. Srisa-an, H. Jiang. "Investigating Throughput Degradation Behavior of Java Application Servers: A View from Inside a Virtual Machine". In Proceedings of ACM International Conference on Principles and Practices of Programming In Java (PPPJ'06), Mannheim, Germany, Aug 30-Sep 1, 2006, Page [26] F. Xian, W. Srisa-an, H. Jiang. "Fortune Teller: Improving Garbage Collection Performance in Server Environment using Live Objects Prediction". In Proceedings of ACM SIGPLAN Conference on Object-Oriented Programming, Systems, Languages and Applications, San Diego, CA. Oct 17, ACM Press, Page , V4-427
PART IV Performance oriented design, Performance testing, Performance tuning & Performance solutions. Outline. Performance oriented design
PART IV Performance oriented design, Performance testing, Performance tuning & Performance solutions Slide 1 Outline Principles for performance oriented design Performance testing Performance tuning General
Multi-core Programming System Overview
Multi-core Programming System Overview Based on slides from Intel Software College and Multi-Core Programming increasing performance through software multi-threading by Shameem Akhter and Jason Roberts,
Garbage Collection in the Java HotSpot Virtual Machine
http://www.devx.com Printed from http://www.devx.com/java/article/21977/1954 Garbage Collection in the Java HotSpot Virtual Machine Gain a better understanding of how garbage collection in the Java HotSpot
Full and Para Virtualization
Full and Para Virtualization Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF x86 Hardware Virtualization The x86 architecture offers four levels
Azul Pauseless Garbage Collection
TECHNOLOGY WHITE PAPER Azul Pauseless Garbage Collection Providing continuous, pauseless operation for Java applications Executive Summary Conventional garbage collection approaches limit the scalability
Azul Compute Appliances
W H I T E P A P E R Azul Compute Appliances Ultra-high Capacity Building Blocks for Scalable Compute Pools WP_ACA0209 2009 Azul Systems, Inc. W H I T E P A P E R : A z u l C o m p u t e A p p l i a n c
Understanding Hardware Transactional Memory
Understanding Hardware Transactional Memory Gil Tene, CTO & co-founder, Azul Systems @giltene 2015 Azul Systems, Inc. Agenda Brief introduction What is Hardware Transactional Memory (HTM)? Cache coherence
Architecture Support for Big Data Analytics
Architecture Support for Big Data Analytics Ahsan Javed Awan EMJD-DC (KTH-UPC) (http://uk.linkedin.com/in/ahsanjavedawan/) Supervisors: Mats Brorsson(KTH), Eduard Ayguade(UPC), Vladimir Vlassov(KTH) 1
Replication on Virtual Machines
Replication on Virtual Machines Siggi Cherem CS 717 November 23rd, 2004 Outline 1 Introduction The Java Virtual Machine 2 Napper, Alvisi, Vin - DSN 2003 Introduction JVM as state machine Addressing non-determinism
Virtual Machine Learning: Thinking Like a Computer Architect
Virtual Machine Learning: Thinking Like a Computer Architect Michael Hind IBM T.J. Watson Research Center March 21, 2005 CGO 05 Keynote 2005 IBM Corporation What is this talk about? Virtual Machines? 2
General Introduction
Managed Runtime Technology: General Introduction Xiao-Feng Li ([email protected]) 2012-10-10 Agenda Virtual machines Managed runtime systems EE and MM (JIT and GC) Summary 10/10/2012 Managed Runtime
PC Based Escape Analysis in the Java Virtual Machine
PC Based Escape Analysis in the Java Virtual Machine Manfred Jendrosch, Gerhard W. Dueck, Charlie Gracie, and AndréHinkenjann Abstract Current computer architectures are multi-threaded and make use of
Validating Java for Safety-Critical Applications
Validating Java for Safety-Critical Applications Jean-Marie Dautelle * Raytheon Company, Marlborough, MA, 01752 With the real-time extensions, Java can now be used for safety critical systems. It is therefore
Monitoring Java enviroment / applications
Monitoring Java enviroment / applications Uroš Majcen [email protected] Java is Everywhere You Can Expect More. Java in Mars Rover With the help of Java Technology, and the Jet Propulsion Laboratory (JPL),
Characterizing Java Virtual Machine for More Efficient Processor Power Management. Abstract
Characterizing Java Virtual Machine for More Efficient Processor Power Management Marcelo S. Quijano, Lide Duan Department of Electrical and Computer Engineering University of Texas at San Antonio [email protected],
Investigating the Throughput Degradation Behavior of Java Application Servers: A View from Inside a Virtual Machine
Investigating the Throughput Degradation Behavior of Java Application Servers: A View from Inside a Virtual Machine Feng Xian Computer Science & Engineering University of Nebraska-Lincoln Lincoln, NE 68588-0115
10.04.2008. Thomas Fahrig Senior Developer Hypervisor Team. Hypervisor Architecture Terminology Goals Basics Details
Thomas Fahrig Senior Developer Hypervisor Team Hypervisor Architecture Terminology Goals Basics Details Scheduling Interval External Interrupt Handling Reserves, Weights and Caps Context Switch Waiting
Java Virtual Machine: the key for accurated memory prefetching
Java Virtual Machine: the key for accurated memory prefetching Yolanda Becerra Jordi Garcia Toni Cortes Nacho Navarro Computer Architecture Department Universitat Politècnica de Catalunya Barcelona, Spain
Optimizing Shared Resource Contention in HPC Clusters
Optimizing Shared Resource Contention in HPC Clusters Sergey Blagodurov Simon Fraser University Alexandra Fedorova Simon Fraser University Abstract Contention for shared resources in HPC clusters occurs
Mission-Critical Java. An Oracle White Paper Updated October 2008
Mission-Critical Java An Oracle White Paper Updated October 2008 Mission-Critical Java The Oracle JRockit family of products is a comprehensive portfolio of Java runtime solutions that leverages the base
Web Performance, Inc. Testing Services Sample Performance Analysis
Web Performance, Inc. Testing Services Sample Performance Analysis Overview This document contains two performance analysis reports created for actual web testing clients, and are a good example of the
Cloud Computing and Robotics for Disaster Management
2016 7th International Conference on Intelligent Systems, Modelling and Simulation Cloud Computing and Robotics for Disaster Management Nitesh Jangid Information Technology Department Green Research IT
2 2011 Oracle Corporation Proprietary and Confidential
The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material,
Fachbereich Informatik und Elektrotechnik SunSPOT. Ubiquitous Computing. Ubiquitous Computing, Helmut Dispert
Ubiquitous Computing Ubiquitous Computing The Sensor Network System Sun SPOT: The Sun Small Programmable Object Technology Technology-Based Wireless Sensor Networks a Java Platform for Developing Applications
CSCI E 98: Managed Environments for the Execution of Programs
CSCI E 98: Managed Environments for the Execution of Programs Draft Syllabus Instructor Phil McGachey, PhD Class Time: Mondays beginning Sept. 8, 5:30-7:30 pm Location: 1 Story Street, Room 304. Office
Tuning Your GlassFish Performance Tips. Deep Singh Enterprise Java Performance Team Sun Microsystems, Inc.
Tuning Your GlassFish Performance Tips Deep Singh Enterprise Java Performance Team Sun Microsystems, Inc. 1 Presentation Goal Learn tips and techniques on how to improve performance of GlassFish Application
A Thread Monitoring System for Multithreaded Java Programs
A Thread Monitoring System for Multithreaded Java Programs Sewon Moon and Byeong-Mo Chang Department of Computer Science Sookmyung Women s University, Seoul 140-742, Korea [email protected], [email protected]
A Dynamic Resource Management with Energy Saving Mechanism for Supporting Cloud Computing
A Dynamic Resource Management with Energy Saving Mechanism for Supporting Cloud Computing Liang-Teh Lee, Kang-Yuan Liu, Hui-Yang Huang and Chia-Ying Tseng Department of Computer Science and Engineering,
LBPerf: An Open Toolkit to Empirically Evaluate the Quality of Service of Middleware Load Balancing Services
LBPerf: An Open Toolkit to Empirically Evaluate the Quality of Service of Middleware Load Balancing Services Ossama Othman Jaiganesh Balasubramanian Dr. Douglas C. Schmidt {jai, ossama, schmidt}@dre.vanderbilt.edu
Operating Systems, 6 th ed. Test Bank Chapter 7
True / False Questions: Chapter 7 Memory Management 1. T / F In a multiprogramming system, main memory is divided into multiple sections: one for the operating system (resident monitor, kernel) and one
Control 2004, University of Bath, UK, September 2004
Control, University of Bath, UK, September ID- IMPACT OF DEPENDENCY AND LOAD BALANCING IN MULTITHREADING REAL-TIME CONTROL ALGORITHMS M A Hossain and M O Tokhi Department of Computing, The University of
Understanding Java Garbage Collection
TECHNOLOGY WHITE PAPER Understanding Java Garbage Collection And What You Can Do About It Table of Contents Executive Summary... 3 Introduction.... 4 Why Care About the Java Garbage Collector?.... 5 Classifying
PERFORMANCE ANALYSIS OF KERNEL-BASED VIRTUAL MACHINE
PERFORMANCE ANALYSIS OF KERNEL-BASED VIRTUAL MACHINE Sudha M 1, Harish G M 2, Nandan A 3, Usha J 4 1 Department of MCA, R V College of Engineering, Bangalore : 560059, India [email protected] 2 Department
Stream Processing on GPUs Using Distributed Multimedia Middleware
Stream Processing on GPUs Using Distributed Multimedia Middleware Michael Repplinger 1,2, and Philipp Slusallek 1,2 1 Computer Graphics Lab, Saarland University, Saarbrücken, Germany 2 German Research
Efficient Architectures for Low Latency and High Throughput Trading Systems on the JVM
60 Informatica Economică vol. 17, no. 3/2013 Efficient Architectures for Low Latency and High Throughput Trading Systems on the JVM Alexandru LIXANDRU Bucharest University of Economic Studies [email protected]
Java Performance Tuning
Summer 08 Java Performance Tuning Michael Finocchiaro This white paper presents the basics of Java Performance Tuning for large Application Servers. h t t p : / / m f i n o c c h i a r o. w o r d p r e
Transaction Performance Maximizer InterMax
Transaction Performance Maximizer InterMax A-1208 Woorim Business Center, YeomChang-Dong, GangSeo-Gu, Seoul Korea Republic. TEL 82.2.6230.6300 l FAX 80.2.6203.6301 l www.ex-em.com Transaction Performance
MapReduce on GPUs. Amit Sabne, Ahmad Mujahid Mohammed Razip, Kun Xu
1 MapReduce on GPUs Amit Sabne, Ahmad Mujahid Mohammed Razip, Kun Xu 2 MapReduce MAP Shuffle Reduce 3 Hadoop Open-source MapReduce framework from Apache, written in Java Used by Yahoo!, Facebook, Ebay,
Delivering Quality in Software Performance and Scalability Testing
Delivering Quality in Software Performance and Scalability Testing Abstract Khun Ban, Robert Scott, Kingsum Chow, and Huijun Yan Software and Services Group, Intel Corporation {khun.ban, robert.l.scott,
Practical Performance Understanding the Performance of Your Application
Neil Masson IBM Java Service Technical Lead 25 th September 2012 Practical Performance Understanding the Performance of Your Application 1 WebSphere User Group: Practical Performance Understand the Performance
Java Garbage Collection Basics
Java Garbage Collection Basics Overview Purpose This tutorial covers the basics of how Garbage Collection works with the Hotspot JVM. Once you have learned how the garbage collector functions, learn how
Using In-Memory Computing to Simplify Big Data Analytics
SCALEOUT SOFTWARE Using In-Memory Computing to Simplify Big Data Analytics by Dr. William Bain, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T he big data revolution is upon us, fed
Agenda. Enterprise Application Performance Factors. Current form of Enterprise Applications. Factors to Application Performance.
Agenda Enterprise Performance Factors Overall Enterprise Performance Factors Best Practice for generic Enterprise Best Practice for 3-tiers Enterprise Hardware Load Balancer Basic Unix Tuning Performance
JVM Performance Study Comparing Oracle HotSpot and Azul Zing Using Apache Cassandra
JVM Performance Study Comparing Oracle HotSpot and Azul Zing Using Apache Cassandra January 2014 Legal Notices Apache Cassandra, Spark and Solr and their respective logos are trademarks or registered trademarks
BridgeWays Management Pack for VMware ESX
Bridgeways White Paper: Management Pack for VMware ESX BridgeWays Management Pack for VMware ESX Ensuring smooth virtual operations while maximizing your ROI. Published: July 2009 For the latest information,
Garbage Collection in NonStop Server for Java
Garbage Collection in NonStop Server for Java Technical white paper Table of contents 1. Introduction... 2 2. Garbage Collection Concepts... 2 3. Garbage Collection in NSJ... 3 4. NSJ Garbage Collection
SEER PROBABILISTIC SCHEDULING FOR COMMODITY HARDWARE TRANSACTIONAL MEMORY. 27 th Symposium on Parallel Architectures and Algorithms
27 th Symposium on Parallel Architectures and Algorithms SEER PROBABILISTIC SCHEDULING FOR COMMODITY HARDWARE TRANSACTIONAL MEMORY Nuno Diegues, Paolo Romano and Stoyan Garbatov Seer: Scheduling for Commodity
Angelika Langer www.angelikalanger.com. The Art of Garbage Collection Tuning
Angelika Langer www.angelikalanger.com The Art of Garbage Collection Tuning objective discuss garbage collection algorithms in Sun/Oracle's JVM give brief overview of GC tuning strategies GC tuning (2)
Four Keys to Successful Multicore Optimization for Machine Vision. White Paper
Four Keys to Successful Multicore Optimization for Machine Vision White Paper Optimizing a machine vision application for multicore PCs can be a complex process with unpredictable results. Developers need
Multiprocessor Scheduling and Scheduling in Linux Kernel 2.6
Multiprocessor Scheduling and Scheduling in Linux Kernel 2.6 Winter Term 2008 / 2009 Jun.-Prof. Dr. André Brinkmann [email protected] Universität Paderborn PC² Agenda Multiprocessor and
JBoss Data Grid Performance Study Comparing Java HotSpot to Azul Zing
JBoss Data Grid Performance Study Comparing Java HotSpot to Azul Zing January 2014 Legal Notices JBoss, Red Hat and their respective logos are trademarks or registered trademarks of Red Hat, Inc. Azul
Performance Analysis of Web based Applications on Single and Multi Core Servers
Performance Analysis of Web based Applications on Single and Multi Core Servers Gitika Khare, Diptikant Pathy, Alpana Rajan, Alok Jain, Anil Rawat Raja Ramanna Centre for Advanced Technology Department
Performance Tools for Parallel Java Environments
Performance Tools for Parallel Java Environments Sameer Shende and Allen D. Malony Department of Computer and Information Science, University of Oregon {sameer,malony}@cs.uoregon.edu http://www.cs.uoregon.edu/research/paracomp/tau
Zing Vision. Answering your toughest production Java performance questions
Zing Vision Answering your toughest production Java performance questions Outline What is Zing Vision? Where does Zing Vision fit in your Java environment? Key features How it works Using ZVRobot Q & A
The Lagopus SDN Software Switch. 3.1 SDN and OpenFlow. 3. Cloud Computing Technology
3. The Lagopus SDN Software Switch Here we explain the capabilities of the new Lagopus software switch in detail, starting with the basics of SDN and OpenFlow. 3.1 SDN and OpenFlow Those engaged in network-related
Java Performance. Adrian Dozsa TM-JUG 18.09.2014
Java Performance Adrian Dozsa TM-JUG 18.09.2014 Agenda Requirements Performance Testing Micro-benchmarks Concurrency GC Tools Why is performance important? We hate slow web pages/apps We hate timeouts
JBoss Seam Performance and Scalability on Dell PowerEdge 1855 Blade Servers
JBoss Seam Performance and Scalability on Dell PowerEdge 1855 Blade Servers Dave Jaffe, PhD, Dell Inc. Michael Yuan, PhD, JBoss / RedHat June 14th, 2006 JBoss Inc. 2006 About us Dave Jaffe Works for Dell
Berlin Mainframe Summit. Java on z/os. 2006 IBM Corporation
Java on z/os Martina Schmidt Agenda Berlin Mainframe Summit About the mainframe Java runtime environments under z/os For which applications should I use a mainframe? Java on z/os cost and performance Java
An Oracle White Paper July 2011. Oracle Primavera Contract Management, Business Intelligence Publisher Edition-Sizing Guide
Oracle Primavera Contract Management, Business Intelligence Publisher Edition-Sizing Guide An Oracle White Paper July 2011 1 Disclaimer The following is intended to outline our general product direction.
Using an In-Memory Data Grid for Near Real-Time Data Analysis
SCALEOUT SOFTWARE Using an In-Memory Data Grid for Near Real-Time Data Analysis by Dr. William Bain, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 IN today s competitive world, businesses
How To Write A Multi Threaded Software On A Single Core (Or Multi Threaded) System
Multicore Systems Challenges for the Real-Time Software Developer Dr. Fridtjof Siebert aicas GmbH Haid-und-Neu-Str. 18 76131 Karlsruhe, Germany [email protected] Abstract Multicore systems have become
Performance Characteristics of VMFS and RDM VMware ESX Server 3.0.1
Performance Study Performance Characteristics of and RDM VMware ESX Server 3.0.1 VMware ESX Server offers three choices for managing disk access in a virtual machine VMware Virtual Machine File System
Cloud Computing. Up until now
Cloud Computing Lecture 11 Virtualization 2011-2012 Up until now Introduction. Definition of Cloud Computing Grid Computing Content Distribution Networks Map Reduce Cycle-Sharing 1 Process Virtual Machines
WHITE PAPER Guide to 50% Faster VMs No Hardware Required
WHITE PAPER Guide to 50% Faster VMs No Hardware Required Think Faster. Visit us at Condusiv.com GUIDE TO 50% FASTER VMS NO HARDWARE REQUIRED 2 Executive Summary As much as everyone has bought into the
A Deduplication File System & Course Review
A Deduplication File System & Course Review Kai Li 12/13/12 Topics A Deduplication File System Review 12/13/12 2 Traditional Data Center Storage Hierarchy Clients Network Server SAN Storage Remote mirror
A Content-Based Load Balancing Algorithm for Metadata Servers in Cluster File Systems*
A Content-Based Load Balancing Algorithm for Metadata Servers in Cluster File Systems* Junho Jang, Saeyoung Han, Sungyong Park, and Jihoon Yang Department of Computer Science and Interdisciplinary Program
An Oracle White Paper September 2013. Advanced Java Diagnostics and Monitoring Without Performance Overhead
An Oracle White Paper September 2013 Advanced Java Diagnostics and Monitoring Without Performance Overhead Introduction... 1 Non-Intrusive Profiling and Diagnostics... 2 JMX Console... 2 Java Flight Recorder...
BSPCloud: A Hybrid Programming Library for Cloud Computing *
BSPCloud: A Hybrid Programming Library for Cloud Computing * Xiaodong Liu, Weiqin Tong and Yan Hou Department of Computer Engineering and Science Shanghai University, Shanghai, China [email protected],
Java Garbage Collection Characteristics and Tuning Guidelines for Apache Hadoop TeraSort Workload
Java Garbage Collection Characteristics and Tuning Guidelines for Apache Hadoop TeraSort Workload Shrinivas Joshi, Software Performance Engineer Vasileios Liaskovitis, Performance Engineer 1. Introduction
Enabling Technologies for Distributed and Cloud Computing
Enabling Technologies for Distributed and Cloud Computing Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF Multi-core CPUs and Multithreading
Performance Management for Cloudbased STC 2012
Performance Management for Cloudbased Applications STC 2012 1 Agenda Context Problem Statement Cloud Architecture Need for Performance in Cloud Performance Challenges in Cloud Generic IaaS / PaaS / SaaS
Reconfigurable Architecture Requirements for Co-Designed Virtual Machines
Reconfigurable Architecture Requirements for Co-Designed Virtual Machines Kenneth B. Kent University of New Brunswick Faculty of Computer Science Fredericton, New Brunswick, Canada [email protected] Micaela Serra
Distributed and Cloud Computing
Distributed and Cloud Computing K. Hwang, G. Fox and J. Dongarra Chapter 3: Virtual Machines and Virtualization of Clusters and datacenters Adapted from Kai Hwang University of Southern California March
Virtualization Technologies and Blackboard: The Future of Blackboard Software on Multi-Core Technologies
Virtualization Technologies and Blackboard: The Future of Blackboard Software on Multi-Core Technologies Kurt Klemperer, Principal System Performance Engineer [email protected] Agenda Session Length:
HeapStats: Your Dependable Helper for Java Applications, from Development to Operation
: Technologies for Promoting Use of Open Source Software that Contribute to Reducing TCO of IT Platform HeapStats: Your Dependable Helper for Java Applications, from Development to Operation Shinji Takao,
Setting deadlines and priorities to the tasks to improve energy efficiency in cloud computing
Setting deadlines and priorities to the tasks to improve energy efficiency in cloud computing Problem description Cloud computing is a technology used more and more every day, requiring an important amount
Achieving Nanosecond Latency Between Applications with IPC Shared Memory Messaging
Achieving Nanosecond Latency Between Applications with IPC Shared Memory Messaging In some markets and scenarios where competitive advantage is all about speed, speed is measured in micro- and even nano-seconds.
CHAPTER 1 INTRODUCTION
1 CHAPTER 1 INTRODUCTION 1.1 MOTIVATION OF RESEARCH Multicore processors have two or more execution cores (processors) implemented on a single chip having their own set of execution and architectural recourses.
MID-TIER DEPLOYMENT KB
MID-TIER DEPLOYMENT KB Author: BMC Software, Inc. Date: 23 Dec 2011 PAGE 1 OF 16 23/12/2011 Table of Contents 1. Overview 3 2. Sizing guidelines 3 3. Virtual Environment Notes 4 4. Physical Environment
Deploying De-Duplication on Ext4 File System
Deploying De-Duplication on Ext4 File System Usha A. Joglekar 1, Bhushan M. Jagtap 2, Koninika B. Patil 3, 1. Asst. Prof., 2, 3 Students Department of Computer Engineering Smt. Kashibai Navale College
Operatin g Systems: Internals and Design Principle s. Chapter 10 Multiprocessor and Real-Time Scheduling Seventh Edition By William Stallings
Operatin g Systems: Internals and Design Principle s Chapter 10 Multiprocessor and Real-Time Scheduling Seventh Edition By William Stallings Operating Systems: Internals and Design Principles Bear in mind,
An Exception Monitoring System for Java
An Exception Monitoring System for Java Heejung Ohe and Byeong-Mo Chang Department of Computer Science, Sookmyung Women s University, Seoul 140-742, Korea {lutino, [email protected] Abstract. Exception
Chapter 6, The Operating System Machine Level
Chapter 6, The Operating System Machine Level 6.1 Virtual Memory 6.2 Virtual I/O Instructions 6.3 Virtual Instructions For Parallel Processing 6.4 Example Operating Systems 6.5 Summary Virtual Memory General
Dynamic resource management for energy saving in the cloud computing environment
Dynamic resource management for energy saving in the cloud computing environment Liang-Teh Lee, Kang-Yuan Liu, and Hui-Yang Huang Department of Computer Science and Engineering, Tatung University, Taiwan
How To Understand And Understand An Operating System In C Programming
ELEC 377 Operating Systems Thomas R. Dean Instructor Tom Dean Office:! WLH 421 Email:! [email protected] Hours:! Wed 14:30 16:00 (Tentative)! and by appointment! 6 years industrial experience ECE Rep
Design Issues in a Bare PC Web Server
Design Issues in a Bare PC Web Server Long He, Ramesh K. Karne, Alexander L. Wijesinha, Sandeep Girumala, and Gholam H. Khaksari Department of Computer & Information Sciences, Towson University, 78 York
Facing the Challenges for Real-Time Software Development on Multi-Cores
Facing the Challenges for Real-Time Software Development on Multi-Cores Dr. Fridtjof Siebert aicas GmbH Haid-und-Neu-Str. 18 76131 Karlsruhe, Germany [email protected] Abstract Multicore systems introduce
The Key Technology Research of Virtual Laboratory based On Cloud Computing Ling Zhang
International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2015) The Key Technology Research of Virtual Laboratory based On Cloud Computing Ling Zhang Nanjing Communications
Virtual Machine Monitors. Dr. Marc E. Fiuczynski Research Scholar Princeton University
Virtual Machine Monitors Dr. Marc E. Fiuczynski Research Scholar Princeton University Introduction Have been around since 1960 s on mainframes used for multitasking Good example VM/370 Have resurfaced
