2012 IEEE International Conference on Cluster Computing (CLUSTER 2012) Beijing, China 24 28 September 2012. IEEE Catalog Number: ISBN:



Similar documents
The Board of Directors

Workshop on Internet and BigData Finance (WIBF)

How To Balance In Cloud Computing

Open Access Research on Database Massive Data Processing and Mining Method based on Hadoop Cloud Platform

Big Data Management in the Clouds and HPC Systems

Big Data Storage Architecture Design in Cloud Computing

Write a technical report Present your results Write a workshop/conference paper (optional) Could be a real system, simulation and/or theoretical

Increasing Scalability & Performance of Cloud Storage by Improving Load Balancer

Program 2014 Tsinghua-Sanya Workshop on Big Data: Opportunities, Challenges and Innovations Dec. 27 Dec. 30, 2014 Sanya, China

UPS battery remote monitoring system in cloud computing

CONCEPTUAL MODEL OF MULTI-AGENT BUSINESS COLLABORATION BASED ON CLOUD WORKFLOW

AN EFFICIENT LOAD BALANCING ALGORITHM FOR CLOUD ENVIRONMENT

2014 Second International Conference on Advanced Cloud and Big Data

1 Study of City-Level Management System of Air-Conditioning Information System Scm Rationality in the Decision Making... 9

Journal of Chemical and Pharmaceutical Research, 2015, 7(3): Research Article. E-commerce recommendation system on cloud computing

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015

Load Balancing of virtual machines on multiple hosts

A REVIEW: Distributed File System

en fi lm af Zhang Yang instruktøren bag den kinesiske fi lmperle Badeanstalten Premiere 31. oktober

Introduction. Research methodology

SCHEDULING IN CLOUD COMPUTING

Cost Effective Selection of Data Center in Cloud Environment

Analysis and Optimization of Massive Data Processing on High Performance Computing Architecture

Professor and Director, Division of Biological Sciences & Computation Institute, University of Chicago

Mobile Storage and Search Engine of Information Oriented to Food Cloud

HPC Programming Framework Research Team

Information Security Practice and Experience

Technical Program 研 讨 会 程 序 手 册

Enhanced Load Balancing Approach to Avoid Deadlocks in Cloud

Lifetime Management of Cache Memory using Hadoop Snehal Deshmukh 1 Computer, PGMCOE, Wagholi, Pune, India

Modeling Risk Management in Sustainable Construction

Capability Service Management System for Manufacturing Equipments in

Hailong SUN. Associate Professor, School of Computer Science and Engineering, Beihang University

Zhejiang University SPIE Student Chapter Annual Report September 2014

Distributed communication-aware load balancing with TreeMatch in Charm++

Developing Scalable Smart Grid Infrastructure to Enable Secure Transmission System Control

A STUDY ON THE NARRATOR S VOICE IN THE CHINESE TRANSLATION OF A ROOM OF ONE S OWN (1929) LAW TSZ SANG

A COGNITIVE NETWORK BASED ADAPTIVE LOAD BALANCING ALGORITHM FOR EMERGING TECHNOLOGY APPLICATIONS *

A Novel Way of Deduplication Approach for Cloud Backup Services Using Block Index Caching Technique

Dynamic Load Balancing: Improve Efficiency in Cloud Computing Argha Roy * M.Tech CSE Netaji Subhash Engineering College West Bengal, India.

Advances in Natural and Applied Sciences

ASEM Seminar on Capacity Building on Air Pollution Prevention and Control LIST OF PARTICIPANTS

Towards Autonomic Grid Data Management with Virtualized Distributed File Systems

Profit-driven Cloud Service Request Scheduling Under SLA Constraints

Group Based Load Balancing Algorithm in Cloud Computing Virtualization

Cloud Based Adaptive Overlapped Data Chained Declustering

The Comprehensive Performance Rating for Hadoop Clusters on Cloud Computing Platform

Do You Feel the Lag of Your Hadoop?

Proceedings of the 7 th Euro-Asia Conference on Environment and CSR: Tourism, MICE, Hospitality Management and Education Session (Part I)

Index Terms : Load rebalance, distributed file systems, clouds, movement cost, load imbalance, chunk.

Study on Tacit Knowledge Identification of Independent Documentary Film Directors

A Novel User-Preference-Driven Service Selection Strategy in Cloud

International Journal of Advanced Research in Computer Science and Software Engineering

A Method of Cloud Resource Load Balancing Scheduling Based on Improved Adaptive Genetic Algorithm

Autonomic Data Replication in Cloud Environment

THURSDAY, January 24, 2013

ISSN: Keywords: HDFS, Replication, Map-Reduce I Introduction:

DESIGN ARCHITECTURE-BASED ON WEB SERVER AND APPLICATION CLUSTER IN CLOUD ENVIRONMENT

Scientific Computing Programming with Parallel Objects

Method of Fault Detection in Cloud Computing Systems

Modeling on Energy Consumption of Cloud Computing Based on Data Center Yu Yang 1, a Jiang Wei 2, a Guan Wei 1, a Li Ping 1, a Zhou Yongmin 1, a

Appendix 1: Search strategies for randomized controlled trials on Chinese herbal medicine for symptom management in cancer palliative care

Masters Project Proposal

Unleashing the Performance Potential of GPUs for Atmospheric Dynamic Solvers

International Conference on Education, Management and Computing Technology

Research on the UHF RFID Channel Coding Technology based on Simulink

Development of a Web-based Information Service Platform for Protected Crop Pests

Enhancing Cloud-based Servers by GPU/CPU Virtualization Management

2015 CMB Request for Proposals (RFP) Innovations in e-learning for Health Professional Education

MapReduce on GPUs. Amit Sabne, Ahmad Mujahid Mohammed Razip, Kun Xu

Keywords: Load testing, testing tools, test script, Open-source Software, web applications.

Principles of Cross-selling Page Design Based on Consumer Online Purchasing Behaviors --Take C to C E-commerce Website as an Example

ANALYSING THE FEATURES OF JAVA AND MAP/REDUCE ON HADOOP

Efficient Qos Based Resource Scheduling Using PAPRIKA Method for Cloud Computing

Big Application Execution on Cloud using Hadoop Distributed File System

Pre-Stack Kirchhoff Time Migration on Hadoop and Spark

Sino-German Workshop on Cloud-based High Performance Computing. September 26-October 1, 2011, Shanghai, China

Problems in Science and Engineering

Adaptive Overlapped Data Chained Declustering In Distributed File Systems

The Study of a Hierarchical Hadoop Architecture in Multiple Data Centers Environment

I/O intensive applications: what are the main differences in the design of the HPC filesystems vs the MapReduce ones?

A Survey on Load Balancing Algorithms in Cloud Environment

QoS EVALUATION OF CLOUD SERVICE ARCHITECTURE BASED ON ANP

International Conference on Advances in Energy, Environment and Chemical Engineering (AEECE-2015)

Research and Improvement on A Lightweight VOD Cloud Storage Engine

Survey on Scheduling Algorithm in MapReduce Framework

LIST OF CONFERENCE PARTICIPANTS

Chapter 18: Database System Architectures. Centralized Systems

BPOE Research Highlights

Overlapping Data Transfer With Application Execution on Clusters

A B S T R A C T I. INTRODUCTION

Flexible Configuration Mechanism of Control and Data Management

BlobSeer: Towards efficient data storage management on large-scale, distributed systems

A Load Balancing Algorithm based on the Variation Trend of Entropy in Homogeneous Cluster

DESIGN AN EFFICIENT BIG DATA ANALYTIC ARCHITECTURE FOR RETRIEVAL OF DATA BASED ON

Modeling and Prediction of Network Traffic Based on Hybrid Covariance Function Gaussian Regressive

High Performance Computing. Course Notes HPC Fundamentals

How To Write A Novel Data Replication Strategy For Cloud Data Centres

Migration Improved Scheduling Approach In Cloud Environment

Accelerating BIRCH for Clustering Large Scale Streaming Data Using CUDA Dynamic Parallelism

Transcription:

2012 IEEE International Conference on Cluster Computing (CLUSTER 2012) Beijing, China 24 28 September 2012 IEEE Catalog Number: ISBN: CFP12235-PRT 978-1-4673-2422-9

2012 IEEE International Conference on Cluster Computing Cluster 2012 Table of Contents Greetings from the General Chairs...xii Message from the Program Co-Chairs...xiii Cluster 2012 Organization...xiv Reviewers...xviii Keynote Abstracts...xix Session 1: Applications and Algorithms for GPUs Adapting Irregular Computations to Large CPU-GPU Clusters in the MADNESS Framework...1 Vlad Slavici, Raghu Varier, Gene Cooperman, and Robert J. Harrison A GPU-accelerated Branch-and-Bound Algorithm for the Flow-Shop Scheduling Problem...10 N. Melab, I. Chakroun, M. Mezmaz, and D. Tuyttens A Node-based Parallel Game Tree Algorithm Using GPUs...18 Liang Li, Hong Liu, Peiyu Liu, Taoying Liu, Wei Li, and Hao Wang Using 1000+ GPUs and 10000+ CPUs for Sedimentary Basin Simulations...27 Mei Wen, Huayou Su, Wenjie Wei, Nan Wu, Xing Cai, and Chunyuan Zhang Session 2: Conserving Energy Evaluating Power-Monitoring Capabilities on IBM Blue Gene/P and Blue Gene/Q...36 Kazutomo Yoshii, Kamil Iskra, Rinku Gupta, Pete Beckman, Venkatram Vishwanath, Chenjie Yu, and Susan Coghlan eco-idc: Trade Delay for Energy Cost with Service Delay Guarantee for Internet Data Centers...45 Jianying Luo, Lei Rao, and Xue Liu Synergy: A Middleware for Energy Conservation in Mobile Devices...54 Harshit Kharbanda, Manoj Krishnan, and Roy H. Campbell v

Session 3: MapReduce and Services Affinity-aware Virtual Cluster Optimization for MapReduce Applications...63 Cairong Yan, Ming Zhu, Xin Yang, Ze Yu, Min Li, Youqun Shi, and Xiaolin Li Mastiff: A MapReduce-based System for Time-Based Big Data Analytics...72 Sijie Guo, Jin Xiong, Weiping Wang, and Rubao Lee Community Clustering for Distributed Publish/Subscribe Systems...81 Wei Li, Songlin Hu, Jintao Li, and Hans-Arno Jacobsen D2T: Doubly Distributed Transactions for High Performance and Distributed Computing...90 Jay Lofstead, Jai Dayal, Karsten Schwan, and Ron Oldfield Session 4: Distributed File Systems Cx: Concurrent Execution for the Cross-Server Operations in a Distributed File System...99 Letian Yi, Jiwu Shu, Jiaxin Ou, and Ying Zhao Multicore-Enabled Smart Storage for Clusters...108 Zhiyang Ding, Xunfei Jiang, Shu Yin, Xiao Qin, Kai-Hsiung Chang, Xiaojun Ruan, Mohammed I. Alghamdi, and Meikang Qiu The Load Rebalancing Problem in Distributed File Systems...117 Hsueh-Yi Chung, Che-Wei Chang, Hung-Chang Hsiao, and Yu-Chang Chao Clover: A Distributed File System of Expandable Metadata Service Derived from HDFS...126 Youwei Wang, Jiang Zhou, Can Ma, Weiping Wang, Dan Meng, and Jason Kei Session 5: I/O Challenges sebp: Event Based Polling for Efficient I/O Virtualization...135 Kun Tian, Yaozu Dong, Xiang Mi, and Haibing Guan The Power and Challenges of Transformative I/O...144 Adam Manzanares, John Bent, Meghan Wingate, and Garth Gibson Damaris: How to Efficiently Leverage Multicore Parallelism to Achieve Scalable, Jitter-free I/O...155 Matthieu Dorier, Gabriel Antoniu, Franck Cappello, Marc Snir, and Leigh Orf Session 6: Performance and QoS DOSAS: Mitigating the Resource Contention in Active Storage Systems...164 Chao Chen, Yong Chen, and Philip C. Roth vi

High Performance and High Capacity Hybrid Shingled-Recording Disk System...173 Jiguang Wan, Nannan Zhao, Yifeng Zhu, Jibin Wang, Yu Mao, Peng Chen, and Changsheng Xie Replication Based QoS Framework for Flash Arrays...182 Nihat Altiparmak and Ali Şaman Tosun Session 7: Applications and Algorithms for Data and I/O Data Partitioning on Heterogeneous Multicore and Multi-GPU Systems Using Functional Performance Models of Data-Parallel Applications...191 Ziming Zhong, Vladimir Rychkov, and Alexey Lastovetsky A Decoupled Execution Paradigm for Data-Intensive High-End Computing...200 Yong Chen, Chao Chen, Xian-He Sun, William D. Gropp, and Rajeev Thakur Improving I/O Throughput with PRIMACY: Preconditioning ID-Mapper for Compressing Incompressibility...209 Neil Shah, Eric R. Schendel, Sriram Lakshminarasimhan, Saurabh V. Pendse, Terry Rogers, and Nagiza F. Samatova Session 8: Performance and Visualization Performance Analysis of Parallel Processing Systems with Horizontal Decomposition...220 Hanwei Chen, Jianwei Yin, and Calton Pu Characterization and Comparison of Cloud versus Grid Workloads...230 Sheng Di, Derrick Kondo, and Walfredo Cirne DisplayCluster: An Interactive Visualization Environment for Tiled Displays...239 Gregory P. Johnson, Gregory D. Abram, Brandt Westing, Paul Navrátil, and Kelly Gaither Session 9: Applications and Algorithms for Scientific Computing An Out-of-Core Eigensolver on SSD-equipped Clusters...248 Zheng Zhou, Erik Saule, Hasan Metin Aktulga, Chao Yang, Esmond G. Ng, Pieter Maris, James P. Vary, and Umit V. Çatalyürek On Optimal and Balanced Sparse Matrix Partitioning Problems...257 Anael Grandjean, Johannes Langguth, and Bora Uçar Autotuning Stencil-Based Computations on GPUs...266 Azamat Mametjanov, Daniel Lowell, Ching-Chen Ma, and Boyana Norris Accelerating Expectation-Maximization Algorithms with Frequent Updates...275 Jiangtao Yin, Yanfeng Zhang, and Lixin Gao vii

Session 10: Reliability and Consistency SDM: A Stripe-Based Data Migration Scheme to Improve the Scalability of RAID-6...284 Chentao Wu, Xubin He, Jizhong Han, Huailiang Tan, and Changsheng Xie Harmony: Towards Automated Self-Adaptive Consistency in Cloud Storage...293 Houssem-Eddine Chihoub, Shadi Ibrahim, Gabriel Antoniu, and María S. Pérez Accelerating Distributed Updates with Asynchronous Ordered Writes in a Parallel File System...302 Youyou Lu, Jiwu Shu, Shuai Li, and Letian Yi HerpRap: A Hybrid Array Architecture Providing Any Point-in-Time Data Tracking for Datacenter...311 Lingfang Zeng, Dan Feng, Bo Mao, Jianxi Chen, Qingsong Wei, and Wenguo Liu Session 11: Communications A New End-to-End Flow-Control Mechanism for High Performance Computing Clusters...320 Javier Prades, Federico Silla, José Duato, Holger Fröning, and Mondrian Nüssle Minimizing Network Contention in InfiniBand Clusters with a QoS-Aware Data-Staging Framework...329 Raghunath Rajachandrasekar, Jai Jaswani, Hari Subramoni, and Dhabaleswar K. Panda Adjustable Credit Scheduling for High Performance Network Virtualization...337 Zhibo Chang, Jian Li, Ruhui Ma, Zhiqiang Huang, and Haibing Guan Low-Latency Collectives for the Intel SCC...346 Adan Kohler, Martin Radetzki, Philipp Gschwandtner, and Thomas Fahringer Session 12: Resilience and Load Balancing Hierarchical Clustering Strategies for Fault Tolerance in Large Scale HPC Systems...355 Leonardo Bautista Gomez, Thomas Ropars, Naoya Maruyama, Franck Cappello, and Satoshi Matsuoka Hiding Checkpoint Overhead in HPC Applications with a Semi-Blocking Algorithm...364 Xiang Ni, Esteban Meneses, and Laxmikant V. Kalé Automated Load Balancing Invocation Based on Application Characteristics...373 Harshitha Menon, Nikhil Jain, Gengbin Zheng, and Laxmikant Kalé Overlay-Centric Load Balancing: Applications to UTS and B&B...382 Trong-Tuan Vu, Bilel Derbel, Asim Ali, Ahcène Bendjoudi, and Nouredine Melab viii

Session 13: Applications and Algorithms PIC: Partitioned Iterative Convergence for Clusters...391 Reza Farivar, Anand Raghunathan, Srimat Chakradhar, Harshit Kharbanda, and Roy H. Campbell Improving Resource Utilization in MapReduce...402 Zhenhua Guo, Geoffrey Fox, Mo Zhou, and Yang Ruan Differentiating Your Friends for Scaling Online Social Networks...411 Yewei Huang, Qianni Deng, and Yanmin Zhu Generic Parallel Programming for Massive Remote Sensing Data Processing...420 Yan Ma, Lizhe Wang, Dingsheng Liu, Peng Liu, Jun Wang, and Jie Tao Session 14: Memory and I/O KNOWAC: I/O Prefetch via Accumulated Knowledge...429 Jun He, Xian-He Sun, and Rajeev Thakur Evaluation and Optimization of Breadth-First Search on NUMA Cluster...438 Zehan Cui, Licheng Chen, Mingyu Chen, Yungang Bao, Yongbing Huang, and Huiwei Lv GPGPU Memory Estimation and Optimization Targeting OpenCL Architecture...449 Junfeng Zhu, Gang Chen, and Baifeng Wu Adaptive and Scalable Optimizations for High Performance SR-IOV...459 Zhiqiang Huang, Ruhui Ma, Jian Li, Zhibo Chang, and Haibing Guan Session 15: MPI Enabling Fast, Noncontiguous GPU Data Movement in Hybrid MPI+GPU Environments...468 John Jenkins, James Dinan, Pavan Balaji, Nagiza F. Samatova, and Rajeev Thakur Designing an Offloaded Nonblocking MPI_Allgather Collective Using CORE-Direct...477 Grigori Inozemtsev and Ahmad Afsahi Optimizing Process-to-Core Mappings for Application Level Multi-dimensional MPI Communications...486 Christer Karlsson, Teresa Davies, and Zizhong Chen On the Effects of CPU Caches on MPI Point-to-Point Communications...495 Simone Pellegrini, Torsten Hoefler, and Thomas Fahringer ix

Session 16: Task and Job Scheduling ANOLE: A Profiling-Driven Adaptive Lock Waiter Detection Scheme for Efficient MP-guest Scheduling...504 Jian Zhang, Yaozu Dong, and Jiangang Duan Backfilling under Two-tier Virtual Machines...514 Xiaocheng Liu, Chen Wang, Xiaogang Qiu, Bing Bing Zhou, Bin Chen, and Albert Y. Zomaya A Job Scheduling Design for Visualization Services Using GPU Clusters...523 Wei-Hsien Hsu, Chun-Fu Wang, Kwan-Liu Ma, Hongfeng Yu, and Jacqueline H. Chen Poster Session Cache Promotion Policy Using Re-reference Interval Prediction...534 Gangyong Jia, Xi Li, Chao Wang, Xuehai Zhou, and Zongwei Zhu Built-in Device Simulator for OS Performance Evaluation...538 Junjie Mao, Yu Chen, and Yaozu Dong Dynamic Network Forecasting Using SimGrid Simulations...542 Matthieu Imbert and Eddy Caron BWCC: A FS-Cache Based Cooperative Caching System for Network Storage System...546 Liu Shi, Zhenjun Liu, and Lu Xu Towards a Cost-Aware Data Migration Approach for Key-Value Stores...551 Xiulei Qin, Wenbo Zhang, Wei Wang, Jun Wei, Xin Zhao, and Tao Huang DFG-base Dynamic Operation Partitioning for Heterogeneous Multicluster VLIW DSP Processor...557 Yangzhao Yang, Zeng Zhao, and Naijie Gu Memvisor: Application Level Memory Mirroring via Binary Translation...562 Haoliang Dong, Wei Sun, Bin Wang, Haiyang Sun, Zhengwei Qi, Haibing Guan, and Yaozu Dong A Locality-based Performance Model for Load-and-Compute Style Computation...566 Liang Yuan and Yunquan Zhang Transactional Multi-row Access Guarantee in the Key-Value Store...572 Yaoguang Wang, Weiming Lu, and Baogang Wei Building a TaaS Platform for Web Service Load Testing...576 Minzhi Yan, Hailong Sun, Xu Wang, and Xudong Liu Design of Hardware-Based Communication Performance Measurement Tool...580 Zhan Wang, Zheng Cao, Xiaoli Liu, Yong Su, Feilong Liu, and Xuejun An Phase Detection for Loop-Based Programs on Multicore Architectures...584 Chao Wang, Xi Li, Dong Dai, Gangyong Jia, and Xuehai Zhou x

SOME: Selective Offloading for a Mobile Computing Environment...588 Sehoon Park, Youngil Choi, Qichen Chen, and Heon Y. Yeom clone_n(): Parallel Thread Creation for Upcoming Many-Core Architectures...592 Balazs Gerofi, Atsushi Hori, and Yutaka Ishikawa D4D: Inter-datacenter Bulk Transfers with ISP Friendliness...597 Yangyang Li, Hongbo Wang, Peng Zhang, Jiankang Dong, and Shiduan Cheng Cloud Based Short Read Mapping Service...601 Dong Dai, Xi Li, Chao Wang, and Xuehai Zhou Memory Affinity: Balancing Performance, Power, Thermal and Fairness for Multi-core Systems...605 Gangyong Jia, Xi Li, Chao Wang, Xuehai Zhou, and Zongwei Zhu ME2: Efficient Live Migration of Virtual Machine with Memory Exploration and Encoding...610 Yanqing Ma, Hongbo Wang, Jiankang Dong, Yangyang Li, and Shiduan Cheng A Reliable and High-Performance Distributed Storage System for P2P-VoD Service...614 Jing Zhao, Hongbo Wang, Jiankang Dong, and Shiduan Cheng HAaaS: Towards Highly Available Distributed Systems...618 Yaoguang Wang, Weiming Lu, Bin Yu, and Baogang Wei Towards Fault-Tolerant Energy-Efficient High Performance Computing in the Cloud...622 Kurt L. Keville, Rohan Garg, David J. Yates, Kapil Arya, and Gene Cooperman Author Index...627 xi