Manuel E. Acacio - RESUME - (July 2014)



Similar documents
José L. Abellán, PhD - Assistant Professor - Résumé (November 2015)

Curriculum Vitae (January 2016)

Software Distributed Shared Memory Scalability and New Applications

Performance Evaluation of 2D-Mesh, Ring, and Crossbar Interconnects for Chip Multi- Processors. NoCArc 09

BSc in Computer Engineering, University of Cyprus

Outline. Introduction. Multiprocessor Systems on Chip. A MPSoC Example: Nexperia DVP. A New Paradigm: Network on Chip

Introduction to System-on-Chip

Next Generation GPU Architecture Code-named Fermi

Lecture 23: Multiprocessors

RUNAHEAD EXECUTION: AN EFFECTIVE ALTERNATIVE TO LARGE INSTRUCTION WINDOWS

Introduction to Cloud Computing

Friday 1 st August, 2014

Big data management with IBM General Parallel File System

In-network Monitoring and Control Policy for DVFS of CMP Networkson-Chip and Last Level Caches

Parallel Programming Survey

Gerald Roth. Department of Electrical Engineering and Computer Science School of Engineering Vanderbilt University Nashville, TN

CHAPTER 1 INTRODUCTION

A Robust Dynamic Load-balancing Scheme for Data Parallel Application on Message Passing Architecture

Efficient Data Replication Scheme based on Hadoop Distributed File System

Low-Overhead Hard Real-time Aware Interconnect Network Router

Stream Processing on GPUs Using Distributed Multimedia Middleware

Research Statement. Hung-Wei Tseng

How To Build A Cloud Computer

High Performance Computer Architecture

Performance Modeling and Analysis of a Database Server with Write-Heavy Workload

Optimizing a 3D-FWT code in a cluster of CPUs+GPUs

How To Understand The Concept Of A Distributed System

Trends in High-Performance Computing for Power Grid Applications

Min Si. Argonne National Laboratory Mathematics and Computer Science Division

SMCV: a Methodology for Detecting Transient Faults in Multicore Clusters

Achieving Nanosecond Latency Between Applications with IPC Shared Memory Messaging

CURRICULUM VITÆ RICARDO CHAVES. December

Memory Architecture and Management in a NoC Platform

Center for Programming Models for Scalable Parallel Computing

Multi-core Curriculum Development at Georgia Tech: Experience and Future Steps

3D On-chip Data Center Networks Using Circuit Switches and Packet Switches

Secured Embedded Many-Core Accelerator for Big Data Processing

Making Multicore Work and Measuring its Benefits. Markus Levy, president EEMBC and Multicore Association

Multi-core architectures. Jernej Barbic , Spring 2007 May 3, 2007

MapReduce on GPUs. Amit Sabne, Ahmad Mujahid Mohammed Razip, Kun Xu

Study Plan Masters of Science in Computer Engineering and Networks (Thesis Track)

GPU System Architecture. Alan Gray EPCC The University of Edinburgh

Chapter 18: Database System Architectures. Centralized Systems

Sustainability and Energy Efficiency in Data Centers Design and Operation

Optimizing Shared Resource Contention in HPC Clusters

High Performance Computing. Course Notes HPC Fundamentals

A Dynamic Resource Management with Energy Saving Mechanism for Supporting Cloud Computing

Virtualization benefits in High Performance Computing Applications

Multi-Threading Performance on Commodity Multi-Core Processors

Preserving Message Integrity in Dynamic Process Migration

Symmetric Multiprocessing

Data Deduplication Scheme for Cloud Storage

Keywords: Cloudsim, MIPS, Gridlet, Virtual machine, Data center, Simulation, SaaS, PaaS, IaaS, VM. Introduction

Design and Implementation of an On-Chip timing based Permutation Network for Multiprocessor system on Chip

Parallel Computing. Benson Muite. benson.

Multiprocessor System-on-Chip

An Adaptive Task-Core Ratio Load Balancing Strategy for Multi-core Processors

Bob Boothe. Education. Research Interests. Teaching Experience

A Novel Way of Deduplication Approach for Cloud Backup Services Using Block Index Caching Technique

Vorlesung Rechnerarchitektur 2 Seite 178 DASH

STUDY AND SIMULATION OF A DISTRIBUTED REAL-TIME FAULT-TOLERANCE WEB MONITORING SYSTEM

Seeking Opportunities for Hardware Acceleration in Big Data Analytics

Index Terms : Load rebalance, distributed file systems, clouds, movement cost, load imbalance, chunk.

Hari Subramoni. Education: Employment: Research Interests: Projects:

Unleashing the Performance Potential of GPUs for Atmospheric Dynamic Solvers

How To Balance A Web Server With Remaining Capacity

Operating System Multilevel Load Balancing

Accelerating the Data Plane With the TILE-Mx Manycore Processor

Ming-Hsing Chiu. Home: (985) Chapel Loop Office: (985) EDUCATION

Data Center and Cloud Computing Market Landscape and Challenges

OpenSoC Fabric: On-Chip Network Generator

Operating Systems 4 th Class

Centralized Systems. A Centralized Computer System. Chapter 18: Database System Architectures

Evolutionary Prefetching and Caching in an Independent Storage Units Model

Curriculum Vitae. Vladimír Guzma. Flat 16, Moore Court, 2 Dodd Road, WD24 5DA, Watford, United Kingdom Telephone(s) +44 (0)

Power-Aware High-Performance Scientific Computing

~ Greetings from WSU CAPPLab ~

Infrastructure Matters: POWER8 vs. Xeon x86

benchmarking Amazon EC2 for high-performance scientific computing

Transcription:

Manuel E. Acacio - RESUME - (July 2014) ADDRESS Departamento de Ingeniería y Tecnología de Computadores Campus de Espinardo. Facultad de Informática Office: 34-86-888-3983 30100 Murcia (SPAIN) Fax: 34-86-888-5151 http://www.ditec.um.es/ meacacio E-mail: meacacio@um.es RESEARCH INTERESTS Broad Category: Computer Architecture, Multiprocessor Architecture Specific Areas Include: Chip-multiprocessor architectures (CMPs), Cache coherence protocols, Reduced energy consumption, Fault tolerance in CMPs, Hardware transactional memory, Synchronization primitives, Multiprocessor architectures, Cache coherence protocols for SMPs and cc-numas, Parallel programming environments, Performance evaluation EDUCATION Sept 98 Mar 03 Sept 96 Jul 98 Sept 93 Jul 96 Ph.D. Degree in Computer Science Improving the Performance and Scalability of Directory-based Shared Memory Multiprocessors Facultad de Informática - Universidad de Murcia Advisors: José Duato (UPV) and José M. García (UM) Best Thesis Award (Jan. 2004 Facultad de Informática UM) M.S., Computer Science Facultad de Informática - Universidad de Murcia B.S., Electrical Engineering Facultad de Informática - Universidad de Murcia EXPERIENCE Oct 99 Present Jul 1, 02 Aug 30, 02 Universidad de Murcia Departamento de Ingeniería y Tecnología de Computadores Position: Associate Professor Courses: Introduction to Computer Organization Computer Architecture Advanced Computer Architecture Embedded and Real-Time Systems Summer Internship at IBM T.J. Watson Research Center Yorktown Heights (NY) BlueGene System Development Dept Verification of the BlueGene/L compute chip Page 1 of 9

PUBLICATIONS JOURNALS Alberto Ros, Polychronis Xekalakis, Marcelo Cintra, Manuel E. Acacio y José M. García. Adaptive Selection of Cache Indexing Bits for Removing Conflict Misses. Accepted in IEEE Transactions on Computers, 2014. Epifanio Gaona, Rubén Titos-Gil, Juan Fernández and Manuel E. Acacio. Selective Dynamic Serialization for Reducing Energy Consumption in Hardware Transactional Memory Systems. Journal of Supercomputing, Vol. 68, No. 2, May 2014 Rubén Titos-Gil, Anurag Negi, Manuel E. Acacio, José M. García and Per Stenström. ZEBRA: Data-Centric Contention Management in Hardware Transactional Memory. IEEE Transactions on Parallel and Distributed Systems, Vol. 25, No. 5, May 2014. Rubén Titos-Gil, Anurag Negi, Manuel E. Acacio, José M. García and Per Stenström. Eager beats Lazy: Improving Store Management in Eager Hardware Transactional Memory. IEEE Transactions on Parallel and Distributed Systems, Vol. 24, No. 11, November 2013. José L. Abellán, Juan Fernández and Manuel E. Acacio. Desing of an Efficient Communication Infrastructure for Highly-Contended Locks in Many-Core CMPs. Journal of Parallel and Distributed Computing, Vol. 73, No. 7, July 2013. Epifanio Gaona, Rubén Titos-Gil, Juan Fernández and Manuel E. Acacio. On the Design of Energy-Efficient Hardware Transactional Memory Systems. Concurrency and Computation: Practice & Experience, Vol. 25, No. 6, April 2013. Rubén Titos-Gil, Manuel E. Acacio y José M. García. Efficient Eager Management of Conflicts for Scalable Hardware Transactional Memory. IEEE Transactions on Parallel and Distributed Systems, Vol. 24, No. 1, January 2013. José M. Cecilia, José L. Abellán, Juan Fernández, Manuel E. Acacio, José M. García y Manuel Ujaldón. Stencil Computations on Heterogeneous Platforms for the Jacobi Method: GPUs versus Cell BE. Journal of Supercomputing, Vol. 62, No. 2, November 2012. José L. Abellán, Juan Fernández and Manuel E. Acacio. Efficient Hardware Barrier Synchronization in Many-Core CMPs. IEEE Transactions on Parallel and Distributed Systems, Vol. 23, No. 8, August 2012. Alberto Ros, Blas Cuesta, Ricardo Fernández, María E. Gómez, Manuel E. Acacio, Antonio Robles, José M. García and José Duato. "Extending Magny-Cours Cache Coherence". IEEE Transactions on Computers, Vol. 61, No. 5, pp. 593-206, May 2012. Rubén Titos-Gil, Manuel E. Acacio, José M. García, Tim Harris, Adrián Cristal, Osman Unsal, Ibrahim Hur and Mateo Valero. Hardware Transactional Memory with Software-Defined Conflicts. ACM Transactions on Architecture and Code Optimization, Vol. 8, No. 4, pp. 31:1-31:20, January 2012. Alberto Ros, Manuel E. Acacio and José M. García. A Direct Coherence Protocol for Many-Core Chip Multiprocessors. IEEE Transactions on Parallel and Distributed Systems, Vol. 21, No. 12, December 2010. Antonio Flores, Juan L. Aragón and Manuel E. Acacio. Exploiting Address Compression and Heterogeneous Interconnects for Efficient Message Management in Tiled CMPs. Euromicro Journal of Systems Architecture (JSA), Vol. 56, Issue 9, pp. 429-441, September 2010. Page 2 of 9

Ricardo Fernández-Pascual, José M. García, Manuel E. Acacio and José Duato. Dealing with Transient Faults in the Interconnection Network of CMPs at the Cache Coherence Level. IEEE Transactions on Parallel and Distributed Systems, Vol. 21, No. 8, August 2010. José L. Abellán, Juan Fernández and Manuel E. Acacio. Characterizing the Basic Synchronization and Communication Operations in Dual Cell-based Blades through CellStats. Journal of Supercomputing, Vol. 53, No. 2, August 2010. Alberto Ros, Manuel E. Acacio and José M. García. A Scalable Organization for Distributed Directories. Euromicro Journal of Systems Architecture (JSA), Vol. 56, Issue 2-3, pp. 77-87, February 2010. Antonio Flores, Juan L. Aragón and Manuel E. Acacio. Heterogeneous Interconnects for Energy- Efficient Message Management in CMPs. IEEE Transactions on Computers, Vol. 59, Issue 1, January 2010. Alberto Ros, Ricardo Fernández, Manuel E. Acacio and José M. García. Two Proposals for the Inclusion of Directory Information in the Last-Level Private Caches of Glueless Shared-Memory Multiprocessors". Journal of Parallel Distributed Computing, Vol. 68, No. 11, pp. 1413-1424, November 2008. Antonio Flores, Juan L. Aragón and Manuel E. Acacio. An Energy Consumption Characterization of On-Chip Interconnection Networks for Tiled CMP Architectures. The Journal of Supercomputing, Vol. 45, Issue 3, pp. 341-364, September 2008. Ricardo Fernández-Pascual, José M. García, Manuel E. Acacio and José Duato. Extending the TokenCMP Cache Coherence Protocol for Low Overhead Fault Tolerance in CMP Architectures. IEEE Transactions on Parallel and Distributed Systems, Vol. 19, No. 8, August 2008. Gregorio Bernabé, Ricardo Fernández, José M. García, Manuel E. Acacio and José González. An Efficient Implementation of a 3D Wavelet Transform Based Encoder on Hyper-Threading Technology. Parallel Computing Vol. 33, No. 1, pp. 54-72, February 2007. Francisco J. Villa, Manuel E. Acacio and José M. García. Toward Energy-Efficient High- Performance Organizations of the Memory Hierarchy in Chip-Multiprocessor Architectures". Journal of Computer Science & Technology, Vol. 6, No. 1, pp. 1-7, April 2006 (Invited paper). Francisco J. Villa, Manuel E. Acacio and José M. García. Evaluating IA-32 Web Servers through Simics: A Practical Experience. Euromicro Journal of Systems Architecture (JSA) 51, Elsevier, January 2005. Manuel E. Acacio, José González, José M. García and José Duato. A Two-level Directory Architecture for Highly Scalable cc-numa Multiprocessors. IEEE Transactions on Parallel and Distributed Systems, Vol. 16, No. 1, pp. 67-79, January 2005. Manuel E. Acacio, José González, José M. García and José Duato. An Architecture for High- Performance Scalable Shared-Memory Multiprocessors Exploiting On-chip Integration. IEEE Transactions on Parallel and Distributed Systems, Vol. 15, No. 8, pp. 755-768, August 2004. Manuel E. Acacio and José M. García. Techniques for Improving the Performance and Scalability of Directory-based Shared-Memory Multiprocessors: A Survey". Journal of Computer Science & Technology, Vol. 3, No. 2, pp. 1-8, October 2003 (Invited paper). M. E. Acacio, O. Cánovas, J. M. García, P. E. López de Teruel. MPI-Delphi: An MPI Implementation for Visual Programming Environments and Heterogeneous Computing. Journal of Future Generation Computer System, 18 (2002), pp. 317-333, Elsevier Science Publishers, North-Holland, 2002. Page 3 of 9

CONFERENCES Mario Lodge, José Flich and Manuel E. Acacio. Towards Efficient Dynamic LLC Home Bank Mapping with NoC-Level Support. Proc. of the Euro-Par 2013 Int l Conference, August 2013. José L. Abellán, Alberto Ros, Juan Fernández and Manuel E. Acacio. ECONO: Express Coherence Notifications for Efficient Cache Coherency in Many-Core CMPs. Proc. of the 2013 Int l Conference on Embedded Computer Systems: Architectures, MOdeling, and Simulation (IC- SAMOS XIII), July 2013. José L. Abellán, Alberto Ros, Juan Fernández and Manuel E. Acacio. Efficient Dir0B Cache Coherency for Many-Core CMP. Proc. of the 13 th Int l Conference on Computational Science (ICCS-2013), June 2013. Epifanio Gaona, José L. Abellán, Juan Fernández and Manuel E. Acacio. Deploying Hardware Locks to Improve Performance and Energy Efficiency of Hardware Transactional Memory. Proc. of the 26 th Int l Conference on Architecture of Computing Systems (ARCS-2013), February 2013. Mario Lodge, José Flich and Manuel E. Acacio. Dynamic Last-Level Cache Allocation to Reduce Area and Power Overhead in Directory Coherence Protocols. Proc. of the Euro-Par 2012 Int l Conference, August 2012. Alberto Ros, Polychronis Xekalakis, Marcelo Cintra, Manuel E. Acacio y José M. García. ASCIB: Adaptive Selection of Cache Indexing Bits for Removing Conflict Misses. Proc. of the Int l Symposium on Low Power Electronics and Design (ISLPED 2012), August 2012. Mario Lodge, José Flich and Manuel E. Acacio. Heterogeneous NoC Design for Efficient Broadcast-based Coherence Protocol Support. Proc. of the 6 th ACM/IEEE Int l Symposium on Networks-on-Chip (NOCS 2012), May 2012. José L. Abellán, Juan Fernández, Daniele Bortolotti, Manuel E. Acacio, Andrea Marongiu, Davide Bertozzi and Luca Benini. "Design of a Collective Communication Infrastructure for Barrier Synchronization in Cluster-Based Nanoscale MPSoCs". Proc. of DATE 2012, March 2012. Anurag Negi, Rubén Titos-Gil, Manuel E. Acacio, José M. García and Per Stenstrom. π-tm: Pessimistic Invalidation for Scalable Lazy Hardware Transactional Memory. Proc. of the 18 th Int l Symposium on High Performance Computer Architecture (HPCA-2012), February 2012. Epifanio Gaona, Rubén Titos-Gil, Juan Fernández and Manuel E. Acacio. Dynamic Serialization: Improving Energy Consumption in Eager-Eager Hardware Transactional Memory Systems. Proc. of the 20 th Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP-2012), February 2012. Rubén Titos-Gil, Manuel E. Acacio, José M. García, Tim Harris, Adrián Cristal, Osman Unsal, Ibrahim Hur and Mateo Valero. Hardware Transactional Memory with Software-Defined Conflicts. In 7 th Int l Conference on High-Performance and Embedded Architectures and Compilers (HiPEAC 2012), January 2012. Anurag Negi, Rubén Titos-Gil, Manuel E. Acacio, José M. García and Per Stenstrom. Improving Commit Scalability in Lazy Hardware Transactional Memory. Proc. of the 4 th Swedish Workshop on Multicore Computing (MCC-2011), November 2011. Anurag Negi, Rubén Titos-Gil, Manuel E. Acacio, José M. García and Per Stenstrom. Eager Meets Lazy: The Impact of Write-Buffering on Hardware Transactional Memory. Proc. of the 40 th Int l Conference on Parallel Processing (ICPP-2011), September 2011. Page 4 of 9

Rubén Titos-Gil, Anurag Negi, Manuel E. Acacio, José M. García and Per Stenstrom. ZEBRA: A Data-Centric, Hybrid-Policy Hardware Transactional Memory Design. Proc. of the 25 th Int l Conference on Supercomputing (ICS-2011), May 2011. Anurag Negi, Rubén Titos-Gil, Manuel E. Acacio, José M. García and Per Stenstrom. The Impact of Non-coherent Buffers on Lazy Hardware Transactional Memory Systems. Proc. of the 13 th Workshop on Advances on Parallel and Distributed Processing Symposium (APDCM 2011), in conjunction with IPDPS 2011, May 2011. José L. Abellán, Juan Fernández and Manuel E. Acacio. "GLocks: Efficient Support for Highly- Contended Locks in Many-Core CMPs". Proc. of the 25 th IEEE Int l Parallel and Distributed Processing Symposium (IPDPS 2011), May 2011 (Best Paper in the Architecture Track). Alberto Ros, Blas Cuesta, Ricardo Fernández, María E. Gómez, Manuel E. Acacio, Antonio Robles, José M. García and José Duato. "EMC2: Extending Magny-Cours Coherence for Large- Scale Servers". Proc. of 17 th Annual IEEE International Conference on High Performance Computing (HiPC 2010), December 2010. Epifanio Gaona, Rubén Titos, Juan Fernández and Manuel E. Acacio. Characterizing Energy Consumption in Hardware Transactional Memory Systems. Proc. of the 22 nd Int l Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2010), October 2010. José L. Abellán, Juan Fernández and Manuel E. Acacio. "A G-line-based Network for Fast and Efficient Barrier Synchronization in Many-Core CMPs". Proc. of the 39 th Int l Conference on Parallel Processing (ICPP 2010), September 2010. Alberto Ros and Manuel E. Acacio. "Low-Overhead Organizations for the Directory in Future Many-Core CMPs". Proc. of the 4 th Workshop on Highly Parallel Processing on a Chip (HPPC 2010), September 2010. José L. Abellán, Juan Fernández and Manuel E. Acacio. "Efficient and Scalable Barrier Synchronization for Many-Core CMPs". Proc. of the ACM Int l Conference on Computing Frontiers 2010, May 2010. Rubén Titos-Gil, Manuel E. Acacio, José M. García, Tim Harris, Adrián Cristal, Osman Unsal, Ibrahim Hur and Mateo Valero. Hardware Transactional Memory with Software-Defined Conflicts. 5 th ACM SIGPLAN Workshop on Transactional Computing (TRANSACT 2010), April 2010. Antonio Flores, Juan L. Aragón and Manuel E. Acacio. Energy-Efficient Hardware Prefetching for CMPs using Heterogeneous Interconnects. Proc. of the 18 th Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP-2010), February 2010. José L. Abellán, Juan Fernández and Manuel E. Acacio. "A Novel Hardware-based Barrier Synchronization for Many-Core CMPs". 4 th Workshop on Interconnection Network Architectures: On-Chip, Multi-Chip (INA-OCMC 2010), January 2010. Alberto Ros, Marcelo Cintra, Manuel E. Acacio y José M. García. Distance-Aware Round-Robin Mapping for Large NUCA Caches. Proc. of the 16 th Annual IEEE International Conference on High Performance Computing (HiPC 2009), December 2009. Alberto Ros, Manuel E. Acacio y José M. García. Dealing with Traffic-Area Trade-Off in Cache Coherence Protocols for Many-Core CMPs. Proc. of the 8 th International Conference on Advanced Parallel Processing Technologies (APPT 2009), August 2009. Epifanio Gaona, Juan Fernández and Manuel E. Acacio. Fast and Efficient Synchronization and Communication Collective Primitives for Dual Cell-based Blades. Proc. of Euro-Par 2009 Int l Conference, August 2009. Page 5 of 9

Rubén Titos, Manuel E. Acacio and José M. García. Speculation-Based Conflict Resolution in Hardware Transactional Memory. Proc. of the 23 rd IEEE Int l Parallel and Distributed Processing Symposium (IPDPS 2009), May 2009. Joaquín Franco, Gregorio Bernabé, Manuel E. Acacio and Juan Fernández. A Parallel Implementation of the 2D Wavelet Transform Using CUDA. Proc. of the 17 th Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP-09), February 2009. Ricardo Fernández-Pascual, José M. García, Manuel E. Acacio and José Duato. Fault-Tolerant Cache Coherence Protocols for CMPs: Evaluation and Trade-offs". Proc. of the 15 th Annual IEEE International Conference on High Performance Computing (HiPC 2008), December 2008. Rubén Titos, Manuel E. Acacio and José M. García. Directory-Based Conflict Detection in Hardware Transactional Memory. Proc. of the 15 th Annual IEEE International Conference on High Performance Computing (HiPC 2008), December 2008. Antonio Flores, Manuel E. Acacio and Juan L. Aragón. Address Compression and Heterogeneous Interconnects for Energy-Efficient High-Performance Tiled CMPs. Proc. of the 37th International Conference on Parallel Processing (ICPP-08), September 2008. Alberto Ros, Manuel E. Acacio y José M. García. Scalable Directory Organization for Tiled CMP Architectures. Proc. of the Int l Conference on Computer Design (CDES), July 2008. José L. Abellán, Juan Fernández and Manuel E. Acacio. Characterizing the Basic Synchronization and Communication Operations in Dual Cell-Based Blades. Proc. of the Int l Conference on Computational Science (ICCS 2008), June 2008. Ricardo Fernández-Pascual, José M. García, Manuel E. Acacio and José Duato. A Fault-Tolerant Directory-Based Cache Coherence Protocol for CMP Architectures". Proc. of the 38th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN-DCCS), June 2008. Alberto Ros, Manuel E. Acacio and José M. García. DiCo-CMP: Efficient Cache Coherency in Tiled CMP Architectures". Proc. of the 22nd IEEE International Parallel and Distributed Processing Symposium (IPDPS 2008), April 2008. José L. Abellán, Juan Fernández and Manuel E. Acacio. CellStats: a Tool to Evaluate the Basic Synchronization and Communication Operations of the Cell BE. Proc. of the 16 th Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP-08), February 2008. Rubén Titos, Manuel E. Acacio and José M. García. A Characterization of Conflicts in Log-Based Transactional Memory (LogTM). Proc. of the 16 th Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP-08), February 2008. Alberto Ros, Manuel E. Acacio and José M. García. Direct Coherence: Bringing Together Performance and Scalability in Shared-Memory Multiprocessors. Proc. of the 14 th Int l Conference on High Performance Computing (HiPC 2007), December 2007. Antonio Flores, Juan L. Aragón and Manuel E. Acacio. Efficient Message Management in Tiled CMP Architectures using a Heterogeneous Interconnection Network. Proc. of the 14 th Int l Conference on High Performance Computing (HiPC 2007), December 2007. Antonio Flores, Juan L. Aragón and Manuel E. Acacio. Sim-PowerCMP: A Detailed Simulator for Energy Consumption Analysis in Future Embedded CMP Architectures. Proc. of the 4 th IEEE Int l Symposium on Embedded Computing (SEC-07), May 2007. Ricardo Fernández-Pascual, José M. García, Manuel E. Acacio and José Duato. A Low Overhead Fault Tolerant Coherence Protocol for CMP Architectures. Proc. of the 13 th Int l Symposium on High Performance Computer Architecture (HPCA-13), February 2007. Page 6 of 9

Francisco J. Villa, Manuel E. Acacio and José M. García. On the Evaluation of Dense Chip- Multiprocessor Architectures. Proc. of the 2006 Int l Conference on Embedded Computer Systems: Architectures, MOdeling, and Simulation (IC-SAMOS VI), July 2006. Ricardo Fernández-Pascual, José M. García and Manuel E. Acacio. Validating a Token Coherence Protocol for Scientific Workloads. Proc. of the 5 th Annual Workshop on Duplicating, Deconstructing, and Debunking (WDDD 2006), held in conjunction with ISCA-33, June 2006. Alberto Ros, Manuel E. Acacio and José M. García. An Efficient Cache Design for Scalable Glueless Shared-Memory Multiprocessors. Proc. of the 2006 ACM Int l Conference on Computing Frontiers, May 2006. Francisco J. Villa, Manuel E. Acacio and José M. García. Memory Subsystem Characterization in a 16-core Snoop-Based Chip-Multiprocessor Architecture. Proc. of the 2005 International Conference on High Performance Computing and Communications (HPCC), LNCS 3726, September 2005. Alberto Ros, Manuel E. Acacio and José M. García. A Novel Lightweight Directory Architecture for Scalable Shared-Memory Multiprocessors. Proc. of the Euro-Par 2005 Int l Conference, LNCS 3648, September 2005. Ricardo Fernández, Gregorio Bernabé, José M. García and Manuel E. Acacio. Optimizing a 3D- FWT Video Encoder for SMPs and HyperThreading Architectures. Proc. of the 13 th Euromicro Conference on Parallel, Distributed and Network-based Processing (PDP-05), February 2005. Francisco J. Villa, Manuel E. Acacio and José M. García. On the Evaluation of x86 Web Servers Using Simics: Limitations and Trade-Offs". Proc. of the 4 th International Conference on Computational Science, LNCS 3036, pp. 541-544, Krakow (Poland), June 2004. Manuel E. Acacio, José González, José M. García and José Duato. Owner Prediction for Accelerating Cache-to-Cache Transfer Misses in cc-numa Multiprocessors". Proc. of the SC2002 High Performance Networking and Computing, November 2002. Manuel E. Acacio, José González José M. García and José Duato. The Use of Prediction for Accelerating Upgrade Misses in cc-numa Multiprocessors". Proc. of the 2002 International Conference on Parallel Architectures and Compilation Techniques (PACT 2002), September 2002. Manuel E. Acacio, José González José M. García and José Duato. A Novel Approach to Reduce L2 Miss Latency in Shared-Memory Multiprocessors". Proc. of the 14 th Int l Parallel and Distributed Processing Symposium (IPDPS 2002), April 2002. Manuel E. Acacio, José González José M. García and José Duato. Reducing the Latency of L2 Misses in Shared-Memory Multiprocessors through On-Chip Directory Integration". Proc. of the 10 th Euromicro Workshop on Parallel, Distributed and Network-based Processing (PDP-02), January 2002. Manuel E. Acacio, José González José M. García and José Duato. A New Scalable Directory Architecture for Large-Scale Multiprocessors". Proc. of the 7 th Int l Symposium on High Performance Computer Architecture (HPCA-7), January 2001. M. Acacio, J.M. García, P.E. López de Teruel and O. Cánovas. "The MPI-Delphi Interface: A Visual Programming Environment for Clusters of Workstations". Proc. of the Int l Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA'99), CSREA Press, June 1999. P.E. López de Teruel, J.M. García, M.E. Acacio and O. Cánovas. "P-EDR: An Algorithm for Parallel Implementation of Parzen Density Estimation from Uncertain Observations". Proc. of the 11 th Int l Parallel and Distributed Processing Symposium (IPDPS 1999), April 1999. Page 7 of 9

M. Acacio, O. Cánovas, J.M. García and P.E. López de Teruel. "An Evaluation of Parallel Computing in PC Clusters". Proc. of the 4 th Int l ACPC Conference on Parallel Computing (ACPC'99), Lecture Notes in Computer Science 1557, February 1999. BOOKS José L. Abellán, Juan Fernández and Manuel E. Acacio. Efficient and Scalable Manycores: Optimized Synchronization and Cache Coherency. LAP Lambert Academic Publishing, March 2013. BOOK CHAPTERS Rubén Titos-Gil and Manuel E. Acacio. Hardware Approaches to Transactional Memory in Chip Multiprocessors. Accepted in Handbook on Data Centers, S. U. Khan and A. Y. Zomaya, Eds., Springer-Verlag, New York, USA. (Forthcoming) José L. Abellán, Juan Fernández and Manuel E. Acacio. Efficient Hardware-Supported Synchronization Mechanisms for Manycores. Accepted in Handbook on Data Centers, S. U. Khan and A. Y. Zomaya, Eds., Springer-Verlag, New York, USA. (Forthcoming) Alberto Ros, Manuel E. Acacio y José M. García. Cache Coherence Protocols for Many-Core CMPs. Parallel and Distributed Computing, In-Tech, January 2010. ISBN: 978-953-307-057-5. Page 8 of 9

PROFESSIONAL ACTIVITIES Reviewer for: ISCA 2001, SPAA 2001, CAC 2002 (Workshop held in conjunction with the IPDPS 2002), SC 2002, ICS 2003, PACT 2003, ISHPC 2003, IPDPS 2004, SBAC-PAD 2004, HPCA 2005, CGO 2005, IPDPS 2005, HPCA 2006, IPDPS 2006, ISCA 2006, Europar 2007, ICS 2007, PACT 2008, HOTI 2008, ISCA 2010, ISLPED 2010, HPCA 2011, ISLPED 2012, NPC 2012, Journal of Parallel and Distributed Computing, IEE Proceedings Computers and Digital Techniques, IEEE Transactions on Parallel and Distributed Systems, International Journal of Computers and Applications, IEEE Transactions on VLSI, IEEE Computer Architecture Letters, IEEE Micro, IET Circuits, Devices & Systems, EURASIP Journal on Embedded Systems, ACM TACO, Concurrency and computation: Practice and Experience, Canadian Journal of Electrical and Computer Engineering Committee Member of the IASDIS AC 2004, AC 2005, AC 2006, AC 2007, AC 2008, ICPP 2005, ICPADS 2006, IPM/ESA/RFID 2007, IPDPS 2009, ICESS 2009, EmbeddedCom 2009, ICESS 2010, ICESS 2012, HiPEAC 2013, ICESS 2013, HiPEAC 2014, ICESS 2014, OMHI 2014, DASC 2014, EmbeddedCom 2014, ICS 2014 Associate Editor of IEEE Transactions on Parallel and Distributed Systems Member of the HiPEAC European Network of Excellence (www.hipeac.org) Participant in the Euro-TM COST Action (www.eurotm.org) Page 9 of 9