Partitioning Data on Features or Samples in Communication-Efficient Distributed Optimization?
|
|
- Harold Black
- 8 years ago
- Views:
Transcription
1 Partitioning Data on Features or Saples in Counication-Efficient Distributed Optiization? Chenxin Ma Industrial and Systes Engineering Lehigh University, USA Martin Taáč Industrial and Systes Engineering Lehigh University, USA Abstract In this paper we study the effect of the way that the data is partitioned in distributed optiization. The original DiSCO algorith [Counication-Efficient Distributed Optiization of Self- Concordant Epirical Loss, Yuchen Zhang and Lin Xiao, 205] partitions the input data based on saples. We describe how the original algorith has to be odified to allow partitioning on features and show its efficiency both in theory and also in practice. Introduction As the size of the datasets becoes larger and larger, distributed optiization ethods for achine learning have becoe increasingly iportant [2, 5, 3]. Existing ehods often require a large aount of counication between coputing nodes [7, 7, 9, 8], which is typically several agnitudes slower than reading data fro their own eory [0]. Thus, distributed achine learning suffers fro the counication bottlenec on real world applications. In this paper we focus on the regularized epirical ris iniization proble. Suppose we have n data saples {x i, y i } n i=, where each x i R d (i.e. we have d features), y i R. We will denote by X := [x,..., x n ] R d n. The optiization proble is to iniize the regularized epirical loss (ERM) f(w) := n φ i (w, x i ) + λ n 2 w 2 2, () i= where the first part is the data fitting ter, φ : R d R d R is a loss function which typically depends on y i. Soe popular loss functions includes hinge loss φ i (w, x i ) = ax{0, y i w T x i }, square loss φ i (w, x i ) = (y i w T x i ) 2 or logistic loss φ i (w, x i ) = log( + exp( y i w T x i )). The second part of objective function () is l 2 regularizer (λ > 0) which helps to prevent over-fitting of the data. We assue that the loss function φ i is convex and self-concordant [9]: Assuption. For all i [n] := {, 2,..., n} the convex function φ is self-concordant with paraeter M i.e. the following inequality holds: u T (f (w)[u])u M(u T f (w)u) 3 2 (2) for any u R d and w do(f), where f (w)[u] := li t 0 t (f (w + tu) f (w)). There has been an enorous interest in large-scale achine learning probles and any parallel [4, ] or distributed algoriths have been proposed [, 6, 2, 4, 8]. The ain bottlenec in distributed coputing counication was handled by any researches differently. Soe wor considered ADMM type ethods [3, 6], another used bloc-coordinate type algoriths [8, 7, 7, 9], where they tried to solve the local sub-probles ore accurately (which should decrease the overall counications requireents when copared with ore basic approaches [5, 6]).
2 Algorith High-level DiSCO algorith : Input: paraeters ρ, µ 0, nuber of iterations K 2: Initializing w 0. 3: for = 0,,2,...,K do 4: Option : Given w, run DiSCO-S PCG Algorith 2, get v and δ 5: Option 2: Given w, run DiSCO-F PCG Algorith 3, get v and δ 6: Update w + = w +δ v 7: end for 8: Output: w K+ 2 Algorith We assue that we have achines (coputing nodes) available which can counicate between each other over the networ. We assue that the space needed to store the data atrix X exceeds the eory of every single node. Thus we have to split the data (atrix X) over the nodes. The natural question is: How to split the data into parts? There are any possible ways, but two obvious ones:. split the data atrix X by rows (i.e. create blocs by rows); Because rows of X corresponds to features, we will denote the algorith which is using this type of partitioning as DiSCO-F; 2. split the data atrix X by coluns; Let us note that coluns of X corresponds to saples we will denote the algorith which is using this type of partitioning as DiSCO-S; Notice that the DiSCO-S is exactly the sae as DiSCO proposed and analyzed in [9]. In each iteration of Algorith, wee need to copute an inexact Newton step v such that f (w )v f (w ) 2 ɛ, which is an approxiate solution to the Newton syste f (w )v = f(w ). The discussion about how to choose ɛ and K and a convergence guarantees for Algorith can be found in [9]. The ain goal of this wor is to analyze the algorithic odifications to DiSCO-S when the partitioning type is changed. It will turn out that partitioning on features (DiSCO-F) can lead to algorith which uses less counications (depending on the relations between d and n) (see Section 3). DiSCO-S Algorith. If the dataset is partitioned by saples, such that j th node will only store X j = [x j,,..., x j,nj ] R d nj, which is a part of X, then each achine can evaluate a local epirical loss function n j f j (w) := φ(w, x j,i ) + λ n j 2 w 2 2. (3) i= Algorith 2 Distributed DiSCO-S: PCG algorith data partitioned by saples : Input: w R d, and µ 0. counication (Broadcast w R d and reduceall f i (w ) R d ) 2: Initialization: Let P be coputed as (4). v 0 = 0, s 0 = P r 0, r 0 = f(w ), u 0 = s 0. 3: for t = 0,, 2,... do 4: Copute Hu t counication (Broadcast u t R d and reduceall f i(w )u t R d ) 5: Copute α t = rt,st u t,hu t 6: Update v (t+) = v t + α t u t, Hv (t+) = Hv t + α t Hu t, r t+ = r t α t Hu t. 7: Update s (t+) = P r (t+). 8: Copute β t = r (t+),s (t+) r t,s t 9: Update u (t+) = s (t+) + β t u t. 0: until: r (r+) 2 ɛ : end for 2: Return: v = v t+, δ = v T (t+) Hv t + α t v T (t+) Hu t 2
3 Because {X j } is a partition of X we have j= n j = n, our goal now becoes to iniize the function f(w) = h= f j(w). Let H denote the Hessian f (w ). For siplicity in this paper we consider only square loss and hence in this case f (w ) is constant (independent on w ). In Algorith 2, each achine will use its local data to copute the local gradient and local Hessian and then aggregate the together. We also have to choose one achine as the aster, which coputes all the vector operations of PCG loops (Preconditioned Conjugate Gradient), i.e., step 5-9 in Algorith 2. The preconditioning atrix for PCG is defined only on aster node and consists of the local Hessian approxiated by a subset of data available on aster node with size τ, i.e. P = τ τ φ (w, x,j ) + µi, (4) j= where µ is a sall regularization paraeter. Algorith 2 presents the distributed PCG athod for solving the preconditioning linear syste P Hv = P f(w ). (5) DiSCO-F Algorith. If the dataset is partitioned by features, then jth achine will store X j = [a [j],..., a[j] n ] R dj n, which contains all the saples, but only with a subset of features. Also, each achine will only store w [j] R dj and thus only be responsible for the coputation and updates of R dj vectors. By doing so, we only need one ReduceAll on a vector of length n, in addition to two ReduceAll on scalars nuber. Algorith 3 Distributed DiSCO-F: PCG algorith data partitioned by features : Input: w [i] Rdi for i =, 2,...,, and µ 0. 2: Initialization: Let P be coputed as (4). v [i] 0 = 0, s[i] 0 = (P ) [i] r [i] 0, r[i] 3: while r r+ 2 ɛ do 0 = f (w [i] ), u[i] 0 = s[i] 0. 4: Copute (Hu t ) [i]. counication (ReduceAll an R di vector) 5: Copute α t = 6: Update v [i] t+ = v[i] t 7: Update s [i] 8: Copute β t = 9: Update u [i] 0: end while i= r[i] t,s[i] t i= u[i] t,(hut)[i] t+ = (P ) [i] r [i] t+ = s[i] : Copute δ [i] 2: Integration: v = [v [] 3: Return: v, δ. counication (ReduceAll a nuber) + α t u [i] t, (Hv t+ ) [i] = (Hv t ) [i] + α t (Hu t ) [i], r [i] t+ = r[i] t α t (Hu t ) [i]. t+. i= r[i] t+,s[i] t+ i= r[i] t,s[i] t. counication (ReduceAll a nuber) t+ + β tu [i] t. = v [i] T t+ (Hvt ) [i] + α t v t+t [i] (Hut ) [i]. t+,..., v[] t+ ], δ = [δ [] t+,..., δ[] t+ ] counication (Reduce an vector) Rdi Coparison of Counication and Coputational Cost. In Table we copare the counication cost for the two approaches DiSCO-S/DiSCO-F. As it is obvious fro the table, DiSCO-F requires only one reduceall of a vector of length n, whereas the DiSCO-S needs one reduceall of a vector of length d and one broadcast of vector of size d. So roughly speaing, when n < d then DiSCO-F will need less counication. However, very interestingly, the advantage of DiSCO-F is the fact that it uses CPU on every node ore effectively. It also requires less total aount of wor to be perfored on each node, leading to ore balanced and efficient utilization of nodes. 3 Nuerical Experients We present experients on several standard large real-world datasets: news20.binary (d =, 355, 9; n = 9, 996; 0.3GB); dd200(test) (d = 29, 890, 095; n = 748, 40; 0.9GB); and epsilon (d = 2, 000; n = 3
4 Table : Coparison of coputation and counication between different ways of partition on data. partition by saples partition by features atrix-vector ultiplication (R d d R d ) (R d d R d ) aster bac solving linear syste (R d ) (R d ) su of vectors 4 (R d ) 4 (R d ) coputation inner product of vectors 4 (R d ) 4 (R d ) atrix-vector ultiplication (R d d R d ) (R d di R di ) nodes bac solving linear syste 0 (R di ) su of vectors 0 4 (R di ) inner product of vectors 0 4 (R di ) counication Broadcast one R d vector 0 ReduceAll one R d vector one R n vector, 2 R Elapsed Tie Elapsed Tie Elapsed Tie Figure : Coparison of DiSCO-S, DiSCO-F and on various datasets. 00, 000; 3.04GB). Each data was split into achines. We ipleent DiSCO-S, DiSCO-F and [9] algoriths for coparison in C++, and run the on the Aazon cloud, using 4 3.xlarge EC2 instances. Figure copares the evolution of f(w) as function of elapsed tie, nuber of counications and iterations. As it can be observed, the DiSCO-F needs alost the sae nuber of iterations as DiSCO-S, however, it needs roughly just half the counication, therefore it is uch faster (if we care about elapsed tie). 4
5 References [] Aleh Agarwal and John C Duchi. Distributed delayed stochastic optiization. In Advances in Neural Inforation Processing Systes, pages , 20. [2] Diitri P Bertseas and John N Tsitsilis. Parallel and distributed coputation: nuerical ethods. Prentice- Hall, Inc., 989. [3] Stephen Boyd, Neal Parih, Eric Chu, Borja Peleato, and Jonathan Ecstein. Distributed optiization and statistical learning via the alternating direction ethod of ultipliers. Foundations and Trends R in Machine Learning, 3(): 22, 20. [4] Joseph K Bradley, Aapo Kyrola, Danny Bicson, and Carlos Guestrin. Parallel coordinate descent for l- regularized loss iniization. arxiv preprint arxiv: , 20. [5] Ofer Deel, Ran Gilad-Bachrach, Ohad Shair, and Lin Xiao. Optial distributed online prediction using inibatches. The Journal of Machine Learning Research, 3():65 202, 202. [6] Wei Deng and Wotao Yin. On the global and linear convergence of the generalized alternating direction ethod of ultipliers. Journal of Scientific Coputing, pages 28, 202. [7] Martin Jaggi, Virginia Sith, Martin Taác, Jonathan Terhorst, Sanjay Krishnan, Thoas Hofann, and Michael I Jordan. Counication-efficient distributed dual coordinate ascent. In Advances in Neural Inforation Processing Systes, pages , 204. [8] Ching-Pei Lee and Dan Roth. Distributed box-constrained quadratic optiization for dual linear SVM. ICML, 205. [9] Chenxin Ma, Virginia Sith, Martin Jaggi, Michael I Jordan, Peter Richtári, and Martin Taáč. Adding vs. averaging in distributed prial-dual optiization. In ICML Proceedings of the 32th International Conference on Machine Learning, volue 37, pages JMLR, 205. [0] Jaub Marece, Peter Richtári, and Martin Taác. Distributed bloc coordinate descent for iniizing partially separable functions. Nuerical Analysis and Optiization 204, Springer Proceedings in Matheatics and Statistics, 204. [] Benjain Recht, Christopher Re, Stephen Wright, and Feng Niu. Hogwild: A loc-free approach to parallelizing stochastic gradient descent. In Advances in Neural Inforation Processing Systes, pages , 20. [2] Peter Richtári and Martin Taáč. Distributed coordinate descent ethod for learning with big data. arxiv preprint arxiv: , 203. [3] Ohad Shair and Nathan Srebro. Distributed stochastic optiization and learning. In Counication, Control, and Coputing (Allerton), nd Annual Allerton Conference on, pages IEEE, 204. [4] Ohad Shair, Nathan Srebro, and Tong Zhang. Counication efficient distributed optiization using an approxiate newton-type ethod. arxiv preprint arxiv: , 203. [5] Martin Taáč, Avleen Bijral, Peter Richtári, and Nathan Srebro. Mini-batch prial and dual ethods for SVMs. ICML, 203. [6] Martin Taáč, Peter Richtári, and Nathan Srebro. Distributed ini-batch SDCA. arxiv preprint arxiv: , 205. [7] Tianbao Yang. Trading coputation for counication: Distributed stochastic dual coordinate ascent. In Advances in Neural Inforation Processing Systes, pages , 203. [8] Tianbao Yang, Shenghuo Zhu, Rong Jin, and Yuanqing Lin. Analysis of distributed stochastic dual coordinate ascent. arxiv preprint arxiv:32.03, 203. [9] Yuchen Zhang and Lin Xiao. Counication-efficient distributed optiization of self-concordant epirical loss. arxiv preprint arxiv: ,
Federated Optimization: Distributed Optimization Beyond the Datacenter
Federated Optimization: Distributed Optimization Beyond the Datacenter Jakub Konečný School of Mathematics University of Edinburgh J.Konecny@sms.ed.ac.uk H. Brendan McMahan Google, Inc. Seattle, WA 98103
More informationApplying Multiple Neural Networks on Large Scale Data
0 International Conference on Inforation and Electronics Engineering IPCSIT vol6 (0) (0) IACSIT Press, Singapore Applying Multiple Neural Networks on Large Scale Data Kritsanatt Boonkiatpong and Sukree
More informationCooperative Caching for Adaptive Bit Rate Streaming in Content Delivery Networks
Cooperative Caching for Adaptive Bit Rate Streaing in Content Delivery Networs Phuong Luu Vo Departent of Coputer Science and Engineering, International University - VNUHCM, Vietna vtlphuong@hciu.edu.vn
More informationAn Innovate Dynamic Load Balancing Algorithm Based on Task
An Innovate Dynaic Load Balancing Algorith Based on Task Classification Hong-bin Wang,,a, Zhi-yi Fang, b, Guan-nan Qu,*,c, Xiao-dan Ren,d College of Coputer Science and Technology, Jilin University, Changchun
More informationOnline Bagging and Boosting
Abstract Bagging and boosting are two of the ost well-known enseble learning ethods due to their theoretical perforance guarantees and strong experiental results. However, these algoriths have been used
More informationarxiv:0805.1434v1 [math.pr] 9 May 2008
Degree-distribution stability of scale-free networs Zhenting Hou, Xiangxing Kong, Dinghua Shi,2, and Guanrong Chen 3 School of Matheatics, Central South University, Changsha 40083, China 2 Departent of
More informationMulti-Class Deep Boosting
Multi-Class Deep Boosting Vitaly Kuznetsov Courant Institute 25 Mercer Street New York, NY 002 vitaly@cis.nyu.edu Mehryar Mohri Courant Institute & Google Research 25 Mercer Street New York, NY 002 ohri@cis.nyu.edu
More informationExtended-Horizon Analysis of Pressure Sensitivities for Leak Detection in Water Distribution Networks: Application to the Barcelona Network
2013 European Control Conference (ECC) July 17-19, 2013, Zürich, Switzerland. Extended-Horizon Analysis of Pressure Sensitivities for Leak Detection in Water Distribution Networks: Application to the Barcelona
More information2. FINDING A SOLUTION
The 7 th Balan Conference on Operational Research BACOR 5 Constanta, May 5, Roania OPTIMAL TIME AND SPACE COMPLEXITY ALGORITHM FOR CONSTRUCTION OF ALL BINARY TREES FROM PRE-ORDER AND POST-ORDER TRAVERSALS
More informationTrading Regret for Efficiency: Online Convex Optimization with Long Term Constraints
Journal of Machine Learning Research 13 2012) 2503-2528 Subitted 8/11; Revised 3/12; Published 9/12 rading Regret for Efficiency: Online Convex Optiization with Long er Constraints Mehrdad Mahdavi Rong
More informationThe Research of Measuring Approach and Energy Efficiency for Hadoop Periodic Jobs
Send Orders for Reprints to reprints@benthascience.ae 206 The Open Fuels & Energy Science Journal, 2015, 8, 206-210 Open Access The Research of Measuring Approach and Energy Efficiency for Hadoop Periodic
More informationMachine Learning Applications in Grid Computing
Machine Learning Applications in Grid Coputing George Cybenko, Guofei Jiang and Daniel Bilar Thayer School of Engineering Dartouth College Hanover, NH 03755, USA gvc@dartouth.edu, guofei.jiang@dartouth.edu
More informationAn Optimal Task Allocation Model for System Cost Analysis in Heterogeneous Distributed Computing Systems: A Heuristic Approach
An Optial Tas Allocation Model for Syste Cost Analysis in Heterogeneous Distributed Coputing Systes: A Heuristic Approach P. K. Yadav Central Building Research Institute, Rooree- 247667, Uttarahand (INDIA)
More informationExploiting Hardware Heterogeneity within the Same Instance Type of Amazon EC2
Exploiting Hardware Heterogeneity within the Sae Instance Type of Aazon EC2 Zhonghong Ou, Hao Zhuang, Jukka K. Nurinen, Antti Ylä-Jääski, Pan Hui Aalto University, Finland; Deutsch Teleko Laboratories,
More informationResource Allocation in Wireless Networks with Multiple Relays
Resource Allocation in Wireless Networks with Multiple Relays Kağan Bakanoğlu, Stefano Toasin, Elza Erkip Departent of Electrical and Coputer Engineering, Polytechnic Institute of NYU, Brooklyn, NY, 0
More informationModeling Parallel Applications Performance on Heterogeneous Systems
Modeling Parallel Applications Perforance on Heterogeneous Systes Jaeela Al-Jaroodi, Nader Mohaed, Hong Jiang and David Swanson Departent of Coputer Science and Engineering University of Nebraska Lincoln
More informationASIC Design Project Management Supported by Multi Agent Simulation
ASIC Design Project Manageent Supported by Multi Agent Siulation Jana Blaschke, Christian Sebeke, Wolfgang Rosenstiel Abstract The coplexity of Application Specific Integrated Circuits (ASICs) is continuously
More informationOptimal Resource-Constraint Project Scheduling with Overlapping Modes
Optial Resource-Constraint Proect Scheduling with Overlapping Modes François Berthaut Lucas Grèze Robert Pellerin Nathalie Perrier Adnène Hai February 20 CIRRELT-20-09 Bureaux de Montréal : Bureaux de
More informationImage restoration for a rectangular poor-pixels detector
Iage restoration for a rectangular poor-pixels detector Pengcheng Wen 1, Xiangjun Wang 1, Hong Wei 2 1 State Key Laboratory of Precision Measuring Technology and Instruents, Tianjin University, China 2
More informationThe Virtual Spring Mass System
The Virtual Spring Mass Syste J. S. Freudenberg EECS 6 Ebedded Control Systes Huan Coputer Interaction A force feedbac syste, such as the haptic heel used in the EECS 6 lab, is capable of exhibiting a
More informationRECURSIVE DYNAMIC PROGRAMMING: HEURISTIC RULES, BOUNDING AND STATE SPACE REDUCTION. Henrik Kure
RECURSIVE DYNAMIC PROGRAMMING: HEURISTIC RULES, BOUNDING AND STATE SPACE REDUCTION Henrik Kure Dina, Danish Inforatics Network In the Agricultural Sciences Royal Veterinary and Agricultural University
More informationConsiderations on Distributed Load Balancing for Fully Heterogeneous Machines: Two Particular Cases
Considerations on Distributed Load Balancing for Fully Heterogeneous Machines: Two Particular Cases Nathanaël Cheriere Departent of Coputer Science ENS Rennes Rennes, France nathanael.cheriere@ens-rennes.fr
More informationEfficient Key Management for Secure Group Communications with Bursty Behavior
Efficient Key Manageent for Secure Group Counications with Bursty Behavior Xukai Zou, Byrav Raaurthy Departent of Coputer Science and Engineering University of Nebraska-Lincoln Lincoln, NE68588, USA Eail:
More informationCRM FACTORS ASSESSMENT USING ANALYTIC HIERARCHY PROCESS
641 CRM FACTORS ASSESSMENT USING ANALYTIC HIERARCHY PROCESS Marketa Zajarosova 1* *Ph.D. VSB - Technical University of Ostrava, THE CZECH REPUBLIC arketa.zajarosova@vsb.cz Abstract Custoer relationship
More informationCLOSED-LOOP SUPPLY CHAIN NETWORK OPTIMIZATION FOR HONG KONG CARTRIDGE RECYCLING INDUSTRY
CLOSED-LOOP SUPPLY CHAIN NETWORK OPTIMIZATION FOR HONG KONG CARTRIDGE RECYCLING INDUSTRY Y. T. Chen Departent of Industrial and Systes Engineering Hong Kong Polytechnic University, Hong Kong yongtong.chen@connect.polyu.hk
More informationCPU Animation. Introduction. CPU skinning. CPUSkin Scalar:
CPU Aniation Introduction The iportance of real-tie character aniation has greatly increased in odern gaes. Aniating eshes ia 'skinning' can be perfored on both a general purpose CPU and a ore specialized
More informationAnalyzing Spatiotemporal Characteristics of Education Network Traffic with Flexible Multiscale Entropy
Vol. 9, No. 5 (2016), pp.303-312 http://dx.doi.org/10.14257/ijgdc.2016.9.5.26 Analyzing Spatioteporal Characteristics of Education Network Traffic with Flexible Multiscale Entropy Chen Yang, Renjie Zhou
More informationReliability Constrained Packet-sizing for Linear Multi-hop Wireless Networks
Reliability Constrained acket-sizing for inear Multi-hop Wireless Networks Ning Wen, and Randall A. Berry Departent of Electrical Engineering and Coputer Science Northwestern University, Evanston, Illinois
More informationReconnect 04 Solving Integer Programs with Branch and Bound (and Branch and Cut)
Sandia is a ultiprogra laboratory operated by Sandia Corporation, a Lockheed Martin Copany, Reconnect 04 Solving Integer Progras with Branch and Bound (and Branch and Cut) Cynthia Phillips (Sandia National
More informationStochastic Online Scheduling on Parallel Machines
Stochastic Online Scheduling on Parallel Machines Nicole Megow 1, Marc Uetz 2, and Tark Vredeveld 3 1 Technische Universit at Berlin, Institut f ur Matheatik, Strasse des 17. Juni 136, 10623 Berlin, Gerany
More informationPREDICTION OF POSSIBLE CONGESTIONS IN SLA CREATION PROCESS
PREDICTIO OF POSSIBLE COGESTIOS I SLA CREATIO PROCESS Srećko Krile University of Dubrovnik Departent of Electrical Engineering and Coputing Cira Carica 4, 20000 Dubrovnik, Croatia Tel +385 20 445-739,
More informationProtecting Small Keys in Authentication Protocols for Wireless Sensor Networks
Protecting Sall Keys in Authentication Protocols for Wireless Sensor Networks Kalvinder Singh Australia Developent Laboratory, IBM and School of Inforation and Counication Technology, Griffith University
More informationUse of extrapolation to forecast the working capital in the mechanical engineering companies
ECONTECHMOD. AN INTERNATIONAL QUARTERLY JOURNAL 2014. Vol. 1. No. 1. 23 28 Use of extrapolation to forecast the working capital in the echanical engineering copanies A. Cherep, Y. Shvets Departent of finance
More informationThe Application of Bandwidth Optimization Technique in SLA Negotiation Process
The Application of Bandwidth Optiization Technique in SLA egotiation Process Srecko Krile University of Dubrovnik Departent of Electrical Engineering and Coputing Cira Carica 4, 20000 Dubrovnik, Croatia
More informationMarkovian inventory policy with application to the paper industry
Coputers and Cheical Engineering 26 (2002) 1399 1413 www.elsevier.co/locate/copcheeng Markovian inventory policy with application to the paper industry K. Karen Yin a, *, Hu Liu a,1, Neil E. Johnson b,2
More informationDynamic Placement for Clustered Web Applications
Dynaic laceent for Clustered Web Applications A. Karve, T. Kibrel, G. acifici, M. Spreitzer, M. Steinder, M. Sviridenko, and A. Tantawi IBM T.J. Watson Research Center {karve,kibrel,giovanni,spreitz,steinder,sviri,tantawi}@us.ib.co
More informationBig Data Analytics: Optimization and Randomization
Big Data Analytics: Optimization and Randomization Tianbao Yang, Qihang Lin, Rong Jin Tutorial@SIGKDD 2015 Sydney, Australia Department of Computer Science, The University of Iowa, IA, USA Department of
More informationA Scalable Application Placement Controller for Enterprise Data Centers
W WWW 7 / Track: Perforance and Scalability A Scalable Application Placeent Controller for Enterprise Data Centers Chunqiang Tang, Malgorzata Steinder, Michael Spreitzer, and Giovanni Pacifici IBM T.J.
More informationEnergy Efficient VM Scheduling for Cloud Data Centers: Exact allocation and migration algorithms
Energy Efficient VM Scheduling for Cloud Data Centers: Exact allocation and igration algoriths Chaia Ghribi, Makhlouf Hadji and Djaal Zeghlache Institut Mines-Téléco, Téléco SudParis UMR CNRS 5157 9, Rue
More informationReal Time Target Tracking with Binary Sensor Networks and Parallel Computing
Real Tie Target Tracking with Binary Sensor Networks and Parallel Coputing Hong Lin, John Rushing, Sara J. Graves, Steve Tanner, and Evans Criswell Abstract A parallel real tie data fusion and target tracking
More informationDesign of Model Reference Self Tuning Mechanism for PID like Fuzzy Controller
Research Article International Journal of Current Engineering and Technology EISSN 77 46, PISSN 347 56 4 INPRESSCO, All Rights Reserved Available at http://inpressco.co/category/ijcet Design of Model Reference
More informationData Streaming Algorithms for Estimating Entropy of Network Traffic
Data Streaing Algoriths for Estiating Entropy of Network Traffic Ashwin Lall University of Rochester Vyas Sekar Carnegie Mellon University Mitsunori Ogihara University of Rochester Jun (Ji) Xu Georgia
More informationModern Optimization Methods for Big Data Problems MATH11146 The University of Edinburgh
Modern Optimization Methods for Big Data Problems MATH11146 The University of Edinburgh Peter Richtárik Week 3 Randomized Coordinate Descent With Arbitrary Sampling January 27, 2016 1 / 30 The Problem
More informationSoftware Quality Characteristics Tested For Mobile Application Development
Thesis no: MGSE-2015-02 Software Quality Characteristics Tested For Mobile Application Developent Literature Review and Epirical Survey WALEED ANWAR Faculty of Coputing Blekinge Institute of Technology
More informationSearching strategy for multi-target discovery in wireless networks
Searching strategy for ulti-target discovery in wireless networks Zhao Cheng, Wendi B. Heinzelan Departent of Electrical and Coputer Engineering University of Rochester Rochester, NY 467 (585) 75-{878,
More informationModels and Algorithms for Stochastic Online Scheduling 1
Models and Algoriths for Stochastic Online Scheduling 1 Nicole Megow Technische Universität Berlin, Institut für Matheatik, Strasse des 17. Juni 136, 10623 Berlin, Gerany. eail: negow@ath.tu-berlin.de
More informationEvaluating Inventory Management Performance: a Preliminary Desk-Simulation Study Based on IOC Model
Evaluating Inventory Manageent Perforance: a Preliinary Desk-Siulation Study Based on IOC Model Flora Bernardel, Roberto Panizzolo, and Davide Martinazzo Abstract The focus of this study is on preliinary
More informationON SELF-ROUTING IN CLOS CONNECTION NETWORKS. BARRY G. DOUGLASS Electrical Engineering Department Texas A&M University College Station, TX 77843-3128
ON SELF-ROUTING IN CLOS CONNECTION NETWORKS BARRY G. DOUGLASS Electrical Engineering Departent Texas A&M University College Station, TX 778-8 A. YAVUZ ORUÇ Electrical Engineering Departent and Institute
More informationMethod of supply chain optimization in E-commerce
MPRA Munich Personal RePEc Archive Method of supply chain optiization in E-coerce Petr Suchánek and Robert Bucki Silesian University - School of Business Adinistration, The College of Inforatics and Manageent
More informationINTEGRATED ENVIRONMENT FOR STORING AND HANDLING INFORMATION IN TASKS OF INDUCTIVE MODELLING FOR BUSINESS INTELLIGENCE SYSTEMS
Artificial Intelligence Methods and Techniques for Business and Engineering Applications 210 INTEGRATED ENVIRONMENT FOR STORING AND HANDLING INFORMATION IN TASKS OF INDUCTIVE MODELLING FOR BUSINESS INTELLIGENCE
More informationImpact of Processing Costs on Service Chain Placement in Network Functions Virtualization
Ipact of Processing Costs on Service Chain Placeent in Network Functions Virtualization Marco Savi, Massio Tornatore, Giacoo Verticale Dipartiento di Elettronica, Inforazione e Bioingegneria, Politecnico
More informationSupport Vector Machine Soft Margin Classifiers: Error Analysis
Journal of Machine Learning Research? (2004)?-?? Subitted 9/03; Published??/04 Support Vector Machine Soft Margin Classifiers: Error Analysis Di-Rong Chen Departent of Applied Matheatics Beijing University
More informationMedia Adaptation Framework in Biofeedback System for Stroke Patient Rehabilitation
Media Adaptation Fraework in Biofeedback Syste for Stroke Patient Rehabilitation Yinpeng Chen, Weiwei Xu, Hari Sundara, Thanassis Rikakis, Sheng-Min Liu Arts, Media and Engineering Progra Arizona State
More informationPERFORMANCE METRICS FOR THE IT SERVICES PORTFOLIO
Bulletin of the Transilvania University of Braşov Series I: Engineering Sciences Vol. 4 (53) No. - 0 PERFORMANCE METRICS FOR THE IT SERVICES PORTFOLIO V. CAZACU I. SZÉKELY F. SANDU 3 T. BĂLAN Abstract:
More informationPerformance Evaluation of Machine Learning Techniques using Software Cost Drivers
Perforance Evaluation of Machine Learning Techniques using Software Cost Drivers Manas Gaur Departent of Coputer Engineering, Delhi Technological University Delhi, India ABSTRACT There is a treendous rise
More informationFuzzy Sets in HR Management
Acta Polytechnica Hungarica Vol. 8, No. 3, 2011 Fuzzy Sets in HR Manageent Blanka Zeková AXIOM SW, s.r.o., 760 01 Zlín, Czech Republic blanka.zekova@sezna.cz Jana Talašová Faculty of Science, Palacký Univerzity,
More information6. Time (or Space) Series Analysis
ATM 55 otes: Tie Series Analysis - Section 6a Page 8 6. Tie (or Space) Series Analysis In this chapter we will consider soe coon aspects of tie series analysis including autocorrelation, statistical prediction,
More informationData Set Generation for Rectangular Placement Problems
Data Set Generation for Rectangular Placeent Probles Christine L. Valenzuela (Muford) Pearl Y. Wang School of Coputer Science & Inforatics Departent of Coputer Science MS 4A5 Cardiff University George
More informationAn Integrated Approach for Monitoring Service Level Parameters of Software-Defined Networking
International Journal of Future Generation Counication and Networking Vol. 8, No. 6 (15), pp. 197-4 http://d.doi.org/1.1457/ijfgcn.15.8.6.19 An Integrated Approach for Monitoring Service Level Paraeters
More informationLoad Control for Overloaded MPLS/DiffServ Networks during SLA Negotiation
Int J Counications, Network and Syste Sciences, 29, 5, 422-432 doi:14236/ijcns292547 Published Online August 29 (http://wwwscirporg/journal/ijcns/) Load Control for Overloaded MPLS/DiffServ Networks during
More information( C) CLASS 10. TEMPERATURE AND ATOMS
CLASS 10. EMPERAURE AND AOMS 10.1. INRODUCION Boyle s understanding of the pressure-volue relationship for gases occurred in the late 1600 s. he relationships between volue and teperature, and between
More informationDistributed Machine Learning and Big Data
Distributed Machine Learning and Big Data Sourangshu Bhattacharya Dept. of Computer Science and Engineering, IIT Kharagpur. http://cse.iitkgp.ac.in/~sourangshu/ August 21, 2015 Sourangshu Bhattacharya
More informationLecture 8 February 4
ICS273A: Machine Learning Winter 2008 Lecture 8 February 4 Scribe: Carlos Agell (Student) Lecturer: Deva Ramanan 8.1 Neural Nets 8.1.1 Logistic Regression Recall the logistic function: g(x) = 1 1 + e θt
More informationComment on On Discriminative vs. Generative Classifiers: A Comparison of Logistic Regression and Naive Bayes
Coent on On Discriinative vs. Generative Classifiers: A Coparison of Logistic Regression and Naive Bayes Jing-Hao Xue (jinghao@stats.gla.ac.uk) and D. Michael Titterington (ike@stats.gla.ac.uk) Departent
More informationPartitioned Elias-Fano Indexes
Partitioned Elias-ano Indexes Giuseppe Ottaviano ISTI-CNR, Pisa giuseppe.ottaviano@isti.cnr.it Rossano Venturini Dept. of Coputer Science, University of Pisa rossano@di.unipi.it ABSTRACT The Elias-ano
More informationFactored Models for Probabilistic Modal Logic
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (2008 Factored Models for Probabilistic Modal Logic Afsaneh Shirazi and Eyal Air Coputer Science Departent, University of Illinois
More informationThis paper studies a rental firm that offers reusable products to price- and quality-of-service sensitive
MANUFACTURING & SERVICE OPERATIONS MANAGEMENT Vol., No. 3, Suer 28, pp. 429 447 issn 523-464 eissn 526-5498 8 3 429 infors doi.287/so.7.8 28 INFORMS INFORMS holds copyright to this article and distributed
More informationPackage termstrc. February 20, 2015
Type Package Title Zero-coupon Yield Curve Estiation Version 1.3.7 Date 2013-11-03 Author Robert Ferstl, Josef Hayden Package terstrc February 20, 2015 Maintainer Josef Hayden The
More informationPricing Asian Options using Monte Carlo Methods
U.U.D.M. Project Report 9:7 Pricing Asian Options using Monte Carlo Methods Hongbin Zhang Exaensarbete i ateatik, 3 hp Handledare och exainator: Johan Tysk Juni 9 Departent of Matheatics Uppsala University
More informationResearch Article Performance Evaluation of Human Resource Outsourcing in Food Processing Enterprises
Advance Journal of Food Science and Technology 9(2): 964-969, 205 ISSN: 2042-4868; e-issn: 2042-4876 205 Maxwell Scientific Publication Corp. Subitted: August 0, 205 Accepted: Septeber 3, 205 Published:
More informationCapacity of Multiple-Antenna Systems With Both Receiver and Transmitter Channel State Information
IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 49, NO., OCTOBER 23 2697 Capacity of Multiple-Antenna Systes With Both Receiver and Transitter Channel State Inforation Sudharan K. Jayaweera, Student Meber,
More informationInformation Processing Letters
Inforation Processing Letters 111 2011) 178 183 Contents lists available at ScienceDirect Inforation Processing Letters www.elsevier.co/locate/ipl Offline file assignents for online load balancing Paul
More informationOnline Community Detection for Large Complex Networks
Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Online Counity Detection for Large Coplex Networks Wangsheng Zhang, Gang Pan, Zhaohui Wu, Shijian Li Departent
More informationDistributed Newton Methods for Regularized Logistic Regression
Distributed Newton Methods for Regularized Logistic Regression Yong Zhuang, Wei-Sheng Chin, Yu-Chin Juan, and Chih-Jen Lin Department of Computer Science National Taiwan University, Taipei, Taiwan {r1922139,d19446,r1922136,cjlin}@csie.ntu.edu.tw
More informationLeak detection in open water channels
Proceedings of the 17th World Congress The International Federation of Autoatic Control Seoul, Korea, July 6-11, 28 Leak detection in open water channels Erik Weyer Georges Bastin Departent of Electrical
More informationOnline Methods for Multi-Domain Learning and Adaptation
Online Methods for Multi-Doain Learning and Adaptation Mark Dredze and Koby Craer Departent of Coputer and Inforation Science University of Pennsylvania Philadelphia, PA 19104 USA {dredze,craer}@cis.upenn.edu
More informationOn Computing Nearest Neighbors with Applications to Decoding of Binary Linear Codes
On Coputing Nearest Neighbors with Applications to Decoding of Binary Linear Codes Alexander May and Ilya Ozerov Horst Görtz Institute for IT-Security Ruhr-University Bochu, Gerany Faculty of Matheatics
More informationStochastic Optimization for Big Data Analytics: Algorithms and Libraries
Stochastic Optimization for Big Data Analytics: Algorithms and Libraries Tianbao Yang SDM 2014, Philadelphia, Pennsylvania collaborators: Rong Jin, Shenghuo Zhu NEC Laboratories America, Michigan State
More informationModeling operational risk data reported above a time-varying threshold
Modeling operational risk data reported above a tie-varying threshold Pavel V. Shevchenko CSIRO Matheatical and Inforation Sciences, Sydney, Locked bag 7, North Ryde, NSW, 670, Australia. e-ail: Pavel.Shevchenko@csiro.au
More informationAccelerated Parallel Optimization Methods for Large Scale Machine Learning
Accelerated Parallel Optimization Methods for Large Scale Machine Learning Haipeng Luo Princeton University haipengl@cs.princeton.edu Patrick Haffner and Jean-François Paiement AT&T Labs - Research {haffner,jpaiement}@research.att.com
More informationPhysics 211: Lab Oscillations. Simple Harmonic Motion.
Physics 11: Lab Oscillations. Siple Haronic Motion. Reading Assignent: Chapter 15 Introduction: As we learned in class, physical systes will undergo an oscillatory otion, when displaced fro a stable equilibriu.
More informationNetwork delay-aware load balancing in selfish and cooperative distributed systems
Network delay-aware load balancing in selfish and cooperative distributed systes Piotr Skowron Faculty of Matheatics, Inforatics and Mechanics University of Warsaw Eail: p.skowron@iuw.edu.pl Krzysztof
More informationApproximately-Perfect Hashing: Improving Network Throughput through Efficient Off-chip Routing Table Lookup
Approxiately-Perfect ing: Iproving Network Throughput through Efficient Off-chip Routing Table Lookup Zhuo Huang, Jih-Kwon Peir, Shigang Chen Departent of Coputer & Inforation Science & Engineering, University
More informationPreference-based Search and Multi-criteria Optimization
Fro: AAAI-02 Proceedings. Copyright 2002, AAAI (www.aaai.org). All rights reserved. Preference-based Search and Multi-criteria Optiization Ulrich Junker ILOG 1681, route des Dolines F-06560 Valbonne ujunker@ilog.fr
More informationStable Learning in Coding Space for Multi-Class Decoding and Its Extension for Multi-Class Hypothesis Transfer Learning
Stable Learning in Coding Space for Multi-Class Decoding and Its Extension for Multi-Class Hypothesis Transfer Learning Bang Zhang, Yi Wang 2, Yang Wang, Fang Chen 2 National ICT Australia 2 School of
More informationManaging Complex Network Operation with Predictive Analytics
Managing Coplex Network Operation with Predictive Analytics Zhenyu Huang, Pak Chung Wong, Patrick Mackey, Yousu Chen, Jian Ma, Kevin Schneider, and Frank L. Greitzer Pacific Northwest National Laboratory
More informationEvaluating the Effectiveness of Task Overlapping as a Risk Response Strategy in Engineering Projects
Evaluating the Effectiveness of Task Overlapping as a Risk Response Strategy in Engineering Projects Lucas Grèze Robert Pellerin Nathalie Perrier Patrice Leclaire February 2011 CIRRELT-2011-11 Bureaux
More informationIEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, ACCEPTED FOR PUBLICATION 1. Secure Wireless Multicast for Delay-Sensitive Data via Network Coding
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, ACCEPTED FOR PUBLICATION 1 Secure Wireless Multicast for Delay-Sensitive Data via Network Coding Tuan T. Tran, Meber, IEEE, Hongxiang Li, Senior Meber, IEEE,
More informationGenerating Certification Authority Authenticated Public Keys in Ad Hoc Networks
SECURITY AND COMMUNICATION NETWORKS Published online in Wiley InterScience (www.interscience.wiley.co). Generating Certification Authority Authenticated Public Keys in Ad Hoc Networks G. Kounga 1, C. J.
More informationA Hybrid Grey-Game-MCDM Method for ERP Selecting Based on BSC. M. H. Kamfiroozi, 2 A. BonyadiNaeini
Int. J. Manag. Bus. Res., 3 (1), 13-20, Winter 2013 IAU A Hybrid Grey-Gae-MCDM Method for ERP Selecting Based on BSC 1 M. H. Kafiroozi, 2 A. BonyadiNaeini 1,2 Departent of Industrial Engineering, Iran
More informationHigh Performance Chinese/English Mixed OCR with Character Level Language Identification
2009 0th International Conference on Docuent Analysis and Recognition High Perforance Chinese/English Mixed OCR with Character Level Language Identification Kai Wang Institute of Machine Intelligence,
More informationChapter 5. Principles of Unsteady - State Heat Transfer
Suppleental Material for ransport Process and Separation Process Principles hapter 5 Principles of Unsteady - State Heat ransfer In this chapter, we will study cheical processes where heat transfer is
More informationMarkov Models and Their Use for Calculations of Important Traffic Parameters of Contact Center
Markov Models and Their Use for Calculations of Iportant Traffic Paraeters of Contact Center ERIK CHROMY, JAN DIEZKA, MATEJ KAVACKY Institute of Telecounications Slovak University of Technology Bratislava
More informationAirline Yield Management with Overbooking, Cancellations, and No-Shows JANAKIRAM SUBRAMANIAN
Airline Yield Manageent with Overbooking, Cancellations, and No-Shows JANAKIRAM SUBRAMANIAN Integral Developent Corporation, 301 University Avenue, Suite 200, Palo Alto, California 94301 SHALER STIDHAM
More informationA framework for performance monitoring, load balancing, adaptive timeouts and quality of service in digital libraries
Int J Digit Libr (2000) 3: 9 35 INTERNATIONAL JOURNAL ON Digital Libraries Springer-Verlag 2000 A fraework for perforance onitoring, load balancing, adaptive tieouts and quality of service in digital libraries
More informationBig Data Optimization: Randomized lock-free methods for minimizing partially separable convex functions
Big Data Optimization: Randomized lock-free methods for minimizing partially separable convex functions Peter Richtárik School of Mathematics The University of Edinburgh Joint work with Martin Takáč (Edinburgh)
More informationREQUIREMENTS FOR A COMPUTER SCIENCE CURRICULUM EMPHASIZING INFORMATION TECHNOLOGY SUBJECT AREA: CURRICULUM ISSUES
REQUIREMENTS FOR A COMPUTER SCIENCE CURRICULUM EMPHASIZING INFORMATION TECHNOLOGY SUBJECT AREA: CURRICULUM ISSUES Charles Reynolds Christopher Fox reynolds @cs.ju.edu fox@cs.ju.edu Departent of Coputer
More informationImplementation of Active Queue Management in a Combined Input and Output Queued Switch
pleentation of Active Queue Manageent in a obined nput and Output Queued Switch Bartek Wydrowski and Moshe Zukeran AR Special Research entre for Ultra-Broadband nforation Networks, EEE Departent, The University
More informationModified Latin Hypercube Sampling Monte Carlo (MLHSMC) Estimation for Average Quality Index
Analog Integrated Circuits and Signal Processing, vol. 9, no., April 999. Abstract Modified Latin Hypercube Sapling Monte Carlo (MLHSMC) Estiation for Average Quality Index Mansour Keraat and Richard Kielbasa
More informationEvaluating Software Quality of Vendors using Fuzzy Analytic Hierarchy Process
IMECS 2008 9-2 March 2008 Hong Kong Evaluating Software Quality of Vendors using Fuzzy Analytic Hierarchy Process Kevin K.F. Yuen* Henry C.W. au Abstract This paper proposes a fuzzy Analytic Hierarchy
More information