Machine Learning Algorithms



Similar documents
Curriculum Vitae of Vincenzina (Enza) Messina

CURRICULUM VITAE. Phd in computer science

Doctor of Philosophy in Computer Science

Introduction to Data Mining

An Introduction to Data Mining

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

An Overview of Knowledge Discovery Database and Data mining Techniques

Sanjeev Kumar. contribute

Learning outcomes. Knowledge and understanding. Competence and skills

Artificial Intelligence and Politecnico di Milano. Presented by Matteo Matteucci

Area13 Riviste di classe A

Predictive Analytics & Filtering for Finance

LUCA BERTAZZI CURRICULUM VITAE

Data Mining Solutions for the Business Environment

Is a Data Scientist the New Quant? Stuart Kozola MathWorks

Master of Science in Computer Science

Bachelor Degree in Informatics Engineering Master courses

CS Master Level Courses and Areas COURSE DESCRIPTIONS. CSCI 521 Real-Time Systems. CSCI 522 High Performance Computing

Big Data Mining Services and Knowledge Discovery Applications on Clouds

Area 13 - Elenco delle Riviste di Classe A per Settore Concorsuale (AGGIORNATO AL 01/10/2015)

Course Requirements for the Ph.D., M.S. and Certificate Programs

Introduction to Data Mining

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

An Introduction to Health Informatics for a Global Information Based Society

Using Ontologies in Proteus for Modeling Data Mining Analysis of Proteomics Experiments

Bachelorclass

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree

Information Management course

Professional Organization Checklist for the Computer Science Curriculum Updates. Association of Computing Machinery Computing Curricula 2008

Knowledge Discovery from Data Bases Proposal for a MAP-I UC

CURRICULUM VITAE of ANDREA TRAMONTANI (Last update: August 31, 2010)

Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov

Area 13 - Elenco delle Riviste di Classe A per Settore Concorsuale

Abdullah Mohammed Abdullah Khamis

Data Integration. Lectures 16 & 17. ECS289A, WQ03, Filkov

Tracking System for GPS Devices and Mining of Spatial Data

Bio-inspired mechanisms for efficient and adaptive network security

Lecture/Recitation Topic SMA 5303 L1 Sampling and statistical distributions

ANALYTICS IN BIG DATA ERA

DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

Data Warehousing and Data Mining in Business Applications

Resume of Hanan H. Elazhary

Information and Decision Sciences (IDS)

DATA PREPARATION FOR DATA MINING

life science data mining

ANALYTICS CENTER LEARNING PROGRAM

Core Bioinformatics. Degree Type Year Semester Bioinformàtica/Bioinformatics OB 0 1

MS1b Statistical Data Mining

CLUSTER ANALYSIS WITH R

DATA MINING TECHNIQUES AND APPLICATIONS

Bayesian networks - Time-series models - Apache Spark & Scala

2.1. Data Mining for Biomedical and DNA data analysis

Software Development Training Camp 1 (0-3) Prerequisite : Program development skill enhancement camp, at least 48 person-hours.

Statistics Graduate Courses

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

ADVANCED MACHINE LEARNING. Introduction

Bachelor of Science in Applied Bioengineering

Industrial and Systems Engineering (ISE)

Using Data Mining for Mobile Communication Clustering and Characterization

Web Mining. Margherita Berardi LACAM. Dipartimento di Informatica Università degli Studi di Bari

Research-based Learning (RbL) in Computing Courses for Senior Engineering Students

Knowledge Based Descriptive Neural Networks

TIETS34 Seminar: Data Mining on Biometric identification

The Masters of Science in Information Systems & Technology

Principles of Data Mining by Hand&Mannila&Smyth

A leader in the development and application of information technology to prevent and treat disease.

Programming Risk Assessment Models for Online Security Evaluation Systems

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

How To Use Neural Networks In Data Mining

Master's projects at ITMO University. Daniil Chivilikhin PhD ITMO University

Articles IEEE have removed from their database

Dr Alexander Henzing

Applied mathematics and mathematical statistics

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

Data Mining System, Functionalities and Applications: A Radical Review

SYSTEMS, CONTROL AND MECHATRONICS

MEDICAL DATA MINING. Timothy Hays, PhD. Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

Introduction to Pattern Recognition

Protein Protein Interaction Networks

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

MATHEMATICAL MODELS OF TUMOR GROWTH INHIBITION IN XENOGRAFT MICE AFTER ADMINISTRATION OF ANTICANCER AGENTS GIVEN IN COMBINATION

ENHANCED CONFIDENCE INTERPRETATIONS OF GP BASED ENSEMBLE MODELING RESULTS

Vad är bioinformatik och varför behöver vi det i vården? a bioinformatician's perspectives

Kazan (Volga region) Federal University, Kazan, Russia Institute of Fundamental Medicine and Biology. Master s program.

Machine Learning Introduction

NATIONAL SUN YAT-SEN UNIVERSITY

Meta-learning. Synonyms. Definition. Characteristics

ARTIFICIAL INTELLIGENCE METHODS IN EARLY MANUFACTURING TIME ESTIMATION

Transcription:

MINDD Models in decision making & data @nalysis Prof. Francesco Archetti archetti@disco.unimib.it Prof. Enza Messina messina@disco.unimib.it

Main Activities Research Areas: o Machine Learning Algorithms o Probabilistic and Relational Models o Optimization Under Uncertainty Applicative Domains: omultimedia o Life Sciences o Ambient Intelligence oworld Wide Web o Risk Management o Supply Chain

Machine Learning Algorithms Design, analysis and implementation of algorithms for pattern analysis, classification, clustering and prediction Our methodological skills Predictive and descriptive models Probabilistic Models Feature selection Time series analysis Applications of interest World Wide Web Document clustering Recommendation Systems Life Sciences Docking prediction Pharmacogenomic prediction Supply Chain Forecasting models Ambient Intelligence Anomaly Detection

World Wide Web User s query Hierarchical Document Clustering Group documents on the same topic into the same cluster, producing a taxonomy A B C μ D E F G H I 1 μ Taxonomy 2 Recommendation systems: o Hidden Markov Models for dynamic user behaviour using click streams o Combined content and click stream analysis for profiling web users - Machine Learning Algorithms-

Life Sciences Analysis of molecular docking Evaluation of target-drug chemical interactions based on chemical descriptors. molecular descriptors docking energy Analysis of Drug Response Cell lines derived from a variety of cancer tissues. gene expression level drug response - Machine Learning Algorithms-

Ambient Intelligence Probabilistic Models Anomaly Detection using Markov chain based models Tracking using Kalman Filter and Genetic Progamming: - Counting vehicles - Tracking objects - Machine Learning Algorithms-

Relational Models Development of algorithms which take into account (complex) relationships among a collection of different objects Our methodological skills Applications of interest Relational Clustering Relational Dynamic Bayesian Networks Social Network Analysis Probabilistic Relational Models Multimedia Video summarization Emotion Recognition Ambient Intelligence Unattended goods detection World Wide Web Document clustering Document ranking Life Sciences Pharmacogenomic prediction

Multimedia Fusion of Multimedia Sources for Video summarization Emotion Recognition - Relational Models -

Ambient Intelligence Unattended Goods Relational Dynamic Bayesian Network are able to track and make data association in complex systems L.color(t) L.color(t+1) P.hairColor(t) P.hairColor(t+1) Sensing and tracking the relations between different objects can be more effective than working directly on them. L.shape(t) P.shirtColor(t) distance(l,p,t) L.shape(t+1) P.shirtColor(t+1) distance(l,p,t+1) t - Relational Models -

World Wide Web Document representation What is the relation degree between the document entity and an image-block entity? - Relational Models -

Relational Clustering Life Sciences Relationship among gene expression profile and drug activity patterns for cancer treatment World Wide Web Clustering of Web Search Results: o Network of documents instead of single web pages o Degree of relationship among documents o Noisy content cleaning - Relational Models -

Optimization Under Uncertainty Development of model and algorithms for decision making under uncertainty Our methodological skills Applications of interest Stochastic Linear Programming Stochastic Integer Programming Scenario Generation Simulation Risk Management Scenario analysis Resource allocation models Pharmaceutical Marketing Call Planning Marketing mix optimization Life Sciences Metabolic network analysis Gene regulatory network

Risk Management Dynamic time series modelling o Switching models o Forecasting o Scenario generation Stochastic Programming models o Portfolio planning o Asset and Liability Management o Environmental Real Option Pricing o Capacity planning - Optimization Under Uncertainty -

Pharmaceutical Marketing Physicians o Optimization models for call planning based on aggregate response functions o System Dynamics for optimizing marketing mix o Social networks for targeting Response functions Targeting Relational models Prescriptions Call Planning Marketing Mix - Optimization Under Uncertainty -

Life Sciences Mathematical Programming Models for optimizing Biological Networks P 2 m 1 P 1 m 2 P 3 m 3 P 4 D 1 D 2 D 3 D 4 Cellular membrane Timed Stochastic Petri Nets to simulate Biological Networks - Optimization Under Uncertainty -

People Francesco Archetti Enza Messina Guglielmo Lulli Cristina Elena Manfredotti Elisabetta Fersini Ilaria Giordani Past PhD Students Patrick Valente, now IT Manager at Barclays Capital, UK Nico Di Domenica, now Senior Risk Analyst at Royal Bank of Scotland, UK Valentina Bosetti, now Senior Researcher at FEEM, Italy Toscani Daniele, now Researcher at CMR, Italy

A cooperation network for research projects and student mobility University of Toronto Brunel University CARISMA Research Center Norwegian University of Science and Technology Aachen University Massachusset Institute of Technology Hungarian Academy of Sciences Centre of Research and Technology Hellas -TXT e-solutions -Siemens -Project Automation -Aegate Ltd -OptiRisk -Astra Zeneca -DELOS -Comerson

Projects o JUMAS Judicial Management by Digital Libraries Semantics, EU FP7 o INSYEME - Integrated System for Emergency Management, MUR o TRADE - Tracking RFID-based Agents in Distributed Environments, Regione Lombardia o An integrated system for 3D image processing, (Fondo per l innovazione tecnologica Ministero per lo sviluppo economico, in collaborazione con Comerson s.r.l.) o Call Plan, Astra Zeneca o Bayesian Fusion of Stochastic Models, TXT o E-RelationHub, TXT o OSP - Optimization Service Provision EU FP5

Journal Publications (1) F. Archetti, E. Fersini, E. Messina Enhancing Web Page Classification using Visual Block Analysis. To appear in Information Processing and Management. F. Archetti, S. Lanzeni, E. Messina, L. Vanneschi, Genetic Programming for Computational Pharmacokinetics in Drug Discovery and Development. To appear in Genetic Programming and Evolvable Machines. E. Messina, D. Toscani Hidden Markov Models for Scenario Generation. To appear in IMA Journal of Management Mathematics. F. Archetti, S. Lanzeni, E.Messina, "Graph Models and Mathematical Programming in Biochemical Networks Analysis and Metabolic Engineering Design. To appear in Computers & Mathematics with Applications. G. Lulli, M. Romauch: A Mathematical Program to Refine Gene Regulatory Networks, to appear in Discrete Applied Mathematics. G. Andreatta, G. Lulli, A Multi-period TSP with Stochastic Regular and Urgent Demands, European Journal of Operations Research, 185, 1, pp. 122-132, 2008. P. Dell Olmo, A. Iovanella, G. Lulli, B. Scoppola, Exploiting incomplete information to manage multiprocessor, to appear in Computers and Operations Research, 35, 5, pp. 1589-1600, 2008. G. Lulli, A.R. Odoni, The European Air Traffic Flow Management Problem, Transportation Science, 41, 4, pp. 1-13. (short version accepted for the 11th IFAC Symposium on Control in Transportations Systems, August 2006, Delft - The Netherlands), 2007. G. Lulli, S. Sen, A Heuristic Algorithm for Stochastic Integer Program with Complete Recourse, European Journal of Operations Research, 171, 3, pp. 879-890, 2006. E. Messina, V. Bosetti, Integrating stochastic programming and decision tree techniques in land conversion problems, Annals of Operations Research, 142, pp. 243-258, 2006. M.O. Ball, G. Lulli, Ground Delay Programs: Optimizing over the Included Flight Set Based on Distance, Air Traffic Control Quarterly, 12, pp. 1-25, 2004. G. Lulli, S. Sen, A Branch-and-Price Algorithm for Multi-stage Stochastic Integer Programming with Application to Stochastic Batch-Sizing Problems, Management Science, 50, 6,pp. 786-796, 2004. P. Dell'Olmo, G. Lulli, Planning Activities in a Network of Logistic Platforms with Limited Capacity, Annals of Operations Research, 129, Issue 1-4, pp. 155-169, 2004.

Journal Publications (2) V. Bosetti, J.M.Conrad, E. Messina, The Value of Flexibility: Preservation, Remediation, or Development for Ginostra? Environmental and Resource Economics, 29, 2, pp. 219-229, 2004. P. Dell'Olmo, G. Lulli, A new hierarchical architecture for Air Traffic Management: Optimisation of airway's capacity in a free flight scenario, European Journal of Operations Research, 144, 1, pp. 179-193, 2003. P. Dell'Olmo, G. Lulli, A Dynamic Programming Approach for the Airport Capacity Allocation Problem, IMA Journal of Management Mathematics, 14, pp. 235-249, 2003. E. Messina, V. Bosetti, Uncertainty and Option Value in Land Allocation Problems, Annals of Operations Research 124, pp. 165-182, 2003. V. Bosetti, E. Messina, P. Valente, "Optimisation Technologies and Environmental Applications". IMA Journal of Management Mathematics, 13, pp. 167-185, 2002. S.A. Mir Hassani, C. Lucas, E. Messina, G. Mitra, Computational Solution of Capacity Planning Models under Uncertainty, Parallel Computing, 26, pp. 511-538, 2000. E. Messina, G. Mitra, "Modelling and analysis of multistage stochastic programming problems: a software environment European Journal of Operational Research, 101, pp. 343-359, 1997. P. Baricelli, C. Lucas, E. Messina, G. Mitra A model for strategic planning under uncertainty, TOP: O.R. in Practice, 4, 2, pp.361-384, 1996. A. Gaivoronski, E. Messina, A. Sciomachen, A statistical generalized programming algorithm for stochastic optimization problems, Annals of Operations Research, 58, pp. 297-321, 1995. A. Gaivoronski, E. Messina, A. Sciomachen, A stochastic optimization approach for robot scheduling, Annals of Operations Research, 56, pp. 109-133, 1995. F. Archetti, E. Messina, A. Sciomachen, "A graph theoretical approach to the performance analysis of highly concurrent systems", Journal of Combinatorial, Information and System Science, 19, 1-2, pp.87-95, 1994. E. Messina, A. Sciomachen, "Evaluation of resource allocation policies in a production line using Petri nets", Robotics & Computer- Integrated Manufacturing, 10, 6, pp.413-422, 1993.

International Conference Proceedings and Book Chapters (1) D. Bertsimas, G. Lulli, A. Odoni: The Air Traffic Flow Management Problem: An Integer Optimization Approach. To appear in Proceedings of 13th International Conference IPCO 2008 - Bertinoro, to appear in LNCS. K.F. Doerner, W. J. Gutjahr, R.F. Hartl, G. Lulli, Stochastic Local Search Procedures for the Probabilistic Two-Day Vehicle Routing Problem, to appear in A. Fink and F. Rothlauf eds., Advances in Computational Intelligence in Transportation and Logistics, Springer Series on Studies in Computational Intelligence. E. Fersini, C. Manfredotti, E. Messina, F. Archetti. Relational Clustering for Gene Expression Profiles and Drug Activity Pattern Analysis. SysBioHealth Symposium (ISBN: 978-88-903154-0-4), 2007. I. Giordani, L. Vanneschi, E. Fersini. Modelling the Relationship between the Microarray Data of the NCI-60 Anticancer Dataset with Therapeutic Responses by Genetic Programming. SysBioHealth Symposium (ISBN: 978-88-903154-0-4), 2007. S. Lanzeni, E. Messina, F. Archetti, Towards metabolic networks phylogeny using Petri Net-based expansional analysis, BMC Systems Biology 2007, 1(Suppl 1). F. Archetti, S. Lanzeni, E. Messina, L. Vanneschi "Genetic Programming and other Machine Learning approaches to predict Median Oral Lethal Dose (LD50) and Plasma Protein Binding levels (%PPB) of drugs" Lecture Notes in Computer Sciences, Evolutionary Computation,Machine Learning and Data Mining in Bioinformatics, 5th European Conference, EvoBIO 2007. F. Archetti, C. Manfredotti, M. Matteucci, E. Messina and D. G. Sorrenti, Multiple Hypotesis Markov Chains For On-Line Anomaly Detection in Traffic Video Surveillance, Proceedings ICDP 2006: Imaging for Crime Detection and Prevention, 13-14 June 2006. F.Archetti, C.E. Manfredotti, E. Messina, and D. G. Sorrenti Foreground-to-ghost Discrimination in Single-difference Preprocessing, Lecture Notes in Computer Science: Advanced Concepts for Intelligent Vision Systems, ACIVS 06, 263-274, 2006. F. Archetti, S. Lanzeni, E. Messina, L. Vanneschi, Genetic Programming for Human Oral Availability of Drugs, Lecture notes in Computer Science: Genetic and Evolutionary Computation (GECCO 06), 2006. F. Archetti, E. Messina, D. Toscani, L. Vanneschi, "Classifying and Counting Vehicles in Traffic Control Applications" Lecture Notes in Computer Science: Applications of Evolutionary Computing, 2006.

International Conference Proceedings and Book Chapters (2) F. Archetti, E. Fersini, P. Campanelli, E. Messina, "A Hierarchical Document Clustering Environment Based on the Induced Bisecting k-means" Lecture Notes in Computer Science: Flexible Query Answering Systems, 2006. F. Archetti, E. Messina, D. Toscani, "UP-DRES User Profiling for a Dynamic REcommendation System", Lecture Notes in Computer Science: Advances in Data Mining. Applications in Medicine, Web Mining, Marketing, Image and Signal Mining,, 2006. G. Andreatta, G. Lulli: The 2-period Probabilistic TSP: a Markov Decision Model, Proceedings of the EWGT2006 Joint Conferences: The 11th Meeting of the EURO Working Group on Trasportation Advances in Traffic and Transportation Systems Analysis and Extra EURO Conference Handling Uncertainty in Transportation, Technical University of Bari, September 27-29, pp. 672-676 (ISBN: 88-901798-2-1), 2006. F. De Paoli, G. Lulli, A. Maurino. Design of Quality-based Composite Services. In Proceedings of 4th International Conference on Service Oriented Computing - ICSOC 2006 - Chicago, USA LNCS vol. 4294, pp. 153-164, 2006. G. Lulli, G. Andreatta, Congestion Pricing and Queue Theory in Proceedings of the 1st International Conference on Research in Air Transportation, Žilina, ISBN 80-8070-196-2, 2004. P. Dell Olmo, G. Lulli, Models and Algorithms for the Airport Capacity Allocation Problem, in T. Ciriani et al. ed., OR is Space and Air, Kluwer Academic Publisher, pp. 351 368, 2003. G. Lulli, S. Sen, Stochastic Batch-Sizing Problems: Models and Algorithms, in D.L. Woodruff ed., Stochastic Integer Programming and Network Interdiction Models, Kluwer Academic Publisher, pp. 85-104, 2002. P. Dell Olmo, G. Lulli, A Mathematical Programming Approach to Optimize Airport Capacity in: Proceedings of the Fifth International Conference on Mathematical Programming and Applications, Varenna, 2002. G. Lulli, S. Sen, Scenario Updating Method for Stochastic Mixed-integer Programming Problems in: U. Leopold-Wildburger, F. Rendl and G. Wäscher (eds), The OR 2002 proceedings, Springer Verlag, Klagenfurt, 2002. L. Bianco, P. Dell Olmo, S. Giordani, G. Lulli, Models and Algorithms for Multi-airport Traffic Flow Coordination in: Galati and Zellweger (ed.), Proceedings of the International ATM 02 Workshop, Capri, 2002. V. Bosetti, C. Conrad and E. Messina, The value of flexibility, CRENoS Working Paper, Cagliari, 2001.

International Conference Proceedings and Book Chapters (3) F. Archetti, E. Messina, F. Stella, Modellazione e simulazione del traffico urbano: un approccio basato su agenti autonomi, Proceedings Convegno PFT2, Taormina, 1997. F. Archetti, E. Messina, B. Mishra, F. Stella, (1997) CATS: a Complex Adaptive Traffic Simulator, Proceedings of the IFAC\IFIP\IFORS Symposium on Transportation Systems, Chania, Greece, Vol. 3, pp. 1340-1344, 1997. F. Fantauzzi, A. Gaivoronski, E. Messina, Decomposition methods for network optimization problems in the presence of uncertainty, Lecture Notes on Economics and Mathematical Systems: Network Optimization, P. M. Pardalos, D.W.Hearn, W.W. Hager (Eds.), Gainesville, Florida, Vol. 450, pp. 234-248, 1996. A. Gaivoronski, E. Messina, Optimization of stationary behavior of general stochastic discrete event dynamic systems, Proceedings WODES96-International Workshop on Discrete Event Systems, IEEE, pp. 238-243, Edimburg, U.K., 1996. C. Lucas, E. Messina, G. Mitra, Risk and return analysis of multi-period strategic planning problems, Lecture Notes in Economics and Mathematical Systems: Stochastic Modelling in Innovation Manufacturing, A.H. Christer, S. Osaki, L.C. Thomas (Eds.), Springer- Verlag, Cambridge, pp. 81-96, 1995. F. Archetti, E. Messina, Process flexibility through stochastic optimization: a computational approach, Optimization in Industry, A. Sciomachen editor, John Wiley, Vol. 3, pp. 109-125, 1995. E. Messina, M. Colombo, "Application of Linear Programming reduction procedures to manufacturing models", Intelligent Automation and Soft Computing: Trends in Research, Development and Applications, M. Jamshidi, J. Yuh, C. C. Nguyen and R. Lumia eds., V.2, pp. 35-38, 1994. A. Gaivoronski, E. Messina, Stochastic optimization algorithms for regenerative DEDS, Lecture Notes in Control and Information Sciences: System Modelling and Optimization, J. Henry and J-P. Yvon eds., Springer Verlag, Vol. 197, pp. 320-331, 1993. E. Messina, A. Sciomachen Optimal sequencing via structural simulation, Proceedings of the 36th annual Conference Automation, Genova (Italy), Vol 3, pp 197-210, 1992.