Analytics for Software Project Management



Similar documents
IT services for analyses of various data samples

PMP 2013 Exam Prep. Course Overview

Single Level Drill Down Interactive Visualization Technique for Descriptive Data Mining Results

1. Systematic literature review

Enhanced Boosted Trees Technique for Customer Churn Prediction Model

Analysis and Comparison of Project Management Standards and Guides

RESEARCH OF DECISION SUPPORT SYSTEM (DSS) FOR GREENHOUSE BASED ON DATA MINING

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

A Statistical Text Mining Method for Patent Analysis

Prediction of Stock Performance Using Analytical Techniques

PMBOK 5. Chapters. 1. Introduction What is Project Management 2. Organizational Influences and Project Life Cycle 3. Project Management Processes

Business Challenges and Research Directions of Management Analytics in the Big Data Era

Data Mining for Customer Service Support. Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin

TECHNOLOGY ANALYSIS FOR INTERNET OF THINGS USING BIG DATA LEARNING

2.1. Data Mining for Biomedical and DNA data analysis

KNOWLEDGE DISCOVERY FOR SUPPLY CHAIN MANAGEMENT SYSTEMS: A SCHEMA COMPOSITION APPROACH

CHALLENGES AND WEAKNESSES OF AGILE METHOD IN ENTERPRISE ARCHITECTURE

Software Development Processes in Globally Distributed Environment

Gunther Ruhe Claes Wohlin. Editors. Software Project. Management in a Changing World. ^ Springer

How To Use Data Mining For Knowledge Management In Technology Enhanced Learning

Best Practices in Project Management

ISSN: A Review: Image Retrieval Using Web Multimedia Mining

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

IMPROVING BUSINESS PROCESS MODELING USING RECOMMENDATION METHOD

Project Risk Management

Data Mining in Web Search Engine Optimization and User Assisted Rank Results

Formal Methods for Preserving Privacy for Big Data Extraction Software

An Evaluation of Neural Networks Approaches used for Software Effort Estimation

SPATIAL DATA CLASSIFICATION AND DATA MINING

Class Imbalance Learning in Software Defect Prediction

TDWI strives to provide course books that are content-rich and that serve as useful reference documents after a class has ended.

72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD

A GUIDE TO THE PROJECT MANAGEMENT BODY OF KNOWLEDGE

Semantic Concept Based Retrieval of Software Bug Report with Feedback

Cleaned Data. Recommendations

Protocol for the Systematic Literature Review on Web Development Resource Estimation

Full-text Search in Intermediate Data Storage of FCART

Prediction of Heart Disease Using Naïve Bayes Algorithm

Data Mining Governance for Service Oriented Architecture

The Role of Information Technology Studies in Software Product Quality Improvement

APPLYING CASE BASED REASONING IN AGILE SOFTWARE DEVELOPMENT

Grid Density Clustering Algorithm

Healthcare Measurement Analysis Using Data mining Techniques

Towards applying Data Mining Techniques for Talent Mangement

A Case of Study on Hadoop Benchmark Behavior Modeling Using ALOJA-ML

Systems Features Analysis (SFA) and Analytic Hierarchy Process (AHP) in Systems Design and Development

Data Refinery with Big Data Aspects

Text Mining Approach for Big Data Analysis Using Clustering and Classification Methodologies

2 AIMS: an Agent-based Intelligent Tool for Informational Support

An Agent-Based Concept for Problem Management Systems to Enhance Reliability

Project Knowledge Management Based on Social Networks

TSRR: A Software Resource Repository for Trustworthiness Resource Management and Reuse

Research of Smart Distribution Network Big Data Model

A SYSTEMATIC LITERATURE REVIEW ON AGILE PROJECT MANAGEMENT

2.1. The Notion of Customer Relationship Management (CRM)

International Journal of World Research, Vol: I Issue XIII, December 2008, Print ISSN: X DATA MINING TECHNIQUES AND STOCK MARKET

Profile Based Personalized Web Search and Download Blocker

Subject Description Form

9 TH INTERNATIONAL ASECU CONFERENCE ON SYSTEMIC ECONOMIC CRISIS: CURRENT ISSUES AND PERSPECTIVES

ewise TM Project Knowledge Management Solution

Big Data Analytics OverOnline Transactional Data Set

Mining the Software Change Repository of a Legacy Telephony System

Interpreting the Management Process in IEEE/EIA with the Help of PMBOK

Towards SoMEST Combining Social Media Monitoring with Event Extraction and Timeline Analysis

Certified Associate in Project Management (CAPM) Examination Content Outline

The Project Management Body of Knowledge, generally known as PMBOK, deals with nine fundamental subjects for the project management.

A Divided Regression Analysis for Big Data

Model of Learning Organizational Development of Primary School Network under the Office of Basic Education Commission

Clustering Marketing Datasets with Data Mining Techniques

Online Credit Card Application and Identity Crime Detection

A Secure Model for Medical Data Sharing

BUSINESS INTELLIGENCE AS SUPPORT TO KNOWLEDGE MANAGEMENT

Investigating Role of Service Knowledge Management System in Integration of ITIL V3 and EA

Communication Risks and Best Practices in Global Software Development during Requirements Change Management: A Systematic Literature Review Protocol

Exploitation of Server Log Files of User Behavior in Order to Inform Administrator

AN APPROACH TO WORD SENSE DISAMBIGUATION COMBINING MODIFIED LESK AND BAG-OF-WORDS

Project Management Guidelines

ProjectTales: Reusing Change Decisions and Rationales in Project Management

Data Mining Solutions for the Business Environment

Processing and data collection of program structures in open source repositories

VISUALIZATION APPROACH FOR SOFTWARE PROJECTS

Analysis of Internet Topologies

Transcription:

Analytics for Software Project Management Where Are We and Where Do We Go? Guenther Ruhe Joint paper with: Maleknaz Nayebi Roberta Cabral Mota Mujeeb Mufti 1

What do we mean by Analytics for software project management? Analytics ASPM SENG Project management Action 15 2

Types of Analytics (Kaisler et al. 2014) Descriptive: A set of techniques for reviewing and examining the data set(s) to understand the data and analyze business performance (what had happened?) Diagnostic: A set of techniques for determine what has happened and why (why did it happen?) Predictive: A set of techniques that analyze current and historical data to determine what is most likely to (not) happen (what will happen?) Prescriptive: A set of techniques for computationally developing and analyzing alternatives that can become courses of action either tactical or strategic that may discover the unexpected (what should happen?) Decisive: A set of techniques for visualizing information to facilitate human decision-making. Action 15 3

Project management Application of knowledge, skills, tools and techniques to project activities to meet the project requirements. Project management is accomplished through the application and integration of 47 logically grouped project management processes divided into five process groups: initiating, planning, executing, monitoring and controlling, and closing. (Duncan 2013). Action 15 4

Analytics for Software Project Management Where Are We and Where Do We Go? - SM Study - RQ s - Findings - Discussion 5

Systematic mapping study selection process 15,406 papers 7,306 papers 2,906 papers Searching databases Apply inclusion criteria Apply exclusion criteria Exclusion based on title & keywords Exclusion based on abstract Exclusion based on full-text 320 papers 193 papers 115 papers Action 15 6

Keywords used in electronic libraries { Analytical, Analysis, Analytics, Analyzing, Software Analytics, Data Science } AND { Software Management, Project Management, Software Development, Software Project Management }. Inspec, Science Direct, Scopus, IEEE, ACM Dig. Library Action 15 7

Where the papers were coming from? Canada USA China India Taiwan Action 15 8

Research questions RQ1 (Types of analytics): What types of analytics has been used across the different software project management knowledge areas defined in the Software PMBOK? RQ2 (Access to data): To what extent was data used from open repositories or made publicly available? RQ3 (Validation of results): To what degree was validation done and if so, what was the percentage using real world data? RQ4 (Reuseand replication): How much are the retrieved papers (i) cross-referencing each other and (ii) using mutual datasets? Action 15 9

Knowledge areas & analytical techniques Distribution of papers across knowledgeareasofspm & types of analytical techniques Descrip tive Diagnostic Predictive Prescriptive Total Stakeholder management 1 2 0 0 3 Procurement management 0 0 0 3 3 Risk management 6 2 12 4 24 Communication management 0 0 0 0 0 Human resource management 0 0 1 7 8 Quality management 0 3 5 2 10 Cost management 0 0 25 1 26 Time management 0 1 4 4 9 Scope management 1 2 1 7 11 Integration management 5 1 7 5 18 other 1 0 1 1 3 Total 14 11 56 34 0 20 30 Action 15 10

Pareto chart - publications across PMBOK knowledge areas 30 25 20 15 10 5 0 120 100 80 60 40 20 0 PMBOK knowldge areas Frequency Cumulative percentage Action 15 11

Availability of datasets 14 70 12 60 10 50 8 40 6 30 4 20 2 0 4 8 10 13 14 13 10 13 15 15 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 Open data set Closed dataset Open data/total # of used datasets 10 0 Action 15 12

Distribution of papers using validation with real vs. synthetic data sets Action 15 13

Cross-referencing 94 papers with no connection to each other 26 papers with connections Bannerman, P. L. (2008). Risk and risk management in software projects: A reassessment. Journal of Systems and Software, 81(12), 2118-2133. à most cross-references (5) Action 15 14

Classification of analytical questions Action 15 15 Buse, Zimmermann: Information needs for software development analytics. ICSE 2012: 987-996

Comparative analysis Comparing Rank 1 Rank 2 Rank 3 Rank 4 Importance in practice Difficulty in practice # of research papers Descriptive Diagnostic Prescriptive Predictive Predictive Prescriptive Descriptive Diagnostic Predictive Prescriptive Descriptive Diagnostic Action 15 16

Additional: Usage of hybrid techniques Action 15 17

Main findings - Discussion 93.9% of papers provide some form of validation. 37.3% made data openly accessible. Just 23% of the papers connected, replicated or reused previous models. Only 4% shared joined data Open: Evaluation of industrial usefulness of results Open: No trend from supporting developers towards also supporting managers Action 15 18

References [1]D. Dalcher, "Rethinking Success in Software Projects: Looking Beyond the Failure Factors," in Software Project Management in a Changing World, ed: Springer, 2014, pp. 27-49. [2] AE Hassan, "Software Analytics: GoingbeyondDevelopers,"IEEE Software, vol. 4, 2013. [3] R. Buse, T, Zimmermann, "Information Needs for Software Development Analytics, 34 th InternationalConference onsoftwareengineering (ICSE), 2012. [4] S. Kaisler, F. Armour, and J. A. Espinosa, "Introduction to big data: Challenges, opportunities, and realities minitrack," in 2014 47th HICSS International Conference on, 2014, pp. 728-728. [5] PMI, Software Extension to the PMBOK Guide, Fifth ed. Project Management Institute (PMI), USA: IEEE Computer Society, 2013. [6] T. Menzies, "Beyond data mining; towards idea engineering," in Proceedings of the 9th International Conferenceon Predictive Models insoftware Engineering, 2013, pp. 1-6. [7] T. Menzies, E. Kocaguneli, B. Turhan, L. Minku, and F. Peters, Sharing Data and Models in Software Engineering: Sharing Data and Models: Morgan Kaufmann, 2014. [8] G. Ruhe and C. Wohlin, Software Project Management in a Changing World: Springer, 2014. Action 15 19