Data Modeling 101 Donna Burbank and Steve Hoberman. Session Code ED03

Similar documents
ERwin R8 Reporting Easier Than You Think Victor Rodrigues. Session Code ED05

Data Modeling in a Coordinated Data Management Environment: The Key to Business Agility in the Era of Evolving Data

Web Admin Console - Release Management. Steve Parker Richard Lechner

Robert Takoushian, CVS/Caremark. Data Architect Session Code DM04

Reporting with ERwin and Crystal Reports

Development Process. Simon Cockayne Misc Track

CA ERwin Data Modeling's Role in the Application Development Lifecycle

Hands-on Lab: CA ehealth PM Integration with Cisco Unified Communications Manager. Eve Curcio

CA Chorus for Security and Compliance Management Deep Dive

CA Workload Automation Restart Option for z/os Schedulers: NJE Restarts. Jared Moran

Data Governance Tips & Advice

CA Technologies optimizes business systems worldwide with enterprise data model

Architecture in the API Era

Global Service Delivery: Industrialising Service Management

Hands-on Lab: CA Spectrum IM 9.2 Ad Hoc Reporting. David Cosgrove

LAB: Assembling a Business Service Insight (BSI) Dashboard

The Benefits of Data Modeling in Data Warehousing

Connecting the dots from automated software discovery to asset management

Change for the Better: Improved Productivity via CA Service Desk Manager

The Best Kept Secrets of Cloud Service Providers

Service Virtualization CA LISA introduction. Jim Dugger CA LISA Product Marketing Manager Steve Mazzuca CA LISA Public Sector Alliances Director

Collaboration for Big Data, Business Intelligence, and Mobile Initiatives

Data Governance and CA ERwin Active Model Templates

Dynamic Data Center Update:

Data Modeling Master Class Steve Hoberman s Best Practices Approach to Developing a Competency in Data Modeling

BancoEstado achieves greater data efficiency and system development agility with CA ERwin

The Benefits of Data Modeling in Business Intelligence

Leveraging Custom Templates & Controls to Enhance your WebClient Applications. Rob McBride Plex Track

TECHNOLOGY BRIEF: CA ERWIN SAPHIR OPTION. CA ERwin Saphir Option

CA IDMS TM /DB Indexing Part 2

Application Virtualisation Management. Steve Parker

CA Service Desk Manager Change Management. Ken Laufmann Raymond Cadden

WHITE PAPER TOPIC DATE Enabling MaaS Open Data Agile Design and Deployment with CA ERwin. Nuccio Piscopo. agility made possible

ABC s of Improved User Productivity with Service Desk Manager: Adoption, Best Practices and Content Development

1 CA SECURITY SAAS VALIDATION PROGRAM 2015 ca.com. CA Security SaaS Validation Program. Copyright 2015 CA. All Rights Reserved.

Single Sign-on to Salesforce.com with CA Federation Manager

Mobile Applications with Plex & CM WebClient: A User s Perspective. Rob McBride CA Plex/Session D (Misc Track)

Continuous Improvement with CA Service Desk Manager KPIs. Rich Magnuson

The Future of Workload Automation in the Application Economy

CA Chorus Software Manager Update

10A CA Plex in the Cloud. Rob Layzell CA Technologies

How To Model Data For Business Intelligence (Bi)

RECOVERY OF CA ARCSERVE DATABASE IN A CLUSTER ENVIRONMENT AFTER DISASTER RECOVERY

The Role of Service Catalog in IT Asset Management. Faisal Faquih Khalid

CA Cloud Service Delivery Platform

assure the quality and availability of business services to your customers

Measuring end-to-end application performance in an on-demand world. Shajeer Mohammed Enterprise Architect

Software Asset Management (SAM) Best Practice

The Benefits of Data Modeling in Business Intelligence.

CA s Cloud Storage for System z

Fujitsu Australia and New Zealand provides cost-effective and flexible cloud services with CA Technologies solutions

Nordea saves 3.5 million with enhanced application portfolio management

SaaS / Managed Services Market trends and adoption challenges

journey to a hybrid cloud

Extending the value of CA Service Desk Manager with integration connectors SAP Solution Manager, MS System Center Operations Manager

Can you simplify the buying and managing of services for your customers?

Enterprise MDM Logical Modeling

CA ERwin Data Modeler. Implementation Guide

Integrating CA Software Change Management with CA Service Desk Manager for Enterprise Change Control

are you helping your customers achieve their expectations for IT based service quality and availability?

CA Clarity PPM. Overview. Benefits. agility made possible

Modern Systems Analysis and Design

Select the Crow s Foot entity relationship diagram (ERD) option. Create the entities and define their components.

Logica Sweden provides secure and compliant cloud services with CA IdentityMinder TM

agility made possible

how can I improve performance of my customer service level agreements while reducing cost?

Sicredi improves data center monitoring with CA Data Center Infrastructure Management

Rational Reporting. Module 2: IBM Rational Insight Data Warehouse

CA Aion Business Rules Expert r11

CA Big Data Management: It s here, but what can it do for your business?

Data Model s Role in DaaS...3. The SQL Azure Use Case...4. Best Practices and Guidelines...5

Data Modeling for Big Data

Preview DESIGNING DATABASES WITH VISIO PROFESSIONAL: A TUTORIAL

CA Cloud Service Delivery Platform

CA Workload Automation Strategy and Roadmap. Bill Sherwin Principal Consultant EMEA Workload Automation Owner

Open Networking Foundation (ONF) Trademark Usage Guidelines for ONF and OpenFlow Trademarks

TECHNOLOGY BRIEF: PREVENTING UNAUTHORISED ACCESS TO CRITICAL SYSTEMS AND DATA. Colruyt ensures data privacy with Identity & Access Management.

Can My Identity Management Solution Quickly Adapt to Changing Business Requirements and Processes?

Application Performance. Java.,.NET and the IBM i. Rory Hewitt. Co-branded Logo Footprint Aligned LEFT ON COVER ONLY Must Fit Within This Space

Stefanini helps customers achieve cost avoidance savings with CA Service Desk Manager

CA Workload Automation for SAP Software

CA SMF Director. Release Notes. Release

Centris optimises user support with integrated service desk

CA Repository for z/os r7.2

CA Plex and Microsoft Windows Presentation

CA Service Desk Manager - Mobile Enabler 2.0

CA Capacity Manager. Product overview. agility made possible

KASIKORNBANK eliminates nearly 30,000 helpdesk calls a year with automated identity management

Technology Partner Program

PC-WARE realises a consistent backup system for 70 international locations with CA ARCserve

How can Content Aware Identity and Access Management give me the control I need to confidently move my business forward?

agility made possible

CA SOLVE:Central Service Desk for z/os

CA Business Service Insight

Microsoft Dynamics GP. Payroll Connect

CA Change Manager Enterprise Workbench r12

Integrating Records Management and ediscovery Processes for Greater Efficiencies

can you simplify your infrastructure?

Big Data Without Big Headaches: Managing Your Big Data Infrastructure for Optimal Efficiency

CA Virtual Assurance for Infrastructure Managers

Transcription:

Data Modeling 101 Donna Burbank and Steve Hoberman Session Code ED03

Session Abstract Learn the basics of data modeling. This session will cover a practical working knowledge of data modeling concepts and best practices, and how to apply these principles with CA ERwin Data Modeler r8. Conceptual, Logical, and Physical data models will be discussed, as well as the proper use-case for each. Basic principles of relational data modeling will be covered such as entities, relationships, keys, and more. This session is based on Donna and Steve's recent book, Data Modeling Made Simple with CA ERwin Data Modeler r8. The first two people to register and attend will receive a FREE copy of the book. PAGE 2 2 September 9, 2011

Speaker Bios Donna Burbank is a recognized industry expert and author, with more than 15 years of experience in data management, metadata management, and enterprise architecture. Donna currently is the senior director of product marketing for CA s data modeling solutions. She has worked with dozens of Fortune 500 companies worldwide in the U.S., Europe, Asia, and Africa and speaks regularly at industry conferences. Steve Hoberman is the most requested data modeling instructor in the world. Steve taught his first data modeling class in 1992 and has educated more than 10,000 people about data modeling and business intelligence techniques since then, spanning every continent except Africa and Antarctica. Steve s Data Modeling Master Class is recognized as the most comprehensive data modeling course in the industry. More at www.stevehoberman.com 3 Data Modeling 101 September 9, 2011

Speaker Bios Donna and Steve have co-authored two books together: Data Modeling for the Business Data Modeling Made Simple with CA ERwin Data Modeler r8, on which this presentation is based 4 Data Modeling 101 September 9, 2011

Speaker Bios Donna and Steve have co-authored two books together: Data Modeling for the Business Data Modeling Made Simple with CA ERwin Data Modeler r8, on which this presentation is based 5 Data Modeling 101 September 9, 2011

Agenda What is a Data Model? Basic Logical Data Modeling Components Logical Data Modeling with CA ERwin Data Modeler Demo 6 Data Modeling 101

What is a Data Model?

Models are everywhere A set of symbols and text used to make a complex concept easier to grasp Jack Enterprise Architect Mary Enterprise Modeler Bob Enterprise Analyst PAGE 8

Models are everywhere A set of symbols and text used to make a complex concept easier to grasp Jack Enterprise Architect Mary Enterprise Modeler Bob Enterprise Analyst PAGE 9

Models are everywhere A set of symbols and text used to make a complex concept easier to grasp PAGE 10

Models are everywhere A set of symbols and text used to make a complex concept easier to grasp PAGE 11

Data Model Definition A set of symbols and text used to make the actual data easier to grasp Includes both data elements and business rules Each Customer can own one or many Accounts. Each Account must be owned by one and only one Customer. Customer Own Account Each Account can be a Savings, Brokerage, or Checking Account. Savings Account Brokerage Account Checking Account PAGE 12

Data Model Settings - Format - Conceptual (Proof sheet) High level business solution Scoping tool Only basic and critical concepts Logical (Negative) Detailed business solution Normalization Dimensionality Essence Physical (Instantiation) Detailed technical solution Denormalization, indexing, views, partitioning Star schema and snowflake Incarnation PAGE 13

Data Model Settings - Format - Conceptual (Proof sheet) High level business solution Scoping tool Only basic and critical concepts Logical (Negative) Detailed business solution Normalization Dimensionality Essence Physical (Instantiation) Detailed technical solution Denormalization, indexing, views, partitioning Star schema and snowflake Incarnation PAGE 14

Basic Logical Data Modeling Components

Entity Who? What? When? Why? How? An entity is a collection of information about something that the business deems important and worthy of capture. Where? PAGE 16

Data Element A data element is a property of importance to the business whose values contribute to identifying, describing, or measuring instances of an entity. Employee Employee Identifier Employee Last Name Employee First Name Employee Hire Date Employee Signed Employment Contract Employee Drivers License Photo PAGE 17

A key helps you find entity instances Student Student_Identifier Student_Last_Name (IE1.1) Student_First_Name (IE1.2) Student_Social_Security_Number (AK1.1) Class Class_Identifier Student_Grade Class_Identifier (FK) Student_Identifier (FK) Semester_Identifier (FK) Final_Grade Semester Semester_Identifier PAGE 18

A key helps you find entity instances Candidate key Primary key Surrogate key Student Student_Identifier Student_Last_Name (IE1.1) Student_First_Name (IE1.2) Student_Social_Security_Number (AK1.1) Candidate key Alternate key Natural key Class Class_Identifier Student_Grade Class_Identifier (FK) Student_Identifier (FK) Semester_Identifier (FK) Final_Grade Semester Semester_Identifier PAGE 19 Candidate key Primary key Surrogate key Candidate key Primary key Surrogate key

A key helps you find entity instances Candidate key Primary key Surrogate key Non-unique index Composite key Student Student_Identifier Student_Last_Name (IE1.1) Student_First_Name (IE1.2) Student_Social_Security_Number (AK1.1) Candidate key Alternate key Natural key Class Class_Identifier Student_Grade Class_Identifier (FK) Student_Identifier (FK) Semester_Identifier (FK) Final_Grade Semester Semester_Identifier PAGE 20 Candidate key Primary key Surrogate key Candidate key Primary key Surrogate key

A key helps you find entity instances Candidate key Primary key Surrogate key Non-unique index Composite key Candidate key Alternate key Natural key Class Class_Identifier Student Student_Identifier Student_Last_Name (IE1.1) Student_First_Name (IE1.2) Student_Social_Security_Number (AK1.1) Student_Grade Class_Identifier (FK) Student_Identifier (FK) Semester_Identifier (FK) Semester Candidate key Primary key Surrogate key Foreign key Composite key Semester_Identifier Final_Grade PAGE 21 Candidate key Primary key Surrogate key Candidate key Primary key Surrogate key

Relationship A rule is an instruction about how to behave in a specific situation. A static rule is represented on a model via a relationship. PAGE 22 Static Rules Structure Each product can appear on one or many order lines. Each order line must contain one and only one product. Referential Integrity (RI) An order line cannot exist without a valid product. A student cannot exist without a valid student number. Action Rules Freshman students can register for at most 18 credits a semester. Take 10% off an order if the order contains more than five products.

Identifying vs. Non-Identifying Relationships Customer Customer Id Customer Name Non- Identifying Own Account Account Code Customer Id (FK) Account Name Customer Customer Id Customer Name Identifying Own Account Account Code Customer Id (FK) Account Name PAGE 23

Supertypes/Subtypes What is subtyping? Subtyping is grouping together the common data elements and relationships of entities, while keeping what s unique within each entity. Other names for subtyping: Supertyping Generalization Inheritance ACCOUNT ACCOUNT IDENTIFIER ACCOUNT NAME ACCOUNT STATUS COD E Supertype Subtype symbol Subtype SAVINGS ACCOUNT ACCOUNT IDENTIFIER (FK) SAVINGS ACC OUNT MINIMUM BALANCE AMOUNT CHECKING ACCOUNT ACCOUNT IDENTIFIER (FK) CHECKING ACCOUNT FREE C HECKS PER MONTH QUANTITY PAGE 24

Logical Modeling in CA ERwin Data Modeler Baker Cakes Example

Baker Cakes, Inc. Baker Cakes is a family-run business whose main stakeholder is Bob Baker, the owner/operator of Baker Cakes. Bob is in charge of making most decisions, from database design to icing color selection. In our example, we re building a data model for Baker Cakes, who is looking to build a new application to manage their data. PAGE 26

Step 1: Understanding Our Customers The first business concepts (entities) we need to describe are the customers for Baker Cakes. Mr. Baker sells to both retail and wholesale customers How do we represent this in a data model? PAGE 27

Showing Customer Types on a Data Model In this case, we can use a supertype/subtype relationship to show the two types of customers. PAGE 28

Understanding Our Business We ask Mr. Baker what the differences are between a Retail Customer and a Wholesale Customer. One key difference is that Wholesale Customers are managed by a Sales Rep. We ll need to show this on our data model. Creating an entity for Sales Rep Creating a relationship between Sales Rep and Wholesale Customer PAGE 29

Business Rules Our business rules for the relationship between Sales Rep and Wholesale Customer are as follows: Each Sales Rep must call upon one or more Wholesale Customers Each Wholesale Customer must be called upon by one Sales Rep. PAGE 30

Business Rules Our business rules for the relationship between Sales Rep and Wholesale Customer are as follows: Each Sales Rep must call upon one or more Wholesale Customers Each Wholesale Customer must be called upon by one Sales Rep. Verb Phrase PAGE 31

Business Rules Our business rules for the relationship between Sales Rep and Wholesale Customer are as follows: Each Sales Rep must call upon one or more Wholesale Customers Each Wholesale Customer must be Cardinality called upon by one Sales Rep. Verb Phrase PAGE 32

Business Rules Our business rules for the relationship between Sales Rep and Wholesale Customer are as follows: Each Sales Rep must call upon one or more Wholesale Customers Each Wholesale Mandatory Customer must be Cardinality called upon by one (Not Null) Sales Rep. Verb Phrase PAGE 33

Identifying or Non-Identifying? Should we make this relationship an identifying or nonidentifying relationship? PAGE 34

Identifying or Non-Identifying? Should we make this relationship an identifying or nonidentifying relationship? Key question: Do we need wholesale customer information to retrieve a given sales rep? PAGE 35

Identifying or Non-Identifying? Should we make this relationship an identifying or nonidentifying relationship? Key question: Do we need wholesale customer information to retrieve a given sales rep? No. Mr. Baker tells us that Sales Reps are each identified by an Employee ID. PAGE 36

Identifying or Non-Identifying? Should we make this relationship an identifying or nonidentifying relationship? Key question: Do we need wholesale customer information to retrieve a given sales rep? No. Mr. Baker tells us that Sales Reps are each identified by an Employee ID. Therefore, this is a non-identifying relationship (dashed line). PAGE 37

Showing Relationships on a Data Model The relationship between Sales Rep and Wholesale Customer will appear as the following on our data model. Each Sales Rep must call upon one or more Wholesale Customers Each Wholesale Customer must be called upon by one Sales Rep. PAGE 38

Showing Relationships on a Data Model The relationship between Sales Rep and Wholesale Customer will appear as the following on our data model. Verb Phrases generally read clockwise Each Sales Rep must call upon one or more Wholesale Customers Each Wholesale Customer must be called upon by one Sales Rep. PAGE 39

Defining Keys and Attributes Now that we have our high-level entities and relationships defined, we need to add more detail about our Customers and Sales Reps. First, let s identify what information uniquely retrieves an instance of each (primary key). In this case, it s easy. Baker Cakes uses: A Customer ID to uniquely identify Customers (both retail and wholesale) An Employee ID to uniquely identify Sales Reps. PAGE 40

Primary and Foreign Keys Our Model now looks like this. Primary Keys are listed above the line. Note that Foreign Keys (FK) were automatically created for us. (Remember Referential Integrity/RI!) PAGE 41

Attributes In addition to the key/identifying attributes, there are other important attributes to define for Customers and Sales Reps. For instance, let s define an Employee First Name and Employee Last Name for Sales Reps. And a Company Name for Wholesale Customers, as shown below. PAGE 42

Demo using CA ERwin Data Modeler Creating Entities Creating Relationships Creating Primary Keys Viewing how Foreign Keys are Inferred Defining Attributes PAGE 43

Try it yourself Free Software CA ERwin Data Modeler Community Edition www.erwin.com PAGE 44

Summary A data model is a blueprint for your data assets There are three levels of data models: conceptual, logical, and physical A logical data model (LDM) represents a detailed business solution. Logical model objects include: Entities define the who, what, where, when and why Relationships define business rules around data Keys help identify instances of data Attributes provide detailed information about entities. CA ERwin Data Modeler helps automate the logical model design process 45 Data Modeling 101 September 9, 2011

Thank You Donna Burbank donna.burbank@ca.com Twitter: @donnaburbank Steve Hoberman me@stevehoberman.com www.stevehoberman.com Copyright CA 2011. All rights reserved. All trademarks, trade names, service marks and logos referenced herein belong to their respective companies. No unauthorized use, copying or distribution permitted.

And the Book Winners are. To view the book winners: Visit the Prize Center Follow @ERwinModeling on Twitter If you didn t win, you can receive 20% off by ordering via www.technicspub.com using promotion code P20ERWIN. 47 Data Modeling 101 September 9, 2011

Questions? 48 [Insert PPT Name via Insert tab > Header & Footer September 9, 2011

Legal notice Copyright CA 2011. All rights reserved. All trademarks, trade names, service marks and logos referenced herein belong to their respective companies. No unauthorized use, copying or distribution permitted. THIS PRESENTATION IS FOR YOUR INFORMATIONAL PURPOSES ONLY. CA assumes no responsibility for the accuracy or completeness of the information. TO THE EXTENT PERMITTED BY APPLICABLE LAW, CA PROVIDES THIS DOCUMENT AS IS WITHOUT WARRANTY OF ANY KIND, INCLUDING, WITHOUT LIMITATION, ANY IMPLIED WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, OR NONINFRINGEMENT. In no event will CA be liable for any loss or damage, direct or indirect, in connection with this presentation, including, without limitation, lost profits, lost investment, business interruption, goodwill, or lost data, even if CA is expressly advised of the possibility of such damages. Certain information in this presentation may outline CA s general product direction. This presentation shall not serve to (i) affect the rights and/or obligations of CA or its licensees under any existing or future written license agreement or services agreement relating to any CA software product; or (ii) amend any product documentation or specifications for any CA software product. The development, release and timing of any features or functionality described in this presentation remain at CA s sole discretion. Notwithstanding anything in this presentation to the contrary, upon the general availability of any future CA product release referenced in this presentation, CA may make such release available (i) for sale to new licensees of such product; and (ii) in the form of a regularly scheduled major product release. Such releases may be made available to current licensees of such product who are current subscribers to CA maintenance and support on a when and if-available basis. 49 Data Modeling 101