Practical Text Mining in Insurance



Similar documents
In comparison, much less modeling has been done in Homeowners

CAPTURING THE VALUE OF UNSTRUCTURED DATA: INTRODUCTION TO TEXT MINING

Claim notification form (Form RTA1)

Claim notification form

Claim notification form

SPECIAL PROVISIONS ñ NEW YORK

Frequently Asked Questions Auto Insurance

Claim notification form (RTA1) Low value personal injury claims in road traffic accidents ( 1,000-25,000)

Fraud Detection In Insurance Claims. Bob Biermann April 15, 2013

How To Choose Your Auto Insurance In Ohio

DIVISION OF INSURANCE

Warning! Warning! A plain vanilla Business Auto Policy (BAP) does not serve as a substitute for a

Chapter 10. Chapter 10 Learning Objectives. Insurance and Risk Management: An Introduction. Property and Motor Vehicle Insurance

EXPERT THINKING FROM MILLIMAN. Predictive analytics, text mining, and drug-impaired driving in automobile accidents

A Consumer s Auto Insurance Guide

C o p yr i g ht 2015, S A S I nstitute Inc. A l l r i g hts r eser v ed. INTRODUCTION TO SAS TEXT MINER

MAKING A PERSONAL INJURIES CLAIM*

CONSUMER S GUIDE TO AUTO INSURANCE

(Ontario) Use this Certificate for policies first issued or renewed between September 1, 2010 and August 31, 2011

Andrew Tait General Manager Crawford & Company Singapore. Motor Insurance Counter Fraud Solutions

SASKATCHEWAN PRESTIGE AUTO INSURANCE

Chapter 10 Auto Insurance in the United States

1. What s the difference between Limited Tort and Full Tort Auto Insurance Policies?

Predictive Modeling for Workers Compensation Claims

DATA ELEMENTS FOR CERTIFICATE OF AUTOMOBILE INSURANCE

Predictive Claims Processing

Paper Downtime of a truck = Truck repair end date - Truck repair start date

PENNSYLVANIA SURCHARGE DISCLOSURE STATEMENT

Automobile Insurance

Property and Casualty Coverage

DASDRIVE ULTIMATE LEGAL PROTECTION KEY FACTS BROCHURE. Act quickly after an accident and call us now on

SPECIAL REPORT TRAIN INJURY CASE Protect Your Rights: 7 Mistakes That Can Derail Your Train Injury Case

Road to Reform: Reducing Motor Premiums by Reforming the Personal Injury Claims Process

Whitepaper. Add Advanced Analytics to Your Artillery to Combat Insurance Fraud

What is a definition of insurance?

Dear Valued Clients, Thank you for making your Insurance Dynamic! Kindest Regards, DYNAMIC INSURANCE BROKERS

Efficient Techniques for Improved Data Classification and POS Tagging by Monitoring Extraction, Pruning and Updating of Unknown Foreign Words

ZacharLawFirm.com (602) Page 1

What you. need to know. Auto Insurance. A consumer information publication

Dawn Sexton, Director. Claims Committee Update

PROCUREMENT AND THE INSURANCE INDUSTRY WHAT YOU NEED TO KNOW NOVEMBER 2015

a new generation of portable storage engineered for safety and security. n www. ModSpace.com

Preventing Slips, Trips, & Falls

RIGHT Lawyers. Stacy Rocheleau, Esq. Gary Thompson, Esq.

Auto Insurance Buyers Guide

Claims Reporting Procedure

Manage Your EMR: Why Ex Mods Are A Poor Safety Measurement And What You Can Do About It!

SASKATCHEWAN AUTO INSURANCE

AUTO INSURANCE OUR MISSION IS YOU. AUTOMOBILE INSURANCE.

ACCIDENT INVESTIGATION. Facilitator Guide

Incident Reporting Manual

MANITOBA PRESTIGE AUTO INSURANCE

Understanding your auto claim

Insurance Boot Camp. Understanding Coverages for Your School Division, Staff and Students

Business Auto Policy

A CONSUMER'S GUIDE TO AUTOMOBILE INSURANCE IN MARYLAND 1. Peter J. Basile, Shareholder Ferguson, Schetelich & Ballew, P.A.

Your Guide to Automobile Insurance and Accidents

Claims. All About. Deductibles. Frequently Asked Questions. Why we have deductibles How fault affects your deductible When your deductible applies

LOUISIANA DEPARTMENT OF INSURANCE. Consumer s Guide to. Auto. Auto Insurance. James J. Donelon, Commissioner of Insurance

Innovations and Value Creation in Predictive Modeling. David Cummings Vice President - Research

Network Machine Learning Research Group. Intended status: Informational October 19, 2015 Expires: April 21, 2016

W. Heath Rushing Adsurgo LLC. Harness the Power of Text Analytics: Unstructured Data Analysis for Healthcare. Session H-1 JTCC: October 23, 2015

Big Data Analytics. An Introduction. Oliver Fuchsberger University of Paderborn 2014

Texas Homeowner Forms and Endorsements Description

INSURANCE CODE TITLE 10. PROPERTY AND CASUALTY INSURANCE SUBTITLE C. AUTOMOBILE INSURANCE CHAPTER 1952

SUMMARY OF PENNSYLVANIA AUTO INSURANCE LAW

Walking/Work Surfaces Fall Protection Program

Automobile Insurance 1

FIS-PUB 0077 (6/13) Number of copies printed: 10,000 / Legal authorization to print: PA 145 of 1979 / Printed on recycled paper

APPLYING FOR AUTO INSURANCE. What do I need to know to fill out an automobile insurance application?

PRIVATE MOTOR INSURANCE MARKET INVESTIGATION. Background to claims management process

AUTO INSURANCE CLAIMS & CLAIMANTS RIGHTS

MARKET CONDUCT EXAMINATION ATLANTA CASUALTY COMPANY ATLANTA SPECIALTY COMPANY 3169 HOLCOMB BRIDGE ROAD NORCROSS, GEORGIA 30348

BUSINESS INSURANCE FAQ

Automobile Insurance Grade Level 9-12

THE UNIVERSITY OF SHEFFIELD WORK AT HEIGHT POLICY

The Compulsory Automobile Liability Insurance in Taiwan

Accident Injuries and Your Car Insurance

3/17/2009. Knowledge Management BIKM eclassifier Integrated BIKM Tools

NAIC Consumer Shopping Tool for Auto Insurance

GARAGEKEEPERS COVERAGE

Consumer s Quick Check Guide Condominium Unit-Owners Policy

belongings? It may surprise you to know that the average renter purchases approximately

This material was produced under grant number SH F-48 Occupational Safety and Health Administration, U.S. Department of Labor.

Ontario Application for Automobile Insurance

Consumer s Quick Check Guide Dwelling Property Policy

Transcription:

Practical Text Mining in Insurance Marty Ellingsworth and Karthik Balakrishnan ISO Innovative Analytics Predictive Modeling Seminar San Diego, 6 Oct 2008 1

Importance and Relevance of Text Accident: 170824130 - Employee Injured In Fall From Second-Floor Decking Inspection Open Date SIC Establishment Name 127366367 07/29/1996 1521 xxxxxxxxxxxxxxxxxxxxxxxx Employee #1 was atop of the second floor decking of a newly constructed home, connecting frame work for a wall. He fell 18 ft 6 in., sustaining injuries that required hospitalization. Employee #1 was not tied off, nor were any other means of fall protection in use. He had not been trained in working from an elevated work surface, the company did not have a written safety program, and regular inspections were not performed. Keywords: decking, fall, tie-off, untrained, work rules, fall protection, construction Inspection Age Sex Degree Nature Occupation 1 127366367 29 M Hospitalize d injuries Cut/Lacerati on Carpenters Source: U.S. Department of Labor Occupational Safety & Health Administration Accident Report Detail Accident Investigation Summaries (OSHA-170 form) which result from OSHA accident inspections. 2

Importance and Relevance of Text Accident: 170824130 - Employee Injured In Fall From Second-Floor Decking Inspection Open Date SIC Establishment Name 127366367 07/29/1996 1521 xxxxxxxxxxxxxxxxxxxxxxxx not Employee tied off, #1 nor was were atop any of other means second of floor fall protection decking in of use. a newly constructed home, connecting frame work for a wall. He fell 18 He ft 6 had in., not sustaining been trained injuries working that required from elevated hospitalization. work surface Employee #1 was not tied off, nor were any other means of fall the protection company in did use. not He have had a written not been safety trained program, in working and from an elevated work surface, the company did not have a written regular safety program, inspections and were regular not performed. inspections were not performed. Keywords: decking, fall, tie-off, untrained, work rules, fall protection, construction 1 Inspection 127366367 Age 29 Sex M Degree Hospitalize d injuries Nature Cut/Lacerati on Occupation Carpenters Source: U.S. Department of Labor Occupational Safety & Health Administration Accident Report Detail Accident Investigation Summaries (OSHA-170 form) which result from OSHA accident inspections. 3

Policy Processing Underwriting Notes and Diaries Make D&B Data ISO Data Application information Claim loss runs Hazard mappings Concentrations of Staff Premium Auditors Renewal processing Legal Staff others DESK UNDERWRITER Home Office Staff Field Office UW Staff Insured Risk Manager Agent or Broker Diary forward call Agency next week Business Rule large loss review System Reminder update renewal pricing Correspondence Tracking legal letter sent 4

Customer Management Contact Notes and Diaries Sell Voice of the Customer Customer Feedback Call Center Notes Agent Contacts Billing Systems Deductible Processing Premium Auditors Renewal processing 5 ACCOUNT MANAGER Company-wide Sales Staff Product Manager Insured Risk Manager Agent or Broker Diary forward call Mr Jones tonight Business Rule DOI Complaint handling System Reminder Visit with Client Correspondence Tracking legal letter sent

Claims Processing Progress Notes and Diaries Service Medical Management Staff Special Investigation Unit NICB Vendor Management Consulting Engineers Hearing Representative Structured Settlement Unit Recovery Staff Legal Staff 6 CLAIMS ADJUSTER Home Office Staff Field Office Claim Staff Insured Risk Manager Agent or Broker Diary forward call Dr Jones next week Business Rule large loss review System Reminder update case reserves Correspondence Tracking legal letter sent

Feedback to UW Forklift Hits Ladder Ladder in Doorway Forklift Couldn t Stop Or No Barrier Signs Forklift Brakes Defective Cooking Oil on Floor Forklift Going Too Fast Brake Maintenance Delayed Housekeeping Inadequate Speed Limits Not Enforced No Policy Lack of Personnel No Enforcement No Enforcement 7

8 Text Mining in Action

Play the SIU Triage Game IT APPEARS THAT THIS WAS A LOW IMPACT COLLISION WHERE THE INSURED S FOOT SLIPPED OFF THE BRAKE AND SHE ROLLED INTO THE REAR OF THE CLAIMANT. THIS IS CONSSTENT WITH THE FACT THAT THERE WAS NO PROPERTY DAMAGE CLAIM MADE TO THE CLAIMANT VEHICLE. UNDER THESE CIRCUMSTANCES, HOW THE CLAIMANT COULD HAVE SUSTAINED SUCH SEVERE SHOULDER INJURIES AS A RESTRAINED DRIVER APPEARS RATHER SUSPECT. NO PROP DMG FOR INS AND CLMT AS COLL IMPACT WAS LOW. CLMT CLAIMS INJ FROM AX AND TREATED WITH CP AND PT EXTENSIVELY. TX APPEARS EXAGGERATED INSURED WAS RUBBER-NECKING AND DID NOT REALIZE TRAFFIC HAT STOPPED. HE RAN INTO JOHN AT 50-60 MPH, CAUSING THE CLAIMANT FORD FESTIVA TO COMPLETELY BUCKLE IN. JOHN HAD SERIOUS WHIPLASH INJ AND WAS AMBULANCED TO A HOSP ALONG WITH THE INSURED. CLAIMANT WAS VISITED BY THREE SPECIALISTS, WHICH IS NOT EXCESSIVE FOR THIS TYPE OF INJURY. 9

Congratulations! How did you do it? NO PROP DMG FOR INS AND CLMT AS COLL IMPACT WAS LOW. CLMT CLAIMS INJ FROM AX AND TREATED WITH CP AND PT EXTENSIVELY. TX APPEARS EXAGGERATED 10 1. Read and parse sentences into words 2. Knew meanings of words and phrases, in context подозрительный 3. Made intelligent guesses on abbreviations and typos 4. Identified concepts and their relevance to Fraud/Suspicion detection 5. Flagged claims containing certain combinations of concepts

State of Text Mining Technology 1. Read and parse sentences into words and components Language-based parsers and tokenizers Stemming suspicious, suspiciously, suspicion, suspiciousness suspicion Stop-word removal to, a, an, of, etc. 2. Generate meanings of words and phrases, in context Dictionaries and thesauri Word disambiguation based on context Natural Language Processing (NLP) technology Part of speech tagging (e.g., nouns, verbs, etc.) 3. Make intelligent guesses on abbreviations and typos Valid word lists, abbreviation lists, etc. (domain dependent) 11

State of Text Mining Technology 4. Identify concepts and their relevance to Problem Domain-knowledge driven Inductive semi-automated learning based on labeled examples 5. Represent concepts in a form suitable for analysis Vectors of terms, concepts, etc. Typically numeric or flags 6. Build/discover interesting combinations of concepts Miscellaneous predictive, descriptive and analytical methodologies 12

Components of Text Mining Text documents Information Retrieval Indexing Access Drivers Storage Text docs Natural Language Processing Stemming Stop-word filters Disambiguation Part of speech tagging etc,.. Text docs Information Extraction Term extraction Concept extraction Named Entity extraction Etc Data representation Numeric al Feature Vectors Data Mining Classification Clustering Association Statistical Analysis Visual Analysis etc Retrieve and organize relevant documents Search Engines Process and tag documents to enhance the ease of Information Extraction Create structured data from unstructured text Results can be document characterization or hidden relationship extraction 13

14 Simple, Practical Text Mining

FIRE Engine Algorithm Fine-grained Information Retrieval and Evaluation 15 Balakrishnan et al. Enhancing Knowledge Discovery Using Text Mining American Marketing Association Advanced Research Techniques Forum, 2002

Unstructured Data Challenges Problem Fraud/Suspicion Detection 16

17 Target Labeling and Phrase Extraction

Context-Driven Phrase Extraction and Augmentation Using Seed Lists 18

19 Generalization or Phrase Pruning

20 Semanticization

21 Structurization

22

Subrogation Opportunity Identification What is Subrogation? Insured suffers a loss Insurance Company settles loss Another party responsible/liable for the loss (or part of the loss) Insurance Company subrogates against the Other Party/Carrier 23

Subrogation Concept OP Unidentified If ANY of the following phrases occur, set the concept OP_Unidentified = 1 Otherwise OP_Unidentified = 0 NO SUSPECTS UNK SUSPECTS SUSPECTS NOT SUSPECTS UNK UNKNOWN SUSPECT NO KNOWN SUSPECT UNIDENTIFIED SUSPECT NO IDENTIFIABLE SUSPECT I/D UNK NO I/D NO ID UNK PER UNKNOWN BROKE UNKNOWNS BROKE UNK STOLE TORTFEASOR UNKNOWN HIT AND RUN SOMEONE BROKE 24

Subrogation Concept OP Identified If ANY of the following phrases occur, set concept OP_Identified = 1 Otherwise, OP_Identified = 0 SUSPECTS APPREHENDED SUSPECTS KNOWN KNOWN SUSPECTS SUSPECTS ARREST SUSPECTS ID D ID ED SUSPECTS SUSPECTS NAMED SUSPECTS IDENTI ARRESTS SUSPECTS SUSPECTS CAUGHT ID D SUSPECTS SUSPECTS LOCATED IDENTIFIED SUSPECTS SUSPECTS CHARGED TF/CARRIER ID 25

Creating Subrogation Stories Seven key concepts Each concept is represented by a binary flag (1=present, 0=otherwise) Insured At Fault OP Unidenti fied Subro Ruled Out Pure Loss OP At Fault OP Identified Subro Exists Each vector state is a Subrogation Story. E.g., 1010000 = Insured At Fault and Adjuster Ruled out Subro 0000111 = OP At Fault, OP Identified, and Adjuster assessed Subro But never referred the claim to the Subro Recovery Unit! 26

Referral Using Subrogation Stories Determine Subrogation Story for a new claim If Story has HIGH historical Recognition/Hit rates, refer to Recovery Unit LOW HIGH SUBROG ATION STORY CLAIMS WITH THE STORY RECOGNITION RATE HIT RATE $ LOSS PAID $ RECOVERY 0001000 12,558 0.8 5.2 $81,809,336 $2,836 0000000 11,790 1.0 17.4 $83,504,471 -$6,438 0010000 12,740 1.6 4.9 $69,062,230 -$8,014 0011000 30,006 2.1 1.8 $167,154,325 $11,015 0111000 21,364 3.6 4.2 $220,820,511 $47,225 We did a good job of capturing the concept of Subro Ruled Out 0001011 1,422 93.8 35.1 $76,902,215 -$2,873,929 0000111 1,912 98.9 66.9 $73,035,092 -$8,118,833 0001111 1,425 98.9 53.2 $102,310,964 -$6,785,945 The concept of Subro Exists when Other Party is Identified, was also well captured 27

Text Mining in U/W HO Loss Cost Wind Fire Lightning Liability Theft / Vandalism Hail Other Water Decompose HO losses and model by Peril Will result in tighter models Water Weather Water Non-weather 28

Text Mining for Cause-Of-Loss Rich information buried in Unstructured data, such as Loss Descriptions or Adjuster Notes E.g., Extracting the Type of Loss from the Loss Description EAKING FR ICE MAKER IN BAR AFTER HEAVY DOWNPOUR, INSURED'S NOTICED WATER DAMAGE TO CEILING AND WALLS IN DEN FREEZE DAMAGE TO SWIMMING POOL WATER WEATHER RELATED WATER NON- WEATHER RELATED FREEZER DEFROSTED AND DID WATE 29

Questions? Contacts @ ISO Innovative Analytics Marty Ellingsworth President mellingsworth@iso.com Karthik Balakrishnan Vice President, Analytics kbalakrishnan@iso.com 30