Forensic Science Testing: The Forensic Filler-Control Method for Controlling Contextual Bias, Estimating Error Rates, and Calibrating Analysts Reports

Similar documents
Overview of the NAS Report on Forensic Science in the U.S.

Strengthening Forensic Science in the United States: A Path Forward

NATIONAL COMMISSION ON FORENSIC SCIENCE. Ensuring That Forensic Analysis is Based Upon Task-Relevant Information

How To Understand Forensic Science

RECOMMENDATION. Issues Covered During Hearings and Deliberations

Forensic Science : Course Syllabus Forensic Science : Secrets of the Dead

Course Forensic Science. Unit II History

Forensic Science Curriculum

Forensic Science: Crime Scene Basics. T. Trimpe

Dudesville: A crime scene under the microscope. QUT Extreme Science

How To Become A Forensic Scientist

What Can I Do With a Degree in ForSci?

CAREER: FORENSIC SCIENCE TECHNICIAN 1

The Nature of Evidence

Managing Contextual Bias in Forensic Science

Minneapolis Police Department Crime Lab ASCLD/LAB ISO International Accreditation

Introduction to Crime Scene Dynamics

Confirmation bias in biometric and forensic identification. Tim Valentine

FORENSIC SCIENTIST MICHIGAN CIVIL SERVICE COMMISSION JOB SPECIFICATION

CHAPTER 5 FORENSIC SCIENCE

Massachusetts Major City Chiefs. Best Practices in Eyewitness Identification and the Recording of Suspect Interviews

RUTHERFORD HIGH SCHOOL Rutherford, New Jersey COURSE OUTLINE FORENSIC SCIENCE

The College of Forensic Sciences at NAUSS: The pioneer of Forensics in the Arab world

DNA & CRIME VICTIMS: WHAT VICTIMS NEED TO KNOW

GENERAL ASSEMBLY OF NORTH CAROLINA SESSION 2011 SESSION LAW HOUSE BILL 27

Law, Public Safety, Corrections and Security Career Cluster Forensic Science and Criminal Investigations Course Number:

Groton Public Schools Curriculum Map

Executive Summary. Summary - 1

TRAVELING FORENSIC EDUCATION PROGRAM

INTRODUCTION AREAS OF SPECIALIZATION

MASTER OF PROFESSIONAL STUDIES IN FORENSIC SCIENCE INVESTIGATE THE POSSIBILITIES.

BAA Course Approval submission: Introduction to Forensic Science 1:

Conviction Integrity Unit Best Practices October 15, 2015

COMMUNITY UNIT SCHOOL DISTRICT 200. Course Description

ILLINOIS DEPARTMENT OF CENTRAL MANAGEMENT SERVICES CLASS SPECIFICATION FORENSIC SCIENTIST SERIES

Strengthening Forensic Science in the United States: A Path Forward

Willmar Public Schools Curriculum Map

When you open the newspaper, what types of stories are you most interested in reading? If you answered crime stories, you are not alone.

Law, Public Safety, Corrections and Security Career Cluster Criminal Investigations Course Number:

Book Note: In Doubt: The Psychology Of The Criminal Justice Process, by Dan Simon

Criminalists identify, compare, analyze, and interpret physical evidence.

How To Be A Forensic Investigator

Knowledge for a successful forensic chain provided by The Netherlands Forensic Institute. NFI Academy

Crime and Science 1/11/2015. What is physical evidence? What is forensic science? What is the goal of forensic science?

Virginia Department of Criminal Justice Services Model Policy on Lineups/Eyewitness Identification

The Why and How of Blind Sequential Lineup Reform

FORENSIC SCIENTIST MICHIGAN CIVIL SERVICE COMMISSION JOB SPECIFICATION

DIGITAL FORENSICS SPECIALIZATION IN BACHELOR OF SCIENCE IN COMPUTING SCIENCE PROGRAM

C. The Crime Scene Investigation Team

In Your Blood Forensic DNA Databases

How To Understand Forensics

Grade 8 Informational Mini-Assessment Forensic Science: Evidence, Clues, and Investigation excerpt

Hi h School Standard: 1

Goal to recognize, document and collect evidence at a crime scene

Summary of Qualifications: Education: Professional Experience: Kenneth R. Moses

FORENSIC SCIENCE COURSE DESCRIPTION

Forensic Science Studies 35-3

In 2009, the nation s 411 publicly funded crime

Forensic Science. Distance Learning Presentation For King Drew Magnet High School of Medicine and Science March 14, 2007

Model Policy on Eyewitness Identification

How do we build and refine models that describe and explain the natural and designed world?

CRIME SCENE REMINDER CARD. "Interview, Examine, Photograph, Sketch, Process"

Georgia Perimeter College Common Course Outline

A CRIME HAS BEEN COMMITTED

Forensic Science. Students will define and distinguish forensic science and criminalistics.

Rules and Procedures. Rule 312 August 23, Rule CRIME LABORATORY - EVIDENCE AND CRIME SCENES

Forensic. Sciences. Forensic Sciences. Specialties. Programs. Career Pathways

Forensic Science Course Syllabus (CHE100)

PASSAIC COUNTY TECHNICAL INSTITUTE WAYNE, NEW JERSEY FORENSIC SCIENCE I CURRICULUM JULY 2010

PD-102R-92 Forensic Firearms Analyst II I. A. PRIMARY PURPOSE OF ORGANIZATIONAL UNIT:

FORENSIC SERVICES DEPARTMENT

Evidence Technician s School

Programme Specification. MSc/PGDip Forensic and Legal Psychology

YOUR CONTACT DETAILS (ADDRESS, PHONE, etc.):

Solving the Problems That Plague the Forensic Science Community

How To Become A State Pol Forensic Tech Leader

CMJ INTRODUCTION TO FORENSIC SCIENCE Fall Syllabus :30 5:30 p.m.

Somers Public Schools Somers, Connecticut Science Curriculum

Introduction to Forensic Science. So what is Forensic Science? Major Contributors. Chapter 1: Intro to FS

Bio-Molecular Forensic Science Careers. Jack Ballantyne Department of Chemistry National Center for Forensic Science

MASTER COURSE OUTLINE

C R I M E S C E N E. Forensic Science CC Spring 2007 Prof. Nehru

Crime scene evidence who is involved? Police officer and SOCO (Scenes of Crime Officer) or CSI (Crime Scene Investigator)

1/26/15. Chapter 2 Crime Scene

There is an inherent flaw in the United States criminal justice system. The

CALIFORNIA COMMISSION ON THE FAIR ADMINISTRATION OF JUSTICE REPORT AND RECOMMENDATIONS REGARDING EYE WITNESS IDENTIFICATION PROCEDURES

98 Crime scene examination

Transcription:

Forensic Science Testing: The Forensic Filler-Control Method for Controlling Contextual Bias, Estimating Error Rates, and Calibrating Analysts Reports Gary L. Wells 1 Miko M. Wilford 1 Laura Smalarz 1 1 Psychology Department Iowa State University West 112 Lago Hall Ames, IA 50011 E-mail addresses: Gary Wells glwells@iastate.edu Miko Wilford mwilford@iastate.edu Laura Smalarz lsmalarz@iastate.edu Corresponding author: Gary L. Wells 515-201-0431 1

Abstract Forensic science testing has been severely criticized for lacking a scientific basis, not having known error rates, overstating the certainty of conclusions, and being infested with contextual information (National Academy of Sciences, 2009). Kassin et al s (2013) review of basic social psychological science on expectancy effects, confirmation bias, and related phenomena as well as recent studies showing the influence of contextual information on forensic scientists judgments makes a strong case for controlling contextual bias. But even if contextual bias were fully eliminated, the big questions of error rates and overstatement of conclusions will still plague forensic science tests. We develop a filler-control method that can quickly sort between reliable and unreliable forensic science as used in actual cases, quickly uncover frauds, and help calibrate forensic scientists claims of certainty regarding their tests. 2

Forensic Science Testing: The Forensic Filler-Control Method for Controlling Contextual Bias, Estimating Error Rates, and Calibrating Analysts Reports A large share of forensic science techniques involving the analysis of physical evidence have never been validated scientifically. The National Academy of Sciences (NAS, 2009) concluded that, with the exception of nuclear DNA analysis, no forensic method has been rigorously shown to consistently and with a high degree of certainty demonstrate a connection between evidence and a specific individual or source and have not developed evidence-based estimates of error rates. The NAS report also noted that forensic analysts 1 are subject to contextual bias, which occurs when the analyst is influenced by knowledge about the suspect s background or other case information. Kassin et al. (2013) reviewed basic social and cognitive psychology supporting the concern regarding contextual bias and makes a compelling case for the value of psychological science to demonstrate and understand contextual bias in forensic testing. Unfortunately, even if contextual bias were neutralized, it leaves untouched significant problems identified by the NAS report: Is the technique reliable? What is the error rate? Is the analyst competent to conduct the procedure? Are analysts overstating their conclusions? Forensic science testing and testimony has proliferated without answers to these questions. In effect, a large share of forensic science techniques, almost all of which were developed by people in law enforcement rather than scientists, end up coming down to what we call the inter-ocular test in which the analyst visually examines the evidence and determines whether it is a match, not a match, or is inconclusive. 1 We use the term analyst to refer to the forensic scientists, forensic examiners, forensic technicians, and forensic analysts who perform these forensic tests and write the reports. 3

Here, we develop the framework and rationale for a method of forensic testing that we call the filler-control method 2. We argue that this method, which could theoretically be imposed upon forensic testing systems, could simultaneously resolve the bulk of issues addressed in the NAS report: (1) estimate error rates for both the technique and the individual analyst, (2) help calibrate forensic analysts, (3) quickly jettison fraudulent analysts and junk science, and (4) protect against contextual influences. Importantly, the filler-control method can accomplish all four of these objectives using data generated by actual cases as they unfold. Although others have alluded to the idea that a filler-control method might guard against contextual influences in forensic testing (Kassin et al., 2013; Saks et al., 2003), we argue that proper use of the filler-control method as described here can alleviate all four of the abovementioned problems. The filler-control method is best suited to forensic science techniques that involve match judgments. The match domain is a large one that includes fingerprints, tool marks, ballistics, bite marks, hair analysis, fibers, handwriting, shoe prints, and any other test for which an analyst is asked to determine whether a trace sample from a crime scene matches something associated with a suspect. The Three Big Questions Before describing the filler-control method, it is useful to focus on the three big questions that confront the courts when they encounter forensic science match evidence: (1) Is the technique itself reliable? (2) Was this analyst competent to perform the technique? And, (3) was the technique performed properly in this particular case? The filler-control method provides 2 These ideas are based largely on the first author s American Psychology-Law Society Presidential address given at the American Psychological Association meeting in 2006, which was drew heavily from the logic of fillers for eyewitness identification from lineups (Wells, et al, 1993). 4

answers to these three big questions with surprising statistical efficiency using the actual cases in which the technique is being used. Unlike proficiency testing 3, which uses samples of unknown representativeness, reliability estimates from the filler-control method are based on the success of the technique as applied in actual cases. Moreover, past proficiency tests cannot determine whether the technique was conducted properly (or conducted at all) in any particular case. Consider, for example, West Virginia State Police chemist Fred Zain. Zain fabricated forensic test results in hundreds of cases from 1979 until 1989 (Chan, 1994; Tilstone et al. 2007). Claiming a degree in biology (his degree was actually in English), Zain fraudulently made up serology results that always coincided with the police and prosecution theory. In later years, he also fabricated DNA results, which is how he was ultimately caught (via post-conviction DNA re-testing). The filler-control method would have outed the fraudulent Zain much earlier. The Filler-Control Method A standard practice in forensic testing is to provide the analyst with two samples one from a suspect and one from the crime scene. Analysts are then expected to provide an answer to the question, Does this trace evidence match this sample from the suspect? 4 For example, does this photo of bite marks on the victim match this denture impression from the suspect? Or, does this photo of a shoe print from the crime scene match this shoe owned by the suspect? Or, do the striations from this crime scene bullet match those made by this suspect s gun? 3 Proficiency tests involve giving a lab certain samples to process. Often, these labs know at the time that it is a proficiency test rather than a real case. 4 We fully recognize that suggested guidelines for reporting exist for some techniques that involve multiple categories. For example, there is a suggested recommendation for footwear and tire tread analysis that involves six categories ranging from identification (definite conclusion of identity) to elimination (definite exclusion) and a seventh category of unsuitable (Scientific Working Group on Shoeprint and Tire Tread Evidence, 2006). Although our treatment here uses the match, mismatch, and indeterminate categories, a larger set of categories does not undermine the fundamental principles behind the filler-control method. 5

The filler-control method, in contrast, presents the analyst with a qualitatively different task. Instead of two samples, there are a minimum of three the crime scene sample, the suspect sample, and innocent filler sample(s). The suspect sample and the innocent filler samples are intermixed, and the analyst does not know which sample is from the suspect and which are from fillers. Analysts are then confronted with a very different question, Does the crime scene sample match any of these other samples, and if so, which one? The analyst must perform the test and write the report without any prior knowledge of which sample was from the suspect and which were from fillers. Consider, for example, a bite-mark test. Instead of providing the analyst with a matching task in which they are limited to photos of victim bite marks and one denture impression from the suspect, the analyst would receive an additional five denture impressions thereby expanding the task (the suspect s plus five filler sets of dentures matched for age, gender, and general oral health) 5. Or, if matching a bullet from the crime to a particular gun, the analyst would be given the suspect s gun along with five other (filler) guns of the same caliber and make. A necessary quality of the fillers is that they are known a-priori to not be the actual source. Critically, unlike the traditional method, the filler-control method is a test that the analyst can conspicuously fail. For example, the analyst could incorrectly conclude that one of the filler.357 Magnum pistols was the source of the bullet from the crime scene (e.g., the police chief s gun). Using the filler-control method, unreliable forensic techniques and analysts would be quickly discredited. In contrast, reliable analysts using reliable techniques would welcome this challenge and easily and repeatedly avoid fillers in their forensic examinations. Note that the filler-control method manages to exert control over contextual bias as well. Even if forensic analysts have been exposed to contextual information (e.g., knows the police 5 For purposes of this article we use five fillers. 6

obtained a confession, knows that an eyewitness identified the suspect, etc.), they still do not know which samples came from the suspect and which from fillers. Discovery of Error Rates, Unreliable Techniques, Incompetent Analysts, and Frauds No fraud, incompetent analyst or unreliable technique can withstand the odds of exposure inherent in the filler-control method. Although a lucky guess has a 1/6 th chance of avoiding a filler on a single occasion, there is only a 1/36 th chance of avoiding a filler 2 out of 2 times, a 1/216 th chance of avoiding a filler 3 out of 3 times, and only a 1/1,296 th chance of avoiding a filler 4 out of 4 times. This means that only 1 of nearly 1300 forensic analysts could avoid making a public error for affirmative conclusions in their first four cases if their test was totally invalid. Figure 1 displays the theoretical probability that a forensic test would produce at least one filler error as functions of the validity of the forensic test and the number of cases for which the analyst made an affirmative judgment 6. The highest filler-error line comes from a forensic test with zero validity (i.e., accuracy is no better than chance). Notice how quickly the zero-validity test approaches near certainty of yielding at least one error; there is a 99.5% chance that an analyst would have made at least one filler error by the time a forensic analyst had tested three cases. If the validity of the test were.20, a single test has a.667 probability of a filler error but this rises to a 98.8% chance of yielding at least one filler error by the time the test is used in four cases. In contrast, a 99% valid test produces a mere 1% chance of yielding a filler error in 10 cases. The best of the forensic tests rise to the top and the unreliable ones are quickly unmasked with the filler-control method. 6 These analyses are based only on affirmative match judgments (as opposed to indeterminate judgments). 7

Probability 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 Probability of at Least One Erroneous Filler Match as Functions of Accuracy of Forensic Test and Number of Cases After one case After two cases After three cases After four cases After ten cases Accuracy of Forensic Test 0 0.2 0.4 0.6 0.8 0.99 Figure 1 illustrates that the five-filler control test, although not perfectly capable of revealing known errors when used once, requires only a few uses to unmask low-validity. A Fred Zain, for example, would have almost certainly have shown failures after only three tests, thereby stopping him from erroneously claiming to have found affirmative matches in the hundreds more cases for which he remained a credible expert. One of the great advantages of the filler-control method is that scores can be partitioned so that error-rates can be calculated separately for techniques, labs, and analysts. As the NAS report noted, we know little about error rates in forensic testing. With the filler-control method, estimation of the error rate is approximated by the filler rate, but is not exactly the filler rate, due to chance. Corrected for chance, the best estimate of the error rate is: Equation 1. Estimated error rate =1- [[Nfillers/(Nfillers + 1)]-Filler rate] [(Nfillers+1)/(Nfillers)] 8

Hence, for example, assuming a five-filler test (Nfillers = 5), if the analyst selects fillers 20% of the time (filler rate =.20) and selects the suspect sample 80% of the time, then the error rate is 24%, not 20%, not 20%. For a 40% filler rate, the error rate estimate is 48%, and so on. Influence of the Filler-Control Method on Beliefs and Behaviors of Forensic Analysts Forensic analysts who are reliably using highly valid techniques should be elated at the idea of using the filler-control method. These analysts and their techniques will be validated by the filler control method. Forensic analysts who believe erroneously that their techniques are perfectly valid, in contrast, represent a particularly interesting group. The illusion of validity (e.g., Einhorn and Hogarth, 1978) and its sister phenomenon of overconfidence are pervasive in human judgment. Although the illusion of validity can have many causes, including wishful thinking and confirmation bias, the illusion can flourish only in the absence of error feedback. The filler-control method creates a new forensic testing environment in which feedback about errors is guaranteed. We believe that the majority of forensic analysts who use the filler-control method will discover that their technique has some validity, but also makes errors, which could surprise some of the analysts themselves. For these techniques, the filler-control method can help analysts to calibrate their reports and conclusions. For example, if an analyst using a particular technique has a record of six erroneous filler-match conclusions and 30 suspect-match conclusions, then the estimated error rate is 20% (see Equation 1) and the analyst therefore should not express confidence higher than 80% in any given match conclusion. Final Remarks Kassin et al. (2013) successfully articulate the problematic influence of contextual information in forensic testing. Unfortunately, eliminating contextual information fails to answer 9

whether the technique is reliable, the analyst competent, and whether the test was actually performed competently in this particular instance. The filler-control method manages to address all of these problems while additionally controlling for contextual bias. Many forensic analysts who are involved in match tests will resist the idea of the fillercontrol method on several grounds, primarily arguing that it is quixotic. Obviously there would be practical costs (money, time) to performing the additional tests. Moreover, full realization of the filler-control method s best features (such as developing a searchable public data base of error rates across analysts and techniques) requires an oversight body, reporting requirements for every test conducted, and refined protocols for selecting appropriate fillers. In more than one talk given by the lead author of this article, an analyst argued that because some crime scene samples are seriously degraded (e.g., smudged fingerprint, nearlyhealed bite mark), the analyst would be unable to clearly match the crime scene sample to the suspect sample. Hence, the score for the analyst and technique would be harmed for reasons other than the analyst s competence or the technique s reliability. But this is a sophistic argument because the indeterminate category was created precisely for such circumstances. Only affirmative match conclusions can count for or against the technique and analyst. An indeterminate result is a non-event for purposes of scoring reliability. The practical concerns are legitimate and deserve further consideration. Nevertheless, the bold idea of a filler-control method requirement in forensic match tests deserves a place at the table of how to control contextual bias, reduce fraud, generate meaningful estimates of error rates in actual cases, and give analysts the type of feedback that could allow them to accurately calibrate their judgments. 10

References Chan, Sau (1994). Scores of convictions reviewed as chemist faces perjury accusations. Associated Press, August 21, 1994. Einhorn, H. J., Hillel, J, & Hogarth, R.M. (1978). Confidence in judgment: Persistence of the illusion of validity. Psychological Review, 85, 395-416. Kassin, S., Dror, I., & Kukucka (2013). The forensic confirmation bias: Problems, perspectives, and proposed solutions. Journal of Applied Research in Memory and Cognition. XXXXX National Academy of Sciences (2009). Strengthening forensic science in the United States: A path forward. National Academies Press: Washington, DC. Tilstone, W. J., Savage, K. A., Clark, L. A. (2006). Forensic science: An encyclopedia of history, methods, and techniques. ABC-CLIO, Inc., Santa Barbara, CA. Saks, M. J., Risinger, D. M., Rosenthal, R., & Thompson, W. C. (2003). Context effects in forensic science: A review and application of the science of science to crime laboratory practice in the United States. Science & Justice, 43, 77-90. Scientific Working Group on Shoeprint and Tire Tread Evidence. (2006). Guide for minimum qualifications and training for a forensic footwear and/or tire tread examiner. Available at http://www.theiai.org/guidelines/swgtread/ Wells, G. L., Rydell, S. M., & Seelau, E. P. (1993). On the selection of distractors for eyewitness lineups. Journal of Applied Psychology, 78, 835-844. 11