Federal Judicial Center, 2003 2004 District Court Case-Weighting Study. Appendix V. Data-Cleaning Process



Similar documents
FREQUENTLY ASKED QUESTIONS FOR ELECTRONIC CASE FILING (ECF)

UNITED STATES DISTRICT COURT EASTERN DISTRICT OF MICHIGAN

Filing a Social Security Administration electronic Certified Administrative Record (ecar) in CM/ECF. a step-by-step guide

UNITED STATES BANKRUPTCY COURT DISTRICT OF CONNECTICUT

UNITED STATES BANKRUPTCY COURT SOUTHERN DISTRICT OF FLORIDA GUIDELINES FOR PREPARING, SUBMITTING AND SERVING ORDERS

Case 1:14-cr JEM Document 217 Entered on FLSD Docket 10/28/14 16:27:13 Page 1 of 9

IN THE SUPERIOR COURT FOR THE STATE OF ALASKA THIRD JUDICIAL DISTRICT AT ANCHORAGE

CIVIL Statistical Reporting Guide

CIRCUIT COURT. Uncontested Divorce Procedures Manual

USER INSTRUCTIONS WELCOME TO THE CLERK S OFFICE ELECTRONIC FILING SYSTEM

How To efile Criminal Pleadings/Documents:


UNITED STATES DISTRICT COURT EASTERN DISTRICT OF TENNESSEE AT KNOXVILLE

CM/ECF HELPFUL TIPS AND ANSWERS TO FREQUENTLY ASKED QUESTIONS

PERFORMANCE EVALUATION AUDIT CHECKLIST EXAMPLE. EIIP Volume VI

COURT OF COMMON PLEAS OF ALLEGHENY COUNTY

Motion to Compel in a Debt Collection Suit Instructions, Example and Sample Form

UNITED STATES DISTRICT COURT CENTRAL DISTRICT OF CALIFORNIA

FACT SHEET FOR JUDGE SAM SPARKS

Justice and Other Information Disclosure Bill 2008

7.010 PLEAS, NEGOTIATIONS, DISCOVERY AND TRIAL DATES IN CRIMINAL CASES

Case4:12-cv KAW Document2-1 Filed06/25/12 Page1 of 7 UNITED STATES DISTRICT COURT NORTHERN DISTRICT OF CALIFORNIA, OAKLAND DIVISION

Case: 1:12-cv Document #: 320 Filed: 04/16/14 Page 1 of 5 PageID #:2754 UNITED STATES DISTRICT COURT NORTHERN DISTRICT OF ILLINOIS

Rule 6 Adopted at a joint meeting of the District and County Court at Law Judges of Webb County on December 2, 2009

Unemployment Insurance Data Validation Operations Guide

ELECTRONIC FILING STANDARDS IMPLEMENTING E-FILING PROGRAM FOURTH JUDICIAL CIRCUIT MARION COUNTY GENERAL

REPORT BY THE CRIMINAL COURTS COMMITTEE AND CRIMINAL JUSTICE OPERATIONS COMMITTEE RECOMMENDING THE ADOPTION OF A BRADY CHECKLIST

Rule 82.1 Who May Appear as Counsel; Who May Appear Pro Se

SCATS. EDI File Verification. User Manual

COURT SCHEDULING ISSUES

PLEASE TYPE OR WRITE LEGIBLY

(702) 388-6s77 (Fax)

CASE NO. 279 CIVIL ACTION APPLICABLE TO ALL CASES CASE MANAGEMENT ORDER NO.5

Orientation for Contract Court Interpreters PRETRIAL HEARINGS

UNITED STATES BANKRUPTCY COURT MIDDLE DISTRICT OF FLORIDA ORLANDO DIVISION

CM/ECF FREQUENTLY ASKED QUESTIONS

SERVING NON-ENGLISH SPEAKERS IN THE VIRGINIA COURT SYSTEM PAYMENT OF COURT INTERPRETERS PAGE 8-1

Ecug!2<25.ex TDY!!!Fqewogpv!9!!!Hkngf! !!!Rcig!2!qh!6 IN THE UNITED STATES DISTRICT COURT FOR THE DISTRICT OF COLUMBIA

CHAPTER 6: CRIMINAL PROCEDURE MICHIGAN COURT RULES OF 1985

SIXTH JUDICIAL CIRCUIT OF FLORIDA

UNITED STATES DISTRICT COURT FOR THE SOUTHERN DISTRICT OF NEW YORK. V. No. 07 CR 0220 MOTION FOR APPOINTMENT OF COUNSEL UNDER THE CRIMINAL JUSTICE ACT

First Judicial District of Pennsylvania

Title 5: ADMINISTRATIVE PROCEDURES AND SERVICES

ELECTRONIC MEANS FOR FILING, SIGNING AND VERIFICATION OF DOCUMENTS UNITED STATES BANKRUPTCY COURT DISTRICT OF DELAWARE

CIRCUIT COURT. Uncontested Divorce Procedures Manual

UNITED STATES DISTRICT COURT DISTRICT OF NEW JERSEY CLERK S OFFICE ECF USER MANUAL

Assessment of Vaisala Veriteq vlog Validation System Compliance to 21 CFR Part 11 Requirements

) ) NOW THEREFORE, pursuant to the authority vested in me as Chief Judge of the Eleventh Judicial Circuit Court of Florida, it is

Marilyn O. Marshall Chapter 13 Standing Trustee Northern District of Illinois Eastern Division 224 S Michigan Ave., Suite 800

IN THE UNITED STATES COURT OF APPEALS FOR THE ELEVENTH CIRCUIT. No Non-Argument Calendar. D.C. Docket No. 6:10-cv JA-GJK.

SUPERIOR COURT OF THE [INSERT STATE/JURISDICTION] FAMILY DIVISION--DOMESTIC RELATIONS BRANCH

Office of History. Using Code ZH Document Management System

U.S. District Court Southern District of Indiana (Evansville) CIVIL DOCKET FOR CASE #: 3:11-cv-00???-XXX-YYY

UNITED STATES DISTRICT COURT EASTERN DISTRICT OF ARKANSAS INSTRUCTIONS FOR FILING COMPLAINT BY PRISONERS UNDER THE CIVIL RIGHTS ACT, 42 U.S.C.

Frequently Asked Questions

COMMISSION SURVEY ANALYSIS FOR CRIMINAL LAW SECTION N=7

Original FAQ Prepared July 30, 2013

Case 2:03-cr JES Document 60 Filed 02/19/08 Page 1 of 7 PageID 178 UNITED STATES DISTRICT COURT MIDDLE DISTRICT OF FLORIDA FORT MYERS DIVISION

CRIMINAL DOCKET CONTROL PROCEDURES

STATE OF OHIO ) CASE NO. CR ) Plaintiff, ) JUDGE JOHN P. O DONNELL ) vs. ) ) LONNIE CAGE ) JOURNAL ENTRY ) Defendant )

IN THE COURT OF COMMON PLEAS OF PHILADELPHIA COUNTY FIRST JUDICIAL DISTRICT OF PENNSYLVANIA CIVIL TRIAL DIVISION

IN THE COURT OF APPEALS OF THE STATE OF IDAHO. Docket No ) ) ) ) ) ) ) ) ) )

RULE 1. ASSIGNMENT OF CASES

IN THE UNITED STATES DISTRICT COURT FOR THE DISTRICT OF COLUMBIA

Statistics and Analysis. Quality Control: How to Analyze and Verify Financial Data

INSTRUCTIONS FOR THE

United States District Court

Freedom of Information Request Reference No: I note you seek access to the following information:

About Data File Exchange

Drafting the Joint Defense Agreement

Dup, Dedup, DUPOUT - New in PROC SORT Heidi Markovitz, Federal Reserve Board of Governors, Washington, DC

SECOND AMENDED ORDER DESIGNATING ALL CASES E-FILE AND SETTING FORTH CERTAIN REQUIREMENTS IN E-FILE CASES

IN THE UNITED STATES DISTRICT COURT FOR THE DISTRICT OF COLORADO

TRIAL COURT COLLECTIONS POLICY GUIDELINES TABLE OF CONTENTS

End User s Guide. Electronic Filing

2) The general search function: More than 500 records returned, please add more search criteria and search again! Help Case Prefix Submit Enter

CIVIL TRIAL RULES. of the COURTS OF ORANGE COUNTY, TEXAS. Table of Contents GENERAL MATTERS. Rule 1.10 Time Standards for the Disposition of Cases...

Quick Reference Card for Accessing the Attorney Portal (BMC/District Court/Housing Court /Superior Court/Juvenile Court/Probation)

CHANCERY MASTERS GUIDELINES FOR THE TRANSFER OF CLAIMS

3. ROLE OF THE COURT INTERPRETER

Fifth District Court of Appeals (Dallas)

SYSTEM SECURITY REQUIREMENTS

Payroll Time Clock Import - Quick Start Instructions

IMPORTANCE: Make sure the work is getting done.

IN THE UNITED STATES COURT OF APPEALS FOR THE ELEVENTH CIRCUIT. No Non-Argument Calendar. D.C. Docket No. 0:12-cv RSR.

Case: Document: Page: 1 01/24/ UNITED STATES COURT OF APPEALS FOR THE SECOND CIRCUIT SUMMARY ORDER

UNITED STATES DISTRICT COURT WESTERN DISTRICT OF NORTH CAROLINA CHARLOTTE DIVISION

Local Court Rules for the 27 th Judicial District

CALIFORNIA CODES GOVERNMENT CODE SECTION

PRACTICE DIRECTION NUMBER 11 OF 2012 SUPREME COURT OF QUEENSLAND SUPERVISED CASE LIST

Case 6:66-cv MV-WPL Document Filed 05/07/14 Page 1 of 9 IN THE UNITED STATES DISTRICT COURT FOR THE DISTRICT OF NEW MEXICO

The Appellate Mandate: What It Is and Why It Matters By Jennifer L. Swize

Transcription:

Appendix V Data-Cleaning Process Included items: 1. Data-Cleaning Process 2. Brief Description of Main Data-Cleaning Programs

Blank pages inserted to preserve pagination when printing double-sided copies.

Data-Cleaning Process Raw data records extracted from the district courts docketing databases were transmitted electronically to the FJC. We assessed the integrity and completeness of the transmitted data and converted the individual pieces into a reconstructed set of docketed event records ready for final processing. The data-cleaning process and programs are described below. The event-processing programs are described in Appendix W. Initial Processing of Raw Data We developed a set of SAS programs to load and process the raw data received from each court. Because of substantial structural differences in the data extracted from the ICMS and CM/ECF databases, different programs that were nonetheless functionally equivalent were written to handle the files from each system. During the initial processing of raw data, we maintained and processed the data from each court separately. We used this approach both for data-management reasons and to increase processing efficiency. We developed database-driven macro programs to manage the required multiple program executions. The SAS processing programs checked for a full range of data-integrity problems such as: (1) data-type errors in data fields (e.g., alpha characters in numeric fields); (2) unusual or out-of-range values; (3) adherence to the selection criteria (e.g., termination, or re-termination, date within calendar 2002); and (4) basic interrelationships among the case components (e.g., checking that party and event records matched to a case record). We reviewed processing logs and field frequency reports to identify problem areas that might require additional review or data cleaning (e.g., deletion of duplicate case records), and then attended to the data-processing problems that arose. Data cleaning focused primarily on exception reporting. The information received was assumed to be correct and complete unless an unexpected or impossible outcome was detected. For example, a case record that had no matching event records is not necessarily an error and such cases were included in the analysis. Event records that could not be matched to a valid case record, however, were excluded. Because these two situations should occur rarely, incidence levels exceeding a nominal threshold in a court triggered a more detailed investigation of the data. We additionally created reference data tables by pulling civil, criminal, and trial data records for each court from the FJC s Integrated Database. 1 We used 1. The Federal Judicial Center s Integrated Database contains basic descriptive and processing information for civil cases, criminal defendants, and trials and evidentiary hearings. The data are based on information received from the Administrative Office on case filings, terminations, and proceedings in the federal courts. Appendix V: Page 1

these tables to validate the population of cases received from the courts and as a source of additional case information. As part of the preliminary data processing step, we created unique identifiers for each case and record. Identifiers permitted us to match case information from different data sources, link related case components together, establish a docketing sequence, and identify duplicate records created during processing. We also created a series of case flags and constructed specific data fields that assisted in the characterization of cases (e.g., the number of civil parties, the number of codefendants). Flags also identified the set of good cases that is, cases that met the defined case-selection criteria (e.g., cv or cr docket type, single-defendant ICMS criminal case, etc.) that would be used in the final analyses. Appendix V: Page 2

Brief Description of Main Data-Cleaning Programs Processing Task Processed data extracted from ICMS courts. Loaded raw court data and computed basic frequencies. Created case ID codes for matching across data sets. Created unique sequence IDs for identifying duplicate records created during joins. Pulled comparison civil and criminal data from the IDB. Pulled relevant trial data from the IDB. Checked for basic status and error situations (e.g., checked that there were no duplicate records in the case data, that the termdates were within 2002 or, if not, that the reterm dates were). Compared civil and criminal lists to the IDB to verify we had received the cases we expected. Looked for internal consistencies (e.g., cases with no parties, parties with no cases, cases with no events, etc.). SAS Programs xtract_processing_icms version: 1.7 date: 17-Mar-2004 principal source data files: asccases, cases, dplink1, dplink2, events, js2, judge, party, reliefs, who principal output files used in further processing: caseflgs evntrlfjn Additional data-processing and analysis tasks also performed by this program are described in Appendix W. Used macro shell to process all CM/ECF courts. Retrieved parameters from dccws summary data set. Processed data extracted from CM/ECF courts. Loaded raw data and computed very basic frequencies. xtract_processing_cmecf version: 1.3 date: 10-Mar-2004 principal source data files: asccases, asclead, ascmember, ecfcaseflgs, cases, codes, dktntry, dktpart, dktperson, doctype, filer, js23, js56, judge, motion, party Appendix V: Page 3

Brief Description of Main Data-Cleaning Programs Processing Task Created case ID codes for matching across data sets. Created unique sequence IDs for identifying duplicate records created during joins. Pulled comparison civil and criminal data from the IDB. Pulled relevant trial data from the IDB. Checked for basic status and error situations (e.g., checked that there were no duplicate records in the case data, that the termdates were within 2002 or, if not, that the reterm dates were). Compared civil and criminal lists to the IDB to verify we had received the cases we expected. Looked for internal consistencies (e.g., cases with no parties, parties with no cases, cases with no events, etc.). Created and set a new flag in the caseflgs table to identify cases that would be included in the final analysis (i.e., good cases ). Identified good cases as those with a cr or cv docket_type, and that could be matched to a case characteristic record that provided sufficient information to compute a DCCWS case type. For ICMS criminal records, limited good cases to single-defendant cases. For CM/ECF criminal records, processed all defendants individually, but excluded master case records. SAS Programs principal output files used in further processing: all input files add_goodcase_flg (ICMS) version: 1.1 date: 21-Apr-2004 add_goodcase_flg_cmecf version: 1.0 date: 27-Apr-2004 principal source data files: caseflgs principal output files used in further processing: caseflgs Appendix V: Page 4