Assessment of Hemoglobin Level of Pregnant Women Before and After Iron Deficiency Treatment Using Nonparametric Statistics



Similar documents
How To Test For Significance On A Data Set

Statistical Process Control (SPC) Training Guide

Non-Parametric Tests (I)

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

1 Nonparametric Statistics

BA 275 Review Problems - Week 5 (10/23/06-10/27/06) CD Lessons: 48, 49, 50, 51, 52 Textbook: pp

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

Descriptive Statistics

ISyE 2028 Basic Statistical Methods - Fall 2015 Bonus Project: Big Data Analytics Final Report: Time spent on social media

Final Exam Practice Problem Answers

NCSS Statistical Software. One-Sample T-Test

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Statistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice!

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

Paired 2 Sample t-test

Process Capability Analysis Using MINITAB (I)

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

PERINATAL NUTRITION. Nutrition during pregnancy and lactation. Nutrition during infancy.

Chapter 7 Section 7.1: Inference for the Mean of a Population

Paired T-Test. Chapter 208. Introduction. Technical Details. Research Questions

Difference tests (2): nonparametric

Skewed Data and Non-parametric Methods

THE PROCESS CAPABILITY ANALYSIS - A TOOL FOR PROCESS PERFORMANCE MEASURES AND METRICS - A CASE STUDY

Nonparametric Two-Sample Tests. Nonparametric Tests. Sign Test

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

Vitamin A Deficiency: Counting the Cost in Women s Lives

STAT 350 Practice Final Exam Solution (Spring 2015)

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Standard Deviation Estimator

Two Related Samples t Test

THE KRUSKAL WALLLIS TEST

II. DISTRIBUTIONS distribution normal distribution. standard scores

Descriptive Statistics

DATA INTERPRETATION AND STATISTICS

Analysis of Data. Organizing Data Files in SPSS. Descriptive Statistics

Hypothesis Testing --- One Mean

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

Statistiek II. John Nerbonne. October 1, Dept of Information Science

UNIVERSITY OF NAIROBI

Lecture Notes Module 1

Independent t- Test (Comparing Two Means)

Lecture 1: Review and Exploratory Data Analysis (EDA)

Error Type, Power, Assumptions. Parametric Tests. Parametric vs. Nonparametric Tests

Online 12 - Sections 9.1 and 9.2-Doug Ensley

Comparing Means in Two Populations

Module 2 Probability and Statistics

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Stats Review Chapters 9-10

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

AUSTRALIA AND NEW ZEALAND FACTSHEET

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

Parametric and non-parametric statistical methods for the life sciences - Session I

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

Dongfeng Li. Autumn 2010

PROPERTIES OF THE SAMPLE CORRELATION OF THE BIVARIATE LOGNORMAL DISTRIBUTION

Rank-Based Non-Parametric Tests

11. Analysis of Case-control Studies Logistic Regression

NCSS Statistical Software

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

PRACTICE PROBLEMS FOR BIOSTATISTICS

HYPOTHESIS TESTING: POWER OF THE TEST

EPS 625 INTERMEDIATE STATISTICS FRIEDMAN TEST

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct

Additional sources Compilation of sources:

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Introduction to Quantitative Methods

Statistics Review PSY379

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

CHAPTER 14 NONPARAMETRIC TESTS

2 Sample t-test (unequal sample sizes and unequal variances)

Means, standard deviations and. and standard errors

Normality Testing in Excel


2. Simple Linear Regression

NCSS Statistical Software

Capability Analysis Using Statgraphics Centurion

EFFECT OF DAILY VERSUS WEEKLY IRON FOLIC ACID SUPPLEMENTATION ON THE HAEMOGLOBIN LEVELS OF CHILDREN 6 TO 36 MONTHS OF URBAN SLUMS OF VADODARA

Projects Involving Statistics (& SPSS)

Chapter 13 Introduction to Linear Regression and Correlation Analysis

68 3,676, making progress

CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA

How to get accurate sample size and power with nquery Advisor R

Difference of Means and ANOVA Problems

Regression Analysis: A Complete Example

Permutation & Non-Parametric Tests

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

150 7,114, making progress

StatCrunch and Nonparametric Statistics

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Chapter 8 Paired observations

Introduction to Hypothesis Testing OPRE 6301

Testing for differences I exercises with SPSS

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

How Does My TI-84 Do That

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Transcription:

Assessment of Hemoglobin Level of Pregnant Women Before and After Iron Deficiency Treatment Using Nonparametric Statistics M. Abdollahian *, S. Ahmad *, S. Nuryani #, + D. Anggraini * School of Mathematical and Geospatial Sciences, RMIT University, Melbourne, Australia # Ulin Hospital (RSUD Ulin) Banjarmasin Indinesia + Department of Mathematics Lambung Mangkurat University, Banjarbaru, Indonesia Abstract Iron Supplementation is generally recommended during pregnancy to meet the iron requirement of both mother and fetus. When detected early in pregnancy, iron deficiency anemia (IDA) is associated with an increase in the risk of preterm delivery. Deficiency of Vitamin C also causes poor iron absorption leading to IDA. This paper deploys non-parametric statistics to analyse and compare the hemoglobin level of pregnant women before and after consumption of Vitamin C and Sulfas Ferroses. The objective of this study is to compare the hemoglobin level of pregnant women before and after the IDA treatment and assess the association between maternal hemoglobin (Hb) level and pregnancy outcome. Johnson transformation is used to estimate the proportions of women with hemoglobin level out side the required specification limits before and after the treatment. Wilcoxon Rank Sum Test is deployed to compare the hemoglobin level before and after the treatment. The result indicates a significant difference in the hemoglobin level before and after the treatment. Key words: Anemia, Hemoglobin, Johnson transformation, Performance analysis, Proportion of nonconformance, Wilcoxon Rank Sum Test. 1. Introduction Due to the increased iron requirements during pregnancy, pregnant women are recognized as the group most vulnerable to iron deficiency anemia (IDA). Anemia, as determined by low hemoglobin or hematocrit, is most common among women in their reproductive years. Until recently, it was assumed that anemia during pregnancy had few untoward sequelaes. In recent years it has been suggested that a proportional relationship between anemia and preterm delivery now exists [1]. Infants born to women with a low Hemoglobin level have subsequently resulted in a lower birth weight, height and Apgar scores. In Asia, the prevalence of anemia was estimated to be 44% in non-pregnant and 60% in pregnant women []. At least 50% of the anemia cases have been attributed to iron deficiency ([2]; [3]; [12]). Maternal iron deficiency anemia increases the risk of premature delivery and subsequent low birth weight, and may contribute to low iron status and poor health of infants ([1]; [11]; [6]). It was found that pregnant women with anemia are at a greater risk of perinatal mortality and morbidity ([5]; [9]; [8]). The results which have been gathered and analysed are due to a of trial experiment conducted in Indonesia relating to pregnant women. Specifically candidate mothers were observed over a nine month period and asked to consume 90 tablets of Vitamin C 0 mg and Sulfas Ferroses (SF) 350 mg after their third semester of pregnancy. In this paper we assess the performance of non-normal hemoglobin level data collected before and after treatment. The proportion of hemoglobin measurements falling below the WHO recommended level for either case is estimated using Johnson transformation. We will then use Wilcoxon Rank Sum Test to assess the effectiveness of the Vitamin C and Sulfas Ferroses (SF) treatment. 2. Subjects The study was conducted among 125 women enrolled in a maternity clinic in Banjarmasin Indonesia. We are using 125 pairs of data collected during February 2007- September 20. The pregnant women who deliver their baby in the clinic had been monitored during their ninemonth pregnancy and each consumed 90 tablets of Vitamin C 0 mg and Sulfas Ferroses (SF) 350 mg after their third semester of pregnancy. The hemoglobin level of the individual patient was measured before and after the treatment. One of the objectives in this research is to analyse the hemoglobin level of these women before and after treatment. We also assess the distribution of these data and take necessary steps to transform the data (before and after treatment) to estimate the proportion of women with hemoglobin level outside WHO recommended lower and upper specification limits {LSL, USL] of 11 and 14. In section 3.2 we explain the reason for transforming data and propose the transformation model to be adopted. At the start of the study the authors have used statistical package Minitab to fit the commonly used normal distribution function to each set of the hemoglobin data.

For both sets the p-value of the fit was less than 0.01 indicating the measurements of the hemoglobin level do not follow normal distribution. The histograms of the data are presented in Figures 1-2. The summary statistics including skewness, Upper and lower quartiles (Q3 and Q1), mean, standard deviation, minimum and maximum are given in table 1 below. Frequency 35 30 25 20 15 5 0 9.6 Hemoglobin level before treatment.2.8 11.4 Hemoglobin before treatment Figure 1: Distribution of the Hemoglobin level measurements before treatment. Frequency 50 40 30 20 0.5 11.0 Hemoglobin level after treatment 11.5 12.5 13.0 Hemoglobin after treatment Figure 2: Distribution of the Hemoglobin level measurements after treatment. Table 1-a: Descriptive Statistics for Hemoglobin level before and after treatment N Mean St. Dev Min Q1 Hb before 125.94 0.7 9.2.5 Hb after 125 12 0.63.5 11.5 Table 1-b: Descriptive Statistics for Hemoglobin level before and after treatment N Median Q3 Max Skewness Hb before 125 11 11.2 12.5 0.13 Hb after 125 12 12.5 14 -o.o7 3. Method 13.5 12.6 14.0 One of the main objectives of this paper is to estimate the proportion of women with the hemoglobin level outside the WHO lower and upper specification limits (LSL and USL) [13]. This requires the knowledge of the data distribution both prior to the treatment and after the treatment. Figures 1-2 show that none of the data set follows normal distribution. Furthermore, the summary statistics confirm a significant difference between the mean hemoglobin level before and after the treatment with positive skewness coefficient before the treatment and negative after the treatment. It is well documented that the low hemoglobin levels are a growing concern to the medical profession [3], [5]; therefore it is only logical that the aim should be to reduce the proportion of women with Hb level below the LSL. Since the data does not follow normal distribution we have used the Johnson transformation to estimate the proportion of non conforming data. This proportion is estimated using the performance analysis in the statistical package Minitab. We have employed the Wilcoxon Rank sum test which is a nonparametric alternative to paired test to assess the effectiveness of the treatment process undertaken by the clinic. 3.1 Johnson Transformation Many analyses require an assumption of normality. In cases when data is not normal, one can apply a function to make the data approximately normal and complete the required analysis. In this research we desire to perform a capability analysis on the non-normal hemoglobin data, therefore, we will first transform the data and then apply the performance analysis on the amended data. Depending on the nature of the data, there are many different functions such as square root, logarithm, power, reciprocal or arcsine, that one could apply to transform the data. The Johnson transformation function is selected from three types of functions in the Johnson system [4]. Because the functions cover a wide variety of distributions by changing the parameters, the statistical package usually finds an acceptable transformation. The Johnson transformation function is complicated, but is well suited for finding an appropriate transformation for our purposes. 3.2 Performance Analysis of Non-normal Data Process capability analysis is used to ensure that the outcomes of a process are capable to fulfil certain requirements or specifications. Application of process capability analysis is an essential part of the overall quality improvement process. The concept of process capability was introduced by Juran et al.[7]. The two most popular indices used in performance analysis are Cp, Cpk and defined as follows: USL LSL C p = (1) 6σ

USL µ µ LSL = min, 3σ 3σ C pk C min( C, C ) pk = (2) pu pl µ and σ are the population mean and standard deviation respectively. In the capability calculations, we are mainly interested in three points within the process distribution: the upper tail, the point of central tendency and the lower tail. In terms of quantiles, these points for the normal distribution correspond to X = µ 3σ, X = µ X = µ 3σ 0.00135 0.99865 + 0.50 In case of normally distributed data, it is easy to estimate quantile points. But for the non-normal data, it is quite cumbersome to estimate them. In fact when the population is not normal, these quantiles do not necessarily corresponds toµ + 3σ,µ, µ 3σ, respectively. To deal with non-normality; one approach is to transform the non-normal output data to approximately normal data using mathematical functions. In this paper we have used the Johnson transformation to transform the data and used the statistical package Minitab to estimate the capability indices and the proportion of data falling out side the specification limits. 3.3 Non-parametric Wilcoxon Rank Sum Test The Wilcoxon test is a nonparametric analog of the sample t-test because it does not require the data to come from a normally distributed population, as the t-test does. It is a nonparametric hypothesis test for the median of a single population. The procedure uses the null hypothesis that the population median (η) is equal to a hypothesized value (H0: η= η 0 ), and tests it against an alternative hypothesis, which can be either left-tailed η< η 0 ), righttailed (η>η 0 ), or two-tailed (η η 0 ). In this paper we apply the Wilcoxon test to the population of the hemoglobin difference of the study group. Since the differences would also be non-normal. The differences are defined as: Difference = Hb level after treatment Hb level before treatment; If the treatment is significantly increasing the hemoglobin level, then we expect the median of the population of differences to be greater than zero. However if the treatment had no significant effect then the population of differences would have a median of zero, i.e., we test the hypothesis: 4. Case Study and Discussion As stated earlier this study has used the hemoglobin measurements of 125 women who were monitored in a maternity clinic in Banjarmasin between January 2005 and December 20. Each patient was given 90 tablets of Vitamin C 0 mg and Sulfas Ferroses (SF) 350 mg after their third semester of pregnancy. The hemoglobin level of individual patient was measured before and after the treatment. The authors have examined the distribution of each data set (before and after treatment) and presented the results in Figures 1-2. The results clearly demonstrate the data does not follow normal distribution. The Johnson transformation was employed to convert the skewed data to approximately normal data. The capability analysis was performed on the data to estimate the proportion of patients whose globins level fall out side the WHO recommended specification limits defined by LSL=11 and USL=14. The WHO anemia classifications based on the hemoglobin level are: Hb: 11 gr% : normal Hb: 9- gr% : mild anemia Hb: 9- gr% : mild anemia Hb: 7 8 gr%: medium anemia Hb: < 7 gr% : severe anemia The results of the performance analysis are presented in Figures 3-4. The outputs indicate that if patients had not received the treatment then, approximately 416000 per million would have had anemia (PPM < LSL = 416000). Figure 4 shows the significant reduction of this proportion after the treatment, i.e. PPM < LSL = 32000 (under observed performance). Process Data LSL 11 Target * USL 14 Sample Mean.9392 Sample N 125 Mean.9392 StDev 0.699988 O bserved Performance PPM < LSL 416000.00 PPM > USL 0.00 PPM Total 416000.00 Process Capability of Hemoglobin before treatment Calculations Based on Normal Distribution Model 9.6.4 LSL 11.2 12.8 13.6 USL O v erall C apability Pp 0.71 PPL -0.03 PPU 1.46 Ppk -0.03 Exp. O v erall Performance PPM < LSL 534608.06 PPM > USL 6.14 PPM Total 534614.20 Figure 3: Performance analysis of Hemoglobin measurements before the treatment. H0 : η= 0.0 vs Ha : η > 0.0

Process Capability of Hemoglobin after treatment Calculations Based on Normal Distribution Model Box plot of the Hemoglobin level before and after treatment Process Data LSL 11 Target * USL 14 Sample Mean 008 Sample N 125 Mean 008 StDev 0.633156 O bserved Performance PPM < LSL 32000.00 PPM > USL 0.00 PPM Total 32000.00 LSL USL O v erall C apability Pp 0.79 PPL 0.53 PPU 1.05 Ppk 0.53 Exp. O v erall Performance PPM < LSL 56978.67 PPM > USL 795.60 PPM Total 57774.27 Data 14 13 12 11.5 11.0 11.5 12.5 13.0 13.5 14.0 Figure 4: Performance analysis of Hemoglobin measurements after the treatment We have also employed the non-parametric Wilcoxon Rank Sum test to investigate the effectiveness of the treatment in increasing the hemoglobin level of the women in the study group. Figure 5 presents the box plot comparison of the two data sets. The results of the Wilcoxon Rank Sum test are presented in Table 2 and show that the median of the differences is 1.05 and the 95% confidence interval for the median difference is [ 1.05 1.]. The results also indicate that the null hypotheses. H0: median of differences = 0.0 should significantly be rejected in favour of the alternative; Ha: median of differences > 0.0 with the p-value of 0.0. Therefore, we can conclude that the treatment has significantly increased the median of the hemoglobin level by 1.05 units. Furthermore, on average we would expect the median to increase between 1.05 to 1.. Table 2: Out put for Wilcoxon Signed Rank Test where difference = Hemoglobin after treatment - Hemoglobin before treatment. H0: median of differences = 0.0 versus Ha: median of differences > 0.0 Wilcoxon N N for Test Statistic Diff after-before 15625 14587 1319492 N P Estimated Median Diff after-before 15625 0.000 1.05000 9 Hemoglobin before treatment Hemoglobin after treatment Figure 5: Box plot of Hemoglobin level measurements before and after the treatment. 5. Conclusion Anemia is an increasingly prevalent factor in the Maternal Mortality Rate (MMR). It has been well established in the research literature that consumption of Vitamin C and Sulfas Ferroses during the pregnancy term reduces the incidence of hemoglobin level <11 which leads to anemia. The paper has investigated the distribution of the hemoglobin measurements before and after the treatment for 125 patients who have been monitored in the clinic. Johnson transformation is used to transfer the non-normal data. Performance analysis is employed to estimate the proportion of anemia patients before and after the treatments. The results show a significant reduction of 416000/00000 to 32000/00000 in the proportion after the treatment. The Wilcoxon Signed Rank test is used to assess the effectiveness of the treatment in increasing the hemoglobin level of pregnant women and reducing the maternal mortality rate. The results have showed an increase of 1.05 to 1. in the median of the hemoglobin level after the treatment. Therefore, we can conclude that the treatment has been extremely effective in reducing the anemia rate in pregnant women. Acknowledgement The authors would like to thank the medical practitioners and staff in the maternity clinic in Banjarmasin Indonesia for their effort in collecting this data and providing them to us for this research. 95% confidence interval for the median of the differences = [1.05 1.]

References [1] Allen LH (2000). Anemia and iron deficiency: effects on pregnancy outcome. Am J Clin Nutr 71, S1280 S1284. [2] Allen LH (2001). Biological mechanisms that might underlie iron's effects on fetal growth and preterm birth. J Nutr 131, S581 S589 [3] DeMaeyer E, Adiels-Tegman M (1985). The prevalence of anaemia in the world. World Health Stat Quart 38, 302 316. [4] Johnson NL (1949) System of frequency curves generated by methods of translation. Biometrika 36:149 176 [5]. Haas JD, Brownlie 4th T (2001). Iron deficiency and reduced work capacity: a critical review of the research to determine a causal relationship. J Nutr 131, S676 S688. [6] Kaiser LL, Allen LH (2002). Position of the American dietetic association: nutrition and lifestyle for a healthy pregnancy outcome. J Am Diet Assoc 2, 1479 1490. [7] Juran, J 1974, Juran's Quality Control Handbook, 3rd edn, McGraw-Hill, New York. [8] Ramakrishnan U, Yip R (2002). Experiences and challenges in industrialized countries: control of iron deficiency in industrialized countries. J Nutr 132, S820 S824 [9] Rasmussen KM (2001). Is there a causal relationship between iron deficiency or iron deficiency anemia and weight at birth, length of gestation and perinatal mortality? J Nutr 131, S590 S603. [] Rush D (2000). Nutrition and maternal mortality in the developing world. Am J Clin Nutr 72 (Suppl), S212 S240. [11] Scholl TO, Reilly T (2000). Anemia, iron and pregnancy outcome. J Nutr 130, S443 S447 [12] Singh K, Fong YF, Arulkumaran S (1998). Anaemia in pregnancy a cross-sectional study in Singapore. Eur J Clin Nutr 52, 65 70. [13] World Health Organization The world health report 2002: reducing risks, promoting healthy life. Geneva: World Health Organization; 2002