Standard scores and normal distributions

17


Standard scores and normal distributions




Introduction


In this chapter we will discuss standard distributions. Standard scores represent the position of a score or measurement in relation to an overall set of scores. Standard distributions are also useful for comparing scores from different sets of measurements. Standard scores are used in both clinical practice and research in the health sciences. In clinical practice the score of a patient is often compared with a known distribution to interpret the score. Measurements such as blood pressure or cholesterol levels are compared with a distribution to interpret the patient’s result.


The aims of this chapter are to:




Standard scores (z scores)


Consider this example: infant A walked unaided at the age of 40 weeks, while infant B is 65 weeks old but still cannot walk. What sense can we make of these measurements? Could infant B need further clinical investigation in case he has some neurological abnormality? The fact that infant B is unable to walk at the age of 65 weeks is not very informative in the absence of additional information about how this compares with norms for other children. However, say that it is known that the distribution of walking ages is such that µ = 50 weeks and σ = 5. Assuming that the frequency distribution is normal, the frequency polygon representing the population would look something like that shown in Figure 17.1.



In this instance, infant B’s score is clearly above the mean. In fact, by inspection, we can see the infant’s score at this point of time was three standard deviations above (+3) the mean (65 = 50 + (3 × 5)). In contrast, infant A began walking earlier than the mean, his score of 40 being two standard deviations below (−2) the mean. In general, any ‘raw’ score in a frequency distribution can be described in terms of its distance from the mean. The process of transforming a score into a measurement based on its distance from the mean in standard deviations is called standardizing the score. Such ‘transformed’ scores are called z scores or standard scores.


A z score represents how many standard deviations a given raw score is above or below the mean. The equation for transforming specific raw scores into z scores is given as:


image


For the above equation, x is the raw score, image or µ is the mean of the distribution from which the score was drawn and s or σ is the standard deviation of the distribution. That is, when we know the mean and standard deviation of a distribution, we can transform any raw score into a z score. Conversely, when the z score is known, we can use the above equations to calculate the corresponding raw scores.


In the above example, the z scores corresponding to the infants’ raw scores are:


image


These calculations support our previous observations that A’s score was two standard deviations below the mean and B’s score was three standard deviations above the mean. In other words, A walked very early and B was a very late starter. The particular value of standardizing scores for understanding clinical or research evidence will be discussed in the context of the concepts of normal and standard normal distributions.



Normal distributions


Many variables measured in the biological, behavioural and clinical sciences are approximately ‘normally’ distributed, i.e. they have a characteristic ‘bell-shaped’ curve. What is meant by a normal distribution is illustrated by the normal curve (see Fig. 17.2), which is a frequency polygon representing the theoretical distribution of population scores. We assume here that the variable x has been measured on an interval or ratio scale and that it is a continuous variable such as weight, height or blood pressure.



The normal curve has the following characteristics:



We need not worry about the actual formula for the normal curve. Rather, the point is that, given that the functional relationship between f and x is known, integral calculus can be used to calculate areas under the curve for any value of x. All normal curves have the same general mathematical form; whether we are graphing IQ or weight, the same bell shape will appear. The only differences between the curves are the mean value and the amount of variation. This is why the mean and the standard deviation provide us with important information about any particular normal distribution. Note that it is unlikely that any real data are exactly normally distributed. Rather, the normal distribution is a mathematical model that is useful for representing real distributions.



The standard normal curve


If we transform the raw scores of a variable into z scores and then plot the frequency polygon for the distribution, we will have a standard curve. If the original distribution was normal, then the frequency polygon will be a standard normal curve. Standard normal curves are identical regardless of the nature of the original variables.


By transforming raw scores into z scores we are adjusting for differences in means and standard deviations, which are the only things that distinguish between non-standardized normal curves.


The standard normal curve has the following additional properties:



1. The mean is always 0 (zero). For the previous example, the z score corresponding to µ = 50 (as in the ‘infants’ walking age’ example) is:


image


2. The mean = median = mode, as the curve is symmetrical.


3. The standard deviation of z scores is always 1 (one). For instance, the z score for 55 (which is one standard deviation above the mean) is:


image


4. It is assumed that the total area under the curve adds up to 1.00. Since the normal curve is symmetrical, 0.5 of the area falls above z = 0 and 0.5 falls below z = 0. This is another way of saying that 50% of the total cases fall below the mean, and 50% of the cases fall above the mean (which is equal to the median).


5. More generally, we can use appropriate statistical tables to estimate the area under the standard normal curve for any given z scores. These areas are available in table form (see Appendix A) so that for any value of z we can read off the corresponding area.


6. The area under the curve between any two points is directly proportional to the percentage of cases falling above, below or between those two points. We can use the standard normal curve to calculate the percentage of scores falling between any specified two scores.

Stay updated, free articles. Join our Telegram channel

Apr 12, 2017 | Posted by in MEDICAL ASSISSTANT | Comments Off on Standard scores and normal distributions

Full access? Get Clinical Tree

Get Clinical Tree app for offline access