The normal distribution is the most important distribution in business and economics.
Recall: the manufacturing plant that produces pieces of metal with two holes. The distance between the holes varies from one piece to another.
Construct a histogram in which the area of the bar representing each class interval is equal to the proportion of all pieces within this class interval. (i.e. the height of the bar equals the proportion of all pieces within the interval divided by the width of the interval).
The area under this histogram must equal 1. This is called a density scale.
Now say we increase the number to 1,000,000 sampled, rather than 1,000. We would now be able to divide the class intervals more finely.
Finally, taken to the limit, the histogram would become a smooth curve.
The probability that the distance in hole centers lies within a particular range is equal in value to the area under the smooth curve in this range.
|
|
|
|
NOTE: in Calculus terms
The Normal Distribution
The PDF of a Normal Random Variable is given by:
The normal distribution is a family of distributions that depend on
and
.
Common Characteristics of normal distributions
Symmetrical and bell-shaped
Probability that a value will lie within k standard deviations of the mean ±1 = 68.3% ±2 = 95.4% ±3 = 99.7%
Location and shape determined entirely by
and
.
The normal distribution is important because
It is a reasonably good approximation to many populations (especially in nature).
The probability distribution of the sample mean should be close to the normal distribution.
The normal distribution can be used in many instances to approximate the binomial distribution.
The Standard Normal Curve
If one expresses any normal random variable as a deviation from its mean, and measures these deviations in units of its standard deviation, the resulting random variable is called a Standard Normal Random Variable. Its probability distribution is called the Standard Normal Curve.
The Standard Normal Random Variable
NOTE: If any normal variable is expressed in standard units, its probability distribution is given by the Standard Normal Curve.
|
Example 1
J. Crew is considering a new line of clothes for men between 72 and 76 inches
in height. They want to estimate the size of this potential market. To accomplish
this, they need to know the probability distribution of the heights of adult
males. Given that the heights of adult males can be approximated by a normal
distribution with
solution Z = (X - 70) / 2 What is the value of Z corresponding to a height of 74 inches? Z = (74 - 70) / 2 = 4 / 2 = 2. |
of a standard normal random variable is equal to 0.
of a standard normal random variable is equal to 1.
The Probability that a Normal Random Variable lies between two points (in our example: J. Crew --> height of male between 72 and 76 inches) can be calculated in two steps:
| 1. | Find the points on the Standard Normal Distribution corresponding to these two points. | |
| (72 - 70) / 2 = 1 | (76 - 70) / 2 = 3 } | |
| 2. |
Determine the area under the Standard Normal Curve between the two points.
{ the area under the standard normal between 1 & 3 }
|
|
Using the Table of the Standard Normal Distribution (Appendix Table II)
| 1. |
Area between zero and a positive value.
The table gives the area under the standard normal curve from zero to a positive value, so just read it off the table. (row 1.1 column 0.0 --> 0.3643) |
| 2. |
Area between zero and a negative value.
symmetrical so do the same as above. |
| 3. |
Area to the left of some positive value.
look in the table and add 0.5 (because the probability to the left of 0 is 0.5). ( 0.500 + 0.3643 = 0.8643 ) |
| 4. |
Area to the right of some negative value.
same as #3. ( 0.500 + 0.4778 = 0.9778 ) |
| 5. |
Area to the right of some positive value.
area to the right of 0 = 0.5 so we subtract. ( 0.500 + 0.4505 = 0.0495 ) |
| 6. |
Area to the right of some negative value.
same as #5. ( 0.500 + 0.3531 = 0.1469 ) |
| 7. |
Area between two positive values.
difference between Area(1.82) and Area(1.10). ( 0.4656 + 0.3643 = 0.1013 ) |
| 8. |
Area between two negative values.
same as #7. ( Area(-1.60) - Area(-0.92) = 0.4452 + 0.3212 = 0.1240 ) |
| 9. |
Area between negative and positive values.
sum the two probabilities in the tables. ( 0.2257 + 0.1879 = 0.4136 ) |
|
Example 2
A Tire Company produces tires whose diameters are normally distributed with
solution
|
|
Example 3
An analyst for a competing tire company tells reporters that 90% of the tires have a diameter of 36.0020 inches or less and that 10% have a diameter of more than 36.0020 inches. The analyst is incorrect. What figure should be substituted for 36.0020 to make his statement true? solution
|
|
Example 4
The probability that a machine will be down for repairs next week is 1/2. A computer company has 100 such machines. Assuming statistical independence, what is the probability that at least 60 machines will be down. solution # of machines down has a binomial distribution
Because of the continuity correction P(x >= 60) can be approximated by
the probability that the value of a normal variable with
Z = (59.50 - 50) / 5 = 1.9 A2 shows P(0 < Z < 1.9) = 0.4713 so area to the right of 1.9 is 0.5000 - 0.4713 = 0.0287. This is the (approximate) probability that at least 60 machines will be down for repair. |