๐ง๐ผโ๐ฌ Z Test
One Sample and Two Sample Tests
Standard Normal Deviate Tests or Large Sample Tests
- If the sample size
n > 30
then it is considered as large sample and if the sample sizen < 30
then it is considered as small sample.
SND Test or One Sample (Z-test)
- Given by
R.A. Fisher
.
Case-I: Population standard deviation (ฯ) is known Assumptions:
Assumptions:
- Population is normally distributed
- The sample is drawn at random
Conditions:
- Population standard deviation
ฯ is known
- Size of the sample is large (say n > 30)
Procedure:
- Let x1, x2, โฆโฆโฆ xn be a random sample size of n from a normal population with mean ฮผ and variance ฯ2.
- Let x be the sample mean of sample of size โnโ
- Null hypothesis (H0): population mean (ยต) is equal to a specified value ฮผ0
- i.e. H0 : ยต = ยต0
- Under H0, the test statistic is
- i.e. the above statistic follows Normal Distribution with mean โ0โ and varaince โ1โ. If the calculated value of Z < table value of Z at 5% level of significance, H0 is accepted and hence we conclude that there is no significant difference between the population mean and the one specified in H0 as ยต0.
Case-II: If ฯ is not known
Assumptions:
- Population is normally distributed
- Sample is drawn from the population should be random
- We should know the population mean Conditions:
- Population standard deviation
ฯ is not known
- Size of the sample is large (say n > 30)
- Null hypothesis (H0): ยต = ยต0 under H0, the test statistic
- If the calculated value of Z < table value of Z at 5% level of significance, H0 is accepted and hence we conclude that there is no significant difference between the population mean and the one specified in H0 otherwise we do not accept H0.
- The table value of Z at 5% level of significance =
1.96
and table value of Z at 1% level of significance =2.58
.
Two sample Z-Test or Test of significant for difference of means
Case-I: When ฯ is known
Assumptions:
- Populations are distributed normally
- Samples are drawn independently and at random Conditions:
- Populations standard deviation
ฯ is known
- Size of samples are large
Procedure:
- Let x1 be the mean of a random sample of size n1 from a population with mean ยต1 and variance ฯ22
- Let x2 be the mean of a random sample of size n2 from another population with mean ยต2 and variance ฯ22
- Null hypothesis H0: ฯ1 = ฯ2
- Alternative Hypothesis H1: ฯ1 โ ฯ2
- i.e. The null hypothesis states that the population means of the two samples are identical. Under the null hypothesis the test statistic becomes
- i.e. the above statistic follows Normal Distribution with mean โ0โ and variance โ1โ. If ฯ12 = ฯ22 = ฯ2 (say) i.e. both samples have the same standard deviation then the test statistic becomes
- If the calculated value of |Z| < table value of Z at 5% level of significance, H0 is accepted.
- Otherwise rejected. If H0 is accepted means, there is no significant difference between two population means of the two samples are identical.
Example: The Average panicle length of 60 paddy plants in field No. 1 is 18.5 cms and that of 70 paddy plants in field No. 2 is 20.3 cms. With common S.D. 1.15 cms. Test whether there is significant difference between two paddy fields w.r.t panicle length.
Solution:
- Null hypothesis: H0: There is no significant difference between the means of two paddy fields w.r.t. panicle length.
- H0: ฮผ1 = ฮผ2
- Under H0, the test statistic becomes
- Substitute the given values in equation (1), we get
- Calculated value of Z = 8.89
- Calculated Value of Z > table value of Z at 5% LOS (1.96), H0 is rejected. This means, there is highly significant difference between two paddy fields w.r.t. panicle length.
Case-II: When ฯ is not known
Assumptions:
- Populations are normally distributed
- Samples are drawn independently and at random Conditions:
- Population standard deviation ฯ is not known
- Size of samples are large
Null hypothesis H0: ฮผ1 = ฮผ2
- Under the null hypothesis the test statistic becomes
- If the calculated value of |Z| < table value of Z at 5% level of significance, H0 is accepted otherwise rejected.
Example
- A breeder claims that the number of filled grains per panicle is more in a new variety of paddy ACM.5 compared to that of an old variety ADT.36. To verify his claim a random sample of 50 plants of ACM.5 and 60 plants of ADT.36 were selected from the experimental fields.
- The following results were obtained:
Sol:
- Null hypothesis H0: ฮผ1 = ฮผ2 (i.e. the average number of filled grains per panicle is the same for both ACM.5 and ADT.36)
- Under H0, the test statistic becomes
- Substitute the given values in equation (1), we get
- Calculated value of Z > Table value of Z at 5% LOS (1.96), H0 is rejected. We conclude that the number of filled grains per panicle is significantly greater in ACM.5 than in ADT.36