Course Project

1. Assume you have a population of 100 students, and you have collected data about four variables as follows:

Variable 1: “Gender” using the function “=RANDBETWEEN(1,2)” where the value “1” means male and the value “2”means female.

Variable 2: “Major” using the function “=RANDBETWEEN(1,3)” where the value “1” means “Finance”, the value “2”means “Accounting”, and the value “3”means “Marketing”.

Variable 3: “Course Grade” using the function “=TRUNC(NORM.INV(RAND(),μ,σ),0)”. The value of normal distribution mean is any value between 50 and 75, and the value of the standard deviation is any value between 5 and 8. Make sure that you do not have any value greater than 100 or less than 0.

Variable 4: “Credit Registered” using the function “=TRUNC(NORM.INV(RAND(),μ,σ),0)”. The value of normal distribution mean is any value between 12 and 15, and the value of the standard deviation is any value between 1.5 and 3. Make sure that you do not have any value greater than 24 or less than 3.

Once the data set is created, do the following:

A. Construct relative frequency distributions for variables “Gender” and “Major”.

B. Use any appropriate graph to display information in A.

C. Construct grouped data frequency distributions for variables “Course Grade” and “Credit Registered”.

D. Present the variable “Credit Registered” using histogram for each group of students in each major.

E. Use histograms to display information in C.

F. Collect a sample of size 16 for variables “Course Grade” and “Credit Registered” and show that the distribution of the sample mean is very close to results computed form the population.

G. Based on a sample of 25, compute the probability that the sample mean for “Course Grade” is between 65 and 80.