A distribution of sample means has a mean of 68, with a standard distribution of 12.

If you believe that content available by means of the Website (as defined in our Terms of Service) infringes one or more of your copyrights, please notify us by providing a written notice (“Infringement Notice”) containing the information described below to the designated agent listed below. If Varsity Tutors takes action in response to an Infringement Notice, it will make a good faith attempt to contact the party that made such content available by means of the most recent email address, if any, provided by such party to Varsity Tutors.

Your Infringement Notice may be forwarded to the party that made the content available or to third parties such as ChillingEffects.org.

Please be advised that you will be liable for damages (including costs and attorneys’ fees) if you materially misrepresent that a product or activity is infringing your copyrights. Thus, if you are not sure content located on or linked-to by the Website infringes your copyright, you should consider first contacting an attorney.

Please follow these steps to file a notice:

You must include the following:

A physical or electronic signature of the copyright owner or a person authorized to act on their behalf; An identification of the copyright claimed to have been infringed; A description of the nature and exact location of the content that you claim to infringe your copyright, in \ sufficient detail to permit Varsity Tutors to find and positively identify that content; for example we require a link to the specific question (not just the name of the question) that contains the content and a description of which specific portion of the question – an image, a link, the text, etc – your complaint refers to; Your name, address, telephone number and email address; and A statement by you: (a) that you believe in good faith that the use of the content that you claim to infringe your copyright is not authorized by law, or by the copyright owner or such owner’s agent; (b) that all of the information contained in your Infringement Notice is accurate, and (c) under penalty of perjury, that you are either the copyright owner or a person authorized to act on their behalf.

Consider the population below with population mean = μ and standard deviation = σ. Next, we take many samples of size n, calculate the mean for each one of them, and create a distribution of the sample means. This distribution is called the Sampling Distribution of Means. Technically, a sampling distribution of a statistic is the distribution of values of the statistic in all possible samples of the same size from the same population.

A distribution of sample means has a mean of 68, with a standard distribution of 12.

The Population Distribution

A distribution of sample means has a mean of 68, with a standard distribution of 12.

The population is distributed as N(μ, σ)

The Distribution of Sample Means

A distribution of sample means has a mean of 68, with a standard distribution of 12.

This is also know as a Sampling Distribution of Means. It has a mean of μ and a standard deviation of

A distribution of sample means has a mean of 68, with a standard distribution of 12.
. Thus,

If a population is N(μ, σ) then the distribution of sample means of size n is N(μ, σ/sqrt(n)).

Standard Error of the Mean

A distribution of sample means has a mean of 68, with a standard distribution of 12.

The standard error of the mean is the standard deviation of the sampling distribution of the sample means.

The Central Limit Theorem

  • If a population is N(μ, σ) then the distribution of sample means of n independent observations is N(μ,
    A distribution of sample means has a mean of 68, with a standard distribution of 12.
    ).
  • When the population is normally distributed the sampling distribution of the sample means is also normally distributed.
  • For large n the sampling distribution is approximately normal even when the population itself is not normally distributed.

    Some Example Problems

    A test of computer anxiety has a population mean of 65 and variance of 144.

    1. A random sample of size 36 is drawn from the population with a sample mean of 68. What is the probability of selecting a sample with a smaller mean by chance alone?

    If the standard deviation, σ, is known, we can transformto an approximately standard normal variable, Z:

     

    A distribution of sample means has a mean of 68, with a standard distribution of 12.

    Example:

    From the previous example, μ=20, and σ=5. Suppose we draw a sample of size n=16 from this population and want to know how likely we are to see a sample average greater than 22, that is P(> 22)?

    A distribution of sample means has a mean of 68, with a standard distribution of 12.

    So the probability that the sample mean is greater than 22 is between 0.005 and 0.025 (or between 0.5% and 2.5%)

    Learning Outcomes

    • Recognize central limit theorem problems

    The central limit theorem for sample means says that if you keep drawing larger and larger samples (such as rolling one, two, five, and finally, ten dice) and calculating their means, the sample means form their own normal distribution (the sampling distribution). The normal distribution has the same mean as the original distribution and a variance that equals the original variance divided by, the sample size. The variable n is the number of values that are averaged together, not the number of times the experiment is done.

    Central Limit Theorem

    Suppose X is a random variable with a distribution that may be known or unknown (it can be any distribution). Using a subscript that matches the random variable, suppose:

    1. μX = the mean of X
    2. σX = the standard deviation of X

    If you draw random samples of size n, then as n increases, the random variable [latex]\displaystyle\overline{{X}}[/latex].

    To put it more formally, if you draw random samples of size n, the distribution of the random variable [latex]\displaystyle\overline{{X}}[/latex], which consists of sample means, is called the sampling distribution of the mean. The sampling distribution of the mean approaches a normal distribution as n, the sample size, increases.

    The random variable [latex]\displaystyle\overline{{X}}[/latex] in one sample.

    [latex]\displaystyle\frac{{\overline{X}-{\mu}_{x}}}{{\frac{{\sigma{x}}}{{\sqrt{n}}}}}[/latex]

    [latex]\displaystyle{\mu}_{x}[/latex] is the average of both X and [latex]\displaystyle\overline{X}[/latex]
    [latex]\displaystyle{\sigma}\overline{x} = \frac{{\overline{X}-{\mu}_{x}}}{{\frac{{\sigma{x}}}{{\sqrt{n}}}}}[/latex] = standard deviation of [latex]\displaystyle\overline{{X}}[/latex] and is called the standard error of the mean.
    To find probabilities for means on the calculator, follow these steps:

    • 2nd DISTR
    • 2:normalcdf
    • normalcdf  (Lower value of the area, upper value of the area, mean,[latex]\displaystyle\sqrt{\frac{{\text{standard deviation}}}{{\text{sample size}}}}[/latex]
    • where: mean is the mean of the original distribution standard deviation is the standard deviation of the original distribution sample size =n

    Example

    An unknown distribution has a mean of 90 and a standard deviation of 15. Samples of sizen = 25 are drawn randomly from the population.

    1. Find the probability that the sample mean is between 85 and 92.
    2. Find the value that is two standard deviations above the expected value, 90, of the sample mean.

    A distribution of sample means has a mean of 68, with a standard distribution of 12.

    Solution:

    normalcdf: (lower value, upper value, mean, standard error of the mean)
    The parameter list is abbreviated (lower value, upper value, μ,[latex]\displaystyle\frac{{\sigma}}{{\sqrt{n}}}[/latex]

    normalcdf: (85,92,90, [latex]\displaystyle\frac{{15}}{{\sqrt{25}}}[/latex] = 0.6997

    To find the value that is two standard deviations above the expected value 90, use the formula:

    • value = [latex]\displaystyle{\mu}_{x}[/latex] + (# of STDEVs)[latex]\displaystyle\left(\frac{{{\sigma}_{x}}}{{\sqrt{n}}}\right)[/latex]
    • value = 90 + 2 ([latex]\displaystyle\frac{{15}}{{\sqrt{25}}}[/latex] )= 96

    The value that is two standard deviations above the expected value is 96. The standard error of the mean is [latex]\displaystyle\frac{{\sigma}}{{\sqrt{n}}}[/latex] = [latex]\displaystyle\frac{{15}}{{\sqrt{25}}}[/latex] = 3.

    Recall that the standard error of the mean is a description of how far (on average) that the sample mean will be from the population mean in repeated simple random samples of size n.

    Try it

    An unknown distribution has a mean of 45 and a standard deviation of eight. Samples of size n = 30 are drawn randomly from the population. Find the probability that the sample mean is between 42 and 50.

    P(42 < [latex]\displaystyle\overline{x}[/latex] < 50) = 42, 50, 45, [latex]\displaystyle\frac{{8}}{{\sqrt{30}}}[/latex] = 0.9797

    Example

    The length of time, in hours, it takes an “over 40” group of people to play one soccer match is normally distributed with a mean of two hours and a standard deviation of 0.5 hours. A sample of size n = 50 is drawn randomly from the population. Find the probability that the sample mean is between 1.8 hours and 2.3 hours.

    The length of time, in hours, it takes an “over 40” group of people to play one soccer match is normally distributed with a mean of two hours and a standard deviation of 0.5 hours. A sample of size n = 50 is drawn randomly from the population. Find the probability that the sample mean is between 1.8 hours and 2.3 hours.

    Let [latex]\displaystyle\overline{X}[/latex] = the mean time, in hours, it takes to play one soccer match.

    If [latex]{\mu}_{x}[/latex]= _________, [latex]{\sigma}_{x}[/latex] = __________, and n = ___________, then X~ N(______, ______) by the central limit theorem for means.

    [latex]{\mu}_{x}[/latex] = 2
    [latex]{\sigma}_{x}[/latex]  =  0.5
    n  =  50 and
    X~N  (2, [latex]\frac{{0.5}}{{\sqrt{50}}}[/latex])

    Find P(1.8 < [latex]\displaystyle\overline{x}[/latex] < 2.3)

    Solution:

    P(1.8 < [latex]\displaystyle\overline{x}[/latex] < 2.3)

    P(1.8 < [latex]\displaystyle\overline{x}[/latex] < 2.3) =  0.9977

    normalcdf : (1.8, 2.3, 2, [latex]\displaystyle\frac{{0.5}}{{\sqrt{50}}}[/latex]) = 0.9977

    The probability that the mean time is between 1.8 hours and 2.3 hours is 0.9977.

    Try it

    The length of time taken on the SAT for a group of students is normally distributed with a mean of 2.5 hours and a standard deviation of 0.25 hours. A sample size of n = 60 is drawn randomly from the population. Find the probability that the sample mean is between two hours and three hours.

    P(2 < [latex]\overline{x}[/latex] < 3) = normalcdf( 2, 3, 2.5, [latex]\frac{0.25}{\sqrt{60}}[/latex] = 1

    To find percentiles for means on the calculator, follow these steps.

    • 2nd DIStR
    • 3:invNorm
    • k = invNorm (area to the left of k, mean[latex]\displaystyle\sqrt{\frac{{\text{standard deviation}}}{{\text{sample size}}}}[/latex]), where: k = the kth percentile mean is the mean of the original distribution standard deviation is the standard deviation of the original distribution sample size = n

    Example

    In a recent study reported Oct. 29, 2012 on the Flurry Blog, the mean age of tablet users is 34 years. Suppose the standard deviation is 15 years. Take a sample of size n = 100.

    1. What are the mean and standard deviation for the sample mean ages of tablet users?
    2. What does the distribution look like?
    3. Find the probability that the sample mean age is more than 30 years (the reported mean age of tablet users in this particular study).
    4. Find the 95th percentile for the sample mean age (to one decimal place).

    Solution:

    1. Since the sample mean tends to target the population mean, we have μX = μ = 34. The sample standard deviation is given by [latex]\displaystyle\frac{{\sigma}}{{\sqrt{n}}}=\frac{{15}}{{\sqrt{100}}}=\frac{{15}}{{10}}={1.5}[/latex]
    2. The central limit theorem states that for large sample sizes(n), the sampling distribution will be approximately normal.
    3. The probability that the sample mean age is more than 30 is given by P(Χ > 30) = normalcdf(30,E99,34,1.5) = 0.9962
    4. Let k = the 95th percentile. k = invNorm(0.95, 34, [latex]\displaystyle\frac{{15}}{{\sqrt{100}}}[/latex]) = 36.5

    Try it

    In an article on Flurry Blog, a gaming marketing gap for men between the ages of 30 and 40 is identified. You are researching a startup game targeted at the 35-year-old demographic. Your idea is to develop a strategy game that can be played by men from their late 20s through their late 30s. Based on the article’s data, industry research shows that the average strategy player is 28 years old with a standard deviation of 4.8 years. You take a sample of 100 randomly selected gamers. If your target market is 29- to 35-year-olds, should you continue with your development strategy?

    You need to determine the probability for men whose mean age is between 29 and 35 years of age wanting to play a strategy game. Find P(29 < [latex]\displaystyle\overline{x}[/latex] < 35) = normalcdf = 0.0186

    (29, 35, 28, [latex]\displaystyle\frac{{4.8}}{{\sqrt{100}}}[/latex] = 0.0186

    You can conclude there is approximately a 2% chance that your game will be played by men whose mean age is between 29 and 35.

    Example

    The mean number of minutes for app engagement by a tablet user is 8.2 minutes. Suppose the standard deviation is one minute. Take a sample of 60.

    1. What are the mean and standard deviation for the sample mean number of app engagement by a tablet user?
    2. What is the standard error of the mean?
    3. Find the 90th percentile for the sample mean time for app engagement for a tablet user. Interpret this value in a complete sentence.
    4. Find the probability that the sample mean is between eight minutes and 8.5 minutes.

    Solution:

    1. [latex]\displaystyle{\mu}_{\overline{x}}={\mu}=8.2[/latex], [latex]\displaystyle{\sigma}_{\overline{x}}=\frac{{\sigma}}{{\sqrt{n}}}=\frac{{1}}{{\sqrt{60}}} = 0.13[/latex]This allows us to calculate the probability of sample means of a particular distance from the mean, in repeated samples of size 60.
    2. Let k = the 90th percentile, k = invNorm(0.9, 8.2,  [latex]\displaystyle\frac{{1}}{{\sqrt{60}}}[/latex]) = 8.37. This values indicates that 90 percent of the average app engagement time for table users is less than 8.37 minutes.
    3. P(8 < [latex]\displaystyle\overline{x}[/latex] < 8.5) = normalcdf(8, 8.5, 8.2[latex]\displaystyle\frac{{1}}{{\sqrt{60}}}[/latex]) = 0.9293

    Try it

    Cans of a cola beverage claim to contain 16 ounces. The amounts in a sample are measured and the statistics are n = 34,[latex]\displaystyle\overline{x}[/latex] = 16.01 ounces. If the cans are filled so that μ = 16.00 ounces (as labeled) and σ= 0.143 ounces, find the probability that a sample of 34 cans will have an average amount greater than 16.01 ounces. Do the results suggest that cans are filled with an amount greater than 16 ounces?

    We have P([latex]\displaystyle\overline{x}[/latex] > 16.01) = normalcdf(16.01, E99, 16, [latex]\displaystyle\frac{{0.143}}{{\sqrt{34}}}[/latex]= 0.3417

    Since there is a 34.17% probability that the average sample weight is greater than 16.01 ounces, we should be skeptical of the company’s claimed volume. If I am a consumer, I should be glad that I am probably receiving free cola. If I am the manufacturer, I need to determine if my bottling processes are outside of acceptable limits.

     

    References

    Baran, Daya. “20 Percent of Americans Have Never Used Email.”WebGuild, 2010. Available online at http://www.webguild.org/20080519/20-percent-of-americans-have-never-used-email (accessed May 17, 2013).

    Data from The Flurry Blog, 2013. Available online at http://blog.flurry.com (accessed May 17, 2013).

    Data from the United States Department of Agriculture.

    Concept Review

    In a population whose distribution may be known or unknown, if the size ( n) of samples is sufficiently large, the distribution of the sample means will be approximately normal. The mean of the sample means will equal the population mean. The standard deviation of the distribution of the sample means, called the standard error of the mean, is equal to the population standard deviation divided by the square root of the sample size (n).

    Formula Review

    The Central Limit Theorem for Sample Means:[latex]\displaystyle\overline{X}{\sim}{N}({\mu}_{x},\frac{{{\sigma}_{x}}}{{\sqrt{n}}})[/latex]

    When the mean of a distribution of scores is 50 and the standard deviation is 10?

    Normal distribution with a mean of 50 and standard deviation of 10. 68% of the area is within one standard deviation (10) of the mean (50).

    When the mean of a number is 18 what is the mean of the sampling distribution?

    So when mean of a number is going to be equals to 18, then mean of a sampling distribution? It remains the same, which is 18 only.

    What is the standard error of the mean if the sample size is 25 with standard deviation?

    Thus, for a sample of N = 25 and population standard deviation of s = 100, the standard error of the mean is 100/5 or 20.

    What is the mean of the distribution of sample means?

    The mean of the distribution of sample means is called the Expected Value of M and is always equal to the population mean μ.