课 Progress
0% Complete

Conditions for geometric models

What is the geometric distribution?

  • The geometric distribution models the number of trials needed to reach the first success

    • For example, how many times will you have to roll a dice until it lands on a ‘6’ for the first time

  • The notation for the geometric distribution is Geo open parentheses p close parentheses 

    • For a random variable X that has the geometric distribution you can write X tilde Geo open parentheses p close parentheses space

    • X is the number of trials it takes to reach the first success

      • For example, X equals 8 means the first success occurred on the 8th trial

    • p is the fixed probability of success in any one trial

What are the conditions for using a geometric model?

  • A geometric model can be used for an experiment that satisfies the following conditions:

    • The experiment consists of an indefinite number of successive trials

    • The outcome of each trial is independent of the outcomes of all other trials

    • There are exactly two possible outcomes for each trial (success and failure)

    • The probability of success in any one trial (p) is constant 

  •  Note that these conditions are very similar to the conditions for the binomial distribution

    • But for a binomial distribution the number of trials (bold italic n) is fixed

      • And you count the number of successes

    • While for a negative binomial distribution the experiment continues until the first success is achieved

      • And you count the number of trials it takes to reach that first success

When might the conditions not be satisfied?

  • If asked to criticise a geometric model, you may be able to question whether the trials are really independent

    • For example, someone may be repeating an activity until they achieve a success

      • The trials may not be independent because the person gets better from practising the activity

      • This also means the probability of success, p, is not constant

    • In order to proceed using the model, you would have to assume that the trials are independent

Examiner Tips and Tricks

  • Replace the word “trials” with the context (e.g. “flips of a coin”) when commenting on conditions and assumptions

Geometric probabilities

What are the probabilities for the geometric distribution?

  • If X space tilde space Geo open parentheses p close parentheses then X has the probability function:

    • Error converting from MathML to accessible text.

    • the random variable X is the number of trials needed to get the first success

    • p is the constant probability of success in one trial

    • straight P open parentheses X equals x close parentheses is the probability that the first success will occur on the x th trial

  • Note that that is the product of

    • the probability of first getting bold italic x bold minus bold 1 failures,  ,

    • and the probability of getting a success in the bold italic x bold th trialp

  • Also note that there is no greatest possible x

    • It could require any number of trials to reach the first success

    • However straight P open parentheses X equals x close parentheses gets closer and closer to zero as x gets larger

  • Your calculator may allow you to calculate Geometric probabilities directly

    • i.e., without having to use the above formula

What are the properties of the geometric distribution?

  • Note that straight P open parentheses X equals 1 close parentheses comma space straight P open parentheses X equals 2 close parentheses comma space straight P open parentheses X equals 3 close parentheses comma space straight P open parentheses X equals 4 close parentheses comma space... equals p comma space p open parentheses 1 minus p close parentheses comma space p open parentheses 1 minus p close parentheses squared comma space p open parentheses 1 minus p close parentheses cubed comma space... space

    • This means that the geometric probabilities form a geometric sequence

      • The first term is p

      • The common ratio is open parentheses 1 minus p close parentheses

    • This is where the geometric distribution gets its name!

  • Assuming that 0 less than p less than 1, then it is also true that 0 less than open parentheses 1 minus p close parentheses less than 1

    • This means that  straight P open parentheses X equals 1 close parentheses greater than straight P open parentheses X equals 2 close parentheses greater than straight P open parentheses X equals 3 close parentheses greater than straight P open parentheses X equals 4 close parentheses greater than...

      • i.e., the probabilities form a decreasing sequence

      • and straight P open parentheses X equals 1 close parentheses is the largest probability in the sequence

      • Therefore X equals 1 is the mode of the distribution

3-2-3-cie-fig0-geo-dist-graphs
  • The geometric distribution has no ‘memory’

    • It doesn’t matter what has happened previously, or how many ‘failures’ in a row there have been

    • The probability of getting a ‘success’ in any trial is always p

    • This means that the number of additional trials needed for the first success is not dependent on the number of trials that have already occurred

      • e.g. if 5 (failed) trials have already occurred, the probability of the first success happening after 7 trials is simply the probability of success happening after 2 trials in the first place, i.e. p open parentheses 1 minus p close parentheses

What are the cumulative probabilities for the geometric distribution?

  • If X space tilde space Geo open parentheses p close parentheses then X has the cumulative geometric distribution:

    • the random variable X is the number of trials needed to get the first success

    • p is the constant probability of success in one trial

    • straight P open parentheses X less or equal than x close parentheses is the probability that the first success will occur on or before the x th trial

  • Your calculator may allow you to calculate Geometric probabilities directly

    • i.e., without having to use the above formula

  • The formula can be proved as follows

    • If the first success occurs on or before the x th trial, that means that the first x trials have not all been failures

      • The probability of getting x failures in a row is Error converting from MathML to accessible text.

      • So the probability of that not happening is 1 minus open parentheses 1 minus p close parentheses to the power of x

  • Alternatively, it can be proved algebraically

    • The geometric probabilities form a geometric sequence with first term p and common ratio 1 minus p

    • Putting that into the geometric series formula S subscript n equals fraction numerator a open parentheses 1 minus r to the power of n close parentheses over denominator 1 minus r end fraction gives

  • Because limit as x rightwards arrow infinity of open parentheses 1 minus p close parentheses to the power of x equals 0 (assuming 0 less than p less than 1), it follows that

sum from r equals 1 to infinity of straight P open parentheses X equals r close parentheses equals 1 minus 0 equals 1

  • So the sum of all probabilities is equal to 1

    • This is a requirement of any probability distribution

Examiner Tips and Tricks

  • If you forget the formulae in the exam, you can often still do questions using basic probability concepts and geometric series

Worked Example

Joshua is an inspector in a factory.  His job is to randomly sample widgets produced by a particular machine, until he finds a widget that has a defect.  If he finds a widget with a defect, then the machine must be stopped until a repair procedure has been completed.  Given that the probability of a widget being defective is 0.002, find the probability that:

a) the 10th widget that Joshua inspects is the first one that is defective

geo-probs-we-a

b) the 250th widget that Joshua inspects is the first one that is defective

geo-probs-we-b

c) the 250th widget that Joshua inspects is the first one that is defective, given that the first 240 were not defective

geo-probs-we-c

d) Joshua will inspect 250 or fewer widgets before finding the first one that is defective

geo-probs-we-d

e) Joshua will need to inspect more than 250 widgets before finding the first one that is defective.

geo-probs-we-e

f) State an assumption you have used in calculating the above probabilities.

geo-probs-we-f

Geometric mean & variance

What are the mean and variance of the geometric distribution?

  • If X space tilde space Geo open parentheses p close parentheses, then

    • The mean of X is  straight E open parentheses X close parentheses equals mu italic equals 1 over p

  • The variance of X is  Var open parentheses X close parentheses equals sigma to the power of italic 2 italic equals fraction numerator italic 1 italic minus p over denominator p to the power of italic 2 end fraction

  • You need to be able to use these formulae to answer questions about the geometric distribution

Examiner Tips and Tricks

  • If a question gives you the value of the mean or variance, form an equation in p and solve it

Worked Example

Palamedes is rolling a biased dice for which the probability of the dice landing on a ‘6’ is p.  The random variable X represents the number of times he needs to roll the dice until a ‘6’ appears for the first time.  Given that the standard deviation of X is 2 square root of 3, find:

a) the value of p.

geo-mean-var-we-a

b) the mean of X

geo-mean-var-we-b

c) straight P open parentheses X less or equal than 3 close parentheses.

geo-mean-var-we-c

Responses

您的邮箱地址不会被公开。 必填项已用 * 标注