STATISTICS 230, FALL 2005


dice.jpg
Probability


Figure

Probability:

(Random House) a strong likelihood or chance of something. The relative possibility an event will occur ...the ratio of the number of actual occurrences to the total number of possible occurrences.

Three Types of Probability

1. Classical:

(equally probable outcomes) Let S=sample space (set of all possible distinct outcomes). Then the probability of an event =MATH

2. Relative Frequency Definition

The probability of an event in an experiment is the proportion (or fraction) of times the event occurs in a very long (theoretically infinite) series of (independent) repetitions of experiment. (e.g. probability of heads=0.4992)

3. Subjective Probability

The probability of an event is a "best guess" by a person making the statement of the chances that the event will happen. (e.g. 30% chance of rain)




Definitions1 and 2 are consistent with one another if we are careful in constructing our model.


Mathematical Probability

Random Experiment:

must be repeatable (at least in theory).

Has several possible distinct outcomes MATH

A single repetition of the experiment is a ``trial''.

The probability of an event A, denoted P(A), is a number between 0 and 1.

Examples:

(a) roll a fair die

(b) toss a fair coin until the first head appears

(c) Select a female student in Stat 230 and measure her height.

Sample Space S:

The set of all possible (indivisible) outcomes MATH of a random experiment.




Above examples:

(a) S={1,2,3,4,5,6}

(b) S={H,TH,TTH,TTTH,...}

(c) S={x $\in $R;100 $\leq $ x$\leq $ 300 cm.}




Discrete Sample space:

S has a finite or a countable number of points in it. (for example (a) and (b) above).

e.g. we write S={MATH} in this case. These "smallest" possible sets in S, $A_{i}=\{a_{i}\}$ are simple events.

Probability Model:

Consists of a sample space S and a probability distribution (to be defined) defined on S$.$




Event

Any subset of the sample space (in the case that S is discrete).

Let S={MATH} be a discrete sample space. Then probabilities $p_{i}=P(A_{i})$ are numbers attached to the simple events $A_{i}$ for $i=1,2,3,\dots $ such that the following two conditions hold:

  1. MATH

  2. MATH

The set of values MATH is called a probability distribution on S.




When the experiment is performed, some simple event in S must occur.

Definition: The probability P(A) of an general event A is the sum of the probabilities for all the simple events that make up A.

MATH


An event A made up of two or more simple events is called a compound event.

Properties of a Probability measure:

$0\leq P(A)\leq 1$ for any event A

MATH since $A\cup \overline{A}$=S$.$




Example: Toss a fair coin until the first head appears. S={H,TH,TTH,TTTH,...}. Then $p_{1}=1/2,$ $p_{2}=1/4,$ $p_{3}=1/8,...$ etc. The probability of A= "an even number of tosses is required" MATHProbability an odd number of tosses is required is MATH.

Example:

Toss a coin twice. Find the probability of getting 1 head.

Solution 1:

Let S={HH,HT,TH,TT} and assume the simple events each have probability 1/4. Then HT means head on the first toss and tails on the second. Since 1 head occurs for simple events HT and TH, A={HT,TH} we get P(1 head) =2/4=1/2.

Solution 2:

Let S = { 0 heads, 1 head, 2 heads } and assume the simple events each have probability 1/3. Then P(1 head) = 1/3.

Example:

Roll a red die and a green die. Find the probability the total is 6.

Solution:

Let (x,y) represent getting x on the red die and y on the green die.
MATHAll points in S have probability 1/36.

A={(1,5),(2,4),(3,3),(4,2),(5,1)}

P(A)=5/36.




What if the dice are identical?

MATH MATH
MATH
...............
MATH

S has 21 points. P(A)=3/21?




Example.

What is the probability that a 5-card poker hand contains four of a kind? (four twos or four threes, ...etc)


2.7 Machine Recognition of Handwritten Digits. An optical scanner determining which of the digits 0,1,...,9 an individual has written in a square box. The system may of course be wrong

Chapter 3: Probability -- Counting Techniques

If sample space S={MATH} and each simple event has probability 1/n (i.e. is "equally likely"), then a compound event A consisting of r simple events, has probability MATH

Example: Roll 3 fair dice. There are 6$\times $6$\times $6=216 possible outcomes, all equally likely. What is the probability of A= "three dice are different numbers"? Number of triples e.g. (1,2,3) with different numbers is 6$\times $5$\times $4 so

MATH

  1. The Addition Rule:

    Suppose we can either do job 1 in p ways or job 2 in q ways. Then we can do either job 1 or job 2, (not both) in p+q ways.

  2. The Multiplication Rule:

    Suppose we can do job 1 in p ways and an unrelated job 2 in q ways. Then we can do both job 1 and job 2 in p$\times $q ways.

Permutations

n distinct objects are to be "drawn" or ordered from left to right (Order matters objects are drawn without replacement)

  1. The number of ways to arrange n distinct objects in a row is n(n-1)(n-2)$\cdots $(2)(1)=n!

  2. The number of ways to arrange r objects selected from n distinct objects is
    n(n-1)(n-2)...(n-r+1)=n$^{(r)}$

Combinations

The number of ways to choose r objects from n (a set, i.e. order doesn't matter) is denoted by MATH. For n and r both non-negative integers with $n\geq r,$ MATH

Example:

We form a 4 digit number by randomly selecting and arranging 4 digits from {1, 2, 3,…7} without replacement. Find the probability the number formed is (c) an even number over 3000.

S={1234,1235,...} has $7^{(4)}$ points.

Order filled 2 3 4 1
Number of choices 5 5 4 1 (ends in 2)

(5$\times $5$\times $4$\times $1) or

Order filled 2 3 4 1
Number of choices 4 5 4 2 (ends in 4 or 6)

MATH

Total=MATH

Beware of Double Counting!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

Example: (p.26)

Find the probability a bridge hand (13 cards picked at random from a standard deck) has

  1. 3 aces

  2. at least 1 ace

A1: P(hand has 1 ace or more) = MATH (is this correct?)

A2: $1-P[$no ace]MATH

  1. Suppose $4$ passengers get on an elevator at the basement floor. There are $9$ floors above (numbered $1,2,3,$..., $9$) where passengers may get off.

    1. Find the probability

      1. no passenger gets off at floor 1

      2. A="passengers all get off at different floors"

      3. B="all passengers get off at the same floor"

      4. C="no-one gets of at the first two floors"

      5. D="all passengers get off at floors 1 or nine"

      6. G="exactly one person gets of at each of floors 3 and 4"

    2. What assumption(s) underlies your answer to (a)? Comment briefly on how likely it is that the assumption(s) is valid.

e.g. S={1111,1112,...,9999} has $9^{4}$ points. Of these $9^{(4)}$ are such all passengers get off on different floors.


  1. The Birthday Problem. Suppose there are $r$ persons in a room. Ignoring February 29 and assuming that every person is equally likely to have been born on any of the 365 other days in a year, find the probability that no two persons in the room have the same birthday. Find the numerical value of this probability forMATH. A: MATH=0.411,0.891,0.994 http://www.ship.edu/%7Edeensl/mathdl/stats/Birthday.html





Chapter 4: Probability Rules and Conditional Probability

An event A "occurs" if one of the simple events in A occur.

e.g. throw a die, S={1,2,3,4,5,6} A="Die is even"={2,4,6}.


Combinations of events A,B.

Union: A$\cup $B means A OR B (or possibly both) occurs.
c4.f1a.ps

Intersection: A$\cap $B (usually written as AB in probability) is shaded.
AB means A and B both occur.


c4.f2a.ps

Complement: $\bar{A}$ =points in S which are not in A
$\bar{A}$ means "A does not occur"


c4.f3a.ps

USING VENN DIAGRAMS:

EXAMPLE SECTION 4.1: Students finishing 2A Math:

22% have a math average $\geq $ 80%,

24% have a STAT 230 mark $\geq $ 80%,

20% have an overall average $\geq $ 80%,

14% have both a math average and STAT 230 $\geq $ 80%,

13% have both an overall average and STAT 230 $\geq $ 80%,

10% have all 3 of these averages $\geq $ 80%,

67% have none of these 3 averages $\geq $ 80%.

Find the probability a randomly chosen math student finishing 2A has math and overall averages both $\geq $ 80% and STAT 230 $<$ 80%.

Events of interest.

A = {Math average$\geq $80%}
B = {Overall average$\geq $80%}
C = {Stat230 $\geq $80%}

Find __________________

Definition:

Events $A_{1},A_{2},....$ are mutually exclusive (disjoint) if, for all $i\neq j,$ MATH (the empty event)

(at most one of these can ``happen'')

Definition:

A probability measure is a set function P($.)$ defined on subsets A of S (i.e. assigning a real number value P(A) to subsets A$)$ such that:

  1. P(S)=1

  2. $P(\varphi )=0,$ $\varphi =$empty event.

  3. If $A_{1},A_{2},....$ are mutually exclusive events, ( a finite number or an infinite sequence) MATH

Other rules governing probability models:

  1. For any event A, 0$\leq $P(A)$\leq $1

  2. If A and B are two events with A$\subseteq $B, then P(A)$\leq $P(B).

  3. $P(\overline{A})=1$-P(A)

  4. For any two events A$,$BMATH

  5. For any three events A$,$B$,C$MATH




Example: Roll a die 3 times. Find the probability of getting at least one 6.

MATHThen $P(A\cup B\cup C)$=P(A)+P(B)+P(C)=MATH(it this right?)

Example: A box contains 10 defective chips and 15 good ones. Twelve chips are drawn at random (without replacement). Find an expression for the probability that at most 5 defective chips are drawn.







Let $A_{x}$="Exactly x defective chips are drawn".




Want P(A)MATH whereMATHThen P(A)MATH




Intersection of Events and Independence

Events A and B are independent if MATH

Indiana Jones: Raiders of the Lost Ark


Figure


Figure


Figure

What is P(AW)?? P(A)=0.3, P(W)=0.4.

What is P(A\W) and when is it the same as P(A)? MATH

Definition: Events A and B are independent if MATH

or if P(A\B)=P(A) where MATHprovided denominator is greater than 0.




QUIZ UP TO & INCLUDING SECTION 4.3

Definition. Three or more events A$_{1},$A$_{2},...$A$_{n}$ are mutually independent if for every choice of $k, $ MATH MATH

Example: Toss a die twice. Let A = {first toss is a 3} and B = {the total is 7}. Are A and B independent? Change B to B$=$ {total is 8}




Why not call events independent if every pair of events is independent?

Coin Example

Two fair coins are tossed. Let A$=$ first coin is heads, B = second coin is heads, $C$= we obtain exactly one head. Then A is independent of B and A is independent of $C.$ Are A$,$B$,C$ mutually independent?




Properties.

A$,$B independent implies

    1. A,MATH are independent,

    2. MATH,B are independent

    3. MATH,MATH are independent

Conditional Probability.

We are interested in the probability of the event $~$A$~$ but we are given some relevant information, namely that another event $~$B$~$ occurred.

Revise the probabilities assigned to points of S in view of this new information. If the information does not effect the relative probability of points in B then the new probabilities of points outside of B should be set to 0 and those within B rescaled to add to 1.

Definition: Conditional Probability:

For an event B with P(B)>0, define the conditional probability MATHThis is another probability measure on the same sample space S. Note that P(B\B)=1, and P(MATH\B$)$=0.




Multiplication Rule: P(AB)=P(B)P(A\B)

  1. Let A and B be events defined on the same sample space, with P(A)=0.3, P(B)=0.4 and P(A\B)=0.5. Given that event B does not occur, what is the probability of event A?

  2. Three students Jane, Sue and Tom write an exam. Jane has a 90% chance of passing the exam, Sue has a 70% chance of passing and Tom has an 80% chance of passing. If exactly one of the students failed the exam, what is the probability it was Sue?

    MATH

Note that we are given the event MATH

Multiplication Rules (The sequel)

Let A,B,C,D be events in a sample space such that A,AB,ABC have positive probability. Then
MATH

Partition Rule

Let $A_{1},\dots ,A_{k}$ be a partition of the sample space S into disjoint (mutually exclusive) events such that MATHS. Let B be an arbitrary event in S. Then MATH


  1. Many methods of spam detection are based on features that appear more frequently in spam than in regular email. Conditional probability methods are then used to decide whether an email is spam or not.
    Define the following events associated with a random email message.
    B = "Message is spam"
    MATH = "Message is not spam
    A = "Message contains the word Viagra"

    If we know the values of the probabilities P(B), P(A\B) and P(A\$\overline{B}$), then we can find the probabilities P(B\A) and P(MATH\A).

Review: "at least one of the events A,B,C,D"=MATH

"all of the events A,B,C,D"=ABCD

"none of the events A,B,C,D"=MATH

How do we calculate probabilities of the above?




Choosing appropriate Sample spaces: e.g. 3.8, 3.10, 3.11


Example (p. 48):
In an insurance portfolio 10% of the policy holders are in Class $A_{1}$ (high risk), 40% are in Class $A_{2}$ (medium risk), and 50% are in Class $A_{3}$ (low risk). The probability a Class $A_{1}$ policy has a claim in a given year is .10; similar probabilities for Classes $A_{2}$ and $A_{3}$ are .05 and .02. Find the probability that if a claim is made, it is for a Class $A_{1}$ policy.

Tree Diagrams

Tree- each path represents a sequence of events.

On a branch write the conditional probability of that event given preceding events

The probability at a node of the tree=product of probabilities on the branches leading to the node

= probability of the intersection of the events leading to it





tree.bmp