Review 

Random Phenomena 

We call a phenomenon "random" if individual outcomes are uncertain but there is, more the less, a regular distribution of relative frequencies in a large numbers or repetitions. By 'regular' we mean that the distribution is stable or unchanging. 

Random phenomena are not chaotic (because they are not regular) 

Probability = long run relative frequency 

- infrequent cases (personal probability) 

Ultimately we want to study the origin of lists of numbers — those that come from probability models, with a view to understand the process by which data in the list were generated. 

Mathematics of Probability 

  1. Experiment/random phenomena 
  2. Sample space - set 5 of all possible outcomes e.g. flip a coin 
  3. Towards a consistent theory of assigning probabilities

Event = a set of outcomes of a random phenomena

(Individual outcomes in S are called 'outcomes' or elementary events.) 

 

Assigning Probabilities to Events 

Rule 1 Any probability P(A) is a number between 0 and 1

0 # P(A) #

Rule 2 The collection S of all possible outcomes (elementary events) has probability 1. P(S) = 1 

e.g. Assign probability p to flip of a coin (e.g. P = .5 if the coin is fair).

P(H) = P(T) = ½. P (H or T) = P(S) = 1 

 

Probabilities in a Finite Sample Space

 

If A is any event consisting of (disjoint) outcomes (= elementary events) then Prob. of A= sum of probabilities of individual outcomes 

if elementary events = outcomes are equally probable then: 

 

e.g. roll die 

Venn Diagrams {event, disjoint events) 

Rule 3 If events A and B are disjoint the:

P(A c B) = P(A or B) = P(B or A) = P(A) + P(B) 

Complement of Event

A and Ac are disjoint sets which partition the sample space S into 2 parts. 

Rule 4 Complement 

For any event A, the probability that A does not occur is:

P(Ac) = 1 - P(A) 

Dependent and Independence 

{A, Ac} are not independent - if one happens the other can’t 

Independence: Events A and B are independent if knowing whether A occurs does not change the probability that B occurs. 

e.g. 2 flips of a fair coin - flip a coin/check hair colour 

Rule 5 (independent events: multiplication) 

If events A and B are independent then:

P(A Ù B) = P(A and B) = P(A) × P(B) 

That is the probability that BOTH will occur = product of probability that each will occur. 

Rule 3 (extended)

P(A È B È C) = P(A) + P(B) + P(C) 

If A and B and C are all disjoint. 

General Rule 

P(A or B) = P(A È B) = P(A) + P(B) - P(A Ç B) 

A and B don’t need to be disjoint. 

 

Contingency Table 

Conditional Probability P(A|B)…What does this mean? 

Joint Probability P(A Ç B)… A and B not necessarily independent

e.g. … police force  

Total Probability:  

let A1, A2, …Am be a collection of disjoint events so that A1 È A2 È A3 ÈÈ Am = S 

Then P(B) = P(B Ç A1) + P(B Ç A2) + … + P(B Ç Am

Conditional Probability 

Venn diagram shows this is just a restriction of S. 

General Multiplication Rule for Intersections 

P(A Ù B) = Prob. (A and B) = P(A) P(B|A)

= P(B) P(A|B) 

Reverse Conditional Probability (Bayes Problem) 

Usually given P(A), P(B), P(C|A), P(C|B) 

Asked to find P(A|C) or P(B|C) 

Steps: get P(C) = P(C|A) × P(A) + P(C|B) × P(B)

P(A Ù C) + P(B Ù C)

 

Random Variables 

A random variable is a variable whose value is the numerical outcome of a random phenomena. 

Discrete Random Variables 

X:

 

 

 

 

outcome

x1…xn

 

 

probability

p1…pn

¬ probability dist. å pi = 1 pi ³ 0

 

 Continuos Random Variable 

List of numbers = realization of random variables (maybe all with the same distribution) 

MEAN 

m x = å pixi (derive from Law of Large Numbers)

= long run average 

Law of Averages - how long does it take? Law of Small Numbers 

m a+bx = a + bm x

m x+y = m x + m y  

Variances of Random Variables 

if X and Y are independent

 

Random variables related to counts and proportion.

Binomial Setting

Binomial B(n,p)

 

å probability of any one way

ä

# of ways

 

 

Use of tables 

 

Approximation of Binomial Probabilities by normal 

n large enough

p close enough to .5 

 

Can also work for events such as P(x k) or P(X > k) 

Proportions 

X is B(n,p)

for

 

Population 

Samples 

Sample Statistics — Sample mean

 

Central Limit Theorem distribution of  

is an estimator, is a point estimate 

is unbiased for  

Illustrated of CLT for various population