CS228: Probabilistic Methods in AI
Winter 2008
Weekly Quiz


Overall Score Statistics:
Mean Median Mode LowestHighest
90.82% 90% 100% (27) 50% 100%

Score Frequency Table
Frequency of Scores
ScoreNumber of Students
100%27
90%23
80%4
70%4
60%2
50%1

Detailed Question/Answer Statistics:
Question 1: In the clique tree below which of the following starting message - passing orders is valid?



Answers: 1. C1->C2, C2->C3, C3->C4, C3->C5 2. C1->C2, C2->C3, C5->C3, C3->C4 3. C4->C3, C3->C5, C3->C2 4. C4->C3, C3->C5, C2->C3
TypeAnswerResponsesPercent
Correct:C1->C2, C2->C3, C5->C3, C3->C461100%
Distractor:C1->C2, C2->C3, C3->C4, C3->C500%
Distractor:C4->C3, C3->C5, C3->C200%
Distractor:C4->C3, C3->C5, C2->C300%
Question 2: In the clique tree above, what is the form of the message from clique 3 to clique 4 where is the initial potential of clique i.
Answers: 1. 2. 3. 4.
TypeAnswerResponsesPercent
Correct:<img src="http://www.stanford.edu/class/cs228/Images/Quiz3/new_qn/img5.png" >5997%
Distractor:<img src="http://www.stanford.edu/class/cs228/Images/Quiz3/new_qn/img4.png" >12%
Distractor:<img src="http://www.stanford.edu/class/cs228/Images/Quiz3/new_qn/img6.png" >00%
Distractor:<img src="http://www.stanford.edu/class/cs228/Images/Quiz3/new_qn/img7.png" >12%
Question 3: Consider the pairwise MRF, H, shown below with potentials over {A,B}, {B,C}, {A,D}, {B,E}, {C,F}, {D,E} and {E,F}.

Which of the following are not valid cluster graphs for H?


Answers: 1. (1) only 2. (2) only 3. (3) only 4. (2) and (3)

TypeAnswerResponsesPercent
Correct:(2) only5285%
Distractor:(1) only35%
Distractor:(3) only12%
Distractor:(2) and (3)58%
Question 4: Which of the following is true regarding the differences between belief propagation over a clique tree and a cluster graph?
Answers: 1. using a cluster graph is always preferable 2. using a clique tree and a cluster graph both result in exact answers, but using a cluster graph may be faster 3. clique tree BP is exact but may be intractable, while cluster graph BP is approximate but is often tractable even when clique tree is not 4. using a clique tree is preferable because it is exact, but one may not exist for all graphical models
TypeAnswerResponsesPercent
Correct:clique tree BP is exact but may be intractable, while cluster graph BP is approximate but is often tractable even when clique tree is not5489%
Distractor:using a cluster graph is always preferable00%
Distractor:using a clique tree and a cluster graph both result in exact answers, but using a cluster graph may be faster12%
Distractor:using a clique tree is preferable because it is exact, but one may not exist for all graphical models58%
Question 5: Which of the following ideas can not be used to help handle the problem of non-convergence of Generalized Belief Propagation?
Answers: 1. Stopping BP after a fixed amount of time regardless of complete convergence 2. Removing edges from the cluster graph until convergence is achieved 3. Intelligent message scheduling heuristics 4. Dampening changes to messages in successive iterations%%
TypeAnswerResponsesPercent
Correct:Removing edges from the cluster graph until convergence is achieved5793%
Distractor:Stopping BP after a fixed amount of time regardless of complete convergence12%
Distractor:Intelligent message scheduling heuristics23%
Distractor:Dampening changes to messages in successive iterations%%00%
Question 6: Consider the process of rejection sampling to generate samples from the posterior distribution P(X | e). If we want to keep M samples, what is the expected number of samples that would need to be drawn from P(X)?
Answers: 1. M * P(X | e) 2. M * P(e) 3. M * (1 - P(e)) 4. M / P(e)
TypeAnswerResponsesPercent
Correct:M / P(e)5590%
Distractor:M * P(X &KPHHASH124; e)00%
Distractor:M * P(e)35%
Distractor:M * (1 - P(e))35%
Question 7: If we are given 123 independent samples {X[1], ..., X[123]} from a Bernoulli distribution with p = .15, from which we compute an estimate what bounds can we give for the probability that we will be within 0.1 of the correct value of p? That is, what is the smallest d such that
Answers: 1. 0.1709 2. 0.5846 3. 4.143e-7 4. 1.0%%
TypeAnswerResponsesPercent
Correct:0.17095387%
Distractor:0.5846610%
Distractor:4.143e-712%
Distractor:1.0%%00%
Question 8: Consider using likelihood weighting sampling in the network below to estimate the conditional probability P(A|C=0). Assume we have just sampled a particle with A = 0. What will be the weight of the particle?



Answers: 1. 0.02 2. 0.1 3. 0.5 4. 0.2
TypeAnswerResponsesPercent
Correct:0.15184%
Distractor:0.0235%
Distractor:0.512%
Distractor:0.2610%
Question 9: Consider using unnormalized importance sampling to estimate the expected value of some function relative to P(X,Y) defined by a Bayesian network over binary-valued variables, X and Y.

Suppose we sample from a proposal distribution Q(X,Y) = Q(X)Q(Y) with Q(x0) = Q(y0) = 0.6. We draw sample <x1, y0>. What correction weight should be applied to the sample?
Answers: 1. (0.6 * 0.5) / (0.4 * 0.6) 2. (0.4 * 0.6) / (0.6 * 0.5) 3. (0.4 * 0.7) / (0.4 * 0.6) 4. (0.4 * 0.6) / (0.4 * 0.7)

TypeAnswerResponsesPercent
Correct:(0.4 * 0.7) / (0.4 * 0.6)5489%
Distractor:(0.6 * 0.5) / (0.4 * 0.6)00%
Distractor:(0.4 * 0.6) / (0.6 * 0.5)12%
Distractor:(0.4 * 0.6) / (0.4 * 0.7)610%
Question 10: If we are doing importance sampling in the following Bayesian Network, which operations are necessary to create a mutilated network for evidence C=c?

.

i) remove edge C -> D, C->E
ii) remove edges A->C, B->C
iii) alter the CPDs of D and E
iv) remove edge B->E
v) alter the cpd of C
Answers: 1. i) ii) and v) 2. i) and iii) 3. ii) and v) 4. iii) and iv) 5. all of them

TypeAnswerResponsesPercent
Correct:ii) and v)5895%
Distractor:i) ii) and v)23%
Distractor:i) and iii)12%
Distractor:iii) and iv)00%
Distractor:all of them00%




CS228 Class Page
Powered by QuizTest v3.0.31