CS228: Probabilistic Methods in AI
Winter 2008
Weekly Quiz


Overall Score Statistics:
Mean Median Mode LowestHighest
85.25% 90% 100% (17) 40% 100%

Score Frequency Table
Frequency of Scores
ScoreNumber of Students
100%17
90%16
80%13
70%10
60%1
50%1
40%1

Detailed Question/Answer Statistics:
Question 1: Which independencies hold in the following 2-TBN(Note, it may be helpful to draw the unfolded DBN for several slices):

Answers: 1. Weathert is independent of Velocityt given Observations1...t 2. Failuret is independent of Velocityt given Observations1...t 3. Weathert is independent of Velocityt given Weathert-1 and Observations1...t 4. Weathert is independent of Locationt given Velocityt and Observations1...t 5. None of the above
TypeAnswerResponsesPercent
Correct:Weather<sup>t</sup> is independent of Velocity<sup>t</sup> given Weather<sup>t-1</sup> and Observations<sup>1...t</sup>4881%
Distractor:Weather<sup>t</sup> is independent of Velocity<sup>t</sup> given Observations<sup>1...t</sup>00%
Distractor:Failure<sup>t</sup> is independent of Velocity<sup>t</sup> given Observations<sup>1...t</sup>12%
Distractor:Weather<sup>t</sup> is independent of Location<sup>t</sup> given Velocity<sup>t</sup> and Observations<sup>1...t</sup>00%
Distractor:None of the above1017%
Question 2: Which of the following template clique-trees allows for exact filtering in given 2-TBN?



Answers: 1. A 2. B 3. C 4. D 5. E 6. None of the above
TypeAnswerResponsesPercent
Correct:A4068%
Distractor:B12%
Distractor:C58%
Distractor:D35%
Distractor:E47%
Distractor:None of the above610%
Question 3: What makes inference in DBNs difficult?
Answers: 1. (a) As t grows large, we generally lose independence properties of the form (X(t) &perp Y(t) | Z(t)). 2. (b) Standard clique tree inference will not work in a DBN 3. (c) In many networks, maintaining a belief state over the variables requires representing a full joint 4. (b) and (c) 5. (a) and (c)
TypeAnswerResponsesPercent
Correct:(a) and (c)5085%
Distractor:(a) As <i>t</i> grows large, we generally lose independence properties of the form <i>(X<sup>(t)</sup> &perp Y<sup>(t)</sup> &KPHHASH124; Z<sup>(t)</sup>)</i>.23%
Distractor:(b) Standard clique tree inference will not work in a DBN00%
Distractor:(c) In many networks, maintaining a belief state over the variables requires representing a full joint610%
Distractor:(b) and (c)12%
Question 4: What is the key difference between particle filtering and likelihood weighting for DBNs?
Answers: 1. In particle filtering, the particles are reweighed at each time step. 2. In particle filtering, the particles are resampled at each time step. 3. The particle filter performs exact inference at each time step. 4. Particle filtering starts with more particles.
TypeAnswerResponsesPercent
Correct:In particle filtering, the particles are resampled at each time step.4983%
Distractor:In particle filtering, the particles are reweighed at each time step.915%
Distractor:The particle filter performs exact inference at each time step.00%
Distractor:Particle filtering starts with more particles.00%
Question 5: In general, if we initialize a bootstrap particle filter with a set of particles, where all particles are the same, i.e. x(0)[m] = x, for all m=1,...,M, then as the algorithm proceeds...
Answers: 1. the belief state and all particles will remain unchanged, x(t)[m] = x. 2. the belief state will change, but all particles will be equal, x(t)[m] = x(t)[n]. 3. the belief state will change and the particles will become different, x(t)[m] != x(t)[n].
TypeAnswerResponsesPercent
Correct:the belief state will change and the particles will become different, <i>x<sup>(t)</sup>[m] != x<sup>(t)</sup>[n]</i>.5085%
Distractor:the belief state and all particles will remain unchanged, <i>x<sup>(t)</sup>[m] = x</i>.12%
Distractor:the belief state will change, but all particles will be equal, <i>x<sup>(t)</sup>[m] = x<sup>(t)</sup>[n]</i>.814%
Question 6: Recall that the conditional entropy of X given Y is HP(X | Y) = -EP[log P(X | Y )]. Which of the following is true as we condition on more variables?
Answers: 1. HP(X | Y, Z) <= HP(X | Y) 2. HP(X | Y, Z) >= HP(X | Y) 3. HP(X | Y, Z) = HP(X, Y) 4. Cannot be determined in general.
TypeAnswerResponsesPercent
Correct:H<sub>P</sub>(X &#124; Y, Z) <= H<sub>P</sub>(X &#124; Y)5492%
Distractor:H<sub>P</sub>(X &KPHHASH124; Y, Z) >= H<sub>P</sub>(X &KPHHASH124; Y)35%
Distractor:H<sub>P</sub>(X &KPHHASH124; Y, Z) = H<sub>P</sub>(X, Y)00%
Distractor:Cannot be determined in general.23%
Question 7: D is the relative entropy.
I(X ; Y) is equal to:
Answers: 1. (a) D( P(X,Y) || P(X | Y )) 2. (b) D( P(X,Y) || P(X) P(Y) ) 3. (c) I(Y;X) 4. (a) and (c) 5. (b) and (c)
TypeAnswerResponsesPercent
Correct:(b) and (c)4169%
Distractor:(a) D( P(X,Y) &KPHHASH124;&KPHHASH124; P(X &KPHHASH124; Y ))12%
Distractor:(b) D( P(X,Y) &KPHHASH124;&KPHHASH124; P(X) P(Y) )47%
Distractor:(c) I(Y;X)712%
Distractor:(a) and (c)610%
Question 8: When does Bayesian prediction converge to the MLE estimate, using a Dirichlet prior? Recall that M is the total number of observed samples, and M' is the equivalent sample size, where , for hyperparameters .
I. When M -> infinity
II. When M' -> 0
Answers: 1. I 2. II 3. I and II 4. None of the above
TypeAnswerResponsesPercent
Correct:I and II5492%
Distractor:I35%
Distractor:II23%
Distractor:None of the above00%
Question 9: Consider an experiment where we toss a thumbtack multiple times (independently). We model the probability distribution as

If we observe 20 heads and 30 tails, what is the MLE of theta?
Answers: 1. 20 2. 30 3. 0.4 4. 0.6

TypeAnswerResponsesPercent
Correct:0.459100%
Distractor:2000%
Distractor:3000%
Distractor:0.600%
Question 10: In the question above assume we now add a Dirichlet(25, 25) prior. What is the posterior estimate for theta?
Answers: 1. 45 2. 55 3. 0.45 4. 0.55 5.
TypeAnswerResponsesPercent
Correct:0.455898%
Distractor:4512%
Distractor:5500%
Distractor:0.5500%
Distractor:00%
CS228 Class Page
Powered by QuizTest v3.0.31