The “probability to win” is hard to estimate…

Real-time computation (or estimation) of the “probability to win” is difficult. We’ve seem that in soccer games, in elections… but actually, as a professor, I see that frequently when I grade my students.

Consider a classical multiple choice exam. After each question, imagine that you try to compute the probability that the student will pass. Consider here the case where we have 50 questions. Students pass when they have 25 correct answers, or more. Just for simulations, I will assume that students just flip a coin at each question… I have $n$ $n$ students, and 50 questions |123|set.seed(1)n=10M=matrix(sample(0:1,size=n*50,replace=TRUE),50,n)|

Let $X_{i,j}$ $X_{i,j}$ denote the score of student $i$ $i$ at question $j$ $j$ . Let $S_{i,j}$ $S_{i,j}$ denote the cumulated score, i.e. $S_{i,j}=X_{i,1}+\cdots+X_{i,j}$ $S_{i,j}=X_{i,1}+\cdots+X_{i,j}$ . At step $j$ $j$ , I can get some sort of prediction of the final score, using $\hat{T}_{i,j}=50\times S_{i,j}/j$ $\hat{T}_{i,j}=50\times S_{i,j}/j$ . Here is the code |12|SM=apply(M,2,cumsum)NB=SM*50/(1:50)|

We can actually plot it |1234|plot(NB[,1],type=”s”,ylim=c(0,50))abline(h=25,col=”blue”)for(i in 2:n) lines(NB[,i],type=”s”,col=”light blue”)lines(NB[,3],type=”s”,col=”red”)|

But that’s simply the prediction of the final score, at each step. That’s not the computation of the probability to pass !

Let’s try to see how we can do it… If after $j$ $j$ questions, the students has 25 correct answer, the probability should be 1 – i.e. if $S_{i,j}\geq 25$ $S_{i,j}\geq 25$ – since he cannot fail. Another simple case is the following : if after $j$ $j$ questions, the number of points he can get with all correct answers until the end is not sufficient, he will fail. That means if S_{i,j}+(50-i+1)< 25[/latex] the probability should be 0. Otherwise, to compute the probability to sucess, it is quite straightforward. It is the probability to obtain at least [latex]25-S_{i,j}[/latex] correct answers, out of [latex]50-j[/latex] questions, when the probability of success is actually [latex]S_{i,j}/j[/latex]. We recognize the survival probability of a binomial distribution. The code is then simply

1234567

PB=NB*NAfor(i in 1:50){ for(j in 1:n){ if(SM[i,j]>=25) PB[i,j]=1 if(SM[i,j]+(50-i+1)<25) PB[i,j]=0 if((SM[i,j]<25)&(SM[i,j]+(50-i+1)>=25)) PB[i,j]=1-pbinom(25-SM[i,j],size=(50-i),prob=SM[i,j]/i) }}

So if we plot it, we get |1234|plot(PB[,1],type=”s”,ylim=c(0,1))abline(h=25,col=”red”)for(i in 2:n) lines(PB[,i],type=”s”,col=”light blue”)lines(PB[,3],type=”s”,col=”red”)|

which is much more volatile than the previous curves we obtained ! So yes, computing the “probability to win” is a complicated exercice ! Don’t blame those who try to find it hard to do !

Of course, things are slightly different if my students don’t flip a coin… this is what we obtain if half of the students are good (2/3 probability to get a question correct) and half is not good (1/3 chance),

If we look at the probability to pass, we usually do not have to wait until the end (the 50 questions) to know who passed and who failed

PS : I guess a less volatile solution can be obtained with a Bayesian approach… if I find some spare time this week, I will try to code it…

Related