Who wants to be a Millionaire?

Computer Science Level 5

Consider the following version of the who wants to be a millionaire game: You are given a list $\mathcal{L}$ , consisting of $N$ questions. You can answer the questions in any order you like. The probability that you answer the $i$ th question in $\mathcal{L}$ correctly is $p_i$ and you win $v_i$ dollars for answering it correctly , $i=1,2,\ldots, N$ . However, if you give a wrong answer to a question, the game ends immediately and you leave the game with the money you have already won (if any). Naturally, your objective is to answer the questions in an optimal order $\mathcal{S}$ so as to maximize your expected winning. To be clear, $\mathcal{S}$ is a permutation of natural numbers from $1$ to $N$ which corresponds to your order of answering questions from the list $\mathcal{L}$ , e.g., if $\mathcal{S}(1)=1000$ , it means that you answer the $1000$ th question first from $\mathcal{L}$ and so on.

Here is the data for $N=10000$ questions, where the row numbers (on the left) correspond to question numbers in the list $\mathcal{L}$ , the first column corresponds to the probabilities $p_i$ 's and the second column corresponds to the prizes $v_i$ 's.

Find $\mathcal{S}(2015)$ in the optimal sequence.

The answer is 3132.

2 solutions

Abhishek Sinha
Oct 8, 2015

We will be solving the problem using an interchange argument . Consider any sequence of answering the questions $\mathcal{S}$ . Let $X_{(i)}$ denote the reward obtained from the $i$ th question in the sequence $\mathcal{S}$ . Hence, the total expected reward $\mathbb{E}R$ obtained by answering the questions in the order of $\mathcal{S}$ is given by : $\mathbb{E}R=\mathbb{E}\sum_{i=1}^{N}X_{(i)}=\sum_{i=1}^{N}\mathbb{E}X_{(i)}$ Where we have used Linearity of Expectations in the last equality. Since we obtain a reward associated with the question $(i)$ iff we answer all first $i$ questions correctly, it follows that $\mathbb{E}X_{(i)}=(\prod_{j=1}^{i}p_{(j)} )v_{(i)}$ . Now consider another sequence $\mathcal{S}'$ , which is the same as $\mathcal{S}$ , except that we interchange the order of answering $i$ th and $i+1$ th questions, for some given $i$ . Hence, from the above equation, it follows that the total expected reward $\mathbb{E}R'$ for answering the questions in the order of $\mathcal{S}'$ is given by $\mathbb{E}R'=\mathbb{E}R +\big( \prod_{j=1}^{i-1}p_j\big)\big(p_{(i+1)}v_{(i+1)}+p_{(i)}p_{(i+1)}v_{(i)}-p_{(i)}v_{(i)}-p_{(i)}p_{(i+1)}v_{(i+1)}\big)$ If $\mathcal{S}$ is the optimal sequence, we must have $\mathbb{E}R\geq \mathbb{E}R'$ . This implies $p_{(i)}v_{(i)}+p_{(i)}p_{(i+1)}v_{(i+1)}\geq p_{(i+1)}v_{(i+1)}+p_{(i)}p_{(i+1)}v_{(i)}$ Rearranging, $p_{(i)}\big(1-p_{(i+1)}\big)v_{(i)}\geq p_{(i+1)}\big(1-p_{(i)}\big)v_{(i+1)}$ i.e., $\frac{p_{(i)}}{1-p_{(i)}}v_{(i)} \geq \frac{p_{(i+1)}}{1-p_{(i+1)}}v_{(i+1)}$ This implies that the optimal sequence is obtained by sorting the questions in the list $\mathcal{L}$ (in decreasing order) according to their scores $s_i=\frac{p_iv_i}{1-p_i}$ . Once we perform this sorting in the given data, we find that the $2015$ th position in the optimal list is occupied by the $3132$ nd question.

Abdelhamid Saadi
Oct 10, 2015

Based on the same observation as Abhishek Sinha , this is a program in python 3.5

f = open('data')
txtlst = f.readlines()
f.close()
data = []
for k in range(len(txtlst)):
    [a, b] = [eval(x) for x in txtlst[k].split()]
    data.append([k + 1, a, b , a*b/(1 - a)])

sdata = sorted(data, key=lambda x: x[3], reverse=True)
print(sdata[2014][0])

Who wants to be a Millionaire?

The answer is 3132.

2 solutions

0 pending reports