One of my all-time favorite problems:
I have two boxes. Each has some positive amount of money in it, but I will give you no information about the possible dollar amounts other than the fact that one box has exactly twice the amount of money in it as the other. You randomly select one of the two boxes, open it, and find $100 inside. I now give you the option of keeping the $100 or switching boxes with me and keeping whatever's inside the other box. Which should you choose?
If you'd like a related problem that might help you think about this differently, click below.
(show)
However, knowing I would be suspicious of your willingness to trade, perhaps you are engaging in reverse psychology. But, again, you don't have to offer, so this would be pointless, unless you just like playing head games and don't mind losing the $100. Answer -- don't take the trade.
If you are blinded, then perhaps it is always better to take the exchange. I mean, $100 is not that much, and there is no upper bound on the contents of the boxes. There is a lower bound ($0), so I have more room to move up than room to move down. Giving the limitations on the money supply, the upper bound is not quite infinite, and perhaps you are not that rich, but surely you have a lot more than $100 to offer, and I can afford to risk the $100.
Take the other box.
When you randomly select, your expected return is 1.5x, where x is the amount of money in lesser box. (The total amount of money is x+2x; you get, on average, 50% of that, or 1.5x.)
Once you see the money, you don't know if you've received x or 2x, but it doesn't matter, because the frequency of getting either is .5. So you'll get, on average, .5x or x, or .75x.
If you switch, though, you may get 2x or you may get .5x. On average, it's 1.25x. So, you gain .5x by switching.
After the $100 is revealed, you know that x=100 or x=50. You also know that your expected return is 200+50= $250, divided by two, or $125 if you switch and $100 if you don't.
First scenario: You get $100. Switching means that you have a 50% probability of gaining $100 more (changing a $100 box for a $200 box) and a 50% probability of losing $50 of what you have (exchaning a $100 box for a $50 box.
Second scenario: You don't have any knowledge, but you're still asked to switch, with a 50% chance of getting 2x your current amount of money, and a 50% chance of getting only .5x your money.
In other words, in each case the potential loss is only half the potential gain. Considered from a strictly mathematical standpoint, you should switch, since the possible gain is double the possible loss.
But ultimately, in both scenarios, the solution is the same.: Keep what you have, because in both cases, the fact that the other guy is asking you to switch boxes means he got box with the lesser amount, he knows it, and he's trying to fool you.
Or maybe I just think human beings are naturally cheating, lying bastards.
On the flip side, it makes no sense to always switch as a matter of policy. You've got a 50% chance of getting the "right" box on the first pick, so you can't have a better than 50% chance of getting the "right" box on the second pick. So the EV calculations can't be the right way to think about this.
A sure thing of $100 seems better to me than a 50% chance of only $50. Guess I'm "risk-averse."
You make the first switch, because EV=$125, right? But then the other guy asks you to switch again. But you don't have any information different that before the first switch! So a the second switch has an EV=1.25*($125)!
And so on...
So there's got to be something wrong with the EV calculation.
You stick with the $100 box. This is simply because 1/2 of $100 is $50, while twice $100 is $200. Since United States Currency has a $100 bill and a $50 bill, it is more likely that you're going to get a $50 bill. This does not hold if you open the box and find 5 20s. If you find 5 20s you should switch boxes, because the person likely went to the ATM, and only has 20s, and thus cannot have put $50 in a box.
Let's call the larger amount "Big" and the smaller amount "Small".
Bayes' Rule:
Prob(Big|$100) = Prob($100|Big)*Prob(Big) / [ Prob($100|Big)*Prob(Big) + Prob($100|Small)*Prob(Small) ]
Now, we know that the prior probabilities Prob(Big) and Prob(Small) are each 0.5. But we don't know the conditional probabilities Prob($100|Big) and Prob($100|Small). I think this is where Aaron Bergman's point comes in.
Anyway, this is making my head hurt.
I might have this whole analysis wrong - somebody please tell me.
This is different from the Monty Hall problem, where you get new information when he opens the door you didn't pick.
For the second question, the answer is, do whatever you feel like -- you have no more information about the contents with the box in your hand than you did before, so it doesn't matter.
Of course, this is ignoring the potential psychological aspect, which is why this is theory, eh?
This dilemma was memorably played out in the Princess Bride when the villain asked the hero to choose which glass of wine to drink when of the two glasses had been poisoned. The hero's only safe bet: secretly poison both glasses and spend the previous five years slowly building up an immunity to poison. I like that answer better than the EV analysis.
There are two probabilistics steps in the problem. First, you chose how much to put in the boxes.
When I open the first and find X dollars, all I know is that there is a 50% chance that the expected value of randomly picking a box is 0.75X and a 50% chance that the expected value is 1.5X.
Any amount of switching doesn't change those expected values or the randomness of whether my eventual selection is the high box or the low. (For instance, if you opened the box I didn't choose and it had $100 in it, I would have the exact same information and the decision to switch would be just as inconsequential.)
The only time you should take the choice for sure is if your box had only one penny in it when you opened it because that is the smallest amount of positive value possible and the other box would have to have two pennies.
The princess bride scenario was a true "game". The initiator knows which has the iocaine powder, they both have everything to lose (and supposedly much to gain, although she was not that hot), and the initiator essentially has no active role except the requirement to drink at the end. Of course they can play head games with each other...
This is much simpler. The initiator has the option to make the offer to trade with the chooser. If he wants to win, and knows the contents of the boxes, then he will only offer if it is of benefit to him. If he is blinded, it is again a very different game from the Princess Bride scenario...
"Uniform distribution" just means there's an equal chance of every possible outcome - like rolling a fair die, where you have an 1/6th chance of rolling anything from one through six. A "uniform distribution" to choose any random number of dollars is impossible - there's no such thing as a 1/infinity chance of something happening. It's important here, because the "50/50" chance of getting $200 vs. $50 is only true if the split is equally likely after you get your information - but it's not, because it wasn't equally likely before you got the information. This is the like the screwy behavior you get with the Monty Hall problem.
How about this? If you get an odd number of dollars (or cents, if he allows stray change in the box), then you know you have to switch. If you get an even number, it can go either way. That certainly should affect your decision in the second problem, and before you get the $100 it affects the "distribution" in the first problem as well.
I only meant it was like the Monty Hall problem in the sense that the "uniform distribution" is a bad way to think about it. Should have been more explicit about that.
In the blinded case as Aaron Bergman pointed out there's too little information in the problem to make a choice. The choice of distribution is surprisingly deep for those who haven't taken a proper
sadisticsstatistics course.Huh? The original problem specifies that we are NOT told the distribution. Thus, the original problem asks what we should do, given that we DON'T know the distribution. It cannot possibly be a correct answer to say that what we should do depends upon information that the problem says we don't get.
Consider -- What if there problem had stated: "I have 2 boxes, and they are (with equal probability) either $50 and $100, or $100 and $200, but I'm not going to tell you which. You take a box and it has $100 in it. Should you switch?"
The answer to this question is NOT "It depends upon whether the other box has $50 or $200." That is part of the information that, by the call of the question, we are not given. For the same reason, the answer to the original question is not "The answer depends on the particular distribution you choose." The answer might reasonably be: "The question does not give us sufficient information to provide a conclusive answer," but the answer to the question actually asked (in which it is given that we are not told the distribution) is not "the answer depends upon the distribution." The correct answer may be (and in my opinion is) "There is not enough information to conclude with confidence which choice leads to a higher expectation value, for reasons that would take a while to explain" but that is different from "The answer depends on the particular distribution you choose."
Indeed, the claim "The answer depends on the particular distribution you choose" is LESS right than "The answer depends upon what is in the other box." If you are allowed to say that the answer is "based on" information you do not have, then surely the fact of what is in the other box is a more important fact (leads to correct choice 100% of the time) than the fact of what the original distribution is (leads to correct choice 50% of the time).
Bottom line -- if the solution is allowed to be expressed in terms of "The answer depends on [some fact that is withheld by the nature of the question]," then why is "The answer depends on the particular distribution you choose" a better answer than "The answer depends upon what is in the other box."?
[NOTE -- This is one of my all-time favorite puzzles as well. I can't let the "answer" be a one- or two-liner.]
You guys are hopeless.
The correct answer is I tell you if you don't show me how much money is in the second box I'll hit you so hard in the mouth it will wake up your dentist. I then decide with certainty.
Come on.
Now, a clever man would put the smaller amount into his own box, because he would know that only a great fool would reach for what he was given. I am not a great fool, so I can clearly not choose the box in front of him. But Kevin must have known we are not great fools, he would have counted on it, so I can clearly not choose the box in front of me . . .
Dan
That said, your point about the stakes being different in the two "games" is spot on. Which leads me to think that Slipperman's comment is the one that ultimately ought to be dispositive here - if someone gives you a box with $100, take it.
There may be a genuine mathematical way to reason this out, but I'm not sure it's been put persuasively here yet. It's out of my bailiwick, so I defer to the better brains in the e-room, or as the Dread Pirate Roberts might've said, "Aaas Yooouuuu Wiiiiish."
Bull shit. Replace $100 with $100,000,000. Most of the population will hold the $100,000,000 if it was their box, and they would switch to get the $100,000,000 if the other person's box was opened. Yeah the information you have is the same, but the decision to switch can change depending on what's at stake.
The assumption that you should trade depends on constant marginal utility of money (almost always false) and on no risk-aversion (rare).
Cullen,
Is it true that by the time they started the game, neither knew? Perhaps my memory is not so good. But why then the analysis of whether Roberts/Wesley would be the type to put the iocaine in his own glass or in his opponents (see Dan Schmutter's reference)?
The EV analysis even works in the blinded case. It doesn't matter what is in your box, and whether you know what is in your box. Or can you show a difference in the EV calculation between the two cases?
This is really a Monty Hall problem, with the difference being there's no additional information. If there was a third box (two boxes of $N, one of $2N), the MH logic would say that you should switch after the third box (one of the two you did not already have in your hands) was revealsed to be $N.
Without the additional information of the third box, there's no basis for deciding to switch.
(The Wikipedia article claims that the best strategy for an N-door Monty situation gets you a (mathematically proven) (N-1)/N chance of winning; for two doors that's 1/2.)
Actually I'm starting to think you are right and I am wrong. The blinded case is still "x", and the EV of the trade is 1.25x. So no trades beyond the first. Perhaps that is true. I'll think about it some more...
But my original answer is still undoubtedly correct: in the non-blinded case (at least not blinded with respect to the box offerer), the offerer has absolutely nothing to gain by letting you exchange if he knows he has the box with the greater value (2x). Therefore you should not exchange.
If everyone is blinded (or even just the offerer/initiator), then maybe the change is worthwhile...
You just showed why it is NOT a Monty Hall problem!! There is NO ADDITIONAL INFORMATION!!! Completely different. Let's Make a Deal -- lets not mention the poor fellow again...
Expected value makes sense in a business environment where you are doing more or less the same kind of thing over and over with the goal of amassing money. However, when you are doing something only once, I would think that an individual's own current wealth, risk aversion, etc., would do a lot more work than the expected value.
Over multiple iterations, this double-or-half game is incomplete as a model for human behavior also because it assumes a generous game provider. It is somewhat akin to a false dichotomy, in that it sets up an apparent logical conundrum that could not exist in nature. There is nothing in this game for the person providing the money under the boxes. Once we assume that the person providing the game wants to make money on it (after all, what point is expected value theory if we discount the most fundamental economic assumption), then that person has to either charge a fee or change the payout structure. Let's say they don't change the game, but charge a flat fee for each iteration ($100). With the knowledge that all people desire to make a profit over time, if the provider is in the business of doing this game, then we will be able to deduce a range of likely dollar amounts under the boxes. For instance, if we find $10,000 under the first box, we know that this outcome could only occur rarely if the flat fee is $100, and we will ignore the statement that "half of the time, the other box contains 1/2 the amoutn and half of the time it has 2X the amount" because even if strictly true as a mean average, it discounts the additional information that the dollar amount provides in the instant transaction.
It feels like I am still missing something, but that is my attempt to explain away the apparent logic-loop.
TheShadow is right. Knowing what's in the box you got does not provide you with any true information - other than what 2(x) and x/2 are.
The only real information you are provided is that you pick a box "randomly." That means you don't consider all the psycho-babble princess-bride nonesense.
Since you took the box randomly (which for purposes of the problem we assume can be done - let's say a coin toss), you've already made your best play and your worst. Keep what you've got or trade. It doesn't matter in any event.
For probability wizzes, this is essentially the point raised by Martingales. you can wiki it for more info.
It's an important example of the importance of priors. The "trick" is that people, given the statement of no knowledge of the distribution, assume that the distribution must be uniform.
So maybe I will go back, and say this is like Monty Hall after all -- he is getting advertising revenue to compensate his game!
You are being asked to make a choice which is more likely 50 and 100 or 100 and 200. That judgement depends on the wealth of the person asking the question and the amount of money available. I'd switch. Kevin seems kind of loose with the money.
OK, Symmetry, so what are the probabilities? 75/25? 60/40? We know it has to be one or the other.
Silly frequentist. ;)
As I've made clear, put this PhD probabilist down with Mr. Bergman.
You don't have a distribution here. You have 2 boxes. Not 100 boxes, not 100 possibilities within each box. This problem is bounded by the statement that one is twice the other. I repeat, there is no distribution to assume being normal or not normal. You're not smarter because you say that people assume a normal distribution. While this may be true, it does not apply here. If X is the larger value, than X/2 is the smaller. No distribution. Get it?
The "distribution" plays out like Mike Keenan said:
You are being asked to make a choice which is more likely 50 and 100 or 100 and 200.
And you can't assume that the two scenarios are 50/50. So that is why you can't do the EV...
You are correct if we assume no finite limit to the amount of money that can be put in a box. But in the "real world" there obviously has to be such a limit. Doesn't this negate your argument since a "natural" probability distribution for the natural numbers 1,2,3, ..., N where N is finite would be P(x)=1/N; 1<=x<=N?
I'm much more comfortable with the Bayesian argument!
http://mathworld.wolfram.com/ProbabilityMeasure.html
http://mathworld.wolfram.com/Measure.html
Theorem: There does not exist a uniform probability measure over the measurable space of natural numbers.
Proof: Assume there exists a uniform probability measure over the set of natural numbers N.
A uniform probability measure has the property that m(X) = k for all X such that X is a member of N. By the definition of a measure, k is a non-negative real number.
If k > 0, then there exists an integer j such that jk > 1. Define Y to be the set of the first j integers. Then, m(Y) = jk (union of disjoint subsets). m(N) >= m(Y) > 1. Since m(N) = 1, we conclude that k cannot be greater than zero.
If k = 0, then m(N) = 0 (union of disjoint subsets). Since m(N) = 1, we conclude that k cannot be zero.
Therefore, we have a contradiction, and we conclude that no such distribution exists.
It's the same problem as asking whether the sentence "this sentence is false" is false. You're given a logically impossible premise, so there is no answer.
When you find out that there is $100 in one of the boxes, you know slightly more about the situation than you did before (namely, that the other box contains either $50 or $200.) That's only relevant if you're thinking in terms of your own diminishing marginal value of money. If you're thinking economically, substitute appropriately discounted values and do the math with those instead of with the dollar figures. (And heck, if you're thinking economically, it's only one more step to work in things like the value of your pride in guessing correctly or shame in being wrong, reputational effects of being seen as a risk-taker or as conservative, and the like. Have fun with that.)
Other posters are correct to note that you should switch boxes if the likelihood is 50% that your benefactor put $100 and $200 in the boxes rather than $50 and $100. Indeed, you should switch if you believe the likelihood that the other box contains $200 is greater than one-third, since the expected monetary value of switching when the chance is exactly 1/3 is equal to $100.
But you have <i>no idea</i> what this probability is, and the problem doesn't tell you any information that helps to determine it. Perhaps there's some information in the real world that would help - you know that your benefactor is a cheapskate, or you read something in his body language as you consider your decision - but that's not in the problem.
Nice summary, not too redundant, but still the question persists -- if you don't know the probality of 50/100 vs 100/200, what can you do? Can you do any calculation at all? If not, and if you are not answering it in terms of (very simple) game theory, then the question is really not worth answering -- thanks alot, Kevan! You prob/stat whizzes need to let me know what to do in an uncertain world!
Here's why the EV analysis doesn't work:
Suppose you pick the first box. You know that there's money in it, but not how much. Then if the EV analysis were true, the expected value of the second box would be 1.25 the value of the first box REGARDLESS OF WHETHER YOU OPEN IT. But the same would be true if you chose the second box.
If the game is dishonest, you should keep the $100, because it's more likely you chose the right box. If you chose the wrong box, why would he ask you to switch.
Box X and Box Y
Only two boxes I note.
"Each has some positive amount of money in it, but I will give you no information about the possible dollar amounts other than the fact that one box has exactly twice the amount of money in it as the other."
X and Y values are positive integers
X=2Y or X=1/2Y
"You randomly select one of the two boxes, open it, and find $100 inside."
X=100
Solve for Y
Y=200 or Y=50
"I now give you the option of keeping the $100 or switching boxes with me and keeping whatever's inside the other box. Which should you choose?"
Expected Value of Y:
1. number of possible values for Y - 2
2. probability for each value is thus %50
3. Solve for expected value:
E(Y)= 1/2*200 + 1/2*50
E(Y)= 125
Now - the value of Y is not ever going to be $125 - however this Expected value thingy is telling me to take the second box.
You are choosing between two outcomes. One outcome yields 1/3rd of the total money in the boxes, the other 2/3rds. The expected value of choosing box A is 1/2 the total money -- the same as box B.
2) This is not the Monty Hall case. With MH you first choose 1 of 3 doors assumed equally likey to have the best outcome. Let's choose A. They then show that one door you didn't choose was a loser--let's say door B. But you already knew that either B or C was a loser, since there can be only one winner. (Indeed the winner could be A.) There is no new relevant information offered.
Do you want to change your choice to the 3rd door or keep your first choice? Yes you should change your choice from A to C. But you are not changing your view based on new information. Rather you have been given a different opportunity than before. Instead of choosing 1 of 3 doors, you are allowed to choose between door A and the better of door B or C. You get the outcome of 2 doors instead of one. Monty Hall is not about new information. Wikipedia gives the right answer for the wrong reason.
If the first box you opened had an odd number in it (say, $5.01), then you know it can't be the box that contains the 2x higher amount (since an odd number can't be 2x anything.) Since the box you opened has an even number, keep it. It has a higher probability of being the "2x" value, since it is even.
The second example does not give this information, since you haven't opened the box.
Why not? Probabilities are just a lack of information. The "reality" is that the likelihood of 50 and 100 is either 1 or 0. But that doesn't help. And if there is no additional information, then both scenarios are 50% likely.
Both boxes cannot hold odd amounts (to the penny or the dollar) because one box holds "exactly" twice as much as the first). This reduces the number of possibilities so that knowing one box hold $100 does affect your choice.
Before the first box is opened, the possibilities are [box 1/box 2]:
odd/even, and box 2 holds twice the amount in box 1
even/odd, and box 1 holds twice the amount in box 2
even/even, and box 1 holds twice the amount in box 2
even/even, and box 2 holds twice the amount in box 1
When box 1 is revealed to hold an even amount, we are left with these three possibilities [box 1/box 2]:
even/odd, and box 1 holds twice the amount in box 2
even/even, and box 1 holds twice the amount in box 2
even/even, and box 2 holds twice the amount in box 1
It is twice as likely that box 1 holds the greater amount. Keep the $100.
I don't have a PhD in statistics, but I'm fairly comfortable saying that "Probabilities are just a lack of information" is just not true. Known probabilities are actually a wealth of information. Uncertainty is a completely different art...
Can someone with a PhD in statistics explain further?
The paradox, of course, is that switching in the second scenario is ridiculous, since you haven't learned anything more. You could keep switching forever! And yet, if you open it, the weird thing is that you're going to want to switch... 100% of the time. So then, why did you even need to open it? But how can that be?
Well, by my reasoning, you want to trade if you know the dollar value of your box, but otherwise it doesn't matter.
I'm basing this on the idea that if the numbers are entirely unknowable and can be anything, the potential gain or loss are both equal: infinity. But if you know the value of the box, then the potential gain or loss is fixed, and the gain is twice that of the loss. This means that fixing the value does actually change the facts.
To illustrate, let's say you know the boxes aren't going to be worth more than a million dollars. In that case, whether you switch obviously depends on what your number is. High number = stay, low number = switch. Knowing the number, then, makes all the difference.
If you don't know, however, you may in fact have a box worth a million dollars. In that case, you're automatically going to lose $500,000. Which seems to illustrate something important: If you have anything from 500k to 1mil, you're going to lose a LOT of money. But if you have from 1 cent to 499k, you could actually still lose half your money, while the gain even if you double is only modest.
All of this is basically why, if you don't know the values, you don't know whether you should switch. getting rid of the limit, you could win a million dollars, you could lose 2 million, you could win 4 million, or you could lose 8 million. If you know the value, however, then there are only two options, and the one is more good than the other one is bad.
But I want to go back to the point that, if you really have *no information* about the likelihood of two choices, then each choice is 50% likely. We're not talking about the spin of quarks here, so a priori the odds that each box contains a certain dollar amount are either 1 or 0. The probability we assign just reflects our level of uncertainty about what the boxes contain. But like CB, if someone can give a theoretical explanation of why I'm wrong, I'd love to see it.
So we bound the distribution and call it uniform. Or we specify that it is 50/50. We eliminate odd-even trickery. We blind the other guy so his motivations don't enter into it. We take whatever action is necessary to deny you the pathetic pseudo-erudite cop-out of "gee, we can't know."
I still think the EV analysis has to be wrong because you have a 50% chance of picking the "right" box on the first try, and switching--whether you peek inside or not--can't change that.
Can someone who has been wasting pixels on proofs and jargon explain either 1) why the EV analysis is wrong, or 2) why I'm wrong?
www.j-bradford-delong.net/movable_type/archives/001395.html
My previous proof is wrong.
Independent of the initial box chosen, a switch would be made!
Yes, it does have to be one or the other, which tells us that the probabilities add up to 1. However, we don't know what they are specifically without information not given in the problem.
Actually, I should ammend that. The problem says "I will give you no information about the possible dollar amounts" rather than "You have no information about the possible dollar amounts." If it was the former then we'd be well and truly stuck, but since its the later we can bring in information like the fact that theres no way to put 10 trillion dollars in a box, how much money people and this person specifically tend to use to prove points, etc. From these you can try to work out probabilities based on the state of your knowledge about the universe.
Random:You don't have a distribution here. You have 2 boxes.
I disagree. As long as those two boxes could, based on the current state of our knowledge, contain more than one possible value we have a probability distribution across those values.
Regardless of "distributions", if a switch is called for even when you don't know what you have, then the original choice is moot. If offered the choice again, you would switch again. This can't be helpful.
So if the expected value analysis doesn't hold up, then what does this tell us about the probabilities to start with.....
>I think part of what's feeding the battling intuitions here is that, in fact, 100/200 is *not* equally likely as 50/100, because in general in life higher amounts are less frequent than lower amounts.<
Well, if there is a limited amount of money in the world, I think that's necessarily true. What that would mean (at the very least) would be that if you had a box for more than half of that total value, you are going to lose a FORTUNE if you switch. Meanwhile, your only possibility of winning would be if you had less than half of the world's wealth in your box. And even then, you could still lose. Thus, in the real world, your potential losings may actually be greater than your potential winnings (that is, if there's a top limit, and you don't know what's in your box).
I think this explains pretty well why if you don't know the values it's not necessarily true that doublings are more good than halvings are bad. Because when you're at the outer reaches of the potential money, you always get halved.
That's theoretical, though, and merely explains why the math isn't as simple as it appears. Nevertheless, if my box had 5 dollars, or 100, or a million, or 100 million, I'd always switch (if he has 100 million, isn't it just as possible he has 200 million?)
The problem I see with traditional EV is that there's nothing to suggest that the person running the puzzle is as likely to put $200 in a box as he is to put $50 in a box.
Suppose I specify that they are equally probable. EV still can't be right. What's the mathematical reason for that?
Going in, you have a 50% chance of choosing the box with the higher value, and 50% chance of picking the lower value. I presume that if you played this game 1000 times, 50% of the first picks chose the higher value box and 50% chose the lower value box. If that is so, then switching 100% of the time will result in ending up with the higher value box 50% of the time and the lower value box 50% of the time. If that is right, it shouldn't matter whether you switch or not. What am I missing here?
If the chooser keeps the original box there is a 50% chance that she is getting x, and a 50% chance that she is getting 1/2 x. The expected value of what she holds is .75x. (It doesn't matter whether she gets to look inside the box or not, because that tells her nothing about whether she has x or 1/2 x). If the chooser switches, the situation is the same, a 50% chance of x and a 50% chance of 1/2x, again equaling .75 x. So switching gets you nowhere; the expected payoff is the same either way.
Nice point, very elegant. You are not as simple as you would like to admit...
Actually, if you specify that then suddenly the EV does become valid. Before you opened the box you didn't know that this was one of those Xs for which P(2X)=P(.5X), but once you've opened it you do knowand can apply hte EV accordinly. Since this can't be true for all X, that means that switching is probably a bad idea for every other number.
Discussion of Expected Values is now concluded.
something about the distribution of money amounts, regardless of what is told to you. Then whether you should switch depends on what you believe.
http://brainyplanet.com/index.php/Envelope%20Solution
Unfortunately this seems to be on a slow server that often returns blank or truncated pages.
The two boxes have a smaller amount 'X' and a larger amount '2X' in them. Assuming your choice is random, your expected value on choosing (and though it doesn't effect the analysis opening) is (X+2X)/2=1.5X
If you have chosen the box with the smaller amount you get X. If you have chosen the box with the larger amount you get 2X.
Now the switching comes in. If you initially had chosen the box with the smaller amount you will now receive 2X. If you initially had chosen the box with the larger amount you will now receive X. The expected value post-switch is 1.5X which is the same as the pre-switch scenario.
Therefore, on an expected value analysis you have no reason to switch.
On a marginal value analysis you may have a good reason to switch depending on different values of X. (For a $1 revealed amount I always switch because $2 in hand is worth more to me than the 50 cent loss even though my EV is the same in both cases. For a $1 million revealed amount I might not switch because the $500,000 'loss' would make a big difference to me. For a $100 million revealed amount I would probablly switch because the $50 million loss wouldn't feel as much to me as the gain I could get by getting $100 million. But in all cases the EV of switching or not switching remains the same.)
Suppose a dog has 4 puppies and each puppy is a 50/50 independent coin flip of being a boy or girl. Is it most
likely that the litter will be 2 boys and 2 girls?
Her answer would be, "No, it is most likely that the litter will be 3 of one sex, 1 of the other."
This is both correct and incorrect. If the event space is (all four same sex, 3 of one 1 of the other, 2 and 2) then she is correct since the probabilities of these events are, respectively, (1/8, 4/8, 3/8) and 4/8 is the highest of this list.
On the other hand, if the event space is (4 male, 3 male 1 female, 2 and 2, 1 male 3 female, 4 female) then the probabilities are (1/16, 4/16, 6/16, 4/16, 1/16) and thus
the most likely is 2 and 2.
Of course, she never gives you the event space, which, like the distribution in this problem, is necessary to answer the question.
Given that the first statement was essentially "You can't decide" disguised in language which says "I'm smarter than you," it can hardly be a shock that the discussion went on.
All the yammering about uniform distributions and natural numbers and whatnot is basically people who know too much missing the point. Sort of like what would happen if you asked a law professor "Can I legally do X?" and got back a 180 page law review article discussion deontological vs. welfarist thories of justice.
My (pre-law) training was in physics, and I can assure you that if Choset puts up a puzzle about the uncertainty principle, my respone will not include the words "commutation relations" however relevant they may be.
HowieKevan poses us is,Deal... or no deal?
It wasn't disguised in language to me at all. It was the most parsimonious language possible, and completely accurate.
Sort of like what would happen if you asked a law professor "Can I legally do X?" and got back a 180 page law review article discussion deontological vs. welfarist thories of justice.
Except it was hardly 180 pages. More like asking a law professor some question and he responds succinctly by using the precise Latin legalese for the situation, which unfortunately does not always illuminate without a longer response. Ask a math question, get a math answer.
Naturally, of course, to spell out the answer in more detail requires using more words. But there you go.
What an odd comment...
1) If you bound the distribution, the paradox goes away.
If we know that the most that can be in a box is $100, and we get a box that has $60 in it, we don't switch. If we get a box that has $30 in it, we do switch.
The paradox is that we *always* switch, no matter what's in the first box. That makes no sense because opening the first box seems to give us no information. In this case, however, opening the first box does give us information, as expected.
2) You're correct that the EV analysis is wrong. The reason, however, is that the given assumption in the problem is impossible.
You open the first box, and there is $100 in it. The boxes are either $100/$50 (scenario A) or $100/$200 (scenario B).
Before opening the box, these two scenarios are both equal to zero. Otherwise, the total probability of all the possible scenarios would be infinite, and that's impossible.
After opening the first box, the probability of these two scenarios appears to be equal, and this conclusion leads some people to believe that the probabilities are both 50%.
In reality, the probabilities are undefined. Bayes' theorem gives us the probability of scenario A:
Pr{A|$100 in box 1} = Pr{$100 in box 1|A} x Pr{A} / (Pr{A} + Pr{B}) = 0/0
Therefore, Pr{A|$100 in box 1} = 0/0. This value is *undefined*, not 50%. If the numerator and denominator were both very small numbers approaching zero, then 0+/0+ could equal 50%, but 0/0 does not.
This result comes about because the problem statement itself is a contradiction. It is the reason that EV analysis fails and that your intuition is correct.
Sorry if you don't like "proofs" and "jargon," but they're the only way you know you're right rather than just arguing intuitively about questions that have a correct answer.
So at first, I have no idea whatsoever how much money I'm going to get. It could be one dollar or it could be a trillion dollars. Ok, probably not a trillion (but again, if the first box is half a trillion, why not?).
Then I open my box. Blam! It's a million dollars. I'm thinking, "Sweet! That's huge! I'm getting at least 500k here." So now I have the question of whether to switch.
So how do I possibly not switch? Not knowing anything else, I don't see how it can be anything other than 50/50 for 500k or 2 mil.
Maybe the problem is that it wouldn't actually be 100 dollars. If there were truly no limits, the number would be something with like a million zeros on it, right? If the number were truly picked randomly.
If there are no limits, though, I still say you have to switch, if you're given a specific number.
On the other hand, contrary to BrainyPlanet there is a well-defined version of the question which exhibits the same paradoxical behavior. Suppose that there is a 1/2^n chance that the envelopes contain the values $3^n and $3^(n+1). This is a good probability distribution since the sume of 1/2^n is 1.
Suppose you open one envelope and find $2^k. You can easily calculate that the expected value for switching is (11/9)2^k. This is clearly larger than 2^k, so you should always switch no matter what you find in the envelope!
Ah, but then what happens if you don't open the envelope? Here the answer is a lot trickier and there's room for debate. But what I would say is that you now compute the expected value for staying the same, and the expected value for switching. Both of these calculations turn out to be infinite! So there's no way to say that switching is better since both switching and staying the same have infinite expected value.
Everything I know about this problem I learned from a slightly drunken discussion with Kenny of antimeta who is a graduate student studying philosophy of probability. So any good observations in my comment should be attributed to him. Any mistakes should be contributed to either me, or the wine that I was drinking at the time.
For the reasons discussed by some prior commenters, if I open Box 1 and it contains $100, I will think to myself "if I change to Box 2 I will either gain $100 or lose $50. Being risk-neutral, the expected gain outweighs the expected loss. Therefore, I will change to Box 2."
BUT ... nothing in that reasoning depends on the specific value of $100. I would have reasoned "this box contains X, changing to the other box will lead to either a gain of X or a loss of 1/2 X, and therefore I will change," no matter what X was.
Nor does anything in this reasoning depend on it being YOU who chooses the first box. Suppose the situation were that I (the player) gets to choose one of the two boxes, open it, note the amount inside, then keep that box or change to the other one.
Based on the above reasoning, my optimal strategy would always be "pick one box, open it, and then, whatever is in it, reject it in favor of the other box." But if I'd opened the other box to begin with, by identical reasoning, I would have changed that box for the first one.
In any event, that seems to be the reasoning I would go through ... except it doesn't seem to make a whole lot of sense.
I actually find it fairly condescending when someone assumes that I won't understand them, and I expect that a lot of people feel the same way.
If you don't like rigorous math, skip over my posts, and don't take them so personally.
Yes, that is a better analogy. But if the answer is "you can't decide without more information" then there is no need to dress it up in Latin or anything else.
Nor does it make sense to stand on what lawyers would call "nice" distictions like the distribution when the average layman is just going to assume 50/50 and go from there. The interesting question is what happens in the 50/50 case, and why are our intuitions so contradictory?
I'm not opposed to parsimonious languge in and of itself, having used it in many discussions with physicists and lawyers, but it should hardly surprise anyone that non-specialists don't accept the parsimonious explanation only accessible to specialists. To return to the uncertainty principle example, I would hardly expect this sentence to end all discussion here at VC: "The paradox is resolved because you cannot have simultaneous eigenstates of non-commuting observables."
If the box the person picks (Box #1) holds $100, one cannot assume that there's a 50% of Box #2 holding $50, and a 50% chance of Box #2 holding $200. Just because there are only two choices for monetary amounts in Box #2 does not mean each choice is equally likely (nothing in the original question says that).
The point of all the math types here is the following: there is no way to completely speicify the process by which amounts are chosen for the boxes are such that the answer to the question "what is the probability that the other box is twice the amount I have now?" is always 50/50. For instance, if the process is to draw from a uniform distribution between $0 and $100, put that amount in one box and put twice that amount in the other box, then if you open a box and it has over $100 in it, the probability that the other box has $200 in it is zero, not 50%.
Just because you weren't told the procedure by which amounts were put in the boxes (other than one has twice the other) doesn't imply you have no beliefs about the matter. You must have some belief about where these amounts came from. And whatever these beliefs are, the probability of the other box having twice the amount of the box you opened is not 50%.
Think about it this way...
I give you the following information:
* a is either equal to zero or one.
* a is equal to zero.
* a is equal to one.
What is the probability that a = 0? It's not 50%. It just doesn't exist.
It is not correct to say "you can't decide without more information". Like I state in my previous post, you can't help but have beliefs regarding the process which determines how much money is put in the lower amount box. The answer is not "you can't decide" but you whether you should switch depends on what these beliefs are.
* a is equal to zero.
* a is equal to one.
What?
As for the reasoning behind my answers, I'll wait a bit, since the comments keep flowing.
I say the following "I have two boxes. In one box I put a certain amount of money. I then flipped a coin to decide whether the other box would contain double the amount of the first box or half the amount of the first box."
I hand you one box and ask you if you would like to switch. Isn't there a 50% probability that the second box contains half and a 50% probability that the second box contains double?
Why doesn't this result in the paradox?
Dan
I'm not taking it personally. As it happens, I don't mind math (having done graduate physics work before bailing out for law school) but I can't read your notation so I do skip most of it.
That said, I was and am interested in this problem because it exposes an obvious flaw in how we think about probability which is worth trying to understand (I'm still groping around a bit; Sebastian and Box Keeper seem to have gotten it right but I can't say exactly why).
But all the highly technical objections of "we don't know the distribution" strike me as analgous answering the question of "how long will the ball remain airborne?" with "Unless we know if we're on the moon or not, we can't tell what the acceleration of gravity is." Technically correct, not particularly helpful. Make a simple assumption that speficies the problem and explain.
Now replace my dollar values with variables. You've switched, and Kevan shuffles the 2 boxes (including the one you originally had) 3-card-monte-like so that you don't know which is which. You had x, and now you have either x/2 or 2x. Should you switch again? The EV of switching again is .25(x+x/2) + .25(x+2x) = 9x/8, compared to the 5x/4 you now have. (Alternatively, you are given the option to switch back to the box you originally had. Should you? No.) So you should switch once, then stay pat.
I don't see how this problem is any different than the one above, and it doesn't require assuming some sort of uniform probability distribution among all integers, just among 2 or 3 boxes.
You did not address the issue of whether the initiator was required to offer the choice, or whether he is trying to "win". In a game scenario, if he offers to exchange when he doesn't have to, then he is holding the "losing hand".
Of course, this comes down to the question not being specific along these lines.
Vizzini: I'm afraid so. I can't compete with you physically, and you're no match for my brains.
Man in black: You're that smart?
Vizzini: Let me put it this way: Have you ever heard of Plato, Aristotle, Socrates?
Man in black: Yes.
Vizzini: Morons!
Man in black: Really! In that case, I challenge you to a battle of wits.
Vizzini: For the princess? [Man in black nods] To the death? [Man in black nods again] I accept!
Man in black: Good, then pour the wine. [Vizzini pours the wine] Inhale this but do not touch.
Vizzini: [taking a vial from the man in black] I smell nothing.
Man in black: What you do not smell is Iocaine powder. It is odorless, tasteless, and dissolves instantly in liquid and is among the more deadly poisons known to man.
Vizzini: [shrugs with laughter] Hmmm.
Man in black: [turning his back, and adding the poison to one of the goblets] Alright, where is the poison? The battle of wits has begun. It ends when you decide and we both drink - and find out who is right, and who is dead.
Vizzini: But it's so simple. All I have to do is divine it from what I know of you. Are you the sort of man who would put the poison into his own goblet or his enemy's? Now, a clever man would put the poison into his own goblet because he would know that only a great fool would reach for what he was given. I am not a great fool so I can clearly not choose the wine in front of you...But you must have known I was not a great fool; you would have counted on it, so I can clearly not choose the wine in front of me.
Man in black: You've made your decision then?
Vizzini: [happily] Not remotely! Because Iocaine comes from Australia. As everyone knows, Australia is entirely peopled with criminals. And criminals are used to having people not trust them, as you are not trusted by me. So, I can clearly not choose the wine in front of you.
Man in black: Truly, you have a dizzying intellect.
Vizzini: Wait 'till I get going!! ...where was I?
Man in black: Australia.
Vizzini: Yes! Australia! And you must have suspected I would have known the powder's origin, so I can clearly not choose the wine in front of me.
Man in black: You're just stalling now.
Vizzini: You'd like to think that, wouldn't you! You've beaten my giant, which means you're exceptionally strong...so you could have put the poison in your own goblet trusting on your strength to save you, so I can clearly not choose the wine in front of you. But, you've also bested my Spaniard, which means you must have studied...and in studying you must have learned that man is mortal so you would have put the poison as far from yourself as possible, so I can clearly not choose the wine in front of me!
Man in black: You're trying to trick me into giving away something. It won't work.
Vizzini: It has worked! You've given everything away! I know where the poison is!
Man in black: Then make your choice.
Vizzini: I will, and I choose...[pointing behind the man in black] What in the world can that be?
Man in black: [turning around, while Vizzini switches goblets] What?! Where?! I don't see anything.
Vizzini: Oh, well, I...I could have sworn I saw something. No matter. [Vizzini laughs]
Man in black: What's so funny?
Vizzini: I...I'll tell you in a minute. First, lets drink, me from my glass and you from yours.
[They both drink]
Man in black: You guessed wrong.
Vizzini: You only think I guessed wrong! That's what's so funny! I switched glasses when your back was turned! Ha ha, you fool!! You fell victim to one of the classic blunders. The most famous is never get involved in a land war in Asia; and only slightly less well known is this: Never go in against a Sicilian, when death is on the line!
[Vizzini continues to laugh hysterically. Suddenly, he stops and falls right over. The Man in black removes the blindfold from the princess.]
Buttercup: Who are you?
Man in black: I'm no one to be trifled with. That is all you'll ever need know.
Buttercup: And to think, all that time it was your cup that was poisoned.
Man in black: They were both poisoned. I spent the last few years building up immunity to iocaine powder.
To clarify, the problem could read: Has Kevan put 50 and 100 into the boxes or has Kevan put 100 and 200 into the boxes.
But in my version of the problem they are. I've said that I determined which scenario to apply by a coin flip.
Dan
other than the fact that one box has exactly twice the amount of money in it as the other. You randomly select one of the two boxes, open it, and find $100 inside. I now give you the option of keeping the $100 or switching boxes with me and keeping whatever's inside the other box. Which should you choose?".The puzzle remains the same.
How does the puzzle remain the same? In the original you know something about the contents of the second box. In your version you know literally nothing.
Dan
1) I must offer you the switch either way. I'm not playing strategically at all.
2) Despite the unrealistic nature of this assumption, I am indeed assuming there is a limitless amount of money in the universe. If the box I gave you contains every dollar the US Mint has ever printed, the other box might have twice as much.
Mike Keenan: You are precisely right that the two scenarios (50 vs. 100 or 100 vs. 200) are not equally probable. But I believe that far from making my reasoning wrong, that is precisely the insight that makes my answer correct.
Of course I could be wrong.
I don't think you switch once and hold under my version. Each time you have a box, there is a 50% chance the other box contains double and a 50% chance the other box contains half, because it was determined with a coin flip. Why isn't the expected value analysis that leads to infinite switching valid here?
Dan
I like to. . . think outside the box.
If the guy gives you that option only after you've chosen, though, and it is his money, it seems to me like you're in the Princess Bride situation.
The possibility of additional switching is not part of the problem and can't be reasonably analyzed unless you restate the problem with the necessary info. If the third box is created only after you have chosen the first two, it inevitably allows for experimenter manipulation and the logical answer would be to stop at two. If there are two "third boxes" each placed before you depending on which second box you got, I think it's back to the EV problem.
But Kevan, that claim does not seem to be correct under my version.
Dan
For the first pick: p (high) =.5
p (low) =.5
For the second pick p (high given high) =0
p (low given high) =.5
p (high given low) =.5
p (low given high) =0
So the odds of getting the high box on a swtich are the same as getting on your first pick .5. The amount discovered in the box is irrelevant.
That's correct. Under your scenario, the probabilities are, in fact, equal. Certainly under your scenario, the answer to Problem 2 is still that it doesn't matter whether you switch. I need to think some more about the answer to Problem 1 though. And if the answer to Problem 1 (in your scenario) is to switch, then I need to wrap my head around why there's a difference.
similary low given low = .5 * 1 (p of getting high first time, times p of getting high on switch given that you picked low first time)= .5
high given low = .5 * 1 = .5
low gien low = .5 * 0
Anyways, Noah Snyder's version is quite cool. I hadn't realized that you also needed a finite expectation value to avoid the paradox.
For Dan, the problem is the "certain amount of money" he puts in the first box. All possibilities can't be equally likely. Let's say, for example, that he can't put more than a trillion dollars into the initial box, but that's it's uniformly distributed below that. If you get a value over a trillion dollars in the box you open, you should never switch. What to do for values below a trillion is left as an exercise for the reader :)
On Bruce's version, you should switch on a pure expectation value basis. No paradox because you know what's in all the boxes.
Should we do sleeping beauty next?
The double/half part of the puzzle does not give you any useful information to keep the original box or switch it either.
Aaron -
I don't see how that applies. Under the closed box version all you should care about is the probability distribution of the second box because all that matters is the relationship between the first box and the second box. So since you know the second box is relatd to the first box by a coin flip, the amount that could be in the first box is irrelevant.
If I pick a box, I know that when the amounts were chosen the relationship of the two boxes was determined by a 50/50 event. So I know that the expected value of the switch is 25% more than the value of not switching. And that's true each time I have a box in my hand. Thus the paradox arises, and you can't get out of it by saying you can't have an even probability distribution because you do by definition. When I selected the first amount I did so knowing that I would choose the second amount by coin flip, so the first amount had to be one which allowed a coin flip to determine the second amount.
Dan
Dan
Dan says he puts X in Box A and flips a coin to put either X/2 or 2X in Box B. He then hands you a Box. Care to switch Boxes?
If you knew you had Box A, you would switch because your expectation goes from X to 1.25X. If you knew you had Box B, you would not switch because your expectation goes from 1.25X to X.
But, you don't know which Box you have - so you have no reason to switch.
Dan earlier said: That's true, but if you hold Box A you are doubling or halving from a constant amount (X). If you hold Box B, you are halving from a higher amount (2X) than you are doubling from (X/2). So, Dan's conclusion that you always increase your expectation doesn't hold.
In my version it does because you can validly calculate the expect values. There is a 50% probability of either outcome (unlike the version which does not specify the coin flip)
Dan
Simple question: One box contains $X, the other contains $2X. You pick one, and it contains $100. Should you switch?
First math-people answer: "Well, assuming an infinite amount of money in the universe, you can't tell."
Fine, we limit the amount of money in the problem. Second math-people answer: "if the limit is under $200, definitely don't switch because then you know you have $2X."
OK, screw limits. We specifiy a distribution: 50/50, decided by flipping a coin. Should you switch?
Third math-people answer: "I leave that as an exercise to the reader."
It's nice that you've solved the trivial cases for us. That's why we have experts: to dodge the question!
It's clear that there's some problem with straightforward EV. Mev may have explained it, and if so I'm sorry that I don't know enough Baysian probability to make sense of his explanation. But to me what's really interesting here is to understand when one can, and cannot, use EV in other, less artificial problems. Given how common EV is in the real world, I'd love to know that but so far I'm pretty much stumped.
But the point is you don't know which box you have. I think my version is exactly the classic version with the only difference being that you have a defined probability distribution which is even for all outcomes. Why doesn't that launch the paradox?'
Dan
1. We agree that the likelihood of 50 or 200 in the second box is the same in Dan's scenario, due to the coin toss. That is 50/50. And thus, switching does not matter.
2. Assume, for the sake of argument, that in Problem 1, there is some difference between the likelihood of 50 and 100 or 100 and 200. How would you know this as a rational player? You couldn't know. Hence you have no basis to decide which scenario is more likely, even if in objective fact one might be. Thus, you will not benefit yourself by switching.
As an aside, has someone created a simulator of this game and run it through iterations to test these hypotheses with some empirical data?
What is your expectation if you don't switch? There's a fifty/fifty chance you chose the initial box, so you get
(1/2)E + (1/2)((E/2 + 2E/2)) = 5E/4
Now, what if you decide to switch? You get the exact same expression. So there's no point in switching. Once you open the box, my earlier comment applies.
In my experience, when business people applied EV in the weighted average fashion (not the only way mind you, just the easiest one to explain), it was used as an enhanced way to predict value.
In other words, a businessman says, well there's a 30% chance I make $50, 40% chance I make $75, and a 30% chance I make $100, so I should overall expect this - and then he computes a weighted average. Of course this is all guessing in the first place.
But, it was seen as preferrable to just sticking your finger up in the air and saying "this will make $80."
I'm not a statistician - so I'm not sure how they use it. It's clear that a lot of people hear probably learned EV in b classes in undergrad or law school, based on their assumption that you can straightaway apply to probabilty problems. Nope.
-1 to math geeks for giving probabilities and expected values, identifying paradoxes and arguing bitterly about them when the question is simply "which should you choose?"
+1 to the Robert Lyman for the first "doesn't matter" answer
The Solution. Analogously, in the Two Envelope Paradox, the expected value of X in the "2X" part of the formula (where Envelope A is the envelope with less) is less than the expected value in the "1/2 X" part of the formula (where Envelope A is the envelope with more). You would expect less in Envelope A if you knew that it was the envelope with less than you would if you knew it was the envelope with more. Allowing X to have different expectations in different parts of the formula in this way is like comparing apples and oranges. The "X" in the "2X" just isn't the same as the "X" in the "1/2 X" part.
The better calculation in the Two Envelope Paradox involves setting X to the amount in the envelope with less and calculating the expected value of Envelope B as (.5)X + (.5)2X = 3/2 X -- and the expected value Envelope A likewise as (.5)X + (.5)2X = 3/2 X. In these calculations the expectation of X in the first term of each equation is identical to its expectation in the second term: The expected amount of money in the envelope with less does not change depending on whether Envelope A is the envelope with more or Envelope B is.
I didn't claim the paradox wasn't launched with your problem definition. I merely corrected your claim that you should perpetually trade boxes because your expectation always goes up. Going from Box A to Box B improves your expectation. Going from Box B to Box A reduces your expectation by a like amount.
Back to the possible paradox. Using your problem defintion together with Kevan's Problem 2 (pick a box - not sure which box it is - no peeking inside - switch?), it is clear switching makes no difference because of the last sentence in the above paragraph.
But what if we take your problem definition and Kevan's Problem 1 (pick a box - not sure which box it is - open it up and it has Z inside it - switch?). I need to think about that one some more.
To see why, imagine that there are two players simultaneously looking into the two boxes, under each of the two possible scenarios that follow from the presence of a $100 bill in one box. Each player might be inclined to believe (erroneously) that the chance to switch would amount to a double-or-half gamble, but of course that's not possible.
If the actual amounts are $50 and $100, the person looking at the $50 will think "it could be $100, or it could be $25" (incorrect EV of $62.5) of while the person looking at the $100 will think "it could be $200 or it could be $50." (incorrect EV of $125). These EV calculations total much more than what is in the two boxes combined. The reason they are collectively skewed upward is because they contain a figure (here $200) that is not contained in any box, and that upward skew is not fully compensated for by the fact that there is also a downside figure that does not appear in any box (here, $25). The fact that only the two middle figures are possibilities, and not the outliers, is why neither player is actually justified in viewing the gamble as "double or half."
It works the same way if the actual amounts are $100 and $200. Then the person looking at the $100 will think "it could be $50 or it could be $200" while the person looking at the $200 will think "it could be $100 or it could be $400." Again, these ruminations are flawed, because the total set of figures under contemplation contains an upside figure not contained in any box (here, $400) that is not mathematically made up for by the presence of the low figure not contained in any box (here, $50).
After seeing $100 in the first box, I decide at random whether or not to switch. 100/101 of the time I keep the $100, 1/101 I switch. (More generally, if I see $n in the box, I switch with a probability 1/(n+1)).
This actually gives better results on average than always keeping or always switching, regardless of whatever distribution has been used to select the amounts in the boxes.
For example, if 50% of the time the boxes held $50 and $100 and 50% they held $100 and $200, then:
1/4 of the time I'd see $50, and switch 1/51 of the time: EV $51.019
1/4 of the time I'd see $100 (other box has $200), and switch 1/101 of the time: EV $101.01
1/4 of the time I'd see $100 (other box has $50), and switch 1/101 of the time: EV $99.501
1/4 of the time I'd see $200 and switch 1/201 of the time: EV $199.505.
My overall EV is $112.76, which is higher than the EV of always switching or always keeping (both $112.5).
Likewise, if the smaller dollar value is generated by a roll of an unbiased die, the EV of switching or keeping is $5.25, but the EV of switching 1/(n+1) of the time is $5.71.
Sadly, I can't claim credit for this--it was mentioned on a Crooked Timber post on the same subject, IIRC.
Noah's example troubles me--but one thing to notice is that the average amount of money in each envelope is _infinite_ (3/2+9/4+27/8+... grows without bound). Distributions with infinite averages tend to yield...unusual results anyway.
I think that's incorrect. If I go from Box A to Box B I improve my expected value. But I don't know which box I have when I am holding Box B. Box B is either twice Box A or half of Box A. Similarly Box A is either twice Box B or half Box B. If I go from Box B to Box A, then my expected value is .5(2B)+.5(B/2)which equals B + .25 B which is 1.25 times B, so I should switch back. That leads to perpetual switching.
Dan
I don't see how my version is materially different than the classic version. The only difference is that I have specified the probability distribution of the second box. It's exactly the same problem otherwise. However, specifying the probability distribution eliminates what seems to be the main objection to the problem.
Dan
The math types have answered your questions over and over. You just aren't understanding their answers. If you specify a distribution over how Kevin picks the lower amount and assume he puts twice that amount in the other box, and you have a 50/50 chance of initially choosing the lower box, whether you switch when you open the first box depends on what's in that box and the actual distribution. The uniform with a bound example was one example where you can give a simple rule of how it depends. Without the actual distribution, the only correct answer is "it depends on the distribution and what you saw in the first box. Give me the distribution and I will tell you how."
Aaron -
How does that hold in my coin flip version? It does seem that the value you observe when you open the box is irrelevant.
Dan
Fine. My distribution for the first choice is uniform, bounded at $1 million.
The first box contains $100, as Kevan specified.
Switch or not? What is the "simple rule"?
I say, "it doesn't matter." Am I wrong?
It is not true that your expectation value changes when you switch. The problem is when you start out and say that, without looking, assume that the value in the box you have is B. There is a probability distribution for that value that you have to take into account for the rest of the calculation.
The paradox with the original formulation is that, when you open the box, no matter what the value, you should switch which would imply that you should switch even before opening the box. This contradicts the symmetry of the situation.
The resolution, as I said above, is that there does not exist a distribution of probabilities with finite mean that leads to the situation where you should always switch. Switching is irrelevant before you open the box and, after the box is opened, the decision depends on the probability distribution for the values in the box.
if 100 is the larger value then ev = .5*50 + .5*100= .75
without knowing which ev to use, you cannot use the expected value to make your decision, i think the problem with your ev calculations are that from the $100 box you have picked, it is not possible to pick either 50 or 200, only one of them is actually possible, you just don't know which one. this seems to be skewing your ev, i think this is one point box keeper was hitting on.
That is wrong. The distribution between the two boxes is known. It is x and 2x. What is not known is whether or not you have opened the X-box (heh) or the 2X-box.
When you initally select the box, before you open it your expected value for a random selection between the two boxes is [(50% times X)+(50% times 2X]/2. Therefore your expected value (EV) after the initial selection is 1.5X.
If you choose to switch you are switching from the bigger value to the smaller value half of the time. The other half of the time you are switching from the smaller value to the bigger value. You end up with the bigger one half of the time and the smaller one half of the time. That is [(50% times 2X)+(50% times X)/2. Therefore your expected value after the switch is 1.5X.
Since this is the same as your expected value before the switch, there is no (expected value) reason to switch.
I don't really understand why this additional information (x=$100) should change our choice from the unknown value situation, if it does. If it doesn't, I have absolutely no clue why you should pick a box, learn nothing, and then switch once and stop. Of course, that's the solution you get anytime you assign a fixed value X to the first box: Box two has EV 1.25X, and you don't switch back because box 1 has defined (but unknown) value X.
Therefore, you need not open the box to know that you will switch. You should switch even before you open the box. Thus, the paradox arises.
Dan
The difference between the original version and the coin flip goes to what makes the original puzzle puzzling. People make the mistake of treating the choice whether to switch in the standard puzzle as if it were identical to a double-or-half gamble (that is, as if it were actually true that there's a 50% chance of getting half of, and a 50% chance of doubling, whatever you spy in the box). But they are wrong to do so, as I explained above. That is, they are wrong to do so unless someone has come along and helpfully specified that the actual value in the second box is either exactly half what you have or exactly double what you have, and that there's a 50% chance of each outcome. If someone does specify that, there's no puzzle at all -- everyone should just take that gamble. With it unspecified (as it is in the original puzzle), the boxes are identical in EV terms.
"Sebastian and Box Keeper seem to have gotten it right but I can't say exactly why)."
the various arguments above to the effect that "you don't know any more after opening the first box than before" present the logical, or shall we say, "intuitive" answer. the arguments given by S and BK are abbreviated versions of the following formal calculation. note the importance of defining random variables explicitly and that it's a conditional expectation that's being calculated:
Consider boxes A &B. one contains an amount of money D, the other 2D, where D is finite. define random variables X and Y which are the amounts of money in boxes A and B respectively. Choose a box, say A, and open it to find an amount of dollars, necessarily either D or 2D. what is the conditional expectation of the amount of money in box B (Y) given the amount in box A (X).
Ok, so in additional to the coin flip I also specify that the initial amount was selected at random, so all initial amounts are equally likely. Why does the paradox not arise then?
Dan
That's not true. Here's an example. Assume that you know the guy has only 2000 dollars with which to play the game. If you open the box and see 1200 dollars, you know that there has to be 600 dollars in the other box, so you should not swiitch.
Because that's an impossible specification. You cannot select a random initial amount with equal probabilities for all natural numbers (or real numbers if you prefer.)
(Gotta go now...)
All that means is that if you don't know how much he has to play with then you always switch. All I have to do is specify that you don't know how much money he has.
Dan
Having chosen a box with x amount ($100), we STILL DON'T KNOW whether the other is x/2 or 2x. In your scenario, this means we don't know whether or box 'x' was your first box (random amount) or the second (determined by coin toss).
My initial critique of the expected value analysis still holds. For the "pure" version of the problem (no limit on money, no risk aversion, etc), it doesn't matter what is in the box, the relative value of the boxes, or whether you look or not. If it is worth switching once, then it should be worth switching back. Therefore it is not worth switching. If you play this many times, the switch or not-switch strategies will be equal.
Box Keeper (at 3:04 PM) has the most straightforward explanation of the probability tree that shows this.
I don't have to include all natural numbers, I just don't have to tell him what the maximum is. All I tell him is that each number had an equal probability of being chosen.
Dan
For heaven's sake, how about the case of a bounded problem where the amount is nowhere near the bound???? That's the only interesting case.
You guys are so far off, answering some other question. The real answer doesn't actually rely on any of your back and forth. It has been explained by at least 5 or 6 guys why it doesn't matter. Seriously!
Would you just declare Box Keeper the winner already?! (As I did 4 hours ago.)
In your formulation, Box A has the constant value X. Bix B does not have a constant value. Half the time Box B will have 2X and half the time it will have X/2. When you double in switching back to Box A, you always go from X/2 to X. When you halve in switching back to Box A, you always go from 2X to X. Your expectation in switching from Box B to Box A drops from 1.25 down to 1
The problem: We put an amount X in Box A. We flip a coin and half the time put 2X in Box B and half the time put X/2 in Box B. We pick a Box (don't know which one) at random, open it up and see there are Z dollars. However, Z doesn't represent any particular amount but rather the complete set of possible values of Z in the likelihood that they will occur. Should we switch?
Note that we can't answer the problem for any particular value of Z because we need to know the distribution of X. Trivial example: for Z=100 we would never switch if X always equaled 50, but always switch if X always equaled 200. But, if we integrate over all possible values of Z, we can solve the problem.
Example: We always put 50 in Box A. Z=50 with Prob=1/2 (we chose Box A). Z=100 with Prob=1/4 (we chose Box B and it had 2X). Z=25 with Prob=1/4 (we chose Box B and it had X/2). When we see 50 and switch, our expected value goes up by 12.5. When we see 100 and switch, our expected value drops 50. When we see 25 and switch, our expected value increases by 25. Overall change in expected value by switching = 1/2*12.5 - 1/4*50 + 1/4*25 = 0.
Answer: it doesn't matter when we integrate over all the possible values of Z. The paradox appears only when we consider a specific value of Z (and goes away when we integrate over all Z).
In the absence of prior information, it is reasonable to assume a 50-50 chance, and one should switch. If one switches, one shouldn't switch back, because in that case one will be holding either double or half $100, and with a 50-50 chance the Expected Value of what one holds would be $125. So the game is consistent, and there wouldn't be any switching ad infinitum as some have suggested.
But should the odds really be 50-50 for each option? We may know something, or believe we know something, about these kinds of games which renders the chances different from 50-50. We we know something, we should use our prior knowledge. For example, if one believes game hosts will be more likely to offer switches when the contestant will lose, one should substitute what one believes the chances of this occurring are. If one believes the host offers a swich when the contestant holds the higher value two thirds of the time (or more), the expected value of a switch becomes .667*$50 + .333*$200 = $100 (or lower), so one should keep the $100.
However, I think Robert Lyman has an understandable grievence with the mathematicians so far, why can't we give some usable guidance. Well, with the caveat that there are all sorts of bizare distributions which will invalidate these guidelines, here goes.
We must have some a priori assumptions about the distribution -- at the very least it is bounded by all the money in the world. (at the very least there would be significant problems with inflation and/or security at very high values which would invalidate the linearity of the value of a dollar) There are two assumptions which lead to easy decision rules.
a) If the amount in your envelope can not be halved, then it must be the smaller amount so you should switch. Given many likely specific scenarios (e.g. the envelopes contain legal tender)this will apply.
b) If the amount is more than half the maximum you believe possible, then it must be the larger, so don't switch.
c) Under any 'economic' distribution (i.e. excluding those where the probability increases with the amount over a large region) if the amount is significantly below the maximum, then the probability of it being the smaller is no more than, and likely less than, one half. Thus you should be at least as likely to switch as in the pure half or double 50/50 case. For instance, in the case of a uniform, continuous bounded distribution, for X < 1/2 the maximum the probability that X is the smaller amount is 2/3. Thus the EV of switching is 1.5X (1/3 * .5 X + 2/3 * 2X)
So for the given case, as long as I belive that there is a good chance that Kevin would put at least $200 in an envelop, then I should switch.
And the specific-value-of-100 versus all-possible-values-of-X distinction is at the heart of the paradox.
From Box Keeper: "The fact that only the two middle figures are possibilities, and not the outliers, is why neither player is actually justified in viewing the gamble as 'double or half.'"
This seems to me to be a plain-English version of the fact that in ctw's equation, the {Y=D|X=D} and {Y=2D|X=2D} terms equal 0, leaving you with just the two "middle" terms.
What seems to be going on is that Kevan's puzzle seems like a double or half game but is not. I'm trying to figure out why that is. I think the difference is that in a double or half game, like my 3-box variation, you know what you have and you know its relation to the other two boxes. In Kevan's Puzzle 1, you know what you have but you don't know how it relates to the amount in the other box. And — critically — that relationship *changes* depending on what's in the other box. If the other box is lower, then you are playing a puzzle in which the total was only $150, and the EV of the other box is $50. If the other box is higher, you're playing a different game, one in which the total value is $300 and the EV of the other box is $200. In a double-or-half game you know the total value of the system (even in Dan's coin-flip example) and thus the relationship of the box you hold to the other box. So it's more like quark spin than I first thought.
The part I don't get yet is, how come you can't say, given your lack of information, that there is a 50% probability you're in Game 1, and 50% probability you're in Game 2? Which gets you back to square 1.
The 'paradox' arises because everyone wants to express their gain in terms of the first box's amount, which could be high or low. If you instead use the more sensible definition of x as the lower box, period, you can easily calculate the gains and show they have equal expectation values. Doing otherwise muddles things by introducing more scenarios ($50 and $100, or $100 and $200?) which themselves do not have equal expectations, though they have equal likelihoods.
My apologies for not reading the comments closely enough to find commenters that got the correct answer earlier.
Right; but why does it change the answer when you define "x" a different way? If I say x+3=y+5, I don't get a different equation when I say y+3=x+5.
Actually you do. y=x-2 in your first example and y=x+2 in your second example no matter which way the quark spins.
Now, with the definition x is the box you first picked, and you want to answer in terms of that, there are both scenarios, that one is x and the other 2x, and one is 1/2 x and the other x, which complicates the problem. The boxes are not completely specified with this definition. But it can still be done.
To get an answer in terms of the second definition, compute the expectation value for the x/2, x case: 3/4x. Now do the x, 2x case: 3/2 x. The total expectation value is then 9/8 x.
The same answer arises from the first definition as well. Calculate 3/2 y (y now refers to what I called x in the first definition) for y = x/2 and y = x, which covers both scenarios (don't include y = 2x, since y is by definition the lower of the two boxes). The mean of 3/2*x/2 and 3/2*x is 9/8 x, the same result.
So if you get a box with $100, your expectation value is $112.50.
Otherwise my choice would depend on how I estimated his motivations. If I thought he wanted me to win more I would repick, if I thought he wanted me to win less then I would stay put with my first choice.
Of course if it was not $100 but any odd number then I would repick. Notice that this subsumes the issue of picking $1 and knowing the other box has $2. It also subsumes the case where both boxes contain zero dollars. After all 0 and 2x0 is a possibility.
I might explain why I came up with this later but think about this. Your initial choices are x and 2x. After you pick you either got x and your alternative choice is really only 2x with a perceived choice between 1/2 x and 2x. Likewise if you pick 2x your only real other choice is x with a perceived choice of x and 4x.
Of course if you follow the reasoning that you should always pick again because the upside is twice the downside this will lead you into error exactly half the time. This is because there really are two choices and not four. The choices 4x and 1/2 x are imaginary but this reasoning is treating them as real.
Notice also that in either case the real difference you are looking at by switching your choice is either +x or -x. It is not between double on the upside vs. half on the downside. If you have chosen x then switching gets you +x to 2x and if you initially chose 2x then switching gets you -x to x.
I went in circles like most people trying to do things in terms of the first box you pick, until I realized it's simple if you just do things in terms of the lower box, whatever that may be. Maybe it's $50, maybe it's $100, but the point is, there is a lower box, and either choice on average gets you 3/2 that amount. This way of formulating things, unfortunately, does not tell you what that amount actually is in terms of the box you know, but it does answer the puzzle.
1) If we construct the problem in the following way:
Kevan chooses random number X, where X is between $0 and $1,000, and places X dollars in box A. He then places 2X dollars in box B. He then randomly switches box A and box B 50% of the time.
If you open a box, and there is more than $500 in there, then you do not switch. If there is less than $500 in there, then you do switch. All the Bayesian arguments apply.
There's no paradox because you need to open the box to know whether or not to switch.
2) In the original formulation of the problem, the paradox seems to arise because we can conclude without opening the first box that it's 50/50 whether the other box contains half or double the first box.
There are two possibilities here. Either the problem can be expressed mathematically, or it's internally inconsistent (i.e. "this sentence is false"). Let's assume it can be expressed using math, since no one likes the second conclusion.
We want a strategy, one that makes sense to us intuitively, that tells us, given a value $X in the first box, should I switch or not?
In order to determine our strategy, we need to assign a probability that we find $X in the box for all possible values of X. (Note that these probabilities cannot all be equal for reasons stated in earlier posts.)
Based on this strategy, we can do EV analysis to decide whether or not to switch for all values of $X.
Note that the probability that the other box contains 2x is not necessarily equal to the probability that the other box contains 0.5x.
Deciding that you always switch or always don't switch corresponds to an impossible distribution and cannot be a valid strategy.
A separate question might be what's the actual value of your strategy. EV analysis can be used again, but only if the contents of the boxes is again a random variable.
In this case, you can evaluate the expected value of a given strategy, and the maximum possible expected value of any strategy will be the one that uses as its distribution the actual distribution used by the individual filling the boxes.
In conclusion, you figure out what the most likely distribution of box contents will be, and you use that distribution to form your strategy.
In the true absence of any knowledge, it's impossible to calculate the EV of any particular strategy, and EV analysis therefore tells us nothing. We therefore don't have any reason to prefer any strategy over any other strategy.
We can simulate randomly picking a strategy by flipping a coin after opening the first box and switching if it lands heads. Anyone happy with that answer?
My point was that EV analysis requires random variables. Random variables must have distributions. If all values of x are equally likely, then x is not a random number, and EV analysis doesn't apply.
Trying to use EV analysis on situations that aren't random according to a strict definition of random is like trying to divide by zero.
The rules that define what random means are designed to prevent you from reaching absurd conclusions, just like the rules that define when you can divide a number by another prevent you from proving that 0 = 1.
Have a good night, everyone.
You do know that you have a 50% chance of getting the smaller amount but you already knew that before your choice. No new information was gained by your choice.
You were never told the distribution of the amounts. You were not told it was random. For all you know the boxes always contain $100 dollars and $50.
Suppose we rephrase the game like this: I have two two boxes. The first box holds money, the second box holds a ball. The ball is either white or black. The game rules are as follows: you first receive the box with the money in it (*shake shake* yep, it has money). You are then offered the option to open the ball box. If the ball is white, I take half the money from the money box and keep it. If the ball is black, I double the money in the money box, and you get it all.
Lets talk about distributions: Pretend I'm a rich mathematician and have lots of sets of these boxes. I get to pick which set we play with, and I'm an inscrutable mathematician, so there's no way any sort of Princess Bride reverse psychology or economic theorizing works on me. For all you know, I'm picking randomly. Lets say I've got a trillion boxes. I'll let it slip that at least one of the money boxes has $100 in it, and of the ball boxes exactly half are white and half are black. You're free to think up any distribution of money in the other money boxes.
Here's my claim: this problem is *exactly equivalent* to the one in the original post.
Now, lets talk about that distribution again: it *does not matter* what distribution of the money in the money boxes looks like, in either the sighted or blind case. The rules of the ball boxes remain the same either way. Suppose all but one of my money boxes are $1 and the last is $100. I happen to hand you the $100. Um, yay for you, but that doesn't tell you anything about the color of the ball. Suppose all but one are a gazillion dollars and the last one is $100. I happen to hand you the $100. Well, sucks to be you, but that still doesn't tell you anything about the color of the ball.
Seen in this light, the paradoxes vanish quickly.
Well, no, but in the absence of information as to the distribution, this is the only assumption that makes the problem interesting. I mean, the question's pointless otherwise. In these sorts of problems, that's a safe assumption to make. I mean, when someone uses a coin flip analogy to refer to a 50/50 situation you're not that guy who says, "That's only valid if the coin isn't improperly weighted, and only if it's tossed with many rotations," are you? If someone asks, "How many presidents were named George?", you don't respond, "That question is flawed because there are many other nations, and corporate leaders, with the title president, and you have to count them too."
All I can say is, if someone put Scarlett Johannson behind Door Number 1 and asked me whether I'll switch, I'm not going to spend 10 minutes asking questions like, "She doesn't have VD does she? You have to tell me if she does. A giant block isn't going to fall on me just as I walk through the door, is it?" I'm just going through Door Number 1.
Anyway, I'm done with this. Fun puzzle! Beats all those 'list of names' puzzles.
To make things easier to explain (with no real effect on the math) assume that the lower box has a uniform one in 500,000 chance of being any even number between $2 and $1 million. (The assumption that it is even is so you can't make any inferences from the fact that you observe an even or odd amount).
There are now 1 million possible joint events where a joint event is the amount in the lower box and whether you choose the high or low box. Each has a 1 in a million probability.
If you open a box and see $2, you know you have the low box and you should definitely switch.
If you open a box and see any amount between $4 and $1 million there are two equally probable joint events which could have caused this. 1) The low box has this amount and you drew the low box. 2) The low box has half this amount and you drew the high box. Since each event has equal probability (1 in 1 million) you now believe you have the low box with probability 1/2. If you are risk neutral you should switch since 1/2 (x/2) + 1/2( 2 x) = 1.25 x (where x is the amount you observed in the box).
If you open a box and see an amount over $1 million, you should not switch since the only way that can happen is if you have the higher box.
That answers Robert Lyman's question. If you observe $100, you should switch.
Suppose we rephrase the game like this: I have two two boxes. The first box holds money, the second box holds a ball. The ball is either white or black. The game rules are as follows: you first receive the box with the money in it (*shake shake* yep, it has money). You are then offered the option to open the ball box. If the ball is white, I take half the money from the money box and keep it. If the ball is black, I double the money in the money box, and you get it all.
(skipped a little)
Here's my claim: this problem is *exactly equivalent* to the one in the original post.
Patrick,
This claim is incorrect. The problem you posed is constructed to make it always the case that the amount of money you observe has no value in helping you decide whether opening the other box will double or halve your money. But that can never be the case with the original problem. In the original problem, it is impossible to come up with an example which specifies how the amount in the lower box is determined where the amount you observe never gives you information to help you decide whether opening the other box will double or halve your money.
For heaven's sake, how about the case of a bounded problem where the amount is nowhere near the bound???? That's the only interesting case.
The bound affects the decision whether the amount is near the bound or not. People are giving examples where the amount is near the bound because those examples are easier to understand, not because the amount has to be near the bound for there to be an effect.
This has nothing at all to do with whether we put an upper bound on the amount of money that can be in the lower box. All that is required is that whatever process determines this, the expected amount put in the lower box is finite.
In the original problem, it is impossible to come up with an example which specifies how the amount in the lower box is determined where the amount you observe never gives you information to help you decide whether opening the other box will double or halve your money.
>>
I respectfully disagree with this reasoning, via constructive proof: I, the inscrutable mathematician, inform you that the amount of money in the lower box is either $50 or $100, and that I have decided between these two alternatives on the basis of a single flip of a fair coin. In this case, seeing that a box contains $100 gives you no useful information as to whether it is the low box ($100 vs. $200) or the high box ($100 vs $50).
Now, I tend to think the phrase "never gives any information" is outside the scope of the original problem: you have randomly picked one of my two boxes and know only that it contains $100. I only have to demonstrate that you gain no additional information from the fact that "one box contains $100 and you are ignorant of my strategy for stuffing the boxes", rather than having to demonstrate you gain no information from "I can examine potentially examine either of the two boxes and know your strategy a-priori". That would be a radically different (and, depending on the strategy, likely pretty boring) problem.
Mulling it over, I think what you say makes sense. At least, I haven't found a flaw. They key point is knowing that half the high boxes are over $1 million in my hypo, and all the low boxes are below it. So if you didn't draw a box over $1 million, the probability that you got the low box is greater than 1/2 because there are a million low boxes below $1 million but only 500,000 high boxes down there. Do the other math whizzes concur?
But returning to the problem as stated, with no info on bounds or any such thing: I stand by my first post. Peeking or not, switching or not, you had a 50/50 chance of getting it right and switching doesn't change that.
This is correct but does not contradict what I said. Here if you see $50 you know you have the low box. If you see $100 you have a 50% chance of it being the low box. If you see $200, you know you have the high box. But what I wrote was "it is impossible to come up with an example which specifies how the amount in the lower box is determined where the amount you observe never gives you information to help you decide whether opening the other box will double or halve your money." In your example, the amount you observe does sometime give you information, specifically, if it is $50 or $200.
Reconsidering chrisnm's solution, I still think it correct. But my opinion that all the math talk and brackets and natural numbers whatnot are completely irrelevant nonsense and comparable to the application of Rawlsian political theory to the legal question "Is this will valid?" has been reinforced.
Kevan's problem can be restated as follows:
Here are two boxes. They have unequal amounts of money in them. Box A has $100. Would you prefer Box B?
There is no need to resort to mathematical jargon or detailed proofs to say: who knows?
I really wish I had thought of that hours ago.
So there must be valuable information in seeing what's in the box you open, at least for this example. The key is that there is always valuable info in seeing what's in the box you open.
altho you are correct that much of the "math talk" has been nonsense or irrelevant (distributions over natural numbers, measure theory, martingales, et al), "brackets" (ie, symbols) are part of the language of probability theory. your not knowing that language doesn't make it nonsensical any more than not knowing any other language makes it nonsensical. given that your question was answered at every level from the layman's "intuitive" to the professional's "formal", I conclude that your boorish response was born of frustration at your own ignorance.
So there must be valuable information in seeing what's in the box you open, at least for this example. The key is that there is always valuable info in seeing what's in the box you open.
>>
No, that is not true. It requires both that you know the inscrutable mathematician's strategy for stuffing the boxes *and*, for some strategies, that you happen to open a particular one of the two boxes. Even granting that the inscrutable mathematician outlines his strategy, seeing $100 is not useful for some strategies. Given that there is at least one strategy for which "the box contains $100" is not useful information, if the inscrutable mathematician is truly inscrutable you can do no better than this result in the general case. More generally, for any value of the box in front of you there is at least one box stocking strategy for which that value provides you with no useful information.
Its similar to the old Descartes thing about a demon trying to trick you. You say that, if you're infinitely clever and also get lucky, you'll get useful information. The proper constraint to evaluate the problem under is that the *adversary* is infinitely clever and, for the non-blind case, you're constrained to be shown $100.
>>
Here are two boxes. They have unequal amounts of money in them. Box A has $100. Would you prefer Box B?
>>
This is a knee-slappingly amusing way to rethink the problem, and thinking of it this way leads directly to the correct conclusion. I rather like it. I do not share your distate of mathematics -- part of the joy of the field is applying its many tools in sequence trying to find the angle which collapses the problem down into something more managable. Although in this particular case that wasn't something very exotic, probability distributions, expected value, and notation which you may not be familiar are all tools without which some problems are intractable.
This was exactly my restatement some posts ago.
Information has a mathematical definition, and it involves probability. If taking an action has any chance of changing a future decision, you have received mathematical information, even though you may conclude that you've received no information in some cases.
In terms of information theory, here's an equivalent. I send you a piece of paper with a number on it, zero or one. Half the time, it gets erased, and you get a bunch of scribbles. The other times, you get the bit.
Although you don't know anything more when you receive the scribbles, the act of sending you the paper did send you information. It just didn't send you an entire bit. It sent you less than a bit.
If I sent you the bit over and over again, you'd get it eventually. Similarly, in your situation, if you played the game over and over again, you'd eventually find something out by opening the first box.
According to the mathematical definition of information, that's all you need to say that opening the box gives you information.
Math-phobes,
Sorry you think that math is nonsense. Don't mistake precise terminology for jargon. If you don't understand a term, look it up.
Robert Lyman,
I'm glad you've convinced yourself that the answer is that "you don't know" without using math, but that answer isn't satisfactory for me and probably others.
If we can explain mathematically why "you don't know" is the answer, why shouldn't we? Especially when our intuitiion is misleading for other problems. This thread is filled with examples of misleading intuition.
Knowing that one box is 2x the other is not enough information to calculate the EV of switching. The EV calculation requires seven things:
* value of switching if the other box is 2x = x
* value of switching if the other box is 0.5x = -0.5x
* value of keeping if the other box is 2x = 0
* value of keeping if the other box is 0.5x = 0
* probability that you opened the bigger box given that there is $x in it = 50%
* probability that there is $x/$2x in the box = ??%
* probability that there is $x/$0.5x in the box = ??%
Without all seven pieces of information, you can't evaluate Bayes' formula, and you can't calculate EV.
In zarevitz's problem, you're missing the first four pieces of information, but it doesn't matter because you either have all seven or you don't.
If there is a probability that there is $x in the box, however, then a strategy of always switching has the same EV (knowing that probability distribution, which we don't) as the strategy of never switching.
Again, it doesn't matter, said the five hundredth way.
* probability that there is $x/$2x in the two boxes prior to opening the first box
* probability that there is $x/$0.5x in the two boxes prior to opening the first box
1) If you knew the distribution of the money in the lower-valued box (random variable X), and you saw a specific amount of money in the box you picked (random variable Z), then you could calculate whether it is better to switch or not. In his example, you switch if Z equals $50 or $100 and don't switch if Z equals $200.
2) More importantly, when integrating over all possibilities of Z, chrismn found it didn't matter whether you switched or not - the expectation was the same. He would have found the same result for any assumed distribution of X.
The paradox results because Problem 1 gave us a specific value of Z (Z=100), while Problem 2 is equivalent to integrating over all Z. In Problem 1, we can't know the answer without knowing X's distribution. In Problem 2, we know switching makes no difference no matter what X's distribution is.
Unless my (and zarevitz's, which I missed) restatement of the problem is incorrect then the langage of probability is as irrelevant as a deep knowledge of Russian grammar, and brackets as irrelevant as the Cyrillic alphabet. It's not that you can't solve this problem in Russian, it's that there's no need to, despite the insistence that "math problems" require "math answers."
Now, in fairness, it was hyperbole to call it "nonsense." Clearly, written Russian is not "nonsense," and neither are Mev's posts to one who understands his notation. But it would be annoying to encounter a post written in Russian and then be told that, while it does contain the full solution to the puzzle, it can't be translated accurately. I suppose there are times when that might be true (a post about Russian poetry?) but this wasn't one of them. So I was annoyed. And perhaps "boorish," although I think that word a little strong. For that, I apologize.
Also, in fairness, Symmetry (and perhaps others, I don't recall) did answer my main question (Which was "Why does EV fail here?") fairly early on and fairly straighforwardly, although it was not until much later that I fully understood his answer (which was: because you can't say for sure that the chances of $50 and $200 are equal, so you can't take the weighted average). I'll take responsibility for that failure: my bad. And thanks to Symmetry for cutting through the fog, even though I didn't realize you were doing it.
I have no objection to math in and of itself--I frequently deploy it (in simpler form) in arguments about, for instance, energy policy. There are certainly times when it is, as Patrick says, absolutely essential. This is not one of those cases, as was clear to me from my first reading of the problem.
Mev, I went back just now and hastily re-read all of your posts.
This may be the 500th way you've said "it doesn't matter" but it is the first time you've used the words "It doesn't matter."
game I would like to play with you. It goes like this:
1. Dealer (me) takes two boxes and randomly puts X dollars
in one box and 2X dollars in the other box.
2. Picker (you) opens one of the two boxes and takes out
P dollars. ($100 in the original problem.)
3. (Key) Picker now pays Dealer 1.25P dollars for the
opportunity to play the game. ($125 in the original
problem.)
4. Picker may now keep the money found in the first box
or open the second box and keep the money found in
that one.
We can play this game all day long and switching off
between Dealer and Picker. Want to play?
Assuming that P isn't a constant, but rather is a random variable (changing values as the game is played), then I'd like to be the Dealer.
1. I don't see what uniform distribution has to do with it. In the first scenario, you have a finite number of possibilities, and the assumption, one has to think, is that they are equally likely. How is that impossible? Seems possible to me.
Then, in the second scenario, the distribution seems completely irrelevant; the real problem is that the second pick is entirely an illusion. The fact is that the second scenario only allows one pick; Regis simply asks if that's your final answer. Changing, thus, is obviously meaningless. Distributions is a subject for another day.
2. Another way of stating the paradox with the first scenario, though, is that it still seems absurd that Box2 would somehow be more profitable than Box1. The only thing that makes it Box2 is the fact that you chose it second. This can't make it worth more. Thus, it seems, switching really can't matter.
3. How about this: The problem is with the math that proves doubling is somehow more good than halving is bad. Isn't that false? If you end up with 50 dollars, you can say you halved your money. But you could also say you'd have doubled your money by keeping the box. I can't turn this into an equation, but my suspicion is that the 1.25 figure isn't the whole story. E.g., when you're down to 50 dollars, that 100 dollars looks a lot better than it did; now you have to double your money just to break even!
I can't do the math myself, but I think we're understating the catastrophe of halving your money, and this is what makes it look like we should switch.
What we really need to know is how Kevan picked the numbers for the boxes.
Now, uniform distributions etc. are one possibility. Going to the ATM and getting a stack of $20 bills (as suggested by someone above) is another.
There is nothing that makes the more complicated "mathematical" methods more "right" than the psychological methods. There is nothing that makes the demand for more information in the "math" case (Tell me the distribtion!) more fair or reasonable than the demand in other cases (Tell me how much money Kevan had total!), as, again, someone pointed out way up thread.
Hence, my irritation with the mathematicians. Your methods have the appearance of rigor and are inaccessable even to someone like me, who has done lots of math but not much probability, yet they offer us no extra insight over the guy who points out that ATMs don't distribute $50 bills.
I agree that philosophically, it's obvious that there's no reason to prefer switching or not switching. That's not the paradox.
If you asked someone with no mathematical background, they'd tell you it's crazy to open a box and always pick the other one, and they'd be correct.
The paradox is that this problem *looks* mathematical, but the intuitive way of applying mathematics (using expected value) leads us to an absurd conclusion.
The part of the problem that I find interesting is understanding why the same mathematical tools that are normally so helpful fail us in this particular case, and my posts have addressed this topic.
Kevan gave us two problems, an explicit one (what do we do?) and an implicit one (why doesn't the math work?). Simply asserting that the answer to the first problem is that "it doesn't matter" doesn't explain why, and it doesn't give any insight into the second problem.
I've tried to explore the reasons that you can't express this situation as an expected value problem, and these reasons are interesting and require precise mathematical language.
The problem here is that some of us want to understand why the math doesn't work. It's more interesting to us than the stated problem, the stated problem is only not obvious when you start trying to apply mathematics incorrectly to find the answer.
Does anything change when you look inside the box? In the real world, sometimes it does, but these reasons are ultimately distractions from solving what I think is the interesting puzzle here. We have all those issues about the motivations of the box-filler, bounds on possible dollar amounts, even and odd amounts, ATM practices, etc., -- all the things that might make it now more or less likely that you have the low amount rather than the high amount based on the fact you saw $100 rather than $17.31 or one billion when you looked in the box. But put those issues aside and assume that there's nothing about the amount itself that clues you into whether you have the low or high amount. You are still in a world where the expected value of the two boxes is equal. If you give up your box in favor of the other box, you are trading items of equal value in expected value terms.
Here's where the puzzle misleads: Having seen $100 in one box, it becomes possible for the first time to determine two possible expected values for the two boxes. 75% of the higher amount (and hence the expected value of both boxes) is either 150 each or 75 each. But does that mean that the second box's expected value can be derived by simply averaging those two amounts? The answer has to be no, for the same reason that the value of the box you are holding is not the average of 150 and 75.
Remember that you only got those two possible expected values for the two boxes after you looked at the money in the box you are holding. You engineered those amounts yourself based solely on the amount in your hands, but yet if you average them, you get a number bigger than what you hold in your hands. This will always be true for any amount you see in the box -- if you figure out possible expected values based on that amount and average them, it is always higher than what is in the box. But of course it can't be the case that every box is a loser.
The averaging produces an upward skew due to the effects of doubling. I cannot explain it in mathematical terms but here's the intuition: The assessment of every game you try to play is distorted by a phantom game that is not actually on the table. Because doubled amounts do more to averages than halved amounts, the "false doubling" phantom game generates distortions that systematically outweigh those of the "false halving" phantom game.
Trying to use EV analysis on situations that aren't random according to a strict definition of random is like trying to divide by zero."
This is a point Mev and other math-types have said repeatedly, right from the very first comment, so I take it it's true. But my difficulty with this is, randomness is not (often) an inherent feature of the universe. It's just our lack of knowledge as to causes. So if you don't know the probability of whether you're in a 50/100 game or a 100/200 game after opening the first box, why isn't it correct to say it's 50-50? Sure, it's not *really* 50-50; it's either 1 or 0. Mev says that's like assessing the probability that circles are square or true statements are false. But I'm missing the internal contradiction. I understand that there's a proof above that you can't have a uniform distribution of probabilities over all integers or all real numbers. But we're not dealing with the ex ante situation of all integers, we're dealing with 2 sets of 2 integers each. Why is the probability that your integer is part of one set as opposed to the second, and only other, set indeterminate?
If anyone wants to take a last go at clearing that up for me, I'd appreciate it. Otherwise, thanks all, interesting problem.
"Which should you choose?"
The operative word is "should," is it not?
For that reason, I see this not as a mathematical question, but as a philisophical one.
If your only motivation is to increase the amount of money you had (before the money box offer came your way) by some indeterminate amount, then there is no compelling reason why you "should" switch, since you don't care whether you increase your money by $50, $100, or $200; only that you increase your money.
In other words, you've got nothing to lose, but have assigned no relative value to gaining $50, $100 or $200, so why switch?
However, if you have a predetermined minimum amount that you want to acquire through the money box offer, that amount will guide your decision to switch. In this case, you may need to start doing some of that complex math seen above in this thread, to determine the probability of meeting your preset goal.
The money box scenario involves no stakes -- the game is played for free. One only thinks there are stakes because one thinks they may "lose" part of their initial gain. But $50, $100, and $200 are all greater than the $0 you started the game with, and you paid nothing to play, so it is all net profit regardless.
Personally, I am not a gambler. If someone gave me $100 I would say "Thanks, see ya!" and the game would be over. Having not set a predetermined minimum gain for myself, I am satisfied with the $100 and see no need to try to double it at the perceived "risk" of halving it. I could also quickly see, with the simplest of arithmetic, that given a presumably equal chance of doubling or halving my $100 (all math theory aside), I am already holding the 2nd best of the three (theoretical) alternatives, which is good enough for me.
Returning to the motivational aspect of the question, that is, "should" I switch:
If I halve my $100, I still come out ahead of where I was to begin with, but is that important to me? How important? How important is it to me that I double the $100? My own personal circumstances and my own valuation of money guides that answer.
So, the answer is that it is completely subjective.
There may be a mathematical way to determine the liklihood of the various outcomes (or not, depending upon which arguments in this thread you agree with), but in the end the math is irrelevant when placed into the real-world context of the initial question's "should" I switch.
Should I switch? Should you switch?
There's probably two different, equally correct answers to those questions, and they have little or nothing to do with math or probability, and lots to do with how you and I respectively value money, view gambling, or which bills we need to pay this week.
But it doesn't. If you use expected value in the wrong way you get the wrong conclusion. The subtlety in the problem is that expected value calculation has to be made between the initial box selection and and choice to switch boxes.
The expected value upon initial selection is either $150 or $75 depending on whether or not the $100 box is the big-money box or the small-money box.
Post-switching the expected value is the same because half the time you will be switching from the big-money box to the small-money box and the other half of the time you will be switching from the small-money box to the big money box. This has an expected value of either $150 or $75 depending on whether or not the $100 box is the big-money box or the small-money box.
This holds true for all values of the box you open.
The interesting thing about the problem from my point of view is that there is probably some general principle about misapplying EV lurking around in the problem which could be used to explain EV better. But I'm not sure how I would formulate it.
For instance, if the numbers in a set continue to double, the numbers in the set form a non-linear graph. In fact, they plot out asymptotically. Surely that curve has an effect on both the probability function and the intuitive value of doubling vs. halving??
I really don't know. That's why I prefer the philosphical approach! :)
I'm one of those people, as my posts show. I asked repeatedly why EV doesn't work. But it turns out you're wrong: the answer doesn't require precise language or formal proofs. The answer is: we don't know the probabilities of each of the two possible outcomes because we know nothing about how he picked the amounts in the boxes (coin flip? Whatever was in his wallet at the time? Oil prices? Nuclear decay?). Thus, we can't apply the EV forumula because we're missing two key numbers. Ta-da! Done, with only sloppy layman's language (some people got there way, way before me, of course.)
Our tools, in other words, don't actually fail us at all, they're just missing a key ingredient. You don't need to be an expert on electrical engineering to figure out that your TV doesn't work if it isn't plugged in. And it would be pretty weird--and perhaps a tad frustrating--to encounter someone citing Maxwell's equations (and insisting on their absolute indispensibility as the "language of electricity") when the answer is "plug it in."
Again, I didn't mean to be "boorish," so I'm sorry if that's how it comes across.
That's not correct. Expected value, by definition, is a real number (and specifically not a random real number). It can't be "either $150 or $75".
The reason you're resorting to language like "either $150 or $75" is because the mathematical notion expected value is undefined. The expected value formula requires seven variables to be known:
* value of switching if the other box is 2x = x
* value of switching if the other box is 0.5x = -0.5x
* value of keeping if the other box is 2x = 0
* value of keeping if the other box is 0.5x = 0
* probability that you opened the bigger box given that there is $x in it = 50%
* probability that there is $x/$2x in the two boxes prior to opening the first box = ??%
* probability that there is $x/$0.5x in the two boxes prior to opening the first box = ??%
Without all seven pieces of information, you're not calculating the expected value. You're calculating something else. I'd compare this entire thread to the following discussion:
Kevan: I bet you that someone here will say that 2 = 1.
Someone: 2 = 1!
Everyone: That's silly.
Someone:
1) a = 1
2) a = b
3) a^2 = ab
4) a^2 - b^2 = ab - b^2
5) (a-b)(a+b) = (a-b)b
6) a+b = b
7) 2 = 1
Some People: That proof looks good to me. I guess 2 = 1. Weird.
Robert Lyman: That's silly. It's not true that 2 = 1, and I don't need math to know that.
Mev: That proof is wrong because you can't cancel the term (a-b) in line 5 because it's equal to zero.
Some More People: Math sucks. You're not providing any insight with your jargon.
*sigh*
Stop being so irritated by everything.
The confusion people seem to have is that if we don't know anything about how Kevan picked the contents of the boxes, then we want to think that the probability that we have the bigger box is 50%.
It's not. The probability *doesn't exist*, and that's not an obvious conclusion. The reason that it doesn't exist is that a probability has a mathematical definition that has certain requirements, and there is no way to use the term "probability" in its pure, mathematical sense to describe this problem.
You can argue until you're blue in the face with someone who believes that the probability is 50%, but the only way to resolve the disagreement is to look up the definition of what probabilities and probability distributions are.
If your explanation is so obvious, why doesn't everyone agree with you? They don't agree with you because it's not obvious.
So, let other people describe it in a different way that some people who don't like your argument might like better. The *only* reason that your explanation is correct is that there is solid mathematics backing it up.
Now you have well and truly lost and confused me. I am offered two boxes. One of them has more money than the other. I pick one. The odds that I have the bigger box isn't 50%? It's actually undefined? Is that a typo?
I can believe that, in Kevan's problem, the odds that the other box contains $200 vs $50 is not 50/50 (and is in fact undefined), because that depends on his selection method, which might involve random distributions or might involve mugging Eugene Volokh and using the money he gets from selling his wedding ring.
But I'm pretty sure if I just grab a box, that's 50/50, bigger/smaller. Right? If not, then I give up.
value of this game?" Before opening the first box, there
is obviously no way to compute the value of the game. After
opening the box with $100, you can compute the value of the game.
If you think that it doesn't matter if you switch to the
other box, you are in effect saying "The value of this game
is $100" because switching to the other box will result
in no net change in value. If, like me, you think that the
value of the game after opening the first box is $125, you
must take the money in the other box.
The key point is that opening the first box DOES give you
information. It tells you what the value of the game is.
To calculate expected value, you need four probabilities:
* probability that you'll find $100 if the boxes contain $100/200 (50%)
* probability that you'll find $100 if the boxes contain $50/100 (50%)
* probability that the boxes contain $100/200 (Pa = undefined)
* probability that the boxes contain $50/100 (Pb = undefined)
Now, EV of switching = +$100 * (50% * Pa) / (50% * Pa + 50% * Pb) - $50 * (50% * Pb) / (50% * Pa + 50% * Pb)
People now think, well... Pa = Pb, so I'll just cancel Pa and Pb from the equation, so I get EV = +$100 * 50% - $50 * 50%, and EV = $25.
The 50%s in the new equation *look like* the probability that you picked the bigger box, but they're not. In fact, you *can't* cancel Pa and Pb because they don't exist.
My first post proved that it's impossible for Pa = Pb. In other words, there cannot be a uniform distribution.
I'm no Mev, but let me take a crack at the "indeterminate probability" problem:
What is the probability that my first daughter will be named Alice?
How could you begin to calculate that? If I gave you some information about my selection method (say, I intend to name her after one of her grandmothers, but I'll flip a coin as to which) you could figure it out (in this case, zero, since neither grandmother is named Alice). Or, you might adopt some reasonable assumption (I am the "average person" and I'll pick her name with the same probabilities as are reflected in all the girl babies born in the world, or in the US, or in some particular state).
But I think without any further information or assumptions, the probability that my first daughter will be "Alice" must be undefined.
Mev can tell you if that's right or not.
Let's change the game slightly. Kevan gives you two boxes. One is empty and one has 100 dollars. He tells you to open the box with 100 dollars and then he tells you that he is going to put 50 or 200 into the other box and then give you the choice to switch. Now, you can use all of your expected value analysis and you should switch.
But, that is a different game. You know which box you are starting with. In the first game, you don't know and that makes all the difference.
To change the topoic slightly, here is a similar paradox. I tell you that at least one of my two children is a boy. What are the odds that both are a boy? Now, I tell you that the older one is a boy. The odds have changed because of the change in information.
it was, so I'll see your apology and raise you one.
when I first encountered the monty hall problem, I used "verbal reasoning" instead of the formal analysis (of which I am quite capable despite decades of removal from any serious application) and got the wrong answer. you drew your conclusions from the numerous "hand-waving" posts, I drew mine, viz, the formal analysis is always best if you want to get the right answer. it forces you to accurately model the problem, helps organize your steps, and the results are explicit so critics can challenge the model, assumptions, formulas, calculations, etc.
for example, "you can't say for sure that the chances of $50 and $200 are equal, so you can't take the weighted average)." I'm not entirely sure what this means, but I think it is, if you'll pardon the expression, "nonsense". as I tried to point out earlier, the reason the early EV analysis failed was because it computed the wrong quantity, the right one being a conditional EV rather than an unconditional EV (sorry for the jargon, but even without knowing the exact meaning I think one can get gist of the concept from my earlier comment). the actual numbers ($50, $200) are irrelevant and an EV, conditional or not, is a weighted average, which I "computed" with no trouble whatsoever in the comment.
IMO, spending a relatively modest amount of time trying to understand my formal (loosely speaking, of course)formulation would have been much more profitable than participating in another batch of hand-waving exchanges. the majority of the issues raised in the comments are addressed in the four lines it took to write out the (very simple) equation. the remaining ones were mostly about initial distributions on D, a red herring (unless I'm missing a deep subtlety buried in piles of obfuscation).
It is 50/50. If you pick a box under the rules, you do in fact have a 50/50 option before you. Given the hundred, you know that the other box is either 50 or 200. You have no information beyond that as to which. If anything is 50/50, that is 50/50. There are no infinite options here. It's two options, with an absolutely equal chance of it being either.
This IS the meaning of probability. When I flip a coin, we say it's 50/50. In truth, it's not really; how it lands depends on how hard I flip it, the wind, gravity and whatever else. A perfect computer could figure it out. But it hasn't, and we don't know. That makes it 50/50. I don't think there's any difference with the situation here.
The difference between the scenarios is that in the first, I'm given a 50/50 chance of either doubling or halving my 100 dollars. In the second scenario, I'm not given any choice at all; it's illusory. The confusion arises, though, over the idea that a 50/50 chance of doubling or halving should somehow translate into equal dollar values on a case by case basis. It doesn't.
What if the scenario said that one box is worth a million times as much as the other. You open the box, and it's a hundred dollars. Wouldn't swapping boxes be a no-brainer?
Clearly you would, because the risk of going from 100 to an infintesimal fraction of a penny isn't actually a big loss, whereas the potential gain is extraordinary. Who wouldn't change boxes if they got that hundred dollars? It would be insane not to. This has to relate to the limit on halving, as opposed to the exponential growth of doubling to infinity.
So what do you say? Would a rational person double in the above scenario? What if they said the one box has 1.1 times more money than the other? Perhaps the rationality of swapping increases all the way up that spectrum.
If you say they're ridiculous and restrict yourself to English names used in the past centery, it's an easy problem (once you know how many names there are).
If you really allow any name (just like if you really allow any amount, like a $10^100), then the probability is undefined. The probability of any one name must be zero, by symmetry, but how can an event with probability zero happen?
It can't, and the reason it seems to occur is because not all outcomes are equally likely in reality, and some outcomes have non-zero probability (making them more likely than others).
No boxes, no unknown quantities, just doubling and halving a known quantity:
On day one, I give you two cents.
On day two, there is a 50% chance that I will give you 4 cents, or cut in half the money in your pot.
On day three, there is a 50% chance that I will give you 8 cents, or cut in half the money in your pot.
On day four, there is a 50% chance that I will give you 16 cents, or cut in half the money in your pot.
It continues thus, and the game must end after 30 days, but you can choose to stop it at any time.
You could potentially end up with $0.02^30, or $21,474,836.48 or you could end up with one cent.
The probability of these two outcomes is the same (ignoring the fact that you can't halve a penny):
P(30 consecutive doublings) = P(30 consecutive halvings).
On what day should you stop playing the game?
Scenario Two:
Same game, but instead of exchanging the amount in your pot each day for double or half, it is cumulative, so that at the end of 30 days, you could potentially end up with $0.02^30 + $0.02^29 + $0.02^28 + $0.02^27 + $0.02^26 + $0.02^25... = about a zillion bucks.
How does this affect your decision of when to quit?
I re-read your first comment, and I'll take your word for it that it solves the whole problem in 4 lines. I surely understand the need for formal analysis, but there is also a need for sound non-formal understanding. Have you ever encountered an earnest undergrad who does a series of careful calculations and then comes up with a completely ridiculous answer because of a minor error? If not, have a look at the early EV calculations, which are impecable as to their arithmatic but deficient in their underlying understanding.
I've not only encountered such people, I've been one on enough occasions.
Here's what I meant when I said "you can't say for sure that the chances of $50 and $200 are equal, so you can't take the weighted average":
Expected value of second box= Pa*(200) + Pb*(50).
Where Pa and Pb are probabilities.
You don't know what Pa and Pb are. They total 1, but that's all you know. So you can't use the EV formula. Period. Mev says Pa =! Pb, but that's surely not true Kevan tells you put $100 in the first box and then flipped a coin.
Clealy, my sentence was inartfully phrased. I should have said "you don't know what the probability of each outcome is, so you can't apply EV."
Now, given some information (like the distribution, or Kevan's method, whatever it might be), you could calculate those probabilities. Without them, however, your EV equation is correct, but impotent.
Your TV is not plugged in.
Also, I agree entirely that mathematical arguments (correctly) don't carry any weight with non-mathematicians because they can't verify the argument. In reference to an earlier post, I couldn't verify an iron-clad argument in Russian either.
I'm a fan of non-mathematical arguments... It's just I personally need the math to feel better in this particular case.
Think of it as a security blanket for an electrical engineer who keeps his TV plugged in at night so that Maxwell's equations don't start crawling out of the wall.
Everybody who knows that by "in general," Mev means "always and everywhere, without any possible exception," rathar than "most of the time, usually, but with some exceptions," please raise your hand.
(surveys hands)
Hmmm...looks like everyone who stuck around this long is a math geek, so no problem.
But that is probably the #1 example of how math people's language confuses others, since the two meanings are practically opposites.
Kevan gives me a box. It contains a goat worth $1. He points at another box and says "This other box contains either 50 cents or a million dollars, would you like to switch?" And if I say yes I get a box that contains 2 quarters and a lottery ticket. And I congratulate myself for buying the lottery ticket with an EV of $1,000,000 / 4,000,000, or 25 cents, for 50 cents (one goat less 50 cents change) instead of the dollar I usually waste on it?
(And my kids say "But DAAAD! We'd really like a goat as a pet!")
I know the million-dollar lottery ticket is worth 25 cents (or whatever the odds tell me), and therefore the ticket plus 50 cents is worth 75 cents, and under those rules I should keep the one-dollar goat.
Now he hands me a box with $50. He says there is a chance p (p in 0 to 1 inclusive, being a probability) that this other box has $50 in it, and a chance 1-p that this other box has $200 in it. I quickly conclude that for any value p, p * 50 + (1-p) * 200 = 250 - 200p >= 50, and I switch. I can't lose and I might win.
Finally he hands me a box with $100 and offers to trade it for the same kind of other box. I think "For some values of p, 250 - 200p < 100, and for other values > 100. If p < 0.75, the EV of the other box is > 100, and if p > 0.75, EV < 100. Do I have any information about the value of p?
I think "I got here by flipping a coin, q(heads) = 0.5", and I wrack my brain trying to find the connection between p and q.
"50Pa+200Pb" isn't meaningful (there's only D and 2D) and you (well, I) most certainly do know what the probabilities are for the meaningful computations I showed. after the choice of D has been made (however that's done - choose your favorite distribution that fits the logical constraints), there is only one source of randomness left: D is either in box A or box B. since the player doesn't have any way of knowing, he has to assume it's equally likely that it's in either. thus, the a priori probabilities it's in A or B are equal, viz, 0.5. since the other box must have 2D, the probabilites for that event are also 0.5. since opening the chosen box conveys no information about what D is, the a posteriori probabilities are the same. QED.
it's now to the point where all I can do is repeat myself. you profess to not know probability theory; I profess that I do based on having studied it in grad school, written a PhD thesis using it, and worked for 30 years in a profession in which it is a core competancy. granted, none of that proves anything - it just might alter your "trust" probabilities. but trust me, don't trust me - makes no nevermind to me.
over and out.
1) The boxes are stuffed before you get to choose. By symmetry it is irrelevant what you do. Note that in this case the probability of getting more money by switching is not 50% so the expectation argument fails.
2) The second box is reseeded after you choose your first box. Then one can guarantee that there is a 50% chance of increasing your wealth in which case the expectation argument goes through.
The resolution to the paradox of why you get two different answers is simply that the two assumptions in cases 1 and 2 are incompatible. The only distribution that gives both a 50% chance of increasing your wealth after the first choice, and can be assigned before the first box is chosen corresponds to all boxes always having $0. As the problem is stated situation 1 attains, implying that switching boxes is irrelevant.
The mathematical answer that there is no distribution over the natural numbers is a nonsequitor. While true it does not answer the problem. The problem is well posed and can be physically be implemented. Just because the problem will not succumb to your favorite mathematical tool does not mean it is insoluble.
Variation 1
The planet Marsiter is made up entirely of only 2 kinds of rocks: red rocks and blue rocks. Blue rocks are worth $1,000. Red rocks are worth $1. In my hand, I have a closed bag containing one rock, selected randomly from all the rocks on Marsiter. I don’t know what color rock is in the bag. I offer to sell you the bag for $400. Do you accept my offer?
Variation 2
Same as above, except I now tell you that 99% of the rocks on Marsiter are red. Do you accept my offer?
Kevan’s problem is like Variation 1. The people doing expected value calculations falsely assumed Kevan’s problem is like Variation 2.
We repeat this a few times, and I keep switching, and I've got $25 times the number of games played.
After a while we get tired of boxes, and he says "Flip the coin, heads I give you $50 (net I gave him $50), tails I give you $200 (net $100 to me)" and this doesn't change the way it works. And then he says "I don't have to hold your $100 each time, just flip the coin, heads you give me $50, tails I give you $100." And I'm still making $25 per round.
The next day, he's got two rows of boxes. The each box in the first row has $100 in it. Each box in the second row has either $50 or $200. I'm to pick up a box from the first row, observe the $100, put it down, and keep the contents of the corresponding box in the second row.
I say "That's stupid, just put either 'You owe me $50' or 'Here is $100' in the second boxes, and don't even bother with the first boxes" so he sets it up that way.
There are 10 boxes in the second row, so I give him $250 to play the game on every box, and I go down the line, and each time I owe him $50, and he tells me "I never told you there was a non-zero chance of $200 in the second row!"
Why is the original puzzle closer to the second day than the first day?
Seeming Paradox:
A) If you switch, you have a 50% chance of $50 and a 50% chance of $200. This averages to $125 (1.25 x $100).
B) Therefore, you should should switch.
C) Since the logic in statement A) had nothing to do with my actually seeing $100 (as opposed to $1 or $1000 dollars), I should therefore switch regardless of what I see.
D) That means I should switch even if I haven't opened the first box yet.
E) That also means I should switch back again if offered to so and so on and so on.
Where is the fault in this sequence?
The problem is that it is impossible to come up with any method whatsover for determining how much is put into the lower box where both statements A) and C) are true. For some methods of determining how much is put in the lower box, when you see $100 you should switch precisely because doing so raises your expected payoff to $125 precisely because, conditional on your having seen $100 in the box you opened, you have a 50% of the other box having $50 in it and a 50% chance of the other box having $200 in it. That is, it is easy to posit a method for determining how much is put in the lower box where statement A) is true.
But in any such example statement C) is false. For any method of determining how much is put into the lower box, if you want to switch when seeing $100, there will be other amounts you could have seen which would make it so you shouldn't switch. That is, that you want to switch does indeed depend on your seeing $100.
Thus the line of reasoning is false. No paradox.
Are you saying "If you know the other box contains either $50 or $200, and you're given the choice of switching or keeping a box that contains
$100$500, of course you would keep the $500?Yeahbut, since I know the rules of the game that I am playing, all I have to know is that there is some amount of money in my box, call it N.
(And yes I understand the symmetry argument, I'm just trying to understand why the EV analysis doesn't work. I thought I understood it when I stopped reading these comments yesterday afternoon, but then I couldn't reconstruct it this morning.)
Before you make any pick at all, you have a 50% chance of getting X, and a 50% change of getting 2x, for an expected value of 1.25x. (Where X is the lower amount.) On your first pick, the only new information you have is that EITHER X or 2X = $100. That doesn't change your expected value in terms of X.
(God help me, I hope I haven't just confused matters more and set off another hundred posts.)
Suppose a dog has 4 puppies and each puppy is a 50/50 independent coin flip of being a boy or girl. Is it most
likely that the litter will be 2 boys and 2 girls?
Her answer would be, "No, it is most likely that the litter will be 3 of one sex, 1 of the other."
This is both correct and incorrect. If the event space is (all four same sex, 3 of one 1 of the other, 2 and 2) then she is correct since the probabilities of these events are, respectively, (1/8, 4/8, 3/8) and 4/8 is the highest of this list.
On the other hand, if the event space is (4 male, 3 male 1 female, 2 and 2, 1 male 3 female, 4 female) then the probabilities are (1/16, 4/16, 6/16, 4/16, 1/16) and thus
the most likely is 2 and 2.
Of course it is not likely to be 2 boys and 2 girls, since young male puppies and young female puppies aren't called by those names. But to restate it, "Most likely" isn't meaningful without knowing "among what" (that is, what you call correctly "event space".) Related favorite epigram: "The vast majority of people have more than the average number of legs" -- where "average" means "arithmetic mean", which would be something like 1.99, or, if we define legs so that the answer is integral (a partial amputation is either a full leg or no leg, by some definition) "Nobody has the arithmetic mean number of legs".
And in fact some people think Mendel cooked his books, because the chances of his peas coming out exactly the way genetics predict is much smaller than the chances of his peas coming out in any other possible distribution, even if you can say with certainty that for N peas it is 65% likely that the number of white flowers will be between N * 0.5 - s and N * 0.5 + s.
But that's because I was searching for "boy" to review Mike Keenan's
To change the topoic slightly, here is a similar paradox. I tell you that at least one of my two children is a boy. What are the odds that both are a boy? Now, I tell you that the older one is a boy. The odds have changed because of the change in information.
In the second choice, the possibilities are (always listed older,younger): boy,girl and boy,boy and always assuming the chances for any given child is 1/2 the answer is 1/2.
In the first choice, out of the entire space of two-child sets, namely (b,b),(b,g),(g,b) and (g,g), the probability of each being 1/4, [and the probability of one-of-each is p(b,g) + p(g,b) or 1/2], we know, having paid attention to Bayesian stuff more than the first time we heard this, that the equally likely possibilities are (b,b),(b,g) and (g,b) and in only 1/3 of those is it b,b. But it still feels wierd.
There is a similar partitioning-of-the-possible-outcomes paradox involving 3 coins, so that the answer to "what are the chances that they are all the same" appears different if you flip 2 and look and then flip the 3rd versus flipping all 3 together. (I don't remember the wording.)
So EV = 1.25X, and I don't know what X is, even though the only possible values of X are 50 and 100?
Repeating, hoping to believe it, I don't know what X is...
What I am saying is this: We know before we open the box that we had a 50% chance of choosing the lower box. Once we
open the box and see what's in it, however, that becomes irrelevant. What matters now is the probability that we have the low box conditional on having seen $100 . Somehow we all want to say that having seen the $100 is irrelevant so we should still believe 50/50. And it is not hard to come up with examples where having seen $100 you still believe you hold the low box with 50% probability. All you need is the process which determines how much is in the low box puts $50 in the low with the same probability as it puts $100 in the low box. But the very first comment in
this now very long thread points out that you can't do this for all amounts. You can have a process determing how much is put in the lower box which gives the same probability to $50 and $100 and the same probability to lots of other (x/2,x) combinations, but you can't have a process which puts the same probability on all (x/2,x) combinations. It's impossible. And so there have to be some x's where if you observe that x it is not 50/50 that you have the low box since the probability that x/2 was put in the low box is not equal to the probability that x was put in the low box.
What do you mean by "process"?
I have two children. I have written the sex of the older on a piece of paper in my pocket. What is the chance that they are both boys?
1/4 to both your questions.
Some method for determining how much is put in the lower box. (The upper box gets twice that amount by assumption).
An example would be "I flip a coin. If it is heads, I put $50 in the low box. If it is tails, I put $100."
Another would be "I get a computer to give me a random number distributed uniformly between zero and 1. I multiply that number by $1000 and put that amount in the lower box."
My point is that somehow the amount that gets put in the lower box gets determined and it is impossible to come up with a method for how it is determined where the amount you see when you open your box is always irrelevant.
Let's face it. We both need paradox discussion therapy. :-)
AppSocRes,
!! ALERT -- SEVERE MATH JARGON AHEAD -- ALERT !!
A Dirichlet process is a process that generates a random probability distribution, and the distribution is randomly generated from all possible distributions according to a separate probability distribution.
I warned you. It gets worse.
You can use this model to evaluate the expected value of switching in the absence of a known distribution for what Kevan's putting in the boxes.
The problem is that you need a base distribution for how Kevan fills the boxes to get started, and you have no reason to pick any one distribution over any other (mathematically, though not perhaps philosophically). The first time, therefore, you still can't evaluate the EV equation for switching without making some assumption about the distribution Kevan is using.
The second time, though, you can improve your understanding of the random distribution that Kevan is using because you know it's less likely that he's using a distribution that make it unlikely for you to have seen what you saw in the first round.
You can continue this process, and hopefully, you'll converge on Kevan's actual distribution by making your Dirichlet process more and more likely to generate a distribution that would have produced what you've seen so far.
The ultra-simple non-mathematician formula is:
EV = Pa*(A) + Pb*(B)
Here's the key problem: how do you know that the chance of $200 and the chance of $50 is 50/50? If Kevan tells you he put $100 in box A and then flipped a coin, then you know. But what if he puts $100 in box A, then rolls a die, putting $200 in box B only when it comes up 1? Then the EV is different. What if he puts $50 in a box and then flips a coin? What if he just checked his wallet, saw that he had $332, and decided round numbers looked best?
Pa and Pb are unknown and can't become known without knowing how Kevan decided what to do. And all of the above possiblities, and many others, are consistent with Kevans actual problem statement.
Or as I put it above (far above): Here are two boxes. They have unequal amounts of money in them. Box A has $100. Would you prefer Box B?
The fact that the degree of unequal-ness is constrained to be half or double doesn't actually help you out any.
At the time of the choice to switch you presently have either the x box or the 2x box. If you choose to switch, 50% of the time you will switch from x to 2x. 50% of the time you will swith from 2x to x. So after the switch you have a 50% of having the x box and a 50% chance of having the 2x box. This has an EV of 1.5x.
The EV is the same for switching and not switching. Therefore there is no reason to switch and no reason not to switch. The two choices have an equal EV.
Robert,
You're right, but when we're talking about practical grounds for decision (since you have to decide whether to pick or not), isn't it rational to assume that an unknown decision method which selected among two possible options gives a .5 probability to each?
First, I'll define a couple of scenarios more precisely, so we do in fact have 50-50 underlying probabilities:
I. Kevan gets $3x. He puts 1x in box 1. He flips a coin, and depending on the result, he puts either 0.5x or 2x in box 2. He gives you box 1, with the option to swap it for box 2.
In this case, the EV of box 2 is indeed 0.5*(0.5x)+0.5*(2x) = $1.25x. You should swap.
(There are exceptions where nonmathematical considerations override this. If I find $1,000,000 in the box, that's enough to retire on right now and spend the rest of my life commenting on blogs with none of those pesky interruptions by my employer. Half of it isn't enough. Doubling that wouldn't give me enough additional enjoyment to be worth the risk.)
II. Kevan sets up the boxes as above. He flips a coin, and depending on the result, he gives you either box 1 or box 2.
Let's call the box he gives you box A, the other one box B, and the lower of the values of the two boxes 'y'. You know box A has $100, but you have no idea whether this is y or 2y. Then the expected value of box A is 1.5y, and so is the expected value of box B. Play this game a million times, and whether you hold or swap it will come out the same.
But what about the EV calculation? The random events in this scenario happened before you arrived. Either x=y or x=2y, and box A is either y or 2y, and these two events were independent. The results of choosing to swap or not are not independent of the previous events, so you have to evaluate conditional probabilities (which are either 0 or 1) for four equally likely possible cases crossed with your two possible choices.
1) x = y, and box A is box 1: Holding gives you x, swapping gives you 2x.
2) x = 2y and box A is box 1: Holding gives you x, swapping gives you y = 0.5x.
3) x = y and box A is box 2: Holding gives you 2x, swapping gives you x.
4) x = 2y and box A is box 2: Holding gives you y = 0.5x, swapping gives you x.
EV for holding is (1+1+0.5+2)x/4 = 1.25x
EV for swapping is (2+0.5+1+1)x/4 = 1.25x
Which means you did get that 25% increase in expected value - but it came from Kevan's coin flips, not from you making a choice afterwards.
Maybe, but you still don't know that Kevan put $100 in the first box, you just know that he put $100 in one of the boxes.
In fact, you don't know that Kevan made a "decision" between half and double at all. He might have, as I said, merely consulted his wallet and tried to use up its entire contents, in which case his notion of a "choice" between half and double is a deliberate head fake in the statement of the problem.
I agree that the conditional EVs of switching and not switching based on knowing that the smaller box contains $x are both $1.5x.
In fact, it's $1.5x for any value of x you choose, so the EV of the two strategies is always equal for any distribution Kevan may select. This analysis matches your intuition.
The problem comes in when you open a box and think that the EV of switching is now greater than the EV of not switching, and you know you would have come to this decision no matter what you saw in the box.
That conclusion is clearly wrong, and the reason is that the conditional EV of switching based on the contents of the first box and no addition information is undefined, as is the conditional EV of not switching.
Note that the difference is that you can calculate the condition EV based on knowing the smaller of the two boxes but not the conditional EV based on knowing the contents of a random box.
Kevan, it's your show. I'd like to hear your answer.
Robert: but on the information presented to you, you have one of two possible options, and no information about the process which went into determining them. All I'm saying is that for purposes of practical reason, it makes sense to treat that as random.
Right. I'm now going to make like Bush should and get out of this quagmire. :-)
Make whatever assumptions you want. Your result (switch) is perfectly correct for your assumptions (that it's 50/50, 200/50).
Just know that Kevan didn't say that, and as you no doubt know, not saying something (like "imminent threat") can matter as much as what you do say.
My point was that you have to be given a non-random distribution in order to come to the conclusion that you should switch. Since it was not specified, you don't know it and cannot assume the distribution is going to make the values $200 and $50 equally likely after your first choice.
The only thing that made any major difference was picking again on odd numbers and when you had picked one cent.
So empirically changing your mind doesn't help.
Folks need to think before reflexively applying an EV. This is not a probability problem - for two reasons. 1) The set is discrete; you have x and 2x. Knowing that one of those two is $100 does not create an additional variable - 1/2x. 2) The problem is not based on repitition. Repitition creates a distribution. If it were repeated, then we could talk about "normal distributions" and relative probabilities (i.e. P(a) and P(b).)
So think of it in simple terms (ignore what you think you know for a minute). I tell you I have a set with two elements: x and 2x. I tell you that one of these elements is equal to 100. Then I ask is 100 the larger of the two? You respond, "I have no frigging clue!" And right you would be to do so - because by telling you that either x or 2x is equal to 100, I have told you nothing.
Hence, switching is irrelevant. There is an information gap.
Steve Holt!
And now I'm really out of this, I swear!
Therefore, we have gained information when we see any specific amount of money in our box - and the assumption that we have picked the correct box with Prob=50% no longer holds.
Setting aside the repetitive nature of these comments, we certainly apply probability and EV to single events. All events are unique, and all events are members of some class of events.
If the problem were "Kevan has two boxes, one containing a cherry pit and one containing $1,000,000 -- he offers you the one-time chance to pick one and keep the contents for a certain price, do you take him up on it?" then (setting aside aversions to gambling and filthy lucre, it's a metaphor for a choice you have to make, like do you accept the proposal of marriage) the answer depends on comparing the price to the expected value. It's OK to consider the non-linearity as markm, 7/13 7:27pm.)
When I get to my desk I'll work through the event tree. Meanwhile I'm thinking "Kevan has two identical boxes: one contains the word 'winner' and one contains the word 'loser'. Also, one of those two boxes contains $100. You pick a box, what are the chances (as in, would you bet a nickel for a chance to win a dime) that your box (or the other box, it doesn't matter, and that's why I know my intuition is wrong) contains the word 'winner'?)
OK, you pick a box, and learn that it contains $100. Now what are the chances that your box contains the word 'winner'?
Now we change the prizes, the 'loser' box gets you a certain prize and the 'winner' box gets you a prize worth twice that much. You don't get to keep the $100 if your box has it, it's just for show. You know nothing about the prize, but as an abstract thinker you know the game is worth 1.5 prizes, and you refuse to pay until you have a better idea what 1 prize is worth.
Now what if you learn that there are only two types of prizes, either 'loser' is $50 and 'winner' is $100 (Type I, also known as 'x=50'), or 'loser' is $100 and 'winner' is $200 (Type II, also known as 'x=100'). So you know the game is worth either $75 (if it's Type I) or $150 (if it's Type II).
I think I understand (but I haven't taken my philosophical shower yet, during which I can simultaneously become clean and confused and late for work) -- You don't know whether the game is worth $75 or $150. And knowing that one of the two prizes is $100, knowing that 50% of the prizes are $100 doesn't help you distinguish whether it's a Type I game or a Type II game.
But I still want to do EV (and I don't mean Eugene Volokh.)
Common folks, this is a blawg. Even as the puzzle part of the blawg, it's not about the math, it's about the making of convincing and persuasive arguments. And a lot of us aren't getting convinced.
information gap." Yes, there is an information gap, but
it does not follow that switching is irrelevant. If you
are to participate in this puzzle, you MUST choose. You
must choose between (A) keeping the $100 or (B) taking
the money in the other box. In order to choose between
(A) and (B) you must put values on these options. We
know (A) is worth $100, but what is the value of (B)? We
can calculate the value of (B) if we know the relative
probability of the boxes being 50-100 or 100-200. If the
probability of the boxes being 50-100 is greater than
2/3, then we should keep the $100. If the probability of
the boxes being 50-100 is less than 2/3, we should switch
to the other box. In order to choose between (A) and (B),
we MUST estimate or guess what that probability is. Many
people have assumed a 50%-50% probability and this seems
like a reasonable guess, absent any other information. If
you say that switching is irrelvant, you are in effect
saying the probability of 50-100 boxes is EXACTLY 2/3
because that is the only situation where switching or not
does not matter. I contend there is absolute no basis
for claiming the original probably is 2/3, therefore
claiming "switching is irrelevant" is wrong.
You reply that some of those aren't "real" possibilies, and you know that for sure, because you've seen the $100? I reply that one of the possibilities you treat as "real" is just as far outside the bounds of reality. Everyone, including you, would be wrong to conclude that there are just "two possibile ways" that the boxes might be filled and that each of the two is therefore equally likely.
As I said yesterday, anyone opening a box will always be able to construct two possible games. One represents the true game she is playing, and the other is just a phantom game that she is absolutely not playing. There's no way to tell which is which, but it is obviously wrong to treat them as "equally likely" and average them. Because of the mathematics of doubling, the high-side phantom games will systematically distort more than the low-side phantom games, yielding wrongheaded EV calculations.
The EV of the two boxes is equivalent, always, and it is never worth switching (unless you have some inside information that is outside of the specified problem).
There's no way to tell which is which, but it is obviously
wrong to treat them as "equally likely" and average them.
Because of the mathematics of doubling, the high-side phantom
games will systematically distort more than the low-side phantom
games, yielding wrongheaded EV calculations.
I can agree with this. However, since $100 a relatively small
amount that I am sure Kevan can afford, I think it is reasonable
to say they are "approximately equally likely". In order to
not switch, the probability of 50-100 vs. 100-200 would need
to exceed 67%. I do not think that is a reasonable conclusion,
therefore switching is best.
Take first the unseen box contents game.
Say there's either $2 &$4, or $4 &$8 in the boxes, and you don't see the contents of the first box.
There are four cases when you're about to stab at the box you want
$2&$4 stab $2 : $2&$4 stab $4
$4&$8 stab $4 : $4&$8 stab $8
You get to choose horizontally, and the moneybags gets to choose vertically.
After the moneybags sets it up, your expected gain in the $2&$4 case is zero for switching (you either gain $2 or you lose $2).
Your expected gain in the $4&$8 case is likewise zero for switching (you either gain $4 or lose $4).
Averaging over the two games, you get zero for switching, on the average.
This accords perfectly with intuition.
The fallacy is thinking your choice affects whether it's
a $2&$4 game or a $4&$8 game.
It doesn't. The moneybags set it up and left, one way or the other.
The same analysis applies for any number of lines. You get zero for switching, no matter what line you're on.
==
In the case of the seen contents, make the table with more lines, starting with $1&$2 and ending with (say)$512&$1024
Moneybags sets it up and leaves, choosing one of the lines.
Say you observe $64 in your box. There are two possibilities, and you either lose $32 or gain $64 by switching.
In this case you switch, but you do not switch for $1024!
That ``but'' provision goes with the positive switch benefit.
The optimal strategy in the ``box seen'' case is to switch
EXCEPT for the biggest prize.
It remains unchanged as the biggest prize is made bigger and bigger.
The information you get from seeing what's in the box is whether or not it's the biggest prize. You switch if it isn't, and stand pat if it is.
Now suppose you don't know what the biggest prize is.
Then there's nothing you can do with the information, and it becomes the unseen-box game, where it does not pay to switch (or hurt, either). The gain from switching is offset by the certain loss from switching out of the biggest prize, is the particular mechanism causing the change in the game.
So in either game, with unknown top prize, the expected value result is in fact you do as well standing pat as
by switching.
This is intuitively what you think as well.
I think the roughly 300 comments without (plus at least one, I immodestly claim, with) [the real] math suggests how hard it is to do what you want. as I noted earlier, symbols are the language of math, and the solution to the problem once you know the language is essentially one short sentence. given how smart most of the commenters are, I'll bet that the time spent hand waving if applied to trying to understand the minimal symbology in that one sentence would have revealed some potential probability theorists.
The paradox works itself by changing the contents of the boxes as you average.
You can do that game too, but it's more subtle.
not sure your comment was a response to mine. if so, it doesn't address my question. yes, the expected gain is zero if the probabilities that D is in box A and box B are both 0.5, which they are. but numerous commenters have argued that they aren't, and I'm just curious how they reach that conclusion.
Now it's 50-50 that you lose $32 or gain $64 by switching,
except it's 100-0 if $64 is the biggest possible prize, and
0-100 if $64 is the smallest possible prize, which is how
the ``distribution'' problem gets into it.
You can't enumerate the game if there's no biggest prize (``this is a piece of a ladder that goes on to infinity'' does not mean you're imagining the situation you seem to be. Actual ladders have tops). One intuitive sign of trouble is that no amount you can imagine is likely to turn up : all the probability favors some bigger amount.
The ``gain from switching'' comes from information about the endpoints and information about what's in the box you chose.
Lacking either, you gain nothing by switching.
Rule for avoiding probability paradoxes : enumerate the equally probable cases and count the favorable ones and the unfavorable ones.
What's an equally probable case depends on the information you're presented with and know about how the game is set up.
In the case where you don't know the top prize or don't know what's in the box you at first choose, you have no way of avoiding the biggest-prize loss from switching that offsets the D/2 : 2D gain elsewhere.
The paradox comes from getting you to think that the D/2 : D case is the general one, where there must always be a cutoff case that kills the reasoning in fact.
> The paradox comes from getting you to think that the D/2 : D case is the general one, where there must always be a cutoff case that kills the reasoning in fact.
D/2 : 2D case, it should be.
The probability that you have picked the higher-valued box is certainly 50% before you open its contents. I think everybody agrees with that claim.
Now, I open the contents and I find D. Is the probability still 50% for each value of D? No.
The probability would be 50% for each D if the there were an equal probability of seeing each value of D. But going back to the very first post, "there's no uniform probability distribution on the natural numbers, so you can't assume that all monetary values are equally likely."
So, for some D the expectation goes up. For others, it goes down. The expectation stays the same for the weighted average across all D.
Trivial example. Half the time Kevan puts $100 in one box and $50 in the other. Half the time he puts $200 in one box and $100 in the other.
For D=50 or 100, your expectation goes up by switching. For D=200, it goes down. When weighted across all D, the expectation stays the same.
ctw, as Mev said a while back, there's really two puzzles here, (1) whether you should switch and (2) why the EV calculation goes wrong. I think almost everyone agrees it's silly to switch, the puzzle I'm really interested in is the second, and saying that we should give up on the variable that gives rise to the second puzzle and solve the first puzzle instead doesn't help us solve the second puzzle.
IF you draw high ($100), you will construct two possibilities for the other box: $50 and $200. You will know you are either in a 100-50 or 100-200 game.
IF you draw low ($50), you will also construct two possibilities for the other box: $25 and $100. You will know you are either in a 100-50 or 25-50 game.
So, no matter what, you will know you are playing one of two games after you look in the box. However, depending on which box you happen to draw, the two games you think you might be playing will differ. If you draw the high box, you will conjure a pair of games that, when averaged, skews high compared to actual game; if you draw the low box, you will conjure a pair of games that, when averaged, skews low (though the skew is not as much in the low direction, so that averaging the possible other-box contents still "tells" you to switch).
The fallacy is in thinking that the box you draw (and hence the pair of games you happen to conjure up) can change the expected value of the whole game.
I think this is relevant because the mistake many posters seem to make in the boxes problem is analogous to, although less obvious, than the mistake rank amateurs to probability make in analyzing the boy problem: they assume there are three cases (2 boys, 1 of each, 3 girls) when there are really 4 equally likely cases (boy-boy, boy-girl, girl-boy, girl-girl). Knowing that 1 child is a boy eliminates the girl-girl case, so the chance that the other child is a boy is 1/3. Knowing that the oldest child is a boy eliminates two cases, so the chance the second child is a boy is 1/2.
This is related to the distribution of initial digits of random quantities. Take every quantity published in the newspaper (say trading volumes by stock in the WSJ) and count up the frequency of the first digits. The first digit will be 1 around 30% of the time, and 9 surprisingly infrequently (I think about 5%; I forget). Supposedly some tax cheats have been caught this way -- the numbers on their forms didn't start with 1 often enough, presumably b/c they were chosen randomly from the wrong distribution.
A friend of mine has observed that touristy/crafty shops selling ceramic digits that people might put on their houses tend to have a perpetual shortage of 1's.
The distribution of the money placed in the box does indeed affect whether you should switch. If the probability distribution happens to have a density function d(x) on a domain containing 50 and 200, then the probability that the other box is the bigger one is d(200)/(d(200)+d(50)) (application of Bayes' theorem). This means that for the most "obvious" choice of distribution, the uniform distribution supported on some domain (0 to 500, say), the other box is equally likely to contain $50 as $200, because d(x) = 1 everywhere in the support. But if the money is distributed, say, with a standard normal, the probability you have the lesser box is different, which might affect how you should choose (whether you should switch in this case is left as an exercise to the reader).
You can arguably say that given no information on the distribution, you would have to assume the other box has 200 with a prob of 50%. But this choice is biased and maybe even arbitrary. Let's play the game again, where I give you two boxes, I tell you no info on any relationship between the quantities in each, you open one and find $100. Do you switch? If only you knew the mean! Declaring the mean to be 1000 and basing your decision to switch on this assumption is definitely flawed, and similarly in the original problem, because you base your decision on some assumption about the distribution.
Very good description of what I was trying to get at. Exactly right, you are not picking the game you are playing.
As for those who think iterating the game makes a difference, well it doesn't, as I have shown empirically with a computer program.