The concept of superrationality (or renormalized rationality) was coined by Douglas Hofstadter, in his article series and book " Metamagical Themas".[1] Superrationality is a type of rational decision making which is different than the usual game-theoretic one, since a superrational player playing against a superrational opponent in a prisoner's dilemma will cooperate while a game-theoretically rational player will defect. Superrationality is not a mainstream model within game theory.
The idea of superrationality is that two logical thinkers analyzing the same problem will think of the same correct answer. For example, if two persons are both good at arithmetic, and both have been given the same complicated sum to do, it can be predicted that both will get the same answer before the sum is known. In arithmetic, knowing that the two answers are going to be the same doesn't change the value of the sum, but in game theory, knowing that the answer will be the same might change the answer itself.
The prisoner's dilemma is usually framed in terms of jail sentences for criminals, but it can be stated equally well with cash prizes instead. Two players are each given the choice to cooperate (C) or to defect (D). The players choose without knowing what the other is going to do. If both cooperate, each will get $100. If they both defect, they each get $1. If one cooperates and the other defects, then the defecting player gets $101, while the cooperating player gets nothing.
The four outcomes and the payoff to each player are listed below
Player B cooperates | Player B defects | |
---|---|---|
Player A cooperates | Both get $100 |
Player A: $0 Player B: $101 |
Player A defects |
Player A: $101 Player B: $0 |
Both get $1 |
One valid way for the players to reason is as follows:
The conclusion is that the rational thing to do is to defect. This type of reasoning defines game-theoretic rationality, and two game-theoretic rational players playing this game both defect and receive a dollar each.
Superrationality is an alternative method of reasoning. First, it is assumed that the answer to a symmetric problem will be the same for all the superrational players. Thus the sameness is taken into account before knowing what the strategy will be. The strategy is found by maximizing the payoff to each player, assuming that they all use the same strategy. Since the superrational player knows that the other superrational player will do the same thing, whatever that might be, there are only two choices for two superrational players. Both will cooperate or both will defect depending on the value of the superrational answer. Thus the two superrational players will both cooperate, since this answer maximizes their payoff. Two superrational players playing this game will each walk away with $100.
Note that a superrational player playing against a game-theoretic rational player will defect, since the strategy only assumes that the superrational players will agree. A superrational player playing against a player of uncertain superrationality will sometimes defect and sometimes cooperate.
In general, if the two players are only superrational with probability p, and with probability 1-p they are game-theoretic rational and therefore defect, the result is as follows. Assuming that the superrational strategy is to cooperate, the expected payoff is 100p. Assuming that the superrational strategy is to defect, the expected payoff is 1. So as long as p>.01 the superrational strategy will be to cooperate.
This means that when the temptation to defect is small enough, the superrational strategy is to cooperate even in the presence of a large fraction of non-superrational opponents.
Although standard game theory assumes common knowledge of rationality, it does so in a different way. The game theoretic analysis maximizes payoffs by allowing each player to change strategies independently of the others, even though in the end, it assumes that the answer in a symmetric game will be the same for all. This is the definition of a game theoretic Nash equilibrium, which defines a stable strategy as one where no player can improve the payoffs by unilaterally changing course. The superrational equilibrium is one which maximizes payoffs where all the players' strategies are forced to be the same before the maximization step.
Some argue that superrationality implies a kind of magical thinking in which each player supposes that his decision to cooperate will cause the other player to cooperate, despite the fact that there is no communication. Hofstadter points out that the concept of "choice" doesn't apply when the player's goal is to figure something out, and that the decision does not cause the other player to cooperate, but rather same logic leads to same answer independent of communication or cause and effect. This debate is over whether it is reasonable for human beings to act in a superrational manner, not over what superrationality means.
Consider, for example, the ideal case in which a person plays the Prisoner's Dilemma with an exact clone of himself, created immediately prior to the game. In this case, both players can be 100% certain not only that the other will ultimately choose the same move that he will, but that at each moment, the mental processes of the other player are precisely the same as his own—not through any causative interaction but owing to the perfect correlation between the two players' minds. It would seem unreasonable for a player to choose to defect in such a circumstance knowing that he would receive a higher payoff in the event that he cooperated. In the more realistic situation in which two superrational players possess a lesser degree of correlation (perhaps only that they are both superrational), reasoning in this way may or may not be justified.
There is no agreed upon extension of the concept of superrationality to asymmetric games.
For simplicity, the foregoing account of superrationality ignored mixed strategies: the possibility that the best choice could be to flip a coin, or more generally to choose different outcomes with some probability. In the Prisoner's Dilemma, it is superrational to cooperate with probability 1 even when mixed strategies are admitted, because the average payoff when one player cooperates and the other defects is less than when both cooperate. But in certain extreme cases, the superrational strategy is mixed.
For example, if the payoffs in are as follows:
So that defecting is a huge reward, the superrational strategy maximizes the expected payoff to you assuming that the other player does the same thing. This is achieved by defecting with probability 1/2.
In similar situations with more players, using a randomising device can be essential. One example discussed by Hofstadter is the platonia dilemma: an eccentric trillionaire contacts 20 people, and tells them that if one and only one of them sends him a telegram (assumed to cost nothing) by noon the next day, that person will receive a billion dollars. If he receives more than one telegram, or none at all, no one will get any money, and cooperation between players is forbidden. In this situation, the superrational thing to do (if it is known that all 20 are superrational) is to send a telegram with probability p=1/20, which maximizes the probability that exactly one telegram is received.
Notice though that this is not the solution in a conventional game-theoretical analysis. Twenty game-theoretically rational players would each send in a telegram and therefore receive nothing. This is because sending the telegram is the dominant strategy; if an individual player sends a telegram he has a chance of receiving money, but if he sends no telegram he cannot get anything.
In a group of people with similar wishes and incomes, superrationality may explain the existence of:
In the first two cases, superrationality may be seen as an antidote to or opposite of the Bystander Effect, or more generally diffusion of responsibility.