*A priori* probabilities are those that can be known solely through reasoning. The principle of equal *a priori* probabilities holds that, absent information to the contrary, every possible event can be taken to be equally likely. The principle is especially important in equilibrium statistical mechanics, where it is used to calculate averages over the energy surface for a classical Hamiltonian system without prior knowledge of the exact trajectory in phase space. It is well known that for ergodic systems the principle is valid. What is less widely known is that the principle can work even for non-ergodic systems. Indeed, an alternative approach to justifying the use of the principle is via the application of the maximum entropy principle. (My personal understanding of the subject was much influenced by Jaynes.)

In this post I wish to draw attention to an issue that many students find somewhat puzzling or even counterintuitive, namely, that the principle of equal *a priori* probabilities works even when probabilities are *a priori* unequal! That’s right, *unequal*!

Consider a roll of the dice. Let us choose the common 6-faced die. If the die is fair, in other words unbiased, then the probability of rolling, say, a is exactly . And indeed, this is exactly the same result as if we use the principle of equal *a priori* probabilities: there are 6 faces and if the probabilities are equal then each face must have a probability of 1/6 exactly.

Now consider “loaded dice.” Let us assume that you know *a priori* that the die is not fair. Let us assume that there is a 90% chance of landing one of the given numbers . So the probability of landing the other 5 numbers add up to 10%. Let us assume that each of these 5 other numbers has a 2% chance. Clearly, the probabilities for are unequal. Let us call the privileged number with the 90% chance to be . Then, for ,

Now let us say that you wish to calculate , but without knowning the value of . Obviously, if you knew then either or else , depending on whether or not . But let us assume that you do not know the value of . Then what?

In this case, we must consider all possible values of and do the calculation over the whole sample space. The calculation is easily done by splitting the contribution to the expected of value into 2 parts, where the 1st part is the case and the second part the case :

Evaluating, we get

Notice that you get the same answer as before! If you do not know , the probability to land a 3 is the same for loaded dice as it is for fair dice.

This simple example illustrates the power of the principle of equal *a priori* probabilities. Remarkably, it is an excellent heuristic even when the underlying probabilities are not in fact equal!