RPR Derivation
A team's RPR, or Relative Power Rating (denoted
We focus on computing point outcomes simply because game/set/half outcomes are dependent on the order of points, whether the game was run with sets or stones, the length of each set, etc., all of which are hard to control for and can add significant noise.
Our data comes from a set of
For each row, we observe an estimate of the probability that team
It is of note that this is equivalent to the other way around - we can swap
If we fill this out for every game, we get a large system of equations. We can write this in matrix form:
with
This matrix
We compute this using a sparse Cholesky decomposition of
There is one issue remaining, however: because RPR is inherently relative, the absolute zero point is a free variable. For the purposes of numerical stability, therefore, we add an additional row of ones to the bottom of
Obviously this is an approximation, and a very crude one at that. It is only one dimension and thus can only measure one dimension of a team's performance; specialties which may lead to drastically different results than RPR predicts will get averaged out. RPR also claims some matchups should have a higher than 100% win rate, which is obviously false. I am not a statistician and therefore do not care; RPR is meant to be a simple way to find vaguely correct power ratings given large amounts of unstructured data.