The Savage Representation
The Savage Representation
As is understood since Savage, proper scoring rules are in one-to-one correspondence with strictly convex functions via the so-called Savage representation
As is understood since Savage, proper scoring rules are in one-to-one correspondence with strictly convex functions via the so-called Savage representation
Seeing that Savage worked with Von Neumann seems to explain quite a few things: the game-theoretic ideas and convex duality arguments and ideas seem very connected to Von Neumann's works in that field
Seeing that Savage worked with Von Neumann seems to explain quite a few things: the game-theoretic ideas and convex duality arguments and ideas seem very connected to Von Neumann's works in that field
$s(\vec \pi,i)=G(\vec \pi)+\lang\delta_i,\nabla G(\vec \pi)\rang$
s(π,i)=G(π)+δi,G(π)s(\vec \pi,i)=G(\vec \pi)+\lang\delta_i,\nabla G(\vec \pi)\rang
This is assuming that $G$ is smooth, which we do
This is assuming that GG is smooth, which we do
The proof of this is simple and beautiful
The proof of this is simple and beautiful
We take the Savage function $S$, represented the expected return assuming $i$ is sampled according to the "reality" $\vec p$
We take the Savage function SS, represented the expected return assuming ii is sampled according to the "reality" p\vec p
$S(\vec\pi,\vec p)=G(\vec \pi) + \lang\vec p,\nabla G(\pi)\rang$
S(π,p)=G(π)+p,G(π)S(\vec\pi,\vec p)=G(\vec \pi) + \lang\vec p,\nabla G(\pi)\rang
The point is then that this is an affine function of the reality $\pi$ (like any expectation is), and we can understand the optimization problem via the angle that since $G$ is convex, as a function of $\vec p$, we have
The point is then that this is an affine function of the reality π\pi (like any expectation is), and we can understand the optimization problem via the angle that since GG is convex, as a function of p\vec p, we have
$S(\vec \pi,\vec p)\leq G(\vec p)$ for all $\vec p$
S(π,p)G(p)S(\vec \pi,\vec p)\leq G(\vec p) for all p\vec p
This follows since $\vec p \mapsto S(\vec \pi,\vec p) $ is affine and tangent to $G$ at $\vec p=\vec \pi$
This follows since pS(π,p)\vec p \mapsto S(\vec \pi,\vec p) is affine and tangent to GG at p=π\vec p=\vec \pi
Since $G$ is convex, we have that it is the sup of all tangents to it, so $\sup_{\vec \pi} S(\vec\pi,\vec p)=G(\vec p)$, which is exactly to say that the scoring rule is proper
Since GG is convex, we have that it is the sup of all tangents to it, so supπS(π,p)=G(p)\sup_{\vec \pi} S(\vec\pi,\vec p)=G(\vec p), which is exactly to say that the scoring rule is proper
How are things if $G$ is piecewise affine?
How are things if GG is piecewise affine?
The converse: any proper scoring rule is Savage
The converse: any proper scoring rule is Savage
It simply hinges on understanding that $G(\vec p)$ is the sup over all $S(\vec \pi,\vec p)$, and since $S(\vec \pi,\vec p)$ is affine in $\vec p$, we find that its value can be deduced from $G$ at the tangent point via the Savage representation
It simply hinges on understanding that G(p)G(\vec p) is the sup over all S(π,p)S(\vec \pi,\vec p), and since S(π,p)S(\vec \pi,\vec p) is affine in p\vec p, we find that its value can be deduced from GG at the tangent point via the Savage representation
And the concrete scoring rule is obtained by "collapsing $\vec p$" to a "deterministic reality $i$"
And the concrete scoring rule is obtained by "collapsing p\vec p" to a "deterministic reality ii"
This is probably an interesting thing to look at!
This is probably an interesting thing to look at!
.
ideas-and-notes
about
tricritical-ising
cellular-automata-and-alife
ising-and-e8
xent
chiral-spin-field
computational-equilibrium
misc-ideas
arrows-of-time
de-finetti
local-vs-global-univ
interestingness
quines-and-self-replicators
proper-scoring-rules