Recent
Tilting at Windmills V
Carry depth as the structural measure of mining hardness: SHA-256d puts 386 adder layers between the header and the output, and the strongest local advantage we can measure falls off a cliff within a single round.
The Literal Listener
A pragmatic speaker, in Chris Potts' sense, reasons about a literal listener. Train one, and it will learn what the listener likes. That's not the same thing as learning what the listener knows.
Ultimately the Survivors Do Not Prevail
Training multi-agent reinforcement learning in a zombie game taught us something that wasn't really about zombies: the environment is a more powerful programming language than the reward function.