Import AI 460: Reward hacking society, RSI data from Anthropic; and RL-based quadcopter racing
by Jack Clark
Welcome to Import AI, a newsletter about AI research. Import AI runs on arXiv, cappuccinos, and feedback from readers. If you’d like to support this, please subscribe.
Society can be reward-hacked, just like cyber environments:
…Imagine an army of credit card point optimizers gaming the sys...
