What 50,000 Runs of a 5-Line Eval Taught Us
June 19, 2026 by VS Code Eval Team, @code
Over the last six months, we have run the same tiny eval more than 50,000 times. It gives the VS Code agent one instruction: write a string to a file. No large codebase to understand, no test suite to debug, no architectural decision to make. It is our smoke test, a quick way to confirm that the end-to-end model ...
