Terminal-Bench 2.0 results