ClawEval Benchmark