a1: Steep Test-time Scaling Law via Environment Augmented Generation

Publication
arXiv