Prover-Verifier Games improve legibility of language model outputsMaking sure th…

Prover-Verifier Games improve legibility of language model outputsMaking sure that language models produce understandable text is crucial to making them helpful for people, especially when dealing with complex tasks like solving math problems. We found that when we optimize the problem-solving process of strong models solely for getting the correct answer, the resulting solutions can […]