Home
News
People
Publications
Seminar
Courses
Vacancies
Contact
Pulp Fictions
Light
Dark
Automatic
mathematical reasoning
The Self-Improvement Paradox: Can Language Models Bootstrap Reasoning Capabilities without External Scaffolding?
Self-improving large language models (LLMs) – i.e., to improve the performance of an LLM by fine-tuning it with synthetic data …
Yutao Sun
,
Mingshuai Chen
,
Tiancheng Zhao
,
Ruochen Xu
,
Zilun Zhang
,
Jianwei Yin
PDF
Cite
Cite
×