Aldo Pacchiano
Home
Publications
Contact
Talks
Principled Fine-tuning of LLMs from User-Edits: A Medley of Preference, Supervision, and Reward
Dipendra Misra*
,
Aldo Pacchiano*
,
Ta-Chung Chi
,
Ge Gao
September 2025
Type
Conference paper
Cite
×