Duc Q. Nguyen
Duc Q. Nguyen
Home
Posts
Projects
Talks
Publications
Contact
CV
Light
Dark
Automatic
Human-in-the-Loop
Thomas: Learning to Explore Human Preference via Probabilistic Reward Model
Recent breakthroughs in large language models and multimodal models underscore the impressive strides deep learning has made in …
Sang T. Truong
,
Duc Q. Nguyen
,
Tho Quan
,
Sanmi Koyejo
PDF
Cite
Cite
×