tongjingqi/

AI-Can-Learn-Scientific-Taste

We propose Reinforcement Learning from Community Feedback (RLCF), a training paradigm that uses large-scale community signals as supervision, and formulate scientific taste learning as a preference modeling and alignment problem.

agentai-innovatorai-scientistsrl

Stars

251

+37 today+38 /wk+38 /mo

Forks

Issues

Watchers

251

Star History

Repository Info

LicenseApache-2.0

CreatedMar 12, 2026

Last push3h ago

Homepagetongjingqi.github.io/AI-Can-Learn-Scientific-Taste/

Open on GitHub