Posts
Comments
Comment by
ZZ Si (zhangzhang-si) on
On the Importance of Open Sourcing Reward Models ·
2023-03-02T21:29:56.700Z ·
LW ·
GW
I think this is a fair point that an open reward function is subject to "SEO" efforts to game it. But, how about a "training" reward function that is open, and a "test" reward function that is hidden?
I would love to know what are some other OSS efforts on reward function (I do follow Carper's development on RF), and love to contribute.