may i know is there any possible to see the dataset used to train this reward model?
· Sign up or log in to comment