Skip to content

feat(rewards): add VITA reward model integration with adaptation and outer-loop training#3269

Open
0xPraedico wants to merge 1 commit intohuggingface:refactor/reward-modelsfrom
0xPraedico:feat/vita-reward-model
Open

feat(rewards): add VITA reward model integration with adaptation and outer-loop training#3269
0xPraedico wants to merge 1 commit intohuggingface:refactor/reward-modelsfrom
0xPraedico:feat/vita-reward-model