### 🚀 Feature With https://github.com/h2oai/h2o-llmstudio/issues/78 being merged, we may want to consider training own reward models within LLM Studio. This should probably be a new task type and requires a different dataset type. ### Motivation With https://github.com/h2oai/h2o-llmstudio/issues/78 being merged, we may want to consider training own reward models within LLM Studio.