Skip to content

[FEATURE] Train a reward model #175

Closed as not planned
Closed as not planned
@pascal-pfeiffer

Description

@pascal-pfeiffer

🚀 Feature

With #78 being merged, we may want to consider training own reward models within LLM Studio.

This should probably be a new task type and requires a different dataset type.

Motivation

With #78 being merged, we may want to consider training own reward models within LLM Studio.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions