[Roadmap] 2025 Q4 Milestones

# AReaL 2025 Q4 Milestone Tracker

## Introduction

This document tracks major planned enhancements for AReaL through January 31, 2026. Our development roadmap is organized into two categories to help contributors identify where they can make the most impact:

**On-going** sections contain features currently under active development by the core AReaL team. These represent our immediate priorities.

**Planned but not in progress** sections list features with concrete implementation plans that we currently lack bandwidth to pursue. **We actively welcome community contributions for these items!** If you're interested in contributing to any planned feature, please reach out to discuss implementation details.

---

## Backends

### On-going

- [x] Single-controller mode #260
- [x] Detailed profiling for optimal performance across different scales #522 #527 #539 etc.
- [x] Low-precision RL training (Megatron FP8)
- [x] Data transfer optimization in single-controller mode
- [x] New PyTorch-native backend: Archon

### Planned but not in progress

- [ ] Multi-LLM training (different agents with different parameters)
- [ ] Auto-scaling inference engines in single-controller mode
- [ ] Elastic weight update setup and acceleration
- [ ] RL training with cross-node vLLM pipeline/context parallelism

---

## Usability

### Done

- [x] Add CI pipeline to build Docker images upon release #564 #574
- [x] Wrap training scripts into trainers
- [x] Refactor FSDP/Megatron engine/controller APIs to finer granularity
- [x] Fully respect allocation mode in trainers/training scripts

### On-going

- [ ] Flatten the import structure of areal modules

### Planned but not in progress

- [ ] Support distributed training and debugging in Jupyter notebooks
- [ ] Example of using a generative or critic-like reward model
- [ ] Support directly constructing inference/training engines without config objects

### Canceled

- [x] Rename `RemoteSGLang/vLLMEngine` as `SGLang/vLLMEngine`

---

## Documentation

### Done

- [x] Tutorial on how to write efficient async rollout workflows
- [x] Benchmarking and profiling guide
- [x] Use case guides: offline inference, offline evaluation
- [x] AReaL performance tuning guide
  - [x] Device allocation strategies for training and inference
  - [x] Parallelism strategy configuration for training and inference

### Planned but not in progress

- [ ] Use case guides: multi-agent training


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Roadmap] 2025 Q4 Milestones #542

AReaL 2025 Q4 Milestone Tracker

Introduction

Backends

On-going

Planned but not in progress

Usability

Done

On-going

Planned but not in progress

Canceled

Documentation

Done

Planned but not in progress

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Roadmap] 2025 Q4 Milestones #542

Description

AReaL 2025 Q4 Milestone Tracker

Introduction

Backends

On-going

Planned but not in progress

Usability

Done

On-going

Planned but not in progress

Canceled

Documentation

Done

Planned but not in progress

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions