PhD student @ NTU Singapore | My research focuses on reinforcement learning (RL), large language models (LLMs), LLM post-training, and LLM-based agents.
- Singapore
-
08:21
(UTC +08:00) - https://langfengq.github.io/
Highlights
- Pro
Pinned Loading
-
verl-agent
verl-agent Publicverl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
-
TimeMaster
TimeMaster PublicOfficial code for paper "TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement Learning"
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.