🎯
Focusing
Undergraduate, Tsinghua University
-
Tsinghua University
- Beijing
-
06:01
(UTC -12:00) - https://zuojr.github.io
Pinned Loading
-
ddorm-llm-preference-benchmark
ddorm-llm-preference-benchmark PublicDDO-RM for LLM preference optimization: a minimal held-out benchmark against DPO
-
Deep-Decison-Optimize
Deep-Decison-Optimize PublicForked from TiantianZ399/Deep-Decison-Optimize
This repository implements the deep decision optimization method propose first in the https://arxiv.org/abs/2509.18138.
Python
-
-
-
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

