Skip to content
View mansicer's full-sized avatar
🎆
coding
🎆
coding

Highlights

  • Pro

Organizations

@LAMDA-RL

Block or report mansicer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. LAMDA-RL/ODIS LAMDA-RL/ODIS Public

    The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".

    Python 43 6

  2. MAIC MAIC Public

    The implementation of AAAI 2022 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".

    Python 56 10

  3. Q-Adapter Q-Adapter Public

    Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"

    Python 17 1

  4. LAMDA-RL/ReDA LAMDA-RL/ReDA Public

    The implementation of the AAMAS 2024 paper "Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation"

    Python 3 1