Skip to content
@OpenMOSS

OpenMOSS (SII)

OpenMOSS Team is a research group under the Shanghai Innovation Institution (SII), working in close collaboration with Fudan University and MOSI Intelligence.

Introduction 👋

OpenMOSS Team is a research group under the Shanghai Innovation Institution (SII), working in close collaboration with Fudan University and MOSI Intelligence. Led by Prof. Xipeng Qiu, the team conducts cutting-edge research on large language models (LLMs), advancing the frontiers of model architecture, evaluation, and application with a strong commitment to open, collaborative, and impactful AI innovation.

We warmly welcome researchers, students, and collaborators who share our vision to join us in pushing the boundaries of LLM technology. For inquiries or collaboration opportunities, please contact us at openmoss@sii.edu.cn .

🌐 Website: https://openmoss.github.io/ or http://openmoss.sii.edu.cn/

💻 GitHub: https://github.com/OpenMOSS

  • SII is dedicated to fostering innovation in education and research in the field of artificial intelligence.

Pinned Loading

  1. MOSS MOSS Public

    An open-source tool-augmented conversational language model from Fudan University

    Python 12.1k 1.1k

  2. MOSS-VL MOSS-VL Public

    MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.

    Python 187 3

  3. MOSS-TTS MOSS-TTS Public

    MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenario…

    Python 1.2k 110

  4. MOVA MOVA Public

    MOVA: Towards Scalable and Synchronized Video–Audio Generation

    Python 900 63

  5. MOSS-TTSD MOSS-TTSD Public

    MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flexible speaker control, and multilingual support, while enablin…

    Python 1.3k 122

  6. Language-Model-SAEs Language-Model-SAEs Public

    Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.

    Python 211 28

Repositories

Showing 10 of 48 repositories
  • MOSS-TTS-Nano Public

    MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for realtime speech generation, can run directly on CPU without a GPU, and keeps the deployment stack simple enough for local demos, web serving, and lightweight product integration.

    OpenMOSS/MOSS-TTS-Nano’s past year of commit activity
    Python 83 Apache-2.0 2 4 0 Updated Apr 13, 2026
  • Language-Model-SAEs Public

    Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.

    OpenMOSS/Language-Model-SAEs’s past year of commit activity
    Python 211 28 8 0 Updated Apr 13, 2026
  • MOSS-TTS Public

    MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenarios, covering stable long‑form speech, multi‑speaker dialogue, voice/character design, environmental sound effects, and real‑time streaming TTS.

    OpenMOSS/MOSS-TTS’s past year of commit activity
    Python 1,202 Apache-2.0 110 21 0 Updated Apr 13, 2026
  • MOSS-Video-Preview Public

    A real-time video understanding foundation model built on Llama-3.2-Vision, featuring comprehensively extended video processing and multimodal reasoning capabilities.

    OpenMOSS/MOSS-Video-Preview’s past year of commit activity
    Python 134 Apache-2.0 4 0 0 Updated Apr 13, 2026
  • MOSS-Audio-Tokenizer Public

    MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming and variable bitrates, delivering SOTA reconstruction and strong performance in generation and understanding—serving as a unified interface for next-generation native audio language models.

    OpenMOSS/MOSS-Audio-Tokenizer’s past year of commit activity
    Python 184 Apache-2.0 12 3 1 Updated Apr 13, 2026
  • OpenMOSS/MOSS-TTS-Nano-Demo’s past year of commit activity
    CSS 0 0 0 0 Updated Apr 12, 2026
  • MOSS-VL Public

    MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.

    OpenMOSS/MOSS-VL’s past year of commit activity
    Python 187 Apache-2.0 3 0 0 Updated Apr 12, 2026
  • sglang Public
    OpenMOSS/sglang’s past year of commit activity
    Python 2 Apache-2.0 0 0 0 Updated Apr 10, 2026
  • mlx-audio Public Forked from Blaizzy/mlx-audio

    A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

    OpenMOSS/mlx-audio’s past year of commit activity
    Python 5 MIT 551 0 0 Updated Apr 9, 2026
  • MOSS-VL-Demo Public
    OpenMOSS/MOSS-VL-Demo’s past year of commit activity
    Vue 5 0 0 0 Updated Apr 9, 2026

Top languages

Loading…

Most used topics

Loading…