Skip to content
Change the repository type filter

All

    Repositories list

    • C++
      20900Updated Jun 18, 2026Jun 18, 2026
    • vllm-musa

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Other
      18k10112Updated Jun 17, 2026Jun 17, 2026
    • torchada

      Public
      An adapter layer that ensures torch_musa🔦 delivers a CUDA-compatible PyTorch experience.
      Python
      MIT License
      113711Updated Jun 17, 2026Jun 17, 2026
    • Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction
      C++
      Other
      506000Updated Jun 12, 2026Jun 12, 2026
    • Fast Library for Approximate Nearest Neighbors
      C++
      Other
      663000Updated Jun 11, 2026Jun 11, 2026
    • Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support
      C++
      BSD 3-Clause "New" or "Revised" License
      16000Updated Jun 11, 2026Jun 11, 2026
    • Forked from https://gitlab.com/libeigen/eigen
      C++
      Mozilla Public License 2.0
      0000Updated Jun 11, 2026Jun 11, 2026
    • muThrust

      Public
      The C++ parallel algorithms library.
      C++
      Other
      760400Updated Jun 8, 2026Jun 8, 2026
    • muAlg

      Public
      Cooperative primitives for MUSA C++.
      Cuda
      Other
      463700Updated Jun 8, 2026Jun 8, 2026
    • Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
      C++
      Other
      6065600Updated Jun 5, 2026Jun 5, 2026
    • tvm_musa

      Public
      Open Machine Learning Compiler Framework
      Python
      Apache License 2.0
      3.9k200Updated Jun 5, 2026Jun 5, 2026
    • TileOPs

      Public
      High-performance LLM operator library built on TileLang.
      Python
      Other
      42000Updated Jun 1, 2026Jun 1, 2026
    • mate

      Public
      MUSA AI Tensor Engine
      C++
      Apache License 2.0
      0910Updated May 26, 2026May 26, 2026
    • MTClaw

      Public
      Local tool-routing proxy for openclaw/opencode/hermes, accelerating tool calls by up to 7x before forwarding general requests to upstream models.
      Python
      MIT License
      42412Updated May 21, 2026May 21, 2026
    • tvm-ffi

      Public
      Open ABI and FFI for Machine Learning Systems
      C++
      Apache License 2.0
      81000Updated May 20, 2026May 20, 2026
    • Python
      Other
      0000Updated May 13, 2026May 13, 2026
    • Python
      Other
      0000Updated May 13, 2026May 13, 2026
    • SimuMax

      Public
      a static analytical model for LLM distributed training
      Python
      Other
      2815900Updated May 11, 2026May 11, 2026
    • C++
      Apache License 2.0
      0100Updated May 7, 2026May 7, 2026
    • TypeScript
      Other
      1101Updated May 7, 2026May 7, 2026
    • LiteGS

      Public
      A refactored codebase for Gaussian Splatting. Training 3DGS in 50 seconds!
      Cuda
      Other
      3337960Updated Apr 10, 2026Apr 10, 2026
    • mujoco_warp_musa is a Python package extending MuJoCo Warp with MUSA compute backend, enabling GPU-accelerated physics simulation on MT MUSA architecture.Forked…
      Python
      Other
      01320Updated Mar 30, 2026Mar 30, 2026
    • mujoco_musa is a C++ sub-repository providing native MUSA kernel libraries for GPU-accelerated physics simulation in mujoco_warp_musa.
      C++
      Other
      0100Updated Mar 27, 2026Mar 27, 2026
    • axinfra is a lightweight array and compute infrastructure library for MUSA/CPU, providing device/stream management, array operations, and zero-copy interoperabi…
      Python
      Other
      0000Updated Mar 27, 2026Mar 27, 2026
    • torch_musa is an open source repository based on PyTorch, which can make full use of the super computing power of MooreThreads graphics cards.
      Python
      Other
      36499300Updated Mar 17, 2026Mar 17, 2026
    • kineto

      Public
      HTML
      Other
      3100Updated Mar 16, 2026Mar 16, 2026
    • PyTorch Extension Library of Optimized Graph Cluster Algorithms
      C++
      MIT License
      164000Updated Mar 13, 2026Mar 13, 2026
    • Provides a Python interface to GPU management and monitoring functions. This is a wrapper around the MTML library.
      C
      MIT License
      5910Updated Mar 10, 2026Mar 10, 2026
    • A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better per…
      Python
      Apache License 2.0
      751901Updated Feb 5, 2026Feb 5, 2026
    • Python
      31300Updated Feb 5, 2026Feb 5, 2026
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.