Skip to content
@ModelTC

ModelTC

Model Infra

Pinned Loading

  1. MQBench Public

    Model Quantization Benchmark

    Python 799 142

  2. United-Perception Public

    United Perception

    Python 432 67

  3. Dipoorlet Public

    Offline Quantization Tools for Deploy.

    Python 127 17

  4. lightllm Public

    LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

    Python 3.1k 247

  5. llmc Public

    [EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

    Python 458 54

  6. OmniBal Public

    Python 20 3

Repositories

Showing 10 of 48 repositories
  • lightllm Public

    LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

    Python 3,144 Apache-2.0 247 76 7 Updated Apr 23, 2025
  • Dockerfile 0 0 0 0 Updated Apr 23, 2025
  • lightx2v Public
    Python 11 6 0 2 Updated Apr 22, 2025
  • general-sam-py Public

    Python bindings for general-sam and some utilities

    Python 3 Apache-2.0 0 0 1 Updated Apr 22, 2025
  • MQBench Public

    Model Quantization Benchmark

    Python 799 Apache-2.0 142 7 5 Updated Apr 20, 2025
  • llmc Public

    [EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

    Python 458 Apache-2.0 54 28 0 Updated Apr 18, 2025
  • flash-attention Public Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    Python 0 BSD-3-Clause 1,637 0 0 Updated Apr 17, 2025
  • 0 0 0 0 Updated Apr 15, 2025
  • greedy-tokenizer Public

    Greedily tokenize strings with the longest tokens iteratively.

    Python 0 Apache-2.0 0 0 1 Updated Mar 24, 2025
  • mtc-token-healing Public

    Token healing implementation in Rust

    Rust 4 Apache-2.0 0 0 0 Updated Mar 22, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…