Skip to content
@basetenlabs

Baseten

Machine learning infrastructure for developers

Welcome to Baseten

Baseten is an AI infrastructure platform. We combine applied performance research, distributed multi-cloud infrastructure, and developer tooling to run models of all modalities in production.

Get started:

  • Deploy an open-source model in two clicks from the model library.
  • Read our docs to package and serve a fine-tuned or custom model.

Pinned Loading

  1. truss truss Public

    The simplest way to serve AI/ML models in production

    Python 1.1k 91

  2. truss-examples truss-examples Public

    Examples of models deployable with Truss

    Python 211 53

Repositories

Showing 10 of 72 repositories
  • Megatron-Bridge Public Forked from NVIDIA-NeMo/Megatron-Bridge

    HuggingFace conversion and training library for Megatron-based models

    basetenlabs/Megatron-Bridge’s past year of commit activity
    Python 0 Apache-2.0 100 0 0 Updated Dec 15, 2025
  • Megatron-LM Public Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    basetenlabs/Megatron-LM’s past year of commit activity
    Python 0 3,454 0 0 Updated Dec 15, 2025
  • Model-Optimizer Public Forked from NVIDIA/Model-Optimizer

    A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

    basetenlabs/Model-Optimizer’s past year of commit activity
    Python 0 Apache-2.0 215 0 1 Updated Dec 15, 2025
  • TensorRT-Model-Optimizer Public Forked from NVIDIA/Model-Optimizer

    A unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.

    basetenlabs/TensorRT-Model-Optimizer’s past year of commit activity
    Python 1 Apache-2.0 215 0 4 Updated Dec 15, 2025
  • truss Public

    The simplest way to serve AI/ML models in production

    basetenlabs/truss’s past year of commit activity
    Python 1,096 MIT 91 6 40 Updated Dec 15, 2025
  • genai-bench Public Forked from sgl-project/genai-bench

    Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

    basetenlabs/genai-bench’s past year of commit activity
    Python 1 MIT 40 0 1 Updated Dec 14, 2025
  • ml-cookbook Public

    Ready-to-use ML training recipes to help you build and deploy models on Baseten.

    basetenlabs/ml-cookbook’s past year of commit activity
    Python 37 MIT 1 0 8 Updated Dec 12, 2025
  • harmony Public Forked from openai/harmony

    Renderer for the harmony response format to be used with gpt-oss

    basetenlabs/harmony’s past year of commit activity
    Rust 0 Apache-2.0 238 0 1 Updated Dec 12, 2025
  • truss-examples Public

    Examples of models deployable with Truss

    basetenlabs/truss-examples’s past year of commit activity
    Python 211 MIT 53 14 58 Updated Dec 7, 2025
  • gorilla Public Forked from ShishirPatil/gorilla

    Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

    basetenlabs/gorilla’s past year of commit activity
    Python 0 Apache-2.0 1,375 0 6 Updated Dec 2, 2025

Top languages

Loading…

Most used topics

Loading…