serving

A flexible, high-performance serving system for machine learning models (by tensorflow)

Serving Alternatives

Similar projects and alternatives to serving

  1. llama.cpp

    LLM inference in C/C++

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. julia

    376 serving VS julia

    The Julia Programming Language

  4. tensorflow

    An Open Source Machine Learning Framework for Everyone

  5. whisper.cpp

    Port of OpenAI's Whisper model in C/C++

  6. mlc-llm

    90 serving VS mlc-llm

    Universal LLM Deployment Engine with ML Compilation

  7. Keras

    89 serving VS Keras

    Deep Learning for humans

  8. exllama

    66 serving VS exllama

    A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

  9. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  10. maturin

    40 serving VS maturin

    Build and publish crates with pyo3, cffi and uniffi bindings as well as rust binaries as python packages

  11. lit-llama

    23 serving VS lit-llama

    Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

  12. pinferencia

    Python + Inference - Model Deployment library in Python. Simplest model inference server ever.

  13. darknet

    Convolutional Neural Networks

  14. serve

    14 serving VS serve

    Discontinued Serve, optimize and scale PyTorch models in production (by pytorch)

  15. flake

    5 serving VS flake

    A Nix flake for many AI projects

  16. MNN

    5 serving VS MNN

    MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/README.md). MNN TaoAvatar Android - Local 3D Avatar Intelligence: apps/Android/Mnn3dAvatar/README.md

  17. oneflow

    32 serving VS oneflow

    OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

  18. glow

    6 serving VS glow

    Discontinued Compiler for Neural Network hardware accelerators (by pytorch)

  19. server

    30 serving VS server

    The Triton Inference Server provides an optimized cloud and edge inferencing solution. (by triton-inference-server)

  20. flashlight

    A C++ standalone library for machine learning (by flashlight)

  21. runtime

    A performant and modular runtime for TensorFlow (by tensorflow)

  22. llama_cpp.rb

    llama_cpp.rb provides Ruby bindings for llama.cpp

  23. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better serving alternative or higher similarity.

serving discussion

serving reviews and mentions

Posts with mentions or reviews of serving. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2025-08-29.

Stats

Basic serving repo stats
13
6,337
9.5
7 days ago

tensorflow/serving is an open source project licensed under Apache License 2.0 which is an OSI approved license.

The primary programming language of serving is C++.


Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that C++ is
the 7th most popular programming language
based on number of references?