continuous-eval

Data-Driven Evaluation for LLM-Powered Applications (by relari-ai)

Continuous-eval Alternatives

Similar projects and alternatives to continuous-eval

  1. instructor

    structured outputs for llms

  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. hyperdx

    26 continuous-eval VS hyperdx

    Resolve production issues, fast. An open source observability platform unifying session replays, logs, metrics, traces and errors powered by ClickHouse and OpenTelemetry.

  4. zep

    Discontinued Zep | The Memory Foundation For Your AI Stack

  5. R2R

    9 continuous-eval VS R2R

    SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.

  6. ContextCheck

    MIT-licensed Framework for LLMs, RAGs, Chatbots testing. Configurable via YAML and integrable into CI pipelines for automated testing.

  7. zep-js

    Build Agents That Recall What Matters. Systematically engineer relevant context from chat history & business data. (TypeScript Client)

  8. colpali

    The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

  9. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  10. text-to-image-eval

    Evaluate custom and HuggingFace text-to-image/zero-shot-image-classification models like CLIP, SigLIP, DFN5B, and EVA-CLIP. Metrics include Zero-shot accuracy, Linear Probe, Image retrieval, and KNN accuracy.

  11. garak

    the LLM vulnerability scanner

  12. PyDGN

    A research library for automating experiments on Deep Graph Networks

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better continuous-eval alternative or higher similarity.

continuous-eval discussion

continuous-eval reviews and mentions

Posts with mentions or reviews of continuous-eval. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-05-09.

Stats

Basic continuous-eval repo stats
4
515
7.7
11 months ago

Sponsored
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io

Did you know that Python is
the 2nd most popular programming language
based on number of references?