batuta

Orchestration framework for the Sovereign AI Stack — privacy-preserving ML infrastructure in pure Rust

Overview

Batuta coordinates the Sovereign AI Stack, a comprehensive pure-Rust ecosystem for organizations requiring complete control over their ML infrastructure. The stack enables privacy-preserving inference, model management, and data processing without external cloud dependencies.

Key Capabilities

Privacy Tiers: Sovereign (local-only), Private (VPC), Standard (cloud-enabled)
Model Security: Ed25519 signatures, ChaCha20-Poly1305 encryption, BLAKE3 content addressing
API Compatibility: OpenAI-compatible endpoints for drop-in replacement
Observability: Prometheus metrics, distributed tracing, A/B testing
Cost Control: Circuit breakers with configurable daily budgets

Installation

cargo install batuta

Or add to your Cargo.toml:

[dependencies] batuta = "0.1.3"

Quick Start

# Analyze project structure and dependencies batuta analyze --languages --dependencies --tdg # Query the Sovereign AI Stack batuta oracle "How do I serve a Llama model locally?" # Model registry operations batuta pacha pull llama3-8b-q4 batuta pacha sign model.gguf --identity alice@example.com batuta pacha verify model.gguf # Encrypt models for distribution batuta pacha encrypt model.gguf --password-env MODEL_KEY batuta pacha decrypt model.gguf.enc --password-env MODEL_KEY

Demo

Live Demo: paiml.github.io/batuta | API Docs

Example Output (batuta analyze --tdg):

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 📊 Technical Debt Gradient Analysis ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Project: my-project Language: Rust (confidence: 98%) Metrics: Cyclomatic Complexity: 4.2 avg (good) Test Coverage: 87% (A-) Documentation: 92% (A) Dependency Health: 95% (A+) TDG Score: 91.5/100 (A) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

Stack Components

Batuta orchestrates a layered architecture of pure-Rust components:

┌─────────────────────────────────────────────────────────────┐ │ batuta v0.1.3 │ │ (Orchestration Layer) │ ├─────────────────────────────────────────────────────────────┤ │ realizar v0.2.2 │ pacha v0.1.1 │ │ (Inference Engine) │ (Model Registry) │ ├──────────────────────────┴──────────────────────────────────┤ │ aprender v0.14.1 │ │ (ML Algorithms & Formats) │ ├─────────────────────────────────────────────────────────────┤ │ trueno v0.7.4 │ │ (SIMD/GPU Compute Primitives) │ └─────────────────────────────────────────────────────────────┘

Core Components

Component	Version	Description
trueno	0.7.4	SIMD/GPU compute primitives with wgpu backend
aprender	0.14.1	ML algorithms: regression, trees, clustering, NAS
pacha	0.1.1	Model registry with signatures, encryption, lineage
realizar	0.2.2	Inference engine for GGUF/SafeTensors models
batuta	0.1.3	Stack orchestration and CLI tooling

Extended Ecosystem

Component	Description
trueno-db	GPU-accelerated analytics database
trueno-graph	Graph database for code analysis
alimentar	Data loading with encryption support
renacer	Syscall tracing for validation

Commands

`batuta analyze`

Analyze project structure, languages, and dependencies:

batuta analyze --languages --dependencies --tdg # Output: # Primary language: Python # Dependencies: pip (42 packages), ML frameworks detected # TDG Score: 73.2/100 (B) # Recommended: Use Aprender for ML, Realizar for inference

`batuta oracle`

Query the stack for component recommendations:

# Natural language queries batuta oracle "Train random forest on 1M samples" # List all components batuta oracle --list # Component details batuta oracle --show realizar # Interactive mode batuta oracle --interactive

`batuta pacha`

Model registry operations:

# Pull models from registry batuta pacha pull llama3-8b-q4 # Generate signing keys batuta pacha keygen --identity alice@example.com # Sign models for distribution batuta pacha sign model.gguf --identity alice@example.com # Verify model signatures batuta pacha verify model.gguf # Encrypt models at rest batuta pacha encrypt model.gguf --password-env MODEL_KEY # Decrypt for inference batuta pacha decrypt model.gguf.enc --password-env MODEL_KEY

`batuta content`

Generate structured content with quality constraints:

# Available content types batuta content types # Generate book chapter prompt batuta content emit --type bch --title "Error Handling" --audience "developers" # Validate content quality batuta content validate --type bch chapter.md

Privacy Tiers

The stack enforces data sovereignty through configurable privacy tiers:

Tier	Behavior	Use Case
Sovereign	Blocks ALL external API calls	Healthcare, Government
Private	VPC/dedicated endpoints only	Financial services
Standard	Public APIs allowed	General deployment

use batuta::serve::{BackendSelector, PrivacyTier}; let selector = BackendSelector::new() .with_privacy(PrivacyTier::Sovereign); // Returns only local backends: Realizar, Ollama, LlamaCpp let backends = selector.recommend();

Model Security

Digital Signatures (Ed25519)

Verify model integrity before loading:

use pacha::signing::{SigningKey, sign_model, verify_model}; let signing_key = SigningKey::generate(); let signature = sign_model(&model_data, &signing_key)?; // Verification fails if model tampered verify_model(&model_data, &signature)?;

Encryption at Rest (ChaCha20-Poly1305)

Protect models during distribution:

use pacha::crypto::{encrypt_model, decrypt_model}; let encrypted = encrypt_model(&model_data, "password")?; let decrypted = decrypt_model(&encrypted, "password")?;

Documentation

The Batuta Book — Comprehensive guide
Sovereign AI Stack Book — Complete stack tutorial with 22 chapters
API Documentation — Rust API reference
Specifications — Technical specifications

Design Principles

Batuta applies Toyota Production System principles:

Principle	Application
Jidoka	Automatic failover with context preservation
Poka-Yoke	Privacy tiers prevent data leakage
Heijunka	Spillover routing for load leveling
Muda	Cost circuit breakers prevent waste
Kaizen	Continuous metrics and optimization

Development

# Clone repository git clone https://github.com/paiml/batuta.git cd batuta # Build cargo build --release # Run tests cargo test # Build documentation mdbook build book

Contributing

Contributions are welcome! Please follow these guidelines:

Fork the repository and create your branch from main
Run tests before submitting: cargo test --all-features
Run lints: cargo clippy --all-targets --all-features -- -D warnings
Format code: cargo fmt --all
Update documentation for any API changes
Submit a pull request with a clear description

See our CI workflow for the full test suite.

License

MIT License — see LICENSE for details.

Links

Batuta — Orchestrating sovereign AI infrastructure.

Name		Name	Last commit message	Last commit date
Latest commit History 169 Commits
.cargo		.cargo
.claude/commands		.claude/commands
.github		.github
.pmat-metrics		.pmat-metrics
benches		benches
book		book
docs		docs
examples		examples
golden_traces		golden_traces
scripts		scripts
src		src
tests		tests
.clippy.toml		.clippy.toml
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
.mutants.toml		.mutants.toml
.pmat-gates.toml		.pmat-gates.toml
.pre-commit-config.yaml		.pre-commit-config.yaml
BATUTA-007-COMPLETE.md		BATUTA-007-COMPLETE.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
COVERAGE_REPORT.md		COVERAGE_REPORT.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
Dockerfile.dev		Dockerfile.dev
GOLDEN_TRACE_INTEGRATION_SUMMARY.md		GOLDEN_TRACE_INTEGRATION_SUMMARY.md
IMPLEMENTATION.md		IMPLEMENTATION.md
LICENSE		LICENSE
Makefile		Makefile
PMAT_BUG_REPORT.md		PMAT_BUG_REPORT.md
README.md		README.md
deny.toml		deny.toml
docker-compose.yml		docker-compose.yml
lcov.info		lcov.info
mutation_results_backend.txt		mutation_results_backend.txt
mutation_results_backend_improved.txt		mutation_results_backend_improved.txt
mutation_results_converters.txt		mutation_results_converters.txt
renacer.toml		renacer.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

batuta

Table of Contents

Overview

Key Capabilities

Installation

Quick Start

Demo

Stack Components

Core Components

Extended Ecosystem

Commands

`batuta analyze`

`batuta oracle`

`batuta pacha`

`batuta content`

Privacy Tiers

Model Security

Digital Signatures (Ed25519)

Encryption at Rest (ChaCha20-Poly1305)

Documentation

Design Principles

Development

Contributing

License

Links

About

Uh oh!

Releases 2

Packages

Contributors 2

Uh oh!

Languages

License

paiml/batuta

Folders and files

Latest commit

History

Repository files navigation

batuta

Table of Contents

Overview

Key Capabilities

Installation

Quick Start

Demo

Stack Components

Core Components

Extended Ecosystem

Commands

batuta analyze

batuta oracle

batuta pacha

batuta content

Privacy Tiers

Model Security

Digital Signatures (Ed25519)

Encryption at Rest (ChaCha20-Poly1305)

Documentation

Design Principles

Development

Contributing

License

Links

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 2

Uh oh!

Languages

`batuta analyze`

`batuta oracle`

`batuta pacha`

`batuta content`

Packages