🎉 ai-infra v1.0 is here — Production-ready AI/LLM infrastructure

Infrastructure that just works. Ship products, not boilerplate.

Frameworks

svc-infra
ai-infra
fin-infra
robo-infra

Resources

Getting Started
What's New
Contributing

Community

GitHub

© 2026 nfrax. All rights reserved.

Start Here What's New

ai-infra

API Reference

Auto-generated API documentation from Python docstrings. Browse classes by category or search for specific functionality.

34 classes•257 methods•55 async•9 categories

Core

5

In-memory storage backend

(tool_name: str, callback: Callable[[ProgressEvent], Any] | None)

Stream for sending progress updates from tools

(db_path: str | Path)

SQLite-based storage backend

TracingCallbacks

(tracer: Tracer | None)

Callbacks that create spans for ai-infra operations

WorkflowRecorder

(record_id: str, storage: Storage | None)

Records agent workflow steps for later replay

LLM

9

(tools: list[Any] | None, provider: str | None, model_name: str | None, ...)

Agent-oriented interface (tool calling, streaming updates, fallbacks)

(voice: str, format: str)

Configuration for audio output from LLMs

(callbacks: Callbacks | (CallbackManager | None))

Direct model convenience interface (no agent graph)

(backend: str, embedding_provider: str | None, embedding_model: str | None, ...)

Long-term memory store with semantic search

(name: str, prompt: str, tools: list[str] | None, ...)

Agent persona configuration

(provider: str | BaseRealtimeProvider | None, config: RealtimeConfig | None)

High-level facade for real-time voice conversations

(provider: str | None, model: str | None, language: str | None, ...)

Speech-to-Text with provider-agnostic API

(provider: str | None, voice: str | None, model: str | None, ...)

Text-to-Speech with provider-agnostic API

(root: str | Path, mode: WorkspaceMode)

Unified workspace configuration for all agent file operations

Agents

1

(nodes: dict[str, Any] | Sequence | None, edges: Sequence | None, entry: str | None, ...)

Production-ready workflow graph with zero-config building

MCP

4

CachingInterceptor

(ttl_seconds: int, _cache: dict[str, tuple[CallToolResult, float]])

Cache tool call results for a configurable TTL

(config: list[dict] | list[McpServerConfig], callbacks: Callbacks | CallbackManager | None, interceptors: list[ToolCallInterceptor] | None, ...)

MCP Client for connecting to one or more MCP servers

MCPSecuritySettings

(domains: Sequence[str] | None, enable_security: bool, allowed_hosts: Sequence[str] | None, ...)

Security settings for MCP servers with automatic environment detection

(strict: bool, health_path: str)

MCP Server for hosting one or more MCP endpoints

Embeddings & Retrieval

3

(provider: str | None, model: str | None, dimensions: int | None, ...)

Simple, provider-agnostic text embeddings

(provider: str | None, model: str | None, embeddings: Embeddings | None, ...)

Semantic search made simple

(embeddings: Embeddings, backend: Literal['memory', 'chroma', 'faiss'], collection_name: str, ...)

Simple vector store for semantic search

Evaluation

5

ContainsExpected

(case_sensitive: bool)

Check if output contains the expected output text

(min_length: int, max_length: int | None, count_words: bool)

Check if output length is within a specified range

RAGFaithfulness

(llm_judge: str | None, provider: str | None, context_key: str, ...)

Evaluate if an answer is grounded in the provided context

SemanticSimilarity

(provider: str | None, model: str | None, threshold: float, ...)

Evaluate semantic similarity between output and expected output

ToolUsageEvaluator

(expected_tools: list[str], forbidden_tools: list[str], require_all: bool, ...)

Evaluate that an agent called expected tools

Callbacks & Observability

5

CallbackManager

(callbacks: Sequence[Callbacks] | None, critical_callbacks: Sequence[Callbacks] | None)

Manages multiple callback handlers

Base class for callback handlers

LoggingCallbacks

Callback that logs all events to the ai-infra logger

MetricsCallbacks

Callback that collects metrics about operations

(verbose: bool)

Simple callback that prints events to stdout

Image Generation

1

(provider: str | None, model: str | None, api_key: str | None, ...)

Provider-agnostic image generation

Providers

1

ProviderRegistry

Central registry for all AI provider configurations