LLM Integrations

LLM-Agnostic by Design

Hyperdrome is not locked into a single AI provider. The agent architecture separates intent parsing, context gathering, and execution into independent layers — so the LLM backend can be swapped without changing anything else. This means Hyperdrome works with any large language model today, and will work with any model released tomorrow.

Supported Models

Claude

Opus, Sonnet, Haiku — Live

GPT-4

GPT-4o, GPT-4, GPT-4 Turbo — Live

Gemini

Gemini 2.5 Pro, 2.5 Flash — Live

Open Source

Llama, Mistral, Qwen, DeepSeek — Planned

All models are available at $0 cost to users. Hyperdrome covers inference costs.

Architecture

The agent pipeline is model-agnostic at every stage:

Step	Component	LLM Required?	Description
1	Intent Parser	Yes — any LLM	Analyzes the user’s message to determine intent (swap, LP, vote, etc.)
2	Context Gatherer	No	Reads on-chain state + off-chain data (wallet, pools, prices, APRs)
3	Responder	Yes — any LLM	Generates natural language explanation with rich UI cards
4	Executor	No	Builds and submits the transaction on-chain

The LLM is only involved in two steps: understanding what the user wants, and explaining the result. Everything else — reading blockchain state, building transactions, submitting on-chain — is deterministic code.

Intelligent Routing

The backend automatically selects the best model for each request based on:

Factor	Logic
Complexity	Simple queries (price checks, balance) → fastest model. Complex multi-step actions → most capable model.
Language	Some models perform better in specific languages. The router optimizes for the user’s detected language.
Latency	If the primary model is slow or unavailable, the router falls back to the next best option in < 500ms.
Cost	The router balances capability vs. cost to keep inference free for all users.

Bring Your Own LLM (Coming Soon)

This feature is on the roadmap and not yet available.

Users will be able to connect their own LLM provider:

API key — Plug in your own OpenAI, Anthropic, Google, or any OpenAI-compatible API key
Self-hosted models — Point the agent to your own Ollama, vLLM, or TGI endpoint running Llama, Mistral, Qwen, or any open-source model
Custom system prompts — Tailor the agent’s personality, risk tolerance, and response style
Full privacy — When using your own model, no data passes through Hyperdrome’s inference servers

Compatible Endpoints

Any endpoint that implements the OpenAI Chat Completions API format will work:

Runtime	Example Models
Ollama	Llama 3.3, Mistral, Qwen 2.5, DeepSeek V3, Gemma
vLLM	Any HuggingFace model
TGI	Any HuggingFace model
Together AI	Llama, Mixtral, Qwen
Groq	Llama, Mixtral, Gemma
Fireworks	Llama, Mixtral, DeepSeek

Why LLM-Agnostic Matters

No vendor lock-in — If a provider raises prices, degrades quality, or adds restrictions, Hyperdrome switches seamlessly.
Best model for the job — Different models excel at different tasks. Routing lets Hyperdrome use the best tool for each request.
Future-proof — New models are released monthly. LLM-agnostic architecture means Hyperdrome adopts them immediately without rewrites.
User sovereignty — With Bring Your Own LLM, users control their data and inference. No dependency on centralized AI providers.
Censorship resistance — Open-source models can’t be shut down or restricted. Users running their own Llama or Mistral instance have full autonomy.

​LLM Integrations

​LLM-Agnostic by Design

​Supported Models

Claude

GPT-4

Gemini

Open Source

​Architecture

​Intelligent Routing

​Bring Your Own LLM (Coming Soon)

​Compatible Endpoints

​Why LLM-Agnostic Matters

LLM Integrations

LLM-Agnostic by Design

Supported Models

Architecture

Intelligent Routing

Bring Your Own LLM (Coming Soon)

Compatible Endpoints

Why LLM-Agnostic Matters