LLM Integrations

LLM Integrations

LLM-Agnostic by Design

Hyperdrome is not locked into a single AI provider. The agent architecture separates intent parsing, context gathering, and execution into independent layers — so the LLM backend can be swapped without changing anything else. This means Hyperdrome works with any large language model today, and will work with any model released tomorrow.

Supported Models

/images/brands/anthropic.svg

Claude

Opus, Sonnet, Haiku — Live
/images/brands/openai.svg

GPT-4

GPT-4o, GPT-4, GPT-4 Turbo — Live
/images/brands/gemini.svg

Gemini

Gemini 2.5 Pro, 2.5 Flash — Live
/images/brands/meta.svg

Open Source

Llama, Mistral, Qwen, DeepSeek — Planned
All models are available at $0 cost to users. Hyperdrome covers inference costs.

Architecture

The agent pipeline is model-agnostic at every stage:
LLM-Agnostic Pipeline
StepComponentLLM Required?Description
1Intent ParserYes — any LLMAnalyzes the user’s message to determine intent (swap, LP, vote, etc.)
2Context GathererNoReads on-chain state + off-chain data (wallet, pools, prices, APRs)
3ResponderYes — any LLMGenerates natural language explanation with rich UI cards
4ExecutorNoBuilds and submits the transaction on-chain
The LLM is only involved in two steps: understanding what the user wants, and explaining the result. Everything else — reading blockchain state, building transactions, submitting on-chain — is deterministic code.

Intelligent Routing

The backend automatically selects the best model for each request based on:
FactorLogic
ComplexitySimple queries (price checks, balance) → fastest model. Complex multi-step actions → most capable model.
LanguageSome models perform better in specific languages. The router optimizes for the user’s detected language.
LatencyIf the primary model is slow or unavailable, the router falls back to the next best option in < 500ms.
CostThe router balances capability vs. cost to keep inference free for all users.

Bring Your Own LLM (Coming Soon)

This feature is on the roadmap and not yet available.
Users will be able to connect their own LLM provider:
  • API key — Plug in your own OpenAI, Anthropic, Google, or any OpenAI-compatible API key
  • Self-hosted models — Point the agent to your own Ollama, vLLM, or TGI endpoint running Llama, Mistral, Qwen, or any open-source model
  • Custom system prompts — Tailor the agent’s personality, risk tolerance, and response style
  • Full privacy — When using your own model, no data passes through Hyperdrome’s inference servers

Compatible Endpoints

Any endpoint that implements the OpenAI Chat Completions API format will work:
RuntimeExample Models
OllamaLlama 3.3, Mistral, Qwen 2.5, DeepSeek V3, Gemma
vLLMAny HuggingFace model
TGIAny HuggingFace model
Together AILlama, Mixtral, Qwen
GroqLlama, Mixtral, Gemma
FireworksLlama, Mixtral, DeepSeek

Why LLM-Agnostic Matters

  1. No vendor lock-in — If a provider raises prices, degrades quality, or adds restrictions, Hyperdrome switches seamlessly.
  2. Best model for the job — Different models excel at different tasks. Routing lets Hyperdrome use the best tool for each request.
  3. Future-proof — New models are released monthly. LLM-agnostic architecture means Hyperdrome adopts them immediately without rewrites.
  4. User sovereignty — With Bring Your Own LLM, users control their data and inference. No dependency on centralized AI providers.
  5. Censorship resistance — Open-source models can’t be shut down or restricted. Users running their own Llama or Mistral instance have full autonomy.