Skip to content

LLM Provider Reference

Per-provider configuration for all backends supported by kdeps. Backend and API keys go in ~/.kdeps/config.yaml. See LLM Backends for routing, allowlists, and streaming.

Local Backend

Ollama (Default)

Ollama is the default backend for local model serving.

yaml
# ~/.kdeps/config.yaml
llm:
  backend: ollama
  # base_url: http://custom-ollama:11434   # optional override

When building Docker images, Ollama is automatically installed when backend: ollama is set. The installOllama workflow flag can force or suppress this:

yaml
# workflow.yaml
settings:
  agentSettings:
    installOllama: true  # Force install (default: auto-detect from chat resources)

Cloud Backends

Any API that implements the OpenAI chat completions API works with kdeps.

OpenAI

yaml
# ~/.kdeps/config.yaml
llm:
  backend: openai
  openai_api_key: sk-...
ModelDescription
gpt-4oLatest GPT-4 Omni
gpt-4o-miniSmaller, faster GPT-4
gpt-4-turboGPT-4 Turbo
gpt-3.5-turboFast, cost-effective

Anthropic (Claude)

yaml
# ~/.kdeps/config.yaml
llm:
  backend: anthropic
  anthropic_api_key: sk-ant-...
ModelDescription
claude-sonnet-4-20250514Latest Claude Sonnet 4
claude-3-5-sonnet-20241022Claude 3.5 Sonnet
claude-3-opus-20240229Most capable Claude 3
claude-3-haiku-20240307Fast, efficient

Google (Gemini)

yaml
# ~/.kdeps/config.yaml
llm:
  backend: google
  google_api_key: ...
ModelDescription
gemini-1.5-proLatest Gemini Pro
gemini-1.5-flashFast inference
gemini-proStandard Gemini

Mistral

yaml
# ~/.kdeps/config.yaml
llm:
  backend: mistral
  mistral_api_key: ...
ModelDescription
mistral-large-latestMost capable
mistral-medium-latestBalanced
mistral-small-latestFast, efficient
open-mistral-7bOpen-source 7B
open-mixtral-8x7bMoE model

Groq

Ultra-fast inference with Groq hardware.

yaml
# ~/.kdeps/config.yaml
llm:
  backend: groq
  groq_api_key: ...
ModelDescription
llama-3.1-70b-versatileLlama 3.1 70B
llama-3.1-8b-instantLlama 3.1 8B (fastest)
mixtral-8x7b-32768Mixtral with 32K context
gemma2-9b-itGoogle Gemma 2 9B

Together AI

yaml
# ~/.kdeps/config.yaml
llm:
  backend: together
  together_api_key: ...
ModelDescription
meta-llama/Meta-Llama-3.1-70B-Instruct-TurboLlama 3.1 70B
meta-llama/Meta-Llama-3.1-8B-Instruct-TurboLlama 3.1 8B
mistralai/Mixtral-8x7B-Instruct-v0.1Mixtral 8x7B
Qwen/Qwen2-72B-InstructQwen2 72B

Perplexity

Search-augmented LLM responses.

yaml
# ~/.kdeps/config.yaml
llm:
  backend: perplexity
  perplexity_api_key: ...
ModelDescription
llama-3.1-sonar-large-128k-onlineLarge with web search
llama-3.1-sonar-small-128k-onlineSmall with web search
llama-3.1-sonar-large-128k-chatLarge chat only

Cohere

yaml
# ~/.kdeps/config.yaml
llm:
  backend: cohere
  cohere_api_key: ...
ModelDescription
command-r-plusMost capable
command-rFast and capable
commandStandard
command-lightFast, efficient

DeepSeek

yaml
# ~/.kdeps/config.yaml
llm:
  backend: deepseek
  deepseek_api_key: ...
ModelDescription
deepseek-chatGeneral chat
deepseek-coderCode generation

Self-Hosted Solutions

kdeps works with any self-hosted solution that implements the OpenAI API: vLLM, Text Generation Inference (TGI), LocalAI, LlamaCpp Server.

yaml
# ~/.kdeps/config.yaml
llm:
  backend: openai
  base_url: http://your-vllm-server:8000/v1

Custom Base URL

Override the default API URL via base_url:

yaml
# Azure OpenAI
llm:
  backend: openai
  base_url: "https://my-resource.openai.azure.com/openai/deployments/my-deployment"
  openai_api_key: ...

See Also

Released under the Apache 2.0 License.