Skip to content

Getting Started

New to AI and not sure what any of this means? Start here. This page explains what hermes-router is, the handful of words you’ll see everywhere, and how to send your very first message — step by step.

It’s a free middleman between your program and AI models. Your program asks it a question; it quietly finds a free AI provider that’s available and gets you an answer — switching to another provider automatically if one is busy or rate-limited.

An analogy: imagine a phone operator with a stack of calling cards from different networks. You ask to make a call; the operator tries one card, and if it’s out of minutes, instantly tries the next — you never get a busy signal. hermes-router is that operator, and the “cards” are free API keys from providers like Google Gemini, Groq, and others.

  • It’s free. It uses the free tiers of many AI providers.
  • It doesn’t go down. When one provider hits its limit, it falls back to another, so your app keeps working.
  • It’s a drop-in. If you already use code that talks to OpenAI or Anthropic, you change one line (the address) and it works.

You don’t need to memorize these — skim them, and come back when one shows up. There’s a fuller list in concepts.md.

  • LLM (Large Language Model) — the actual AI “brain” that reads text and writes a reply (e.g. GPT, Gemini, Claude, Llama).
  • API key — a secret password that lets your program use a provider. You get these free from the providers (see providers.md).
  • Provider — a company that hosts LLMs you can call over the internet (Gemini, Groq…).
  • Token — roughly ¾ of a word. AI usage and limits are measured in tokens.
  • Prompt — the text you send to the AI.
Terminal window
curl -fsSL https://raw.githubusercontent.com/Shaf2665/Hermes-router/main/get.sh | bash

This downloads hermes-router and adds an hr command to your terminal.

Run the friendly setup wizard:

Terminal window
hr setup

It will ask which provider you have a key for and walk you through it. Don’t have one yet? Gemini is the easiest and most generous — get a free key at aistudio.google.com, then paste it when asked. (More options in providers.md.)

Terminal window
curl http://localhost:8319/health

You should see {"status":"ok",...}. That means the router is up and listening.

The router understands the same “language” as the popular OpenAI library, so any OpenAI example works — you just point it at the router. Install the library and run this:

Terminal window
pip install openai
from openai import OpenAI
# api_key here is the router's own password (default "sk-router-1"), NOT a provider key.
client = OpenAI(base_url="http://localhost:8319/v1", api_key="sk-router-1")
reply = client.chat.completions.create(
model="hermes-router",
messages=[{"role": "user", "content": "Explain what an AI agent is, simply."}],
)
print(reply.choices[0].message.content)

Run it — you just made your first AI call, for free, through hermes-router. 🎉

  • Want to build something that does things, not just chat?build-an-agent.md walks you from a chatbot to a real AI agent, step by step.
  • Confused by a term?concepts.md is a plain-language glossary.
  • Want more providers / more reliability?providers.md.
  • Want to change settings?configuration.md.