Hermes
Hermes can use a custom endpoint as the main model provider. Configure the endpoint as OpenAI-compatible Chat Completions, then select the Voxvey model.
1. Set the token
export VOXVEY_TOKEN="<access-token>"
2. Add Voxvey in the dashboard
- Open the Hermes dashboard.
- Go to Keys.
- Add a custom endpoint provider.
- Use
Voxveyas the display name. - Use
https://api.voxvey.com/v1as the base URL. - Use your Voxvey bearer token as the API key.
- Set the API mode to
chat_completions.
3. Select the main model
Open Models, click Change on the main model row, and select:
deepseek/deepseek-v4-flash
Hermes applies model changes to new sessions. Use /model inside a running chat
if you need to switch the current session immediately.
Direct config
You can also edit ~/.hermes/config.yaml.
model:
provider: voxvey
default: deepseek/deepseek-v4-flash
base_url: https://api.voxvey.com/v1
api_mode: chat_completions
api_key: ${VOXVEY_TOKEN}
If Hermes stores secrets separately on your install, add the API key through
the dashboard or hermes setup, then keep api_key empty in the config file.
Verify
hermes status
hermes chat
In the chat, send:
Reply with the exact word ready.
If Hermes reports no authenticated providers, re-open Keys and confirm the custom endpoint has both a base URL and a token.
Messages mode
The setup above uses Chat Completions. If your Hermes build exposes an Anthropic
Messages API mode, use the host root and a model returned by /v1/models for
that API surface:
model:
provider: voxvey
default: <messages-model-id>
base_url: https://api.voxvey.com
api_mode: messages
api_key: ${VOXVEY_TOKEN}
Use this mode when Hermes should call /v1/messages instead of
/v1/chat/completions.