summaryrefslogtreecommitdiff
path: root/docs
diff options
context:
space:
mode:
authorPaul Buetow <paul@buetow.org>2026-03-15 23:24:00 +0200
committerPaul Buetow <paul@buetow.org>2026-03-15 23:24:00 +0200
commit8ec8ee16e23081018e32dea122ecd9a3b8d8b2c7 (patch)
tree5a564bb36fc9750d3353435d2dd3cf2f28fa5261 /docs
parent10112d4b7a8150118e705b95df73c08824ac2b22 (diff)
Release v0.23.0v0.23.0
Diffstat (limited to 'docs')
-rw-r--r--docs/usage.md9
1 files changed, 9 insertions, 0 deletions
diff --git a/docs/usage.md b/docs/usage.md
index 1ecf30f..8404969 100644
--- a/docs/usage.md
+++ b/docs/usage.md
@@ -87,6 +87,10 @@ Defaults: concise answers. If the prompt asks for commands, Hexai outputs only c
Provider/model headers and run summaries are written to `stderr`, so `stdout` stays usable in pipes.
+Successful CLI responses are cached for 24 hours under Hexai's cache directory. The cache key includes the input text, provider, resolved model, and effective CLI prompt text, so prompt/config changes automatically invalidate old entries.
+
+To rerun a multi-provider prompt and print only one response cleanly, use the existing numbered provider flags such as `-0`, `-1`, etc. That reuses the cached response for just that provider when available, which avoids the side-by-side layout on `stdout`.
+
`--tps-simulation` accepts either a fixed rate such as `20` or a range such as `12-18`. It streams positional arguments, piped stdin, or built-in placeholder text when no input is provided, so you can preview perceived model latency without needing a real provider or local hardware.
### Examples
@@ -107,6 +111,11 @@ hexai 'install ripgrep on macOS'
# Verbose explanation
hexai 'install ripgrep on macOS and explain'
+# Warm the cache with two configured CLI providers, then print only the
+# second provider's cached response on a rerun
+hexai 'summarize this file'
+hexai -1 'summarize this file'
+
# Simulate 12-18 tokens per second with placeholder text
hexai --tps-simulation 12-18