summaryrefslogtreecommitdiff
path: root/gemfeed
diff options
context:
space:
mode:
authorPaul Buetow <paul@buetow.org>2025-12-24 10:52:20 +0200
committerPaul Buetow <paul@buetow.org>2025-12-24 10:52:20 +0200
commit68b83a05fe183b6271750e250d6a7ac02c51bd81 (patch)
tree1e0e0d4e6af014aedefbca44ab20dadba4ddb2a0 /gemfeed
parent5b9df6aa49b806b23fc36c8bef443f563744671b (diff)
Update content for md
Diffstat (limited to 'gemfeed')
-rw-r--r--gemfeed/2025-12-24-x-rag-observability-hackathon.md4
1 files changed, 2 insertions, 2 deletions
diff --git a/gemfeed/2025-12-24-x-rag-observability-hackathon.md b/gemfeed/2025-12-24-x-rag-observability-hackathon.md
index bcc23ef9..76d1eede 100644
--- a/gemfeed/2025-12-24-x-rag-observability-hackathon.md
+++ b/gemfeed/2025-12-24-x-rag-observability-hackathon.md
@@ -2,7 +2,7 @@
> Published at 2025-12-24T09:45:29+02:00
-This blog post describes my hackathon efforts adding observability to X-RAG, a distributed Retrieval-Augmented Generation (RAG) platform built by my brother Florian. I especially made time available over the weekend to join his 3-day hackathon (attending 2 days) with the goal of instrumenting his existing distributed system with observability. What started as "let's add some metrics" turned into a comprehensive implementation of the three pillars of observability: tracing, metrics, and logs.
+This blog post describes my hackathon efforts adding observability to X-RAG, the extensible Retrieval-Augmented Generation (RAG) platform built by my brother Florian. I especially made time available over the weekend to join his 3-day hackathon (attending 2 days) with the goal of instrumenting his existing distributed system with observability. What started as "let's add some metrics" turned into a comprehensive implementation of the three pillars of observability: tracing, metrics, and logs.
[X-RAG source code on GitHub](https://github.com/florianbuetow/x-rag)
@@ -48,7 +48,7 @@ This blog post describes my hackathon efforts adding observability to X-RAG, a d
## What is X-RAG?
-X-RAG is the extendendible RAG (Retrieval-Augmented Generation) platform running on Kubernetes. The idea behind RAG is simple: instead of asking an LLM to answer questions from its training data alone, you first retrieve relevant documents from your own knowledge base, then feed those documents to the LLM as context. The LLM synthesises an answer grounded in your actual content—reducing hallucinations and enabling answers about private or recent information the model was never trained on.
+X-RAG is the extensible RAG (Retrieval-Augmented Generation) platform running on Kubernetes. The idea behind RAG is simple: instead of asking an LLM to answer questions from its training data alone, you first retrieve relevant documents from your own knowledge base, then feed those documents to the LLM as context. The LLM synthesises an answer grounded in your actual content—reducing hallucinations and enabling answers about private or recent information the model was never trained on.
X-RAG handles the full pipeline: ingest documents, chunk them into searchable pieces, generate vector embeddings, store them in a vector database, and at query time, retrieve relevant chunks and pass them to an LLM for answer generation. The system supports both local LLMs (Florian runs his on a beefy desktop) and cloud APIs like OpenAI. I configured an OpenAI API key since my laptop's CPU and GPU aren't fast enough for decent local inference.