DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Build a Serverless RAG Engine for $0

Build a Serverless RAG Engine for $0

Comments
3 min read
Docling Speaks LaTeX: Unlocking Academic and Scientific Documents

Docling Speaks LaTeX: Unlocking Academic and Scientific Documents

Comments
23 min read
Show DEV: PardusDB – The "SQLite of Vector DBs" written in Rust

Show DEV: PardusDB – The "SQLite of Vector DBs" written in Rust

5
Comments
1 min read
Chatting with 3 Billion Base Pairs: Building a RAG Index for Your Personal Genome (WGS)

Chatting with 3 Billion Base Pairs: Building a RAG Index for Your Personal Genome (WGS)

Comments
4 min read
What’s Actually Making Your LLM Costs Skyrocket?

What’s Actually Making Your LLM Costs Skyrocket?

Comments
2 min read
Scaling RAG : Demo to Production Ready

Scaling RAG : Demo to Production Ready

Comments
2 min read
Are We Over-Engineering LLM Stacks Too Early?

Are We Over-Engineering LLM Stacks Too Early?

Comments 1
2 min read
What It Actually Takes to Run a RAG System in Production

What It Actually Takes to Run a RAG System in Production

Comments
2 min read
Building a Production RAG Server with Ollama, Open WebUI and Chroma DB

Building a Production RAG Server with Ollama, Open WebUI and Chroma DB

Comments
1 min read
Medicine Encyclopedia 2.0: Stop Guessing and Start Scanning with Multimodal RAG

Medicine Encyclopedia 2.0: Stop Guessing and Start Scanning with Multimodal RAG

1
Comments
4 min read
New here - Full Stack Engineer

New here - Full Stack Engineer

Comments
1 min read
From Paper Trails to Health Insights: Building a Personal EHR Semantic Search Engine with Hybrid Search

From Paper Trails to Health Insights: Building a Personal EHR Semantic Search Engine with Hybrid Search

1
Comments
4 min read
Stop Paying for APIs: Build a 100% Local AI Auditor with Python & Llama 3

Stop Paying for APIs: Build a 100% Local AI Auditor with Python & Llama 3

Comments
4 min read
Reducing OCR Cost in RAG Pipelines with Page-Level Detection

Reducing OCR Cost in RAG Pipelines with Page-Level Detection

Comments
1 min read
Graph RAG and Agentic RAG: The Next Evolution of Retrieval

Graph RAG and Agentic RAG: The Next Evolution of Retrieval

Comments
16 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.