DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
How to Implement Prompt Caching on Amazon Bedrock and Cut Inference Costs in Half

How to Implement Prompt Caching on Amazon Bedrock and Cut Inference Costs in Half

Comments
12 min read
Securing AI-Powered Applications: A Comprehensive Guide to Protecting Your LLM-Integrated Web App

Securing AI-Powered Applications: A Comprehensive Guide to Protecting Your LLM-Integrated Web App

Comments
8 min read
Lost in the Middle: Why Bigger Context Windows Don’t Always Improve LLM Performance

Lost in the Middle: Why Bigger Context Windows Don’t Always Improve LLM Performance

Comments
3 min read
How I ran LLM + RAG fully offline on Android using MNN

How I ran LLM + RAG fully offline on Android using MNN

Comments
3 min read
Fundamental matters more in AI era

Fundamental matters more in AI era

Comments
3 min read
The Hidden Dangers of AI Agents: 11 Critical Security Risks in Model Context Protocol (MCP)

The Hidden Dangers of AI Agents: 11 Critical Security Risks in Model Context Protocol (MCP)

Comments
20 min read
The Real Reason AI Agents “Work” in Software

The Real Reason AI Agents “Work” in Software

Comments
6 min read
Secrets Management for LLM Tools: Don’t Let Your OpenAI Keys End Up on GitHub 🚨

Secrets Management for LLM Tools: Don’t Let Your OpenAI Keys End Up on GitHub 🚨

Comments
3 min read
Andrej Karpathy's microGPT Architecture — Complete Guide

Andrej Karpathy's microGPT Architecture — Complete Guide

Comments
9 min read
From Data to Dialogue: Creating a Technical Design for Smart FAQs using LLMs, Pinecone & Kafka

From Data to Dialogue: Creating a Technical Design for Smart FAQs using LLMs, Pinecone & Kafka

Comments
7 min read
Deploy AI Models Locally: Run LLMs on Your Machine Without API Costs

Deploy AI Models Locally: Run LLMs on Your Machine Without API Costs

Comments
5 min read
How caching helps in LLM Application?

How caching helps in LLM Application?

Comments
2 min read
Building AI Agents That Actually Work in Business Workflows

Building AI Agents That Actually Work in Business Workflows

Comments 1
7 min read
How I Cut My LLM Costs by 70% Without Losing Quality

How I Cut My LLM Costs by 70% Without Losing Quality

4
Comments 4
7 min read
LLM Steering: From Prompting Tricks to Activation Control

LLM Steering: From Prompting Tricks to Activation Control

1
Comments
6 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.