DEV Community

Lamhot Siagian profile picture

Lamhot Siagian

AI Engineer / AI Evaluation Engineer with 9+ years across software engineering and ML. I build RAG + agentic systems that are measurable, safe to ship, and observable end-to-end (evaluation)

Joined Joined on  twitter website
Beyond the Match: A Practitioner’s Guide to Biometric Authentication Metrics

Beyond the Match: A Practitioner’s Guide to Biometric Authentication Metrics

1
Comments
5 min read
LLM-as-a-Judge: Automated Scoring and Reliability vs. Human Evaluation

LLM-as-a-Judge: Automated Scoring and Reliability vs. Human Evaluation

1
Comments
6 min read
Benchmarks Are Breaking: Why Many ‘Top Scores’ Don’t Mean Production-Ready.

Benchmarks Are Breaking: Why Many ‘Top Scores’ Don’t Mean Production-Ready.

1
Comments
7 min read
If you don't red-team your LLM app, your users will

If you don't red-team your LLM app, your users will

1
Comments
7 min read
Evals Aren’t a One-Time Report: Build a Living Test Suite That Ships With Every Release.

Evals Aren’t a One-Time Report: Build a Living Test Suite That Ships With Every Release.

1
Comments
6 min read
Accuracy Is Expensive: How to Evaluate ‘Quality per $’ for Agents and RAG

Accuracy Is Expensive: How to Evaluate ‘Quality per $’ for Agents and RAG

1
Comments
6 min read
loading...