◆
·11 min read
Detecting Hallucinations in LLM Summaries
LLMs write convincingly but fabricate facts. A practical tour of automated detection techniques: BERTScore, embedding similarity, ROUGE/n-gram overlap, NER-based cross-referencing, and QAEVAL.
Tag
2 posts tagged with this.
LLMs write convincingly but fabricate facts. A practical tour of automated detection techniques: BERTScore, embedding similarity, ROUGE/n-gram overlap, NER-based cross-referencing, and QAEVAL.
An orchestration platform for coding agents — Kanban board, live terminals, MCP server per agent, and real-time notifications so you know the moment an agent gets stuck and can act immediately. Agents run on AWS VMs sandboxed with ptracer and minivisor for security and full observability.
All tags