Factorial reposted this
Today, I am super excited to introduce Lynx, the leading hallucination detection model! 🚀 Hallucinations make it really hard to ship AI products. To address this problem, developers have begun to use general-purpose LLMs like GPT-4 to evaluate other LLMs (‘LLM-as-a-judge’). However, general-purpose LLMs weren’t designed to be great at evaluation. Prior research shows that current LLM-as-a-judge approaches are unreliable and inconsistent. Enter Lynx. Lynx beats GPT-4o and all state-of-the-art LLMs on RAG hallucination tasks. And we’ve open sourced it ✨ You can use quantized Lynx-8B locally, use Lynx-70B with GPUs, or just reach out to Patronus AI for easy API access 😃 We are thrilled to launch Lynx with our Day 1 Integration Partners: NVIDIA, MongoDB, and Nomic AI. ⚡ At Patronus AI, our mission is to make high quality LLM evaluation accessible to everyone. The best is yet to come. Download quantized Lynx-8B on Hugging Face: https://lnkd.in/eaTpM3u6 Download Lynx-70B on Hugging Face: https://lnkd.in/eauKirMc Read the arXiv paper: https://lnkd.in/e--_R8Cg Read our blog: https://lnkd.in/eJ4AgX8T Use Lynx with NVIDIA's Nemo Guardrails: https://lnkd.in/ec2yJFrZ Read about our Nomic AI Atlas integration: https://lnkd.in/eKxeq65D