We are using judges with LLMs and web grounding plus manual grading. We recently...

		ymarkov 12 hours ago \| parent \| context \| favorite \| on: Launch HN: Voygr (YC W26) – A better maps API for ... We are using judges with LLMs and web grounding plus manual grading. We recently did a benchmark on the LLM quality across major AI providers - we plan to open source it soon and will probably open source our API quality check benchmark too https://qht.co/item?id=47366423

		help