3d27's submissions | Hacker News

1.		How to evaluate multi-turn LLM chatbots (confident-ai.com)
		3 points by 3d27 on Oct 8, 2024 \| past
2.		We wrote a comprehensive guide on LLM security (confident-ai.com)
		1 point by 3d27 on Aug 20, 2024 \| past
3.		How to generate synthetic data using SOTA data evolution methods (confident-ai.com)
		1 point by 3d27 on May 21, 2024 \| past
4.		How to build your own LLM evaluation framework (confident-ai.com)
		2 points by 3d27 on April 15, 2024 \| past
5.		Overview of All Major LLM Benchmarks (confident-ai.com)
		1 point by 3d27 on March 22, 2024 \| past
6.		I wrote an article about everything I know about LLM metrics (confident-ai.com)
		2 points by 3d27 on March 12, 2024 \| past \| 1 comment
7.		Best practices I learnt from helping health tech enterprise test LLMs (confident-ai.com)
		1 point by 3d27 on Feb 27, 2024 \| past
8.		Am I too needy? From a data science perspective (medium.com/swlh)
		1 point by 3d27 on Feb 26, 2024 \| past \| 1 comment
9.		Best Practices for Unit Testing RAG Systems in Prod (confident-ai.com)
		4 points by 3d27 on Feb 6, 2024 \| past
10.		Tried Apple's Vision Pros, would not recommend it (theverge.com)
		2 points by 3d27 on Feb 5, 2024 \| past
11.		Everything I know about LLM evaluation metrics (confident-ai.com)
		7 points by 3d27 on Jan 24, 2024 \| past
12.		Google 2024 Layoffs on a rolling-basis
		1 point by 3d27 on Jan 20, 2024 \| past
13.		Meta Going All in on GenAI (datacenterdynamics.com)
		3 points by 3d27 on Jan 19, 2024 \| past \| 2 comments
14.		I used QAG to implement an LLM text summarization evals (confident-ai.com)
		3 points by 3d27 on Dec 19, 2023 \| past
15.		I found a way to code like Shakespear (shakespearelang.com)
		1 point by 3d27 on Dec 15, 2023 \| past \| 1 comment
16.		I implemented 12+ LLM evaluation metrics so you don't have to (reddit.com)
		4 points by 3d27 on Dec 13, 2023 \| past \| 1 comment
17.		AI Makes Commercial Masterpiece [video] (youtube.com)
		2 points by 3d27 on Dec 13, 2023 \| past
18.		Show HN: I implemented evals metrics for LLMs that runs locally on your machine (github.com/confident-ai)
		22 points by 3d27 on Dec 11, 2023 \| past \| 3 comments
19.		Overcoming the biggest barrier to practical quantum computers (breakingdefense.com)
		1 point by 3d27 on Dec 11, 2023 \| past
20.		Google's new model is good but the demo's not reproducible in Bard (boingboing.net)
		1 point by 3d27 on Dec 7, 2023 \| past
21.		What Is RAG? (With Examples) (confident-ai.com)
		1 point by 3d27 on Dec 1, 2023 \| past
22.		Found this weird programming language (wikipedia.org)
		2 points by 3d27 on Nov 27, 2023 \| past