| 1. | | How to evaluate multi-turn LLM chatbots (confident-ai.com) |
| 3 points by 3d27 on Oct 8, 2024 | past |
|
| 2. | | We wrote a comprehensive guide on LLM security (confident-ai.com) |
| 1 point by 3d27 on Aug 20, 2024 | past |
|
| 3. | | How to generate synthetic data using SOTA data evolution methods (confident-ai.com) |
| 1 point by 3d27 on May 21, 2024 | past |
|
| 4. | | How to build your own LLM evaluation framework (confident-ai.com) |
| 2 points by 3d27 on April 15, 2024 | past |
|
| 5. | | Overview of All Major LLM Benchmarks (confident-ai.com) |
| 1 point by 3d27 on March 22, 2024 | past |
|
| 6. | | I wrote an article about everything I know about LLM metrics (confident-ai.com) |
| 2 points by 3d27 on March 12, 2024 | past | 1 comment |
|
| 7. | | Best practices I learnt from helping health tech enterprise test LLMs (confident-ai.com) |
| 1 point by 3d27 on Feb 27, 2024 | past |
|
| 8. | | Am I too needy? From a data science perspective (medium.com/swlh) |
| 1 point by 3d27 on Feb 26, 2024 | past | 1 comment |
|
| 9. | | Best Practices for Unit Testing RAG Systems in Prod (confident-ai.com) |
| 4 points by 3d27 on Feb 6, 2024 | past |
|
| 10. | | Tried Apple's Vision Pros, would not recommend it (theverge.com) |
| 2 points by 3d27 on Feb 5, 2024 | past |
|
| 11. | | Everything I know about LLM evaluation metrics (confident-ai.com) |
| 7 points by 3d27 on Jan 24, 2024 | past |
|
| 12. | | Google 2024 Layoffs on a rolling-basis |
| 1 point by 3d27 on Jan 20, 2024 | past |
|
| 13. | | Meta Going All in on GenAI (datacenterdynamics.com) |
| 3 points by 3d27 on Jan 19, 2024 | past | 2 comments |
|
| 14. | | I used QAG to implement an LLM text summarization evals (confident-ai.com) |
| 3 points by 3d27 on Dec 19, 2023 | past |
|
| 15. | | I found a way to code like Shakespear (shakespearelang.com) |
| 1 point by 3d27 on Dec 15, 2023 | past | 1 comment |
|
| 16. | | I implemented 12+ LLM evaluation metrics so you don't have to (reddit.com) |
| 4 points by 3d27 on Dec 13, 2023 | past | 1 comment |
|
| 17. | | AI Makes Commercial Masterpiece [video] (youtube.com) |
| 2 points by 3d27 on Dec 13, 2023 | past |
|
| 18. | | Show HN: I implemented evals metrics for LLMs that runs locally on your machine (github.com/confident-ai) |
| 22 points by 3d27 on Dec 11, 2023 | past | 3 comments |
|
| 19. | | Overcoming the biggest barrier to practical quantum computers (breakingdefense.com) |
| 1 point by 3d27 on Dec 11, 2023 | past |
|
| 20. | | Google's new model is good but the demo's not reproducible in Bard (boingboing.net) |
| 1 point by 3d27 on Dec 7, 2023 | past |
|
| 21. | | What Is RAG? (With Examples) (confident-ai.com) |
| 1 point by 3d27 on Dec 1, 2023 | past |
|
| 22. | | Found this weird programming language (wikipedia.org) |
| 2 points by 3d27 on Nov 27, 2023 | past |
|