N00b Question - how do you measure performance for AI agents like the way they d...

		sghiassy 4 days ago \| parent \| context \| favorite \| on: AGENTS.md outperforms skills in our agent evals N00b Question - how do you measure performance for AI agents like the way they did in this article? Are there frameworks to support this type of work?