And who believes that the difference between 91.9% and 92.4% is significant in t...

		tobias2014 17 days ago \| parent \| context \| favorite \| on: GPT-5.2 And who believes that the difference between 91.9% and 92.4% is significant in these benchmarks? Clearly these have margins of error that are swept under the rug.