Study finds many tests don't measure the right things AI companies regularly tout their models' performance on benchmark tests as a sign of technological and intellectual superiority. But those results, widely used in marketing, may not be meaningful....
Articles
from Hacker News on (#71AXC)
Comments
1