Lugh@futurology.todayM to

Futurology@futurology.todayEnglish · 4 months ago

When AI is tested on questions it can't model from pre-existing answers on the internet, it only scores 10% in the test.

79

When AI is tested on questions it can't model from pre-existing answers on the internet, it only scores 10% in the test.

Lugh@futurology.todayM to

Futurology@futurology.todayEnglish · 4 months ago

Researchers just stumped AI with their most difficult test — but for how long?

A new AI benchmark called "Humanity's Last Exam" stumped top models

Chat

Lugh@futurology.todayOPM
link
fedilink
English
arrow-up
8·
4 months ago
They say the answer to this issue is they’ve released public question samples, but the real questions are kept private.

https://agi.safe.ai/