The problem with AI alignment is that humans aren't aligned

preasket@lemy.lol · edit-2 1 year ago

The problem with AI alignment is that humans aren't aligned

Quatity_Control@lemm.ee · 1 year ago

Align means two very different things here, despite being the same word.

preasket@lemy.lol · edit-2 1 year ago

Does it? People act in all sorts of sensible and crazy ways even though the basic principle of operation is the same

Quatity_Control@lemm.ee · 1 year ago

What loss function do you want AI to align on?

If I have a language model AI and an AI designed to function as a nurse, what are they going to align on?

fubo@lemmy.world · edit-2 1 year ago

Some of the human-alignment projects look like “religions” and some look like “economies” and some look like “just talking to each other and trying to be halfway decent folks and not flipping out or some shit”.

Heck, arguably the United Nations is a human-alignment project for x-risk mitigation.

DeVaolleysAdVocate@lemmy.world · 1 year ago

We’d like to bring all those and their existing versions together with the A-Better-World Consensus-Engine idea.

Tell me more about some of these other projects though please.