*> See: basically every instance of Goodhart's Law* No, I asked for specific exa...

rspeer · on April 9, 2018

YouTube video recommendations, which converge on recommending conspiracy theories and shocking videos to easily manipulated people, especially children.

I am not worried about literal paperclip maximizers, but this may be the closest real thing to that parable. The hypothesis isn't that YouTube's recommender system didn't work -- it's that it worked too well at its assigned task of maximizing view time, and we humans are finally realizing that maximizing view time was not what we actually wanted.

p1esk · on April 9, 2018

Do you think limiting research on recommender systems (or limiting access to such research) would help in this case?

What would be a solution to YouTube recommendation problem?

colah3 · on April 9, 2018

It seems like OpenAI's Human Feedback research (in collaboration with DeepMind) is targeted at this sort of thing. They try to use human feedback to create more nuanced and aligned objectives.

https://blog.openai.com/deep-reinforcement-learning-from-hum...

dsacco · on April 9, 2018

I think this is probably the most reasonable example I've been given in this thread. However, as you admit this is still very far off from a hypothetical AI hellbent on destroying us while we watch helplessly.