The paper clearly is about how a specific form of training on a model causes the outcome.
The article is actively disinformation then, it frames it as a user and not a scientific experiment, and it says it was Facebook llama model, but it wasn’t.
It was a further altered model of llama that was further trained to do this
So, as I said, utter garbage journalism.
The actual title should be “Scientific study shows training a model based off user feedback can produce dangerous results”
Cool
The paper clearly is about how a specific form of training on a model causes the outcome.
The article is actively disinformation then, it frames it as a user and not a scientific experiment, and it says it was Facebook llama model, but it wasn’t.
It was a further altered model of llama that was further trained to do this
So, as I said, utter garbage journalism.
The actual title should be “Scientific study shows training a model based off user feedback can produce dangerous results”