Trying to train any kind of text or image dataset without any icky -ist or -phobic content would give you something completely incoherent and nonfunctional. The only way to get close to what they want with Safety is to train it on normal data and then make sure there are unstoppable computer sentinels keeping the naughty words from ever reaching the part that processes requests and responds to them.
Trying to train any kind of text or image dataset without any icky -ist or -phobic content would give you something completely incoherent and nonfunctional. The only way to get close to what they want with Safety is to train it on normal data and then make sure there are unstoppable computer sentinels keeping the naughty words from ever reaching the part that processes requests and responds to them.