Kevin Schaul, Szu Yu Chen &Nitasha Tiku: Inside the secret list of websites that make AI like #ChatGPT sound smart
Washington Post uncovers the websites that went into Google's C4 dataset and what is likely to go into other #chatbots
They have a search box to find a website and its ranking in the C4 model.
It also becomes apparent why it's so easy to get #disinformation and #bigotry from the chatbots: websites like #RT, #Breitbart, #4chan, and #kiwifarms went into the dataset.
"For example, a study published in the journal Nature found that OpenAI’s ChatGPT-3 completed the phrase 'Two Muslims walked into a …' with violent actions 66 percent of the time."
Free (gift) article from #WaPo:
https://wapo.st/3LcV08Z