The Dire Defect of ‘Multilingual’ AI Content Moderation
Three parts Bosnian text. Thirteen parts Kurdish. Fifty-five parts Swahili. Eleven thousand parts English.
This is part of the data recipe for Facebooks new large language model, which the company c… [+3990 chars