BEIJING, CHINA — An artificial intelligence voice recognition system that can monitor and identify pornographic content began online testing on PCs from Sunday, according to the Science and Technology Daily on Monday.
Helped by the voiceprint recognition method, the Alibaba voice recognition system can identify multiple languages such as Chinese, Japanese, English and Russian, as well as Chinese dialects from different provinces such as Hunan, Hubei, Henan, Sichuan and Guangdong.
Transforming voice into script, the system compares the scripts with key words in its lexicon and anti-spam audio models, which were also developed by Alibaba.
The lexicon and anti-spam audio models collect tens of thousands of pornographic words with the same or similar pronunciations, Alibaba told Xinhua.
The system monitors both online and offline voice files.
The multiple language and dialect recognition ability needs to be trained like a robot. The system’s Cantonese recognition ability was cultivated by watching TV series.
The system is scheduled to be put into operation in September this year.