What's new

[Skynet] This AI-based lip reader could spell the end of privacy

Hamartia Antidote

ELITE MEMBER
Joined
Nov 17, 2013
Messages
35,188
Reaction score
30
Country
United States
Location
United States

http://www.dnaindia.com/scitech/rep...reader-could-spell-the-end-of-privacy-2271807

Scientists develop a computer program that can read lips with superhuman accuracy.


A new computer program has been developed that can read a person’s lips with a higher level of accuracy than even the best-trained professional lip-readers. Scientists at the University of Oxford, which also happens to have received funding from Alphabet’s DeepMind, have developed software that can read lips correctly 93.4 per cent of the time.

A system based on artificial intelligence, LipNet is capable of comprehending a video of a person speaking, and matches it with the movement of their lips to match it with words with an unprecedented level of accuracy. According to the paper shared by the scientists, “LipNet is trained end-to-end to make speaker-independent sentence-level predictions.” It is said to have ‘enormous practical potential’ with applications ranging from improved hearing aids, silent dictation in public spaces, transcribing covert conversations, speech recognition in noisy environments, biometric identification, silent-movie processing and more.

Scientists stated that they proposed LipNet will be the first model to apply deep learning for end-to-end learning of a model, which maps sequences of image frames of a speaker’s mouth to entire sentences. This model eliminates the need to segment videos into words before predicting a sentence.

Machine lip reading is usually difficult because it requires extracting ‘spatiotemporal’ features from the video. However, recent deep learning approaches attempt to extract those features end-to-end. As with modern deep learning based automatic speech recognition (ASR), LipNet claims to be trained end-to-end, to make speaker-independent sentence-level predictions.

Most importantly, LipNet does not require hand-engineered templates of speech patterns or visuals of lip movement as the system is capable of learning and self-evolving, all the while growing better at its predictions.

The flipside, though, is that the program could also be used for mass surveillance--the program could potentially be misused to eavesdrop into public conversations when paired with a CCTV, for example. This could literally spell the end of private conversations in public places.

 
How can I trust or believe it until I experience myself. It is possible that the participants said the exact sentence which was fed to computer already.
 
Last edited:
How can I trust or believe it until I experience myself. It is possible that the participants said the exact sentence which was fed to computer already.

This is Oxford...not some no-name community college. Plus they are using some rather simple short monosyllabic or duosyllabic words. Not words like "supercalifragilisticexpialidocious"
 
Last edited:

http://www.dnaindia.com/scitech/rep...reader-could-spell-the-end-of-privacy-2271807

Scientists develop a computer program that can read lips with superhuman accuracy.


A new computer program has been developed that can read a person’s lips with a higher level of accuracy than even the best-trained professional lip-readers. Scientists at the University of Oxford, which also happens to have received funding from Alphabet’s DeepMind, have developed software that can read lips correctly 93.4 per cent of the time.

A system based on artificial intelligence, LipNet is capable of comprehending a video of a person speaking, and matches it with the movement of their lips to match it with words with an unprecedented level of accuracy. According to the paper shared by the scientists, “LipNet is trained end-to-end to make speaker-independent sentence-level predictions.” It is said to have ‘enormous practical potential’ with applications ranging from improved hearing aids, silent dictation in public spaces, transcribing covert conversations, speech recognition in noisy environments, biometric identification, silent-movie processing and more.

Scientists stated that they proposed LipNet will be the first model to apply deep learning for end-to-end learning of a model, which maps sequences of image frames of a speaker’s mouth to entire sentences. This model eliminates the need to segment videos into words before predicting a sentence.

Machine lip reading is usually difficult because it requires extracting ‘spatiotemporal’ features from the video. However, recent deep learning approaches attempt to extract those features end-to-end. As with modern deep learning based automatic speech recognition (ASR), LipNet claims to be trained end-to-end, to make speaker-independent sentence-level predictions.

Most importantly, LipNet does not require hand-engineered templates of speech patterns or visuals of lip movement as the system is capable of learning and self-evolving, all the while growing better at its predictions.

The flipside, though, is that the program could also be used for mass surveillance--the program could potentially be misused to eavesdrop into public conversations when paired with a CCTV, for example. This could literally spell the end of private conversations in public places.

276e27870dbe243863329c1210571d54.jpg

?
 
Hhahaa , have you seen any football all the players and managers mostly speak to each other with hand in front of mouth , how can this Ai will by pass that
 

Country Latest Posts

Back
Top Bottom