A new computer classification system developed by Penn State computer scientists can correctly identify a person’s gender – based only on eyes, nose, mouth and voice cues – better than human beings can.
Dr. Rajeev Sharma, associate professor of computer science and engineering, said the new system is right nearly 100 percent of the time. Human beings consistently score in the low 90 percents.
The new system, which is the first computer system to combine both face and voice cues, has potential for use in security systems, market research and human/computer interaction systems as well as other applications.
For example, the system could be used to signal when unauthorised individuals tried to enter a restroom, fitting room or dormitory. The new system could also be used to collect market research data automatically.
The new system is based on powerful pattern recognition software technique, called support vector machines (SVM) that have the ability to learn. SVMs have previously been used to scan cell samples for abnormalities or in other applications where patterns are very similar and difficult to separate.
Sharma and his research group adapted SVMs separately for face and voice recognition. The Penn State researchers trained the software dedicated to faces on 1755 thumbnail images of human faces from a standard database. The thumbnails showed only the section of the face that includes eyes, nose and mouth.
Another SVM was trained on voice samples. The voice samples also came from a standard database and included just fractions of a second of voice data.
When the face software and voice software each had been trained separately to the level of human proficiency at classifying gender, Sharma and his group added a SVM ‘manager’ to fuse the results, make the final gender classification and improve the system’s accuracy.
For example, if the face software and voice software disagree on a particular gender classification, the SVM manager reviews both decision processes – with a view to the specific weaknesses of the face and voice software – and makes the final decision. The result is a system that classifies gender correctly more often than human beings supplied with the same data.
Penn State has submitted a provisional application for a patent for the invention that has been licensed to Sharma’s company, Advanced Interface Technologies.
Sharma is confident the new system will find applications as part of a wide variety of ‘intelligent’ systems. He imagines, for example, a passive security check point system for a ‘smart building,’ where people would not have to stop to provide identification or even slow down. They could simply walk past the checkpoint and keep on going – unless they were intruders.