{"id":832,"date":"2018-07-24T14:00:00","date_gmt":"2018-07-24T14:00:00","guid":{"rendered":"https:\/\/www.aiproblog.com\/index.php\/2018\/07\/24\/helping-computers-perceive-human-emotions\/"},"modified":"2018-07-24T14:00:00","modified_gmt":"2018-07-24T14:00:00","slug":"helping-computers-perceive-human-emotions","status":"publish","type":"post","link":"https:\/\/www.aiproblog.com\/index.php\/2018\/07\/24\/helping-computers-perceive-human-emotions\/","title":{"rendered":"Helping computers perceive human emotions"},"content":{"rendered":"<p>Author: Rob Matheson | MIT News Office<\/p>\n<div>\n<p>MIT Media Lab researchers have developed a machine-learning model that takes computers a step closer to interpreting our emotions as naturally as humans do.<\/p>\n<p>In the growing field of \u201caffective computing,\u201d robots and computers are being developed to analyze facial expressions, interpret our emotions, and respond accordingly. Applications include, for instance, monitoring an individual\u2019s health and well-being, gauging student interest in classrooms, helping diagnose signs of certain diseases, and developing helpful robot companions.<\/p>\n<p>A challenge, however, is people express emotions quite differently, depending on many factors. General differences can be seen among cultures, genders, and age groups. But other differences are even more fine-grained: The time of day, how much you slept, or even your level of familiarity with a conversation partner leads to subtle variations in the way you express, say, happiness or sadness in a given moment.<\/p>\n<p>Human brains instinctively catch these deviations, but machines struggle. Deep-learning techniques were developed in recent years to help catch the subtleties, but they\u2019re still not as accurate or as adaptable across different populations as they could be.<\/p>\n<p>The Media Lab researchers have developed a machine-learning model that outperforms traditional systems in capturing these small facial expression variations, to better gauge mood while training on thousands of images of faces. Moreover, by using a little extra training data, the model can be adapted to an entirely new group of people, with the same efficacy. The aim is to improve existing affective-computing technologies.<\/p>\n<p>\u201cThis is an unobtrusive way to monitor our moods,\u201d says Oggi Rudovic, a Media Lab researcher and co-author on a paper describing the model, which was presented last week at the Conference on Machine Learning and Data Mining. \u201cIf you want robots with social intelligence, you have to make them intelligently and naturally respond to our moods and emotions, more like humans.\u201d<\/p>\n<p>Co-authors on the paper are: first author Michael Feffer, an undergraduate student in electrical engineering and computer science; and Rosalind Picard, a professor of media arts and sciences and founding director of the Affective Computing research group.<\/p>\n<p><strong>Personalized experts<\/strong><\/p>\n<p>Traditional affective-computing models use a \u201cone-size-fits-all\u201d concept. They train on one set of images depicting various facial expressions, optimizing features \u2014 such as how a lip curls when smiling \u2014 and mapping those general feature optimizations across an entire set of new images.<\/p>\n<p>The researchers, instead, combined a technique, called \u201cmixture of experts\u201d (MoE), with model personalization techniques, which helped mine more fine-grained facial-expression data from individuals. This is the first time these two techniques have been combined for affective computing, Rudovic says.<\/p>\n<p>In MoEs, a number of neural network models, called \u201cexperts,\u201d are each trained to specialize in a separate processing task and produce one output. The researchers also incorporated a \u201cgating network,\u201d which calculates probabilities of which expert will best detect moods of unseen subjects. \u201cBasically the network can discern between individuals and say, \u2018This is the right expert for the given image,\u2019\u201d Feffer says.<\/p>\n<p>For their model, the researchers personalized the MoEs by matching each expert to one of 18 individual video recordings in the RECOLA database, a public database of people conversing on a video-chat platform designed for affective-computing applications. They trained the model using nine subjects and evaluated them on the other nine, with all videos broken down into individual frames.<\/p>\n<p>Each expert, and the gating network, tracked facial expressions of each individual, with the help of a residual network (\u201cResNet\u201d), a neural network used for object classification. In doing so, the model scored each frame based on level of valence (pleasant or unpleasant) and arousal (excitement) \u2014\u00a0commonly used metrics to encode different emotional states. Separately, six human experts labeled each frame for valence and arousal, based on a scale of -1 (low levels) to 1 (high levels), which the model also used to train.<\/p>\n<p>The researchers then performed further model personalization, where they fed the trained model data from some frames of the remaining videos of subjects, and then tested the model on all unseen frames from those videos. Results showed that, with just 5 to 10 percent of data from the new population, the model outperformed traditional models by a large margin \u2014 meaning it scored valence and arousal on unseen images much closer to the interpretations of human experts.<\/p>\n<p>This shows the potential of the models to adapt from population to population, or individual to individual, with very few data, Rudovic says. \u201cThat\u2019s key,\u201d he says. \u201cWhen you have a new population, you have to have a way to account for shifting of data distribution [subtle facial variations]. Imagine a model set to analyze facial expressions in one culture that needs to be adapted for a different culture. Without accounting for this data shift, those models will underperform. But if you just sample a bit from a new culture to adapt our model, these models can do much better, especially on the individual level. This is where the importance of the model personalization can best be seen.\u201d<\/p>\n<p>Currently available data for such affective-computing research isn\u2019t very diverse in skin colors, so the researchers\u2019 training data were limited. But when such data become available, the model can be trained for use on more diverse populations. The next step, Feffer says, is to train the model on \u201ca much bigger dataset with more diverse cultures.\u201d<\/p>\n<p><strong>Better machine-human interactions<\/strong><\/p>\n<p>Another goal is to train the model to help computers and robots automatically learn from small amounts of changing data to more naturally detect how we feel and better serve human needs, the researchers say.<\/p>\n<p>It could, for example, run in the background of a computer or mobile device to track a user\u2019s video-based conversations and learn subtle facial expression changes under different contexts. \u201cYou can have things like smartphone apps or websites be able to tell how people are feeling and recommend ways to cope with stress or pain, and other things that are impacting their lives negatively,\u201d Feffer says.<\/p>\n<p>This could also be helpful in monitoring, say, depression or dementia, as people\u2019s facial expressions tend to subtly change due to those conditions. \u201cBeing able to passively monitor our facial expressions,\u201d Rudovic says, \u201cwe could over time be able to personalize these models to users and monitor how much deviations they have on daily basis \u2014\u00a0deviating from the average level of facial expressiveness \u2014 and use it for indicators of well-being and health.\u201d<\/p>\n<p>A promising application, Rudovic says, is human-robotic interactions, such as for personal robotics or robots used for educational purposes, where the robots need to adapt to assess the emotional states of many different people. One version, for instance, has been used in <a href=\"http:\/\/news.mit.edu\/2018\/personalized-deep-learning-equips-robots-autism-therapy-0627\">helping robots<\/a> better interpret the moods of children with autism.<\/p>\n<p>Roddy Cowie, professor emeritus of psychology at the Queen\u2019s University Belfast and an affective computing scholar, says the MIT work \u201cillustrates where we really are\u201d in the field. \u201cWe are edging toward systems that can roughly place, from pictures of people\u2019s faces, where they lie on scales from very positive to very negative, and very active to very passive,\u201d he says. \u201cIt seems intuitive that the emotional signs one person gives are not the same as the signs another gives, and so it makes a lot of sense that emotion recognition works better when it is personalized. The method of personalizing reflects another intriguing point, that it is more effective to train multiple \u2018experts,\u2019 and aggregate their judgments, than to train a single super-expert. The two together make a satisfying package.\u201d<\/p>\n<\/div>\n<p><a href=\"http:\/\/news.mit.edu\/2018\/helping-computers-perceive-human-emotions-0724\">Go to Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Author: Rob Matheson | MIT News Office MIT Media Lab researchers have developed a machine-learning model that takes computers a step closer to interpreting our [&hellip;] <span class=\"read-more-link\"><a class=\"read-more\" href=\"https:\/\/www.aiproblog.com\/index.php\/2018\/07\/24\/helping-computers-perceive-human-emotions\/\">Read More<\/a><\/span><\/p>\n","protected":false},"author":1,"featured_media":469,"comment_status":"registered_only","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_bbp_topic_count":0,"_bbp_reply_count":0,"_bbp_total_topic_count":0,"_bbp_total_reply_count":0,"_bbp_voice_count":0,"_bbp_anonymous_reply_count":0,"_bbp_topic_count_hidden":0,"_bbp_reply_count_hidden":0,"_bbp_forum_subforum_count":0,"footnotes":""},"categories":[24],"tags":[],"_links":{"self":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/832"}],"collection":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/comments?post=832"}],"version-history":[{"count":0,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/832\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media\/471"}],"wp:attachment":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media?parent=832"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/categories?post=832"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/tags?post=832"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}