{"id":5576,"date":"2022-04-21T04:00:00","date_gmt":"2022-04-21T04:00:00","guid":{"rendered":"https:\/\/www.aiproblog.com\/index.php\/2022\/04\/21\/anticipating-others-behavior-on-the-road\/"},"modified":"2022-04-21T04:00:00","modified_gmt":"2022-04-21T04:00:00","slug":"anticipating-others-behavior-on-the-road","status":"publish","type":"post","link":"https:\/\/www.aiproblog.com\/index.php\/2022\/04\/21\/anticipating-others-behavior-on-the-road\/","title":{"rendered":"Anticipating others\u2019 behavior on the road"},"content":{"rendered":"<p>Author: Adam Zewe | MIT News Office<\/p>\n<div>\n<p>Humans may be one of the biggest roadblocks keeping fully autonomous vehicles off city streets.<\/p>\n<\/p>\n<p>If a robot is going to navigate a vehicle safely through downtown Boston, it must be able to predict what nearby drivers, cyclists, and pedestrians are going to do next.<\/p>\n<\/p>\n<p>Behavior prediction is a tough problem, however, and current artificial intelligence solutions are either too simplistic (they may assume pedestrians always walk in a straight line), too conservative (to avoid pedestrians, the robot just leaves the car in park), or can only forecast the next moves of one agent (roads typically carry many users at once.)\u00a0\u00a0<\/p>\n<\/p>\n<p>MIT researchers have devised a deceptively simple solution to this complicated challenge. They break a multiagent behavior prediction problem into smaller pieces and tackle each one individually, so a computer can solve this complex task in real-time.<\/p>\n<\/p>\n<p>Their behavior-prediction framework first guesses the relationships between two road users \u2014 which car, cyclist, or pedestrian has the right of way, and which agent will yield \u2014 and uses those relationships to predict future trajectories for multiple agents.<\/p>\n<\/p>\n<p>These estimated trajectories were more accurate than those from other machine-learning models, compared to real traffic flow in an enormous dataset compiled by autonomous driving company Waymo. The MIT technique even outperformed Waymo\u2019s recently published model. And because the researchers broke the problem into simpler pieces, their technique used less memory.<\/p>\n<\/p>\n<p>\u201cThis is a very intuitive idea, but no one has fully explored it before, and it works quite well. The simplicity is definitely a plus. We are comparing our model with other state-of-the-art models in the field, including the one from Waymo, the leading company in this area, and our model achieves top performance on this challenging benchmark. This has a lot of potential for the future,\u201d says co-lead author Xin \u201cCyrus\u201d Huang, a graduate student in the Department of Aeronautics and Astronautics and a research assistant in the lab of Brian Williams, professor of aeronautics and astronautics and a member of the Computer Science and Artificial Intelligence Laboratory (CSAIL).<\/p>\n<\/p>\n<p>Joining Huang and Williams on <a href=\"https:\/\/arxiv.org\/pdf\/2202.11884.pdf\" target=\"_blank\" rel=\"noopener\">the paper<\/a> are three researchers from Tsinghua University in China: co-lead author Qiao Sun, a research assistant; Junru Gu, a graduate student; and senior author Hang Zhao PhD \u201919, an assistant professor. The research will be presented at the Conference on Computer Vision and Pattern Recognition.<\/p>\n<\/p>\n<p><strong>Multiple<\/strong><strong> small<\/strong><strong> models<\/strong><\/p>\n<\/p>\n<p>The researchers\u2019 machine-learning method, called M2I, takes two inputs: past trajectories of the cars, cyclists, and pedestrians interacting in a traffic setting such as a four-way intersection, and a map with street locations, lane configurations, etc.<\/p>\n<\/p>\n<p>Using this information, a relation predictor infers which of two agents has the right of way first, classifying one as a passer and one as a yielder. Then a prediction model, known as a marginal predictor, guesses the trajectory for the passing agent, since this agent behaves independently.<\/p>\n<\/p>\n<p>A second prediction model, known as a conditional predictor, then guesses what the yielding agent will do based on the actions of the passing agent. The system predicts a number of different trajectories for the yielder and passer, computes the probability of each one individually, and then selects the six joint results with the highest likelihood of occurring.<\/p>\n<\/p>\n<p>M2I outputs a prediction of how these agents will move through traffic for the next eight seconds. In one example, their method caused a vehicle to slow down so a pedestrian could cross the street, then speed up when they cleared the intersection. In another example, the vehicle waited until several cars had passed before turning from a side street onto a busy, main road.<\/p>\n<\/p>\n<p>While this initial research focuses on interactions between two agents, M2I could infer relationships among many agents and then guess their trajectories by linking multiple marginal and conditional predictors.<\/p>\n<p><img decoding=\"async\" alt=\"prediction\" data-align=\"center\" data-entity-type=\"file\" data-entity-uuid=\"4afef51a-96fb-4300-a90e-caf7ae4a58a1\" src=\"https:\/\/news.mit.edu\/sites\/default\/files\/images\/inline\/driving-prediction-1.gif\"><img decoding=\"async\" alt=\"prediction\" data-align=\"center\" data-caption=\"These simulations show how the system the researchers developed can predict the future trajectories (shown using red lines) of the blue vehicles in complex traffic situations involving other cars, bicyclists, and pedestrians.\" data-entity-type=\"file\" data-entity-uuid=\"d9e330b0-0226-4ba6-98e4-bff030db9906\" src=\"https:\/\/news.mit.edu\/sites\/default\/files\/images\/inline\/driving-prediction-2.gif\"><\/p>\n<p><strong>Real-world d<\/strong><strong>riving tests<\/strong><\/p>\n<\/p>\n<p>The researchers trained the models using the Waymo Open Motion Dataset, which contains millions of real traffic scenes involving vehicles, pedestrians, and cyclists recorded by lidar (light detection and ranging) sensors and cameras mounted on the company\u2019s autonomous vehicles. They focused specifically on cases with multiple agents.<\/p>\n<\/p>\n<p>To determine accuracy, they compared each method\u2019s six prediction samples, weighted by their confidence levels, to the actual trajectories followed by the cars, cyclists, and pedestrians in a scene. Their method was the most accurate. It also outperformed the baseline models on a metric known as overlap rate; if two trajectories overlap, that indicates a collision. M2I had the lowest overlap rate.<\/p>\n<\/p>\n<p>\u201cRather than just building a more complex model to solve this problem, we took an approach that is more like how a human thinks when they reason about interactions with others. A human does not reason about all hundreds of combinations of future behaviors. We make decisions quite fast,\u201d Huang says.<\/p>\n<\/p>\n<p>Another advantage of M2I is that, because it breaks the problem down into smaller pieces, it is easier for a user to understand the model\u2019s decision making. In the long run, that could help users put more trust in autonomous vehicles, says Huang.<\/p>\n<\/p>\n<p>But the framework can\u2019t account for cases where two agents are mutually influencing each other, like when two vehicles each nudge forward at a four-way stop because the drivers aren\u2019t sure who should be yielding.<\/p>\n<\/p>\n<p>They plan to address this limitation in future work. They also want to use their method to simulate realistic interactions between road users, which could be used to verify planning algorithms for self-driving cars or create huge amounts of synthetic driving data to improve model performance.<\/p>\n<\/p>\n<p>\u201cPredicting future trajectories of multiple, interacting agents is under-explored and extremely challenging for enabling full autonomy in complex scenes. M2I provides a highly promising prediction method with the relation predictor to discriminate agents predicted marginally or conditionally which significantly simplifies the problem,\u201d wrote Masayoshi Tomizuka, the Cheryl and John Neerhout, Jr. Distinguished Professor of Mechanical Engineering at University of California at Berkeley and Wei Zhan, an assistant professional researcher, in an email. \u201cThe prediction model can capture the inherent relation and interactions of the agents to achieve the state-of-the-art performance.\u201d The two colleagues were not involved in the research.<\/p>\n<\/p>\n<p>This research is supported, in part, by the Qualcomm Innovation Fellowship. Toyota Research Institute also provided funds to support this work.<\/p>\n<\/div>\n<p><a href=\"https:\/\/news.mit.edu\/2022\/machine-learning-anticipating-behavior-cars-0421\">Go to Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Author: Adam Zewe | MIT News Office Humans may be one of the biggest roadblocks keeping fully autonomous vehicles off city streets. If a robot [&hellip;] <span class=\"read-more-link\"><a class=\"read-more\" href=\"https:\/\/www.aiproblog.com\/index.php\/2022\/04\/21\/anticipating-others-behavior-on-the-road\/\">Read More<\/a><\/span><\/p>\n","protected":false},"author":1,"featured_media":5577,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_bbp_topic_count":0,"_bbp_reply_count":0,"_bbp_total_topic_count":0,"_bbp_total_reply_count":0,"_bbp_voice_count":0,"_bbp_anonymous_reply_count":0,"_bbp_topic_count_hidden":0,"_bbp_reply_count_hidden":0,"_bbp_forum_subforum_count":0,"footnotes":""},"categories":[24],"tags":[],"_links":{"self":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/5576"}],"collection":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/comments?post=5576"}],"version-history":[{"count":0,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/5576\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media\/5577"}],"wp:attachment":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media?parent=5576"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/categories?post=5576"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/tags?post=5576"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}