{"id":2540,"date":"2019-09-06T06:36:53","date_gmt":"2019-09-06T06:36:53","guid":{"rendered":"https:\/\/www.aiproblog.com\/index.php\/2019\/09\/06\/object-detection-using-deep-learning-faster-r-cnnyolossd\/"},"modified":"2019-09-06T06:36:53","modified_gmt":"2019-09-06T06:36:53","slug":"object-detection-using-deep-learning-faster-r-cnnyolossd","status":"publish","type":"post","link":"https:\/\/www.aiproblog.com\/index.php\/2019\/09\/06\/object-detection-using-deep-learning-faster-r-cnnyolossd\/","title":{"rendered":"Object Detection using Deep Learning: Faster R-CNN,YOLO,SSD"},"content":{"rendered":"<p>Author: Andrea Manero-Bastin<\/p>\n<div>\n<p><em><span>This article was written by <a href=\"https:\/\/disqus.com\/by\/sankit1\/\" target=\"_blank\" rel=\"noopener noreferrer\">Ankit Sachan<\/a>.<\/span><\/em><\/p>\n<p><em><span>\u00a0<\/span><\/em><span>In this post, I shall explain object detection and various algorithms like Faster R-CNN, YOLO, SSD. We shall start from beginners\u2019 level and go till the state-of-the-art in object detection, understanding the intuition, approach and salient features of each method.<\/span><\/p>\n<p><span><a href=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/3532354017?profile=original\" target=\"_blank\" rel=\"noopener noreferrer\"><img decoding=\"async\" src=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/3532354017?profile=RESIZE_710x\" class=\"align-center\"><\/a><\/span><\/p>\n<p><strong>What is Image Classification?<\/strong><\/p>\n<p><span>Image classification takes an image and predicts the object in an image.\u00a0<\/span><\/p>\n<p><span>The problem of identifying the location of an object (given the class) in an image is called\u00a0localization. However, if the object class is not known, we have to not only determine the location but also predict the class of each object.<\/span><\/p>\n<p><span>Predicting the location of the object along with the class is called object Detection.<strong>\u00a0<\/strong>In place of predicting the class of object from an image, we now have to predict the class as well as a rectangle (called bounding box) containing that object. It takes 4 variables to uniquely identify a rectangle. So, for each instance of the object in the image, we shall predict following variables:<\/span><\/p>\n<ul>\n<li>class_name,\u00a0<\/li>\n<li>bounding_box_top_left_x_coordinate,<\/li>\n<li>bounding_box_top_left_y_coordinate,<\/li>\n<li>bounding_box_width,<\/li>\n<li>bounding_box_height<\/li>\n<\/ul>\n<p><span><strong>This article explains the following techniques<\/strong>:<\/span><\/p>\n<ul>\n<li><span>Object Detection using Hog Features<br \/><\/span><\/li>\n<li><span>Region-based Convolutional Neural Networks(R-CNN)<br \/><\/span><\/li>\n<li><span>Spatial Pyramid Pooling(SPP-net)<br \/><\/span><\/li>\n<li><span>Fast R-CNN<br \/><\/span><\/li>\n<li><span>Faster R-CNN and Regression-based Detectors<br \/><\/span><\/li>\n<li><span>YOLO(You only Look Once)<br \/><\/span><\/li>\n<li><span>Single Shot Detector(SSD)<\/span><\/li>\n<\/ul>\n<p><span>The full article is available <a href=\"https:\/\/cv-tricks.com\/object-detection\/faster-r-cnn-yolo-ssd\/\" target=\"_blank\" rel=\"noopener noreferrer\">here<\/a>.\u00a0<\/span><\/p>\n<p><span>\u00a0<\/span><\/p>\n<\/div>\n<p><a href=\"https:\/\/www.datasciencecentral.com\/xn\/detail\/6448529:BlogPost:867175\">Go to Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Author: Andrea Manero-Bastin This article was written by Ankit Sachan. \u00a0In this post, I shall explain object detection and various algorithms like Faster R-CNN, YOLO, [&hellip;] <span class=\"read-more-link\"><a class=\"read-more\" href=\"https:\/\/www.aiproblog.com\/index.php\/2019\/09\/06\/object-detection-using-deep-learning-faster-r-cnnyolossd\/\">Read More<\/a><\/span><\/p>\n","protected":false},"author":1,"featured_media":462,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_bbp_topic_count":0,"_bbp_reply_count":0,"_bbp_total_topic_count":0,"_bbp_total_reply_count":0,"_bbp_voice_count":0,"_bbp_anonymous_reply_count":0,"_bbp_topic_count_hidden":0,"_bbp_reply_count_hidden":0,"_bbp_forum_subforum_count":0,"footnotes":""},"categories":[26],"tags":[],"_links":{"self":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/2540"}],"collection":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/comments?post=2540"}],"version-history":[{"count":0,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/2540\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media\/465"}],"wp:attachment":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media?parent=2540"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/categories?post=2540"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/tags?post=2540"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}