{"id":2154,"date":"2019-05-17T06:31:53","date_gmt":"2019-05-17T06:31:53","guid":{"rendered":"https:\/\/www.aiproblog.com\/index.php\/2019\/05\/17\/free-book-classification-and-regression-in-a-weekend\/"},"modified":"2019-05-17T06:31:53","modified_gmt":"2019-05-17T06:31:53","slug":"free-book-classification-and-regression-in-a-weekend","status":"publish","type":"post","link":"https:\/\/www.aiproblog.com\/index.php\/2019\/05\/17\/free-book-classification-and-regression-in-a-weekend\/","title":{"rendered":"Free Book: Classification and Regression In a Weekend"},"content":{"rendered":"<p>Author: Vincent Granville<\/p>\n<div>\n<p><em>By\u00a0Ajit Jaokar and Dan Howarth. With contributions from Ayse Mutlu.<\/em><\/p>\n<p>Exclusively for Data Science Central members, with free access. You can download this book (PDF)<span>\u00a0<\/span><a href=\"https:\/\/www.datasciencecentral.com\/page\/free-books-1\" target=\"_blank\" rel=\"noopener noreferrer\">here<\/a>.\u00a0<\/p>\n<p><span>This tutorial began as a series of weekend workshops created by Ajit Jaokar and Dan Howarth. The idea was to work with a specific (longish) program such that we explore as much of it as possible in one weekend. This book is an attempt to take this idea online. The best way to use this book is to work with the Python code as much as you can. The code has comments.\u00a0 But you can extend the comments by the concepts explained here.<\/span><\/p>\n<\/p>\n<p><span><a href=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/2612991015?profile=original\" target=\"_blank\" rel=\"noopener noreferrer\"><img decoding=\"async\" src=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/2612991015?profile=RESIZE_710x\" class=\"align-center\"><\/a><\/span><\/p>\n<p><span style=\"font-size: 14pt;\"><strong>Content<\/strong><\/span><\/p>\n<p><strong>1. Introduction and approach<\/strong> 4<\/p>\n<p><strong>2. Background, tools and philosophy<\/strong> 6<\/p>\n<ul>\n<li>What you will learn from this book? 6<\/li>\n<li>Components for book 7<\/li>\n<li>Big Picture Diagram 7<\/li>\n<\/ul>\n<p><strong>3. Code outline<\/strong> 7<\/p>\n<ul>\n<li>Regression code outline 7<\/li>\n<li>Classification Code Outline 8<\/li>\n<\/ul>\n<p><strong>4. Exploratory data analysis and graphics<\/strong> 8<\/p>\n<ul>\n<li>Numeric descriptive statistics 8<\/li>\n<li>Interpreting descriptive statistics 9<\/li>\n<li>Understanding the distribution 10<\/li>\n<li>Histograms 10<\/li>\n<li>Boxplots and IQR 10<\/li>\n<li>Correlation 11<\/li>\n<li>heatmaps for co-relation 12<\/li>\n<li>Analysing the target variable 13<\/li>\n<\/ul>\n<p><strong>5. Pre-processing data<\/strong> 13<\/p>\n<ul>\n<li>Dealing with missing values 13<\/li>\n<li>Treatment of categorical values 13<\/li>\n<li>Normalise the data 14<\/li>\n<li>Split the data 15<\/li>\n<\/ul>\n<p><strong>6. Choose a Baseline algorithm<\/strong> 15<\/p>\n<ul>\n<li>Defining \/ instantiating the baseline model 15<\/li>\n<li>Fitting the model we have developed to our training set 16<\/li>\n<li>Define the evaluation metric 16<\/li>\n<li>Predict scores against our test set and assess how good it is 18<\/li>\n<\/ul>\n<p><strong>7. Evaluation metrics for classification<\/strong> 18<\/p>\n<ul>\n<li>Improving a model \u2013 from baseline models to final models 21<\/li>\n<li>Understanding cross validation 21<\/li>\n<li>Feature engineering 24<\/li>\n<li>Regularization to prevent overfitting 24<\/li>\n<li>Ensembles \u2013 typically for classification 26<\/li>\n<li>Test alternative models 27<\/li>\n<li>Hyperparameter tuning 28<\/li>\n<\/ul>\n<p><strong>8. Conclusion<\/strong> 28<\/p>\n<p><strong>A1. Regression Code<\/strong> 29<\/p>\n<p><strong>A2. Classification Code<\/strong> 36<\/p>\n<p><em>To access the book, and if you are not yet a DSC member, you can register as a member,<span>\u00a0<\/span><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/check-out-our-dsc-newsletter\" target=\"_blank\" rel=\"noopener noreferrer\">following this link<\/a>.<\/em><\/p>\n<\/div>\n<p><a href=\"https:\/\/www.datasciencecentral.com\/xn\/detail\/6448529:BlogPost:825485\">Go to Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Author: Vincent Granville By\u00a0Ajit Jaokar and Dan Howarth. With contributions from Ayse Mutlu. Exclusively for Data Science Central members, with free access. You can download [&hellip;] <span class=\"read-more-link\"><a class=\"read-more\" href=\"https:\/\/www.aiproblog.com\/index.php\/2019\/05\/17\/free-book-classification-and-regression-in-a-weekend\/\">Read More<\/a><\/span><\/p>\n","protected":false},"author":1,"featured_media":456,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_bbp_topic_count":0,"_bbp_reply_count":0,"_bbp_total_topic_count":0,"_bbp_total_reply_count":0,"_bbp_voice_count":0,"_bbp_anonymous_reply_count":0,"_bbp_topic_count_hidden":0,"_bbp_reply_count_hidden":0,"_bbp_forum_subforum_count":0,"footnotes":""},"categories":[26],"tags":[],"_links":{"self":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/2154"}],"collection":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/comments?post=2154"}],"version-history":[{"count":0,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/2154\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media\/458"}],"wp:attachment":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media?parent=2154"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/categories?post=2154"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/tags?post=2154"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}