{"id":2205,"date":"2019-05-30T06:31:57","date_gmt":"2019-05-30T06:31:57","guid":{"rendered":"https:\/\/www.aiproblog.com\/index.php\/2019\/05\/30\/data-science-coding-in-a-weekend-series-of-books\/"},"modified":"2019-05-30T06:31:57","modified_gmt":"2019-05-30T06:31:57","slug":"data-science-coding-in-a-weekend-series-of-books","status":"publish","type":"post","link":"https:\/\/www.aiproblog.com\/index.php\/2019\/05\/30\/data-science-coding-in-a-weekend-series-of-books\/","title":{"rendered":"Data science Coding in a weekend series of books \u2026"},"content":{"rendered":"<p>Author: ajit jaokar<\/p>\n<div>\n<p><a href=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/2732744989?profile=original\" target=\"_blank\" rel=\"noopener noreferrer\"><img decoding=\"async\" src=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/2732744989?profile=RESIZE_710x\" class=\"align-full\"><\/a><\/p>\n<p>After testing this idea for the last few months, we have formally launched this concept<\/p>\n<p>\u00a0<\/p>\n<p>The idea of <strong>\u2018Data Science Coding in a weekend\u2019<\/strong> originated from meetups we conducted in London<\/p>\n<p>\u00a0<\/p>\n<p>The idea is simple but effective<\/p>\n<p>\u00a0<\/p>\n<p>We choose a complex section of code and try to learn it in detail over the weekend<\/p>\n<p>\u00a0<\/p>\n<p>We work backwards i.e. try to drill down the concepts behind the main ideas<\/p>\n<p>\u00a0<\/p>\n<p>This led to the philosophy which I articulated in \u00a0learn <a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/learn-machinelearning-coding-basics-in-a-weekend-a-new-approach\">machine learning coding basics in a weekend a new approach<\/a><\/p>\n<p>\u00a0<\/p>\n<p>And the first book <a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/free-book-classification-and-regression-in-a-weekend\">free book classification and regression in a weekend<\/a><\/p>\n<p>\u00a0<\/p>\n<p>The \u201cin a weekend\u201d series of books on Data Science Central can be seen as an online version of our London based meetups. All the books have a single community <a href=\"https:\/\/www.datasciencecentral.com\/group\/ai-deep-learning-machine-learning-coding-in-a-week\">HERE<\/a>. Like a meetup, the books are free to use. The code is in open source. We have drawn upon many sources which we have referenced in the books<\/p>\n<p>\u00a0<\/p>\n<p>For this first book, the steps in the code are<\/p>\n<p>\u00a0<\/p>\n<p><strong>Regression<\/strong><\/p>\n<p>Load and describe the data<\/p>\n<p>Exploratory Data Analysis<\/p>\n<p>\u00a0\u00a0\u00a0\u00a0\u00a0 Exploratory data analysis \u2013 numerical<\/p>\n<p>\u00a0\u00a0\u00a0\u00a0\u00a0 Exploratory data analysis &#8211; visual<\/p>\n<p>\u00a0\u00a0\u00a0\u00a0\u00a0 Analyse the target variable<\/p>\n<p>\u00a0\u00a0\u00a0\u00a0\u00a0 compute the correlation<\/p>\n<p>Pre-process the data<\/p>\n<p>\u00a0\u00a0\u00a0\u00a0\u00a0 Dealing with missing values<\/p>\n<p>\u00a0\u00a0\u00a0\u00a0\u00a0 Treatment of categorical values<\/p>\n<p>\u00a0\u00a0\u00a0\u00a0\u00a0 Remove the outliers<\/p>\n<p>\u00a0\u00a0\u00a0\u00a0\u00a0 Normalise the data<\/p>\n<p>Split the data<\/p>\n<p>Choose a Baseline algorithm<\/p>\n<p>defining \/ instantiating the baseline model<\/p>\n<p>fitting the model we have developed to our training set<\/p>\n<p>Define the evaluation metric<\/p>\n<p>predict scores against our test set and assess how good it is<\/p>\n<p>Refine our dataset with additional columns<\/p>\n<p>Test Alternative Models<\/p>\n<p>Choose the best model and optimise its parameters<\/p>\n<p>Gridsearch<\/p>\n<p>\u00a0<\/p>\n<p><strong>Classification<\/strong><\/p>\n<p>Load the data<\/p>\n<p>Exploratory data analysis<\/p>\n<p>\u00a0 \u00a0 \u00a0Analyse the target variable\u00a0<br \/>\u00a0 \u00a0 Check if the data is balanced<\/p>\n<p>\u00a0 \u00a0 \u00a0Check the co-relations<br \/> Split the data<br \/> Choose a Baseline algorithm<br \/> Train and Test the Model<br \/> Choose an evaluation metric<br \/> Refine our dataset<\/p>\n<p>Feature engineering<\/p>\n<p>Test Alternative Models<br \/> Ensemble models\u00a0<br \/> Choose the best model and optimise its parameters<\/p>\n<p>\u00a0<\/p>\n<p>The second book \u2013 coming by next week \u2013 is entitled \u201cAzure machine learning in a weekend\u201d.<\/p>\n<p>\u00a0<\/p>\n<p>I introduced the book in this blog \u2013 <a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/azure-machine-learning-concepts-an-introduction\">Azure machine learning concepts \u2013 an introduction.<\/a> \u00a0Most of us start learning development using a language like Python or R. But when you work professionally, you typically end up working with a Cloud platform. The top three Cloud platforms today in terms of market share are AWS, Azure and GCP(Google). These platforms are similar. Our goal is to learn the how to develop for these platform. \u00a0We start with Azure and then with Google next month.<\/p>\n<p>\u00a0<\/p>\n<p>We welcome your comments on the books and approach<\/p>\n<p>You can download the first book <a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/free-book-classification-and-regression-in-a-weekend\">free book classification and regression in a weekend<\/a> and join the community <a href=\"https:\/\/www.datasciencecentral.com\/group\/ai-deep-learning-machine-learning-coding-in-a-week\">HERE<\/a><\/p>\n<\/div>\n<p><a href=\"https:\/\/www.datasciencecentral.com\/xn\/detail\/6448529:BlogPost:831931\">Go to Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Author: ajit jaokar After testing this idea for the last few months, we have formally launched this concept \u00a0 The idea of \u2018Data Science Coding [&hellip;] <span class=\"read-more-link\"><a class=\"read-more\" href=\"https:\/\/www.aiproblog.com\/index.php\/2019\/05\/30\/data-science-coding-in-a-weekend-series-of-books\/\">Read More<\/a><\/span><\/p>\n","protected":false},"author":1,"featured_media":472,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_bbp_topic_count":0,"_bbp_reply_count":0,"_bbp_total_topic_count":0,"_bbp_total_reply_count":0,"_bbp_voice_count":0,"_bbp_anonymous_reply_count":0,"_bbp_topic_count_hidden":0,"_bbp_reply_count_hidden":0,"_bbp_forum_subforum_count":0,"footnotes":""},"categories":[26],"tags":[],"_links":{"self":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/2205"}],"collection":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/comments?post=2205"}],"version-history":[{"count":0,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/2205\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media\/459"}],"wp:attachment":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media?parent=2205"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/categories?post=2205"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/tags?post=2205"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}