{"id":2376,"date":"2019-07-18T06:31:14","date_gmt":"2019-07-18T06:31:14","guid":{"rendered":"https:\/\/www.aiproblog.com\/index.php\/2019\/07\/18\/free-book-the-dataengineering-cookbook-by-andreas-kretz\/"},"modified":"2019-07-18T06:31:14","modified_gmt":"2019-07-18T06:31:14","slug":"free-book-the-dataengineering-cookbook-by-andreas-kretz","status":"publish","type":"post","link":"https:\/\/www.aiproblog.com\/index.php\/2019\/07\/18\/free-book-the-dataengineering-cookbook-by-andreas-kretz\/","title":{"rendered":"Free book: The #dataengineering cookbook by Andreas Kretz"},"content":{"rendered":"<p>Author: ajit jaokar<\/p>\n<div>\n<p>\u00a0<a href=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/3322261133?profile=original\" target=\"_blank\" rel=\"noopener noreferrer\"><img decoding=\"async\" src=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/3322261133?profile=RESIZE_710x\" class=\"align-full\"><\/a><\/p>\n<p>\u00a0<\/p>\n<p>I found an interesting, free book which is still a work in progress book \u2013 <a href=\"https:\/\/github.com\/andkret\/Cookbook\">The Data Engineering Cookbook<\/a><\/p>\n<p>\u00a0<\/p>\n<p>I will be contributing through the author (<a href=\"https:\/\/andreaskretz.com\/\">Andreas Kretz.com<\/a>) patreon site :\u00a0(<a href=\"https:\/\/patreon.com\/plumbersofds\">Link to his \u00a0Patreon<\/a> ) because I see data engineering as a topic which is not fully covered.<\/p>\n<p>The book is being built on an ongoing basis with a wide scope (for free as I understand it but with a patreon model of supporters)<\/p>\n<p><strong>The book is split into five parts<\/strong><\/p>\n<ul>\n<li>introduction<\/li>\n<li>basic data engineering skills<\/li>\n<li>a real world data engineering example<\/li>\n<li>over 30 case studies with links from companies like Netflix, Twitter, Spotify<\/li>\n<li>collection of interview questions<\/li>\n<\/ul>\n<p><strong>Topics covered include<\/strong><\/p>\n<ul>\n<li>Data Engineer vs Data Scientists<\/li>\n<li>Basic Data Engineering Skills<\/li>\n<li>Git<\/li>\n<li>Agile development<\/li>\n<li>Learn how a Computer Works<\/li>\n<li>Computer Networking<\/li>\n<li>Security and Privacy<\/li>\n<li>Linux<\/li>\n<li>The Cloud<\/li>\n<li>Security Zone Design<\/li>\n<li>Big Data<\/li>\n<li>My Big Data Platform Blueprint<\/li>\n<li>Lambda Architecture<\/li>\n<li>Data Warehouse vs Data Lake<\/li>\n<li>Docker<\/li>\n<li>REST APIs<\/li>\n<li>Databases<\/li>\n<li>Data Processing and Analytics &#8211; Frameworks<\/li>\n<li>Apache Kafka<\/li>\n<li>Machine Learning<\/li>\n<li>Data Visualization<\/li>\n<li>Data Engineering Course: Building A Data Platform<\/li>\n<li>Case Studies: AirBnB, spotify, Uber, Twitter and a range of others<\/li>\n<\/ul>\n<p>\u00a0<\/p>\n<p>In my teaching at Oxford University \u2013 <a href=\"https:\/\/www.conted.ox.ac.uk\/courses\/artificial-intelligence-cloud-and-edge-implementations\">Artificial intelligence \u2013 cloud and edge implementations<\/a> &#8211; I have taken an engineering led approach to data science. Many courses miss that depth and its not easy to teach because you need to cover three job roles: Data Engineering, Data Science and Devops. Its easy to miss many small topics in this vast scope.<\/p>\n<p>Hence, I hope this book will be a useful reference \u00a0<\/p>\n<p>The book link is \u2013 <a href=\"https:\/\/github.com\/andkret\/Cookbook\">The Data Engineering Cookbook<\/a><\/p>\n<p>\u00a0<\/p>\n<\/p>\n<p>\u00a0<\/p>\n<\/div>\n<p><a href=\"https:\/\/www.datasciencecentral.com\/xn\/detail\/6448529:BlogPost:857909\">Go to Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Author: ajit jaokar \u00a0 \u00a0 I found an interesting, free book which is still a work in progress book \u2013 The Data Engineering Cookbook \u00a0 [&hellip;] <span class=\"read-more-link\"><a class=\"read-more\" href=\"https:\/\/www.aiproblog.com\/index.php\/2019\/07\/18\/free-book-the-dataengineering-cookbook-by-andreas-kretz\/\">Read More<\/a><\/span><\/p>\n","protected":false},"author":1,"featured_media":471,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_bbp_topic_count":0,"_bbp_reply_count":0,"_bbp_total_topic_count":0,"_bbp_total_reply_count":0,"_bbp_voice_count":0,"_bbp_anonymous_reply_count":0,"_bbp_topic_count_hidden":0,"_bbp_reply_count_hidden":0,"_bbp_forum_subforum_count":0,"footnotes":""},"categories":[26],"tags":[],"_links":{"self":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/2376"}],"collection":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/comments?post=2376"}],"version-history":[{"count":0,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/2376\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media\/462"}],"wp:attachment":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media?parent=2376"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/categories?post=2376"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/tags?post=2376"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}