{"id":3670,"date":"2020-07-16T06:34:05","date_gmt":"2020-07-16T06:34:05","guid":{"rendered":"https:\/\/www.aiproblog.com\/index.php\/2020\/07\/16\/statistics-used-in-data-science-a-dictionary-in-one-picture\/"},"modified":"2020-07-16T06:34:05","modified_gmt":"2020-07-16T06:34:05","slug":"statistics-used-in-data-science-a-dictionary-in-one-picture","status":"publish","type":"post","link":"https:\/\/www.aiproblog.com\/index.php\/2020\/07\/16\/statistics-used-in-data-science-a-dictionary-in-one-picture\/","title":{"rendered":"Statistics Used in Data Science (A Dictionary in One Picture)"},"content":{"rendered":"<p>Author: Stephanie Glen<\/p>\n<div>\n<p><strong>Naming conventions are often quite different in statistics and data science<\/strong>, which causes quite a bit of confusion. Part of the problem with naming conventions is&nbsp; that &#8220;&#8230;<em>data science<\/em> <span><em>is the child of statistics and computer science<\/em>&rdquo; (Blei &amp; Symth, 2017) . In essence, data science then is the child of two parents who speak different languages. In one sense, this makes the job of the data scientist not only to apply the knowledge from both &#8220;parents&#8221;, but to also act as a translator between the two, which is where a dictionary comes in handy.<\/span><\/p>\n<p>Of, course, <strong>many terms are universal.<\/strong> For example,&nbsp; <a href=\"https:\/\/www.statisticshowto.com\/probability-and-statistics\/regression-analysis\/\" target=\"_blank\" rel=\"noopener noreferrer\">regression analysis<\/a>&nbsp;and <em>Bayesian Inference<\/em> mean the same thing no matter what field you&#8217;re working in.&nbsp;<strong>Others are completely different<\/strong>, like the X<sub>i<\/sub>&#8216;s in data: if you&#8217;re a statistician, those are <a href=\"https:\/\/www.statisticshowto.com\/covariate\/\" target=\"_blank\" rel=\"noopener noreferrer\">covariates<\/a>. A data scientist would call those <em>features<\/em>, a name borrowed from computer science.<\/p>\n<p> So, let&#8217;s <strong>clear up a few of the more common mismatches.<\/strong> The following picture is adapted from Larry Wasserman&#8217;s <em>All of Statistics<\/em>. His dictionary was an eye opener for me when I first started studying data science; Hopefully it will clear up some of those fuzzy definitions for you as well.<\/p>\n<p><a href=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/5310591275?profile=original\" target=\"_blank\" rel=\"noopener noreferrer\"><img decoding=\"async\" src=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/5310591275?profile=RESIZE_710x\" class=\"align-full\"><\/a><\/p>\n<h2>References<\/h2>\n<p>Blei, D. and Symth, P. &ldquo;Science and Data Science,&rdquo;<span>&nbsp;<\/span><i>Proceedings of the National Academies of Sciences<\/i>, vol. 114, no. 33, June 2017, pp. 8689-8692.<\/p>\n<p>Wasserman, L. (2004). All of Statistics: A Concise Course in Statistical Inference. Springer.<\/p>\n<\/div>\n<p><a href=\"https:\/\/www.datasciencecentral.com\/xn\/detail\/6448529:BlogPost:954036\">Go to Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Author: Stephanie Glen Naming conventions are often quite different in statistics and data science, which causes quite a bit of confusion. Part of the problem [&hellip;] <span class=\"read-more-link\"><a class=\"read-more\" href=\"https:\/\/www.aiproblog.com\/index.php\/2020\/07\/16\/statistics-used-in-data-science-a-dictionary-in-one-picture\/\">Read More<\/a><\/span><\/p>\n","protected":false},"author":1,"featured_media":458,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_bbp_topic_count":0,"_bbp_reply_count":0,"_bbp_total_topic_count":0,"_bbp_total_reply_count":0,"_bbp_voice_count":0,"_bbp_anonymous_reply_count":0,"_bbp_topic_count_hidden":0,"_bbp_reply_count_hidden":0,"_bbp_forum_subforum_count":0,"footnotes":""},"categories":[26],"tags":[],"_links":{"self":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/3670"}],"collection":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/comments?post=3670"}],"version-history":[{"count":0,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/3670\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media\/463"}],"wp:attachment":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media?parent=3670"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/categories?post=3670"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/tags?post=3670"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}