{"id":4164,"date":"2020-12-06T06:31:00","date_gmt":"2020-12-06T06:31:00","guid":{"rendered":"https:\/\/www.aiproblog.com\/index.php\/2020\/12\/06\/explaining-data-science-to-a-non-data-scientist\/"},"modified":"2020-12-06T06:31:00","modified_gmt":"2020-12-06T06:31:00","slug":"explaining-data-science-to-a-non-data-scientist","status":"publish","type":"post","link":"https:\/\/www.aiproblog.com\/index.php\/2020\/12\/06\/explaining-data-science-to-a-non-data-scientist\/","title":{"rendered":"Explaining Data Science to a Non-Data Scientist"},"content":{"rendered":"<p>Author: William Vorhies<\/p>\n<div>\n<p><strong><em>Summary:<\/em><\/strong><em>&nbsp; Explaining data science to a non-data scientist isn&rsquo;t as easy as it sounds.&nbsp; You may know a lot about math, tools, techniques, data, and computer architecture but the question is how do you explain this briefly without getting buried in the detail.&nbsp; You might try this approach.<\/em><\/p>\n<p>&nbsp;<\/p>\n<p><a href=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/5521687477?profile=original\" target=\"_blank\" rel=\"noopener noreferrer\"><img decoding=\"async\" src=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/5521687477?profile=RESIZE_710x\" width=\"350\" class=\"align-right\"><\/a>We&rsquo;ve all been there.&nbsp; You&rsquo;re at a party or maybe striking up a conversation with that pretty girl at the bar and sooner or later the question comes up, &ldquo;what do you do?&rdquo;&nbsp; Since you have what is reported to be the sexiest job in the world you proudly respond &ldquo;I&rsquo;m a data scientist&rdquo;.<\/p>\n<p>OK, what happens next depends on exactly what you say.&nbsp; Do your fellow party goers hang on your every word in anticipation?&nbsp; Do you, as they say, get the pretty girl&rsquo;s digits?&nbsp; You respond:<\/p>\n<p><em>&ldquo;I&rsquo;m working with deep neural nets with dozens of hidden layers on cloud based TPUs using Tensorflow.&nbsp; Right now I&rsquo;m working to put bounding boxes around images of people so I can create multi-class deep learning models to predict their&hellip;&rdquo;<\/em><\/p>\n<p>Never mind.&nbsp; Your host&rsquo;s eyes have glazed over.&nbsp; The cute girl has turned to the guy on her other side who looks like a personal trainer at your gym.&nbsp; TMI! TMI!&nbsp; How do you keep it simple, brief, and still explain to a non-data scientist the essence of what you do without losing their interest in the first dozen words.&nbsp; Next time you vow to keep it simple.<\/p>\n<p>The next party comes.&nbsp; You think, OK I&rsquo;ll skip the specifics and just talk about the categories of tools that I use.&nbsp; After the obligatory &ldquo;I&rsquo;m a data scientist&rdquo; you continue:<\/p>\n<p><em>&ldquo;I use mathematical algorithms to answer questions in ranking, recommendation, classification, regression, clustering, and anomaly detection.&nbsp; First we gather up massive data sets about the question we want to answer.&nbsp; Getting that data and getting it ready for the algorithms is a whole different conversation.&nbsp; But the fun part begins when I start creating models and testing them with different optimization methods like stochastic gradient descent to see which one is most accurate.&nbsp; Then I score the unseen data&hellip;&rdquo;<\/em><\/p>\n<p>Never mind.&nbsp; Same result.<\/p>\n<p>After several years of trying, I&rsquo;ve settled on a very simple explanation based mostly on <em><a href=\"https:\/\/brohrer.github.io\/five_questions_data_science_answers.html\">Brandon Roher&rsquo;s<\/a><\/em> remarkable 2015 five-question explanation of machine learning.&nbsp; Even with the additional complexity of Big Data and deep learning this is the explanation I&rsquo;ve found most successful.&nbsp; It basically has three parts following &ldquo;I&rsquo;m a data scientist&rdquo;.<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-size: 12pt;\"><strong>Part 1 You&rsquo;re a Wizard<\/strong><\/span><\/p>\n<p>I help people answer question or make predictions about what will happen in the future.&nbsp; So data scientists are kind of like fortune tellers except that we do it with math and data.&nbsp; And most important, unlike fortune tellers we can get the right answer pretty often.<\/p>\n<p>Keep in mind that 50% accuracy is the same as a coin toss, so generally we&rsquo;re pretty happy when we get the answer right about 70% of the time and sometimes we can get it right upwards of 90% of the time.<\/p>\n<p>(Ok I&rsquo;m taking some liberties here but remember the audience).<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-size: 12pt;\"><strong>Part 2 What You Work On is Easy to Understand &ndash; Sort of<\/strong><\/span><\/p>\n<p>There are really only five types of questions that all data scientists deal with.<\/p>\n<ol>\n<li>Is this A or B?<\/li>\n<li>Is this weird?<\/li>\n<li>How much &ndash; or &ndash; How many?<\/li>\n<li>How is this organized?<\/li>\n<li>What should I do next?<\/li>\n<\/ol>\n<p>Now, if they&rsquo;re still with you, you can move on to Part 3 for some examples &ndash; but keep it short.<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-size: 12pt;\"><strong>Part 3 Some Examples &ndash; Keep it Short<\/strong><\/span><\/p>\n<ol>\n<li>Is this A or B?<\/li>\n<\/ol>\n<p>These questions are like predicting who will buy and who won&rsquo;t.&nbsp; Or with machines we might try to predict is that machine going to break down in the next week.<\/p>\n<ol start=\"2\">\n<li>Is this weird?<\/li>\n<\/ol>\n<p>We help your bank and credit card company a lot with this type.&nbsp; Is the transaction that just showed up on your credit card unusual for you so that maybe we should make sure it was really you.&nbsp; This is also where the world of cybersecurity comes in.&nbsp; We can look at individual incoming signals from outside your system and flag the ones that look suspicious.<\/p>\n<ol start=\"3\">\n<li>How much &ndash; or &ndash; How many?<\/li>\n<\/ol>\n<p>These questions are about numbers in the future.&nbsp; What will the price of oil be next month?&nbsp; What will be my sales in each of the next 12 months?<\/p>\n<ol start=\"4\">\n<li>How is this organized?<\/li>\n<\/ol>\n<p>Turns out that a lot of data, particularly about people naturally breaks into groups but those groups aren&rsquo;t necessarily easy to see without some math.&nbsp; So if we&rsquo;re going to recommend what movie to see, what music you might like, or even who you should consider dating we&rsquo;d answer them here.<\/p>\n<ol start=\"5\">\n<li>What should I do next?<\/li>\n<\/ol>\n<p>Some of these questions have only a few logical answers.&nbsp; Like, given two factors, like potential sales and the cost of the sale what&rsquo;s the optimum combination of the two that maximizes profit.&nbsp; The other types of questions here are even more interesting since they&rsquo;re how we program self-driving cars where the question might be, the light just turned yellow, should I brake or accelerate through.<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-size: 12pt;\"><strong>Part 4<\/strong><\/span><\/p>\n<p>Well there really isn&rsquo;t any perfectly designed Part 4.&nbsp; If you&rsquo;ve been a great story teller then maybe your audience is ready to ask you some questions.&nbsp; Maybe it&rsquo;s time to just listen and make room for the next speaker.<\/p>\n<p>You&rsquo;ve devoted thousands of hours to perfecting your skills.&nbsp; You&rsquo;re proud of your knowledge and can speak at length about math, tools, data, computer architecture, deep learning, IoT, and even AGI.&nbsp; What I&rsquo;ve found is that what most non-data scientist want is your elevator pitch.&nbsp; So keep it simple, keep it brief, and maybe try this approach to still get across most of the magic in what you do.<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blog\/list?user=0h5qapp2gbuf8\"><em><u>Other articles by Bill Vorhies<\/u><\/em><\/a><\/p>\n<p><em><u>&nbsp;<\/u><\/em><\/p>\n<p>About the author:&nbsp; Bill is Contributing Editor for Data Science Central.&nbsp; Bill is also President &amp; Chief Data Scientist at Data-Magnum and has practiced as a data scientist since 2001.&nbsp; His articles have been read more than 2.1 million times.<\/p>\n<p><a href=\"mailto:Bill@DataScienceCentral.com\">Bill@DataScienceCentral.com<\/a> <span>or<\/span> <a href=\"mailto:Bill@Data-Magnum.com\">Bill@Data-Magnum.com<\/a><\/p>\n<\/div>\n<p><a href=\"https:\/\/www.datasciencecentral.com\/xn\/detail\/6448529:BlogPost:955491\">Go to Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Author: William Vorhies Summary:&nbsp; Explaining data science to a non-data scientist isn&rsquo;t as easy as it sounds.&nbsp; You may know a lot about math, tools, [&hellip;] <span class=\"read-more-link\"><a class=\"read-more\" href=\"https:\/\/www.aiproblog.com\/index.php\/2020\/12\/06\/explaining-data-science-to-a-non-data-scientist\/\">Read More<\/a><\/span><\/p>\n","protected":false},"author":1,"featured_media":463,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_bbp_topic_count":0,"_bbp_reply_count":0,"_bbp_total_topic_count":0,"_bbp_total_reply_count":0,"_bbp_voice_count":0,"_bbp_anonymous_reply_count":0,"_bbp_topic_count_hidden":0,"_bbp_reply_count_hidden":0,"_bbp_forum_subforum_count":0,"footnotes":""},"categories":[26],"tags":[],"_links":{"self":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/4164"}],"collection":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/comments?post=4164"}],"version-history":[{"count":0,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/4164\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media\/458"}],"wp:attachment":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media?parent=4164"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/categories?post=4164"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/tags?post=4164"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}