{"id":1521,"date":"2019-01-01T06:34:29","date_gmt":"2019-01-01T06:34:29","guid":{"rendered":"https:\/\/www.aiproblog.com\/index.php\/2019\/01\/01\/5-predictions-about-data-science-machine-learning-and-ai-for-2019\/"},"modified":"2019-01-01T06:34:29","modified_gmt":"2019-01-01T06:34:29","slug":"5-predictions-about-data-science-machine-learning-and-ai-for-2019","status":"publish","type":"post","link":"https:\/\/www.aiproblog.com\/index.php\/2019\/01\/01\/5-predictions-about-data-science-machine-learning-and-ai-for-2019\/","title":{"rendered":"5 Predictions about Data Science, Machine Learning, and AI for 2019"},"content":{"rendered":"<p>Author: William Vorhies<\/p>\n<div>\n<p><strong><em>Summary:<\/em><\/strong><em>\u00a0 Here are our 5 predictions for data science, machine learning, and AI for 2019.\u00a0 We also take a look back at last year\u2019s predictions to see how we did.<\/em><\/p>\n<p>\u00a0<\/p>\n<p><a href=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/401132209?profile=original\" target=\"_blank\" rel=\"noopener\"><img decoding=\"async\" src=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/401132209?profile=original&#038;width=250\" width=\"250\" class=\"align-right\"><\/a>It\u2019s that time of year again when we do a look back in order to offer a look forward.\u00a0 What trends will speed up, what things will actually happen, and what things won\u2019t in the coming year for data science, machine learning, and AI.<\/p>\n<p>We\u2019ve been watching and reporting on these trends all year and we scoured the web and some of our professional contacts to find out what others are thinking.\u00a0<\/p>\n<p>\u00a0<\/p>\n<p><span style=\"font-size: 12pt;\"><strong>Here\u2019s a Quick Look at Last Year\u2019s Predictions and How We Did.<\/strong><\/span><\/p>\n<ol>\n<li><em>What we said: Both model production and data prep will become increasingly automated. Larger data science operations will converge on a single platform (of many available).\u00a0 Both of these trends are in response to the groundswell movement for efficiency and effectiveness.\u00a0 In a nutshell allowing fewer data scientists to do the work of many.<\/em>\u00a0<\/li>\n<\/ol>\n<p>Clearly a win.\u00a0 <a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/practicing-no-code-data-science\"><em><u>No code data scie<\/u><\/em><\/a><em><u>nce<\/u><\/em> is on the rise as is end-to-end integration in advanced analytic platforms.<\/p>\n<ol start=\"2\">\n<li><em>What we said: Data Science continues to develop specialties that mean the mythical \u2018full stack\u2019 data scientist will disappear.<\/em><\/li>\n<\/ol>\n<p>Yep.\u00a0 And there\u2019s <a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/data-engineer-and-business-analyst-might-be-the-best-data-science\"><em><u>more talk about Data Engineers<\/u><\/em><\/a> now than Data Scientists.\u00a0 Data Engineers are the folks who are supposed make those data science models work in the real world.<\/p>\n<ol start=\"3\">\n<li><em>What we said: Non-Data Scientists will perform a greater volume of fairly sophisticated analytics than data scientists.<\/em><\/li>\n<\/ol>\n<p>It\u2019s true.\u00a0 See the popularity of Data Viz and Visual Analytics as bridge technologies to let non-data scientists extract more value from sophisticated data science tools.<\/p>\n<ol start=\"4\">\n<li><em>What we said: Deep learning is complicated and hard. Not many data scientists are skilled in this area and that will hold back the application of AI until the deep learning platforms are significantly simplified and productized.\u00a0<\/em><\/li>\n<\/ol>\n<p>Both Microsoft and Google rolled out <a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/automated-deep-learning-so-simple-anyone-can-do-it\"><em><u>automated deep learning platforms<\/u><\/em><\/a> in 2018.\u00a0 These started with transfer learning but are headed toward full AutoDL.\u00a0 There are also a number of startup integrated AutoDL platforms.\u00a0 We reviewed <a href=\"https:\/\/www.oneclick.ai\/\"><em><u>OneClick.AI<\/u><\/em><\/a> earlier this year with both a full AutoML and AutoDL platform.\u00a0 Gartner recently nominated <a name=\"_Toc526321075\"><\/a><a href=\"https:\/\/dimensionalmechanics.com\/\"><em><u>DimensionalMechanics<\/u><\/em><\/a> as one of its \u201c5 Cool Companies\u201d with an AutoDL platform.<\/p>\n<ol start=\"5\">\n<li><em>What we said: Despite the hype, penetration of AI and deep learning into the broader market will be relatively narrow and slower than you think.<\/em><\/li>\n<\/ol>\n<p>With the exception of chatbots which are becoming ever present, applications of real AI in business are narrow.\u00a0 They\u2019re coming but there not there yet.\u00a0 The <a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/the-case-for-just-getting-your-feet-wet-with-ai\"><em><u>most comprehensive studies I\u2019ve seen<\/u><\/em><\/a> show that among large companies only 1\/5<sup>th<\/sup> to 1\/3<sup>rd<\/sup> are implementing AI \u2018at scale\u2019, meaning they are putting on a full court press.\u00a0 Among smaller companies the percentage is much much smaller.\u00a0 And we\u2019re not really sure if they mean <a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/the-two-conflicting-definitions-of-ai\"><em><u>\u2018true\u2019 AI<\/u><\/em><\/a>.<\/p>\n<ol start=\"6\">\n<li><em>What we said: The public (and the government) will start to take a hard look at social and privacy implications of AI, both intended and unintended.<\/em><\/li>\n<\/ol>\n<p>Just take a look at the news and the stream of top social media executives being called on the carpet by our government and the EU.\u00a0 Begins to look like a perp walk.\u00a0 Government regulation is coming if not at the federal level, then in a more chaotic state-by-state balkanization like the California privacy rules about to go into effect or the Australian mandatory anti-encryption requirements.<\/p>\n<p>So we\u2019re calling that 6 for 6 wins on last year.\u00a0 While many of last year\u2019s predictions could serve as well for next year we\u2019ll try to be a little more specific.\u00a0<\/p>\n<p>\u00a0<\/p>\n<p><span style=\"font-size: 12pt;\"><strong>Here\u2019s What We See for 2019.<\/strong><\/span><\/p>\n<p><span style=\"font-size: 12pt;\"><strong>Prediction 1: Data Becomes More Important Than Algorithms<\/strong><\/span><\/p>\n<p>It\u2019s been more than a year since we had any major new breakthroughs in either deep learning or classical machine learning.\u00a0 There\u2019s been some incremental improvement like using <a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/temporal-convolutional-nets-tcns-take-over-from-rnns-for-nlp-pred\"><em><u>Temporal Convolutional Nets (TCNs)<\/u><\/em><\/a> instead of RNNs to reduce latency in NLP but no huge new innovations.\u00a0 The top performing algorithms are pretty well known or can easily be discovered using Automated Machine Learning.<\/p>\n<p>We\u2019re well into the period when having more and better data is the key to success as companies make their journey of digital transformation.\u00a0 As a practical matter this opens up opportunities for competition in providing data-related solutions that\u2019s moving in several directions at once.<\/p>\n<p>One axis is that getting accurately labeled training data for either image or text is still extremely expensive and time consuming.\u00a0 Companies like Figure Eight which specialize in labeling data are promoting smart cost effective strategies like <a href=\"https:\/\/www.datasciencecentral.com\/video\/dsc-webinar-series-ai-models-and-active-learning\"><em><u>Active Learning<\/u><\/em><\/a> that let you determine the optimum tradeoff between labeled data and model accuracy.\u00a0 That involves multiple iterations of adding either human-labeled or machine-labeled data followed by retraining to find the best tradeoff.<\/p>\n<p>Another axis is access to third party data.\u00a0 Service companies like DymstData have entered the field as clearing houses for many <a href=\"https:\/\/www.datasciencecentral.com\/video\/dsc-webinar-series-transforming-3rd-party-data-into-actionable\"><em><u>hundreds of types of append data<\/u><\/em><\/a>.\u00a0 They\u2019re also taking on the onerous role of making sure sensitive PII is protected and that their users can enforce role based accessed to certain sensitive information particularly important in financial and health services.<\/p>\n<p>The third axis is automatically tracking and recording the provenance of data used in models.\u00a0 Particularly when streaming data from many sources is being integrated and changes over time, knowing where it originated and how it has been used is critical to both accuracy and to compliance.\u00a0 Tibco and some other analytic platforms are incorporating this feature.<\/p>\n<p>Look for service offerings around data to expand significantly this year.<\/p>\n<p>\u00a0<\/p>\n<p><span style=\"font-size: 12pt;\"><strong>Prediction 2: Everything Gets Easier as AI\/ML Moves Off Analytic Platforms Onto Industry or Process Specific Applications.<\/strong><\/span><\/p>\n<p>A quick look around the world of AI\/ML startups shows that competition is moving to industry or process specific applications.\u00a0 These applications or mini-platforms are focused on solving industry specific problems in businesses as diverse as marketing, B2B sales, healthcare, fintech, and roughly a dozen other defined groupings.\u00a0 For a quick view take a look at <a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/the-best-opportunities-in-ai-for-data-scientists\"><em><u>CB Insights annual AI 100 winners<\/u><\/em><\/a> in the nearby chart and the way they are grouped by industry or process.<\/p>\n<p><a href=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/401141907?profile=original\" target=\"_blank\" rel=\"noopener\"><img decoding=\"async\" src=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/401141907?profile=original&#038;width=250\" width=\"250\" class=\"align-right\"><\/a>These new applications are focused on embedding the AI\/ML so that the user\u2019s organization does not need support from a large in-house group of data scientists and can rely on these developers to continue to provide updates and improvements.<\/p>\n<p>Some call this the commoditization of AI\/ML but more accurately you might think of this as the specialization of AI\/ML.\u00a0<\/p>\n<p>If you\u2019ve been around long enough to remember the transition from Reengineering to ERPs in the late 90s this is very much the same thing.\u00a0 Initially Reengineering called on companies to improve processes with complex custom developed IT solutions because standardized solutions didn\u2019t yet exist.\u00a0 That rapidly opened the door to the rise of the major integrated ERPs from Oracle, PeopleSoft, SAP and others, and also for specialized applications like CRMs.\u00a0 Our industry is undergoing this same change right now.<\/p>\n<p>These new vendors all strive to provide broad solutions in their particular niches but inevitably end up with less than grand ERP-scale platforms.\u00a0 Watch for many waves of consolidation among developers in each of these industry groups.\u00a0<\/p>\n<p>Watch also for accelerating rates of AI\/ML adoption in mid-size and smaller companies who no longer have to have large data science teams or to rely exclusively on custom developed models.<\/p>\n<p>\u00a0<\/p>\n<p><span style=\"font-size: 12pt;\"><strong>Prediction 3: Rise of the Data Engineer and Data Analyst<\/strong><\/span><\/p>\n<p>It\u2019s not that the world has fallen out of love with data scientists.\u00a0 Far from it.\u00a0 But when you have a shortage of a skill then the market moves to ease that pain by filling it in different ways.<\/p>\n<p>One way is through the industry and process specific smart applications we discussed above that don\u2019t require great squads of in house data scientists.<\/p>\n<p>The second is what\u2019s going on in all the major analytic platforms and the dozens of Automated Machine Learning (AML) platforms that are rapidly emerging.\u00a0 That is to be more efficient in data science meaning that fewer data scientists can do the work of many.<\/p>\n<p>Since the volume of models does not decrease, in fact increases, this <a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/data-engineer-and-business-analyst-might-be-the-best-data-science\"><em><u>moves the work load to data engineers<\/u><\/em><\/a> who have two primary functions.\u00a0<\/p>\n<p>First, to be able to create the required infrastructure required for data science like data lakes and Spark instances.\u00a0<\/p>\n<p>The second is to be the one who takes the models and ensures they are implemented in operational systems and tracked for accuracy and refresh.\u00a0<\/p>\n<p>Some data engineers are also <a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/dataops-it-s-a-secret\"><em><u>responsible for DataOps<\/u><\/em><\/a>, ensuring a clean and preprocessed data stream.<\/p>\n<p>The other evolution of analytic platforms is the growth of Visual Analytics and Data Visualization tools.\u00a0 These are now mostly fully integrated alongside the data science toolset and allow data analysts and LOB managers to extract more value and even guide efforts in analytics.\u00a0 They don\u2019t replace data scientists.\u00a0 It reinforces the team aspect that advanced analytics is becoming.<\/p>\n<p>\u00a0<\/p>\n<p><span style=\"font-size: 12pt;\"><strong>Prediction 4: Neuromorphic Chips:\u00a0 AI Comes to the Edge for IoT<\/strong><\/span><\/p>\n<p>Two different technologies are reaching semi-maturity at the same time to solve a long standing problem.\u00a0 That problem is latency.<\/p>\n<p>Consider for example that when you want to use your mobile device to automatically translate those text or image foreign words into English, or vice versa that your device actually sends that signal all the way back to the app in the cloud where the translation occurs, then all the way back to your device.\u00a0<\/p>\n<p>Google and others providing instant translation services have been moving from RNN to specialized CNN structures called <a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/temporal-convolutional-nets-tcns-take-over-from-rnns-for-nlp-pred\"><em><u>Temporal Convolutional Nets<\/u><\/em><\/a> because RNNs don\u2019t adapt well to MPP but CNNs do.\u00a0 That switch reduced latency, but the signal still had to make the full round trip.<\/p>\n<p>The first of two technologies to solve this problem is 5G networks.\u00a0 You may be aware that 5G is faster but its real benefit is the density of traffic it can carry.\u00a0 This really opens the door to letting pretty much everything in your life communicate on the internet.\u00a0 How many of those will prove worthwhile remains to be seen.<\/p>\n<p><a href=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/401146372?profile=original\" target=\"_blank\" rel=\"noopener\"><img decoding=\"async\" src=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/401146372?profile=original&#038;width=250\" width=\"250\" class=\"align-right\"><\/a>The second solution is the introduction of new and better neuromorphic chips (aka <a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/more-on-3rd-generation-spiking-neural-nets\"><em><u>spiking neural networks<\/u><\/em><\/a>).\u00a0 We hope that these totally new architectures for neural nets may be the gateway to achieving Artificial General Intelligence (AGI).\u00a0 That\u2019s still quite a way off.\u00a0<\/p>\n<p>However, both the major chip makers and several startups are now releasing spiking neuromorphic chips specially optimized to do CNN and RNN style models on the chip (without the round trip signal).\u00a0 Some of these are also optimized for very low power consumption.<\/p>\n<p>Together these features are ideal for moving deep learning onto chips that can exist at the true edge of a network.\u00a0 Watch IoT and other streaming data apps explode with these new capabilities starting this year.<\/p>\n<p>\u00a0<\/p>\n<p><span style=\"font-size: 12pt;\"><strong>Prediction 5:\u00a0 Different AI Frameworks Learn to Speak to Each Other<\/strong><\/span><\/p>\n<p>Now that text, speech, image, and video models are going mainstream we\u2019ve hit an unexpected barrier.\u00a0 Models built on one framework (Caffe2, PyTorch, Apache MXNet, Microsoft Cognitive Toolkit, and TensorFlow) can\u2019t easily be ported to a different framework.\u00a0 We can translate speech, but we did it with a veritable Tower of Babel.<\/p>\n<p>Fortunately pain points like this drive innovation.\u00a0 AWS, Facebook and Microsoft have collaborated to build Open Neural Network Exchange (ONNX), making models interoperable on different frameworks.<\/p>\n<p>ONNX is shaping up to be a key technology this coming year as the number of models being shared among developers, apps, and devices becomes larger and larger.<\/p>\n<p>Well that\u2019s it for this year.\u00a0 Tune in next year and we\u2019ll see how we did.<\/p>\n<p>\u00a0<\/p>\n<p><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blog\/list?user=0h5qapp2gbuf8\"><em><u>Other articles by Bill Vorhies.<\/u><\/em><\/a><\/p>\n<p>\u00a0<\/p>\n<p>About the author:\u00a0 Bill is Editorial Director for Data Science Central.\u00a0 Bill is also President &#038; Chief Data Scientist at Data-Magnum and has practiced as a data scientist since 2001.\u00a0\u00a0\u00a0 He can be reached at:<\/p>\n<p><a href=\"mailto:Bill@DataScienceCentral.com\">Bill@DataScienceCentral.com<\/a> <span>or<\/span> <a href=\"mailto:Bill@Data-Magnum.com\">Bill@Data-Magnum.com<\/a><\/p>\n<\/div>\n<p><a href=\"https:\/\/www.datasciencecentral.com\/xn\/detail\/6448529:BlogPost:786089\">Go to Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Author: William Vorhies Summary:\u00a0 Here are our 5 predictions for data science, machine learning, and AI for 2019.\u00a0 We also take a look back at [&hellip;] <span class=\"read-more-link\"><a class=\"read-more\" href=\"https:\/\/www.aiproblog.com\/index.php\/2019\/01\/01\/5-predictions-about-data-science-machine-learning-and-ai-for-2019\/\">Read More<\/a><\/span><\/p>\n","protected":false},"author":1,"featured_media":463,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_bbp_topic_count":0,"_bbp_reply_count":0,"_bbp_total_topic_count":0,"_bbp_total_reply_count":0,"_bbp_voice_count":0,"_bbp_anonymous_reply_count":0,"_bbp_topic_count_hidden":0,"_bbp_reply_count_hidden":0,"_bbp_forum_subforum_count":0,"footnotes":""},"categories":[26],"tags":[],"_links":{"self":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/1521"}],"collection":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/comments?post=1521"}],"version-history":[{"count":0,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/1521\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media\/465"}],"wp:attachment":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media?parent=1521"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/categories?post=1521"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/tags?post=1521"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}