{"id":4525,"date":"2021-03-31T06:34:39","date_gmt":"2021-03-31T06:34:39","guid":{"rendered":"https:\/\/www.aiproblog.com\/index.php\/2021\/03\/31\/dsc-weekly-digest-29-march-2021\/"},"modified":"2021-03-31T06:34:39","modified_gmt":"2021-03-31T06:34:39","slug":"dsc-weekly-digest-29-march-2021","status":"publish","type":"post","link":"https:\/\/www.aiproblog.com\/index.php\/2021\/03\/31\/dsc-weekly-digest-29-march-2021\/","title":{"rendered":"DSC Weekly Digest 29 March 2021"},"content":{"rendered":"<p>Author: Kurt A Cagle<\/p>\n<div>\n<div style=\"width: 720px; font-family: Arial; margin-left: auto; margin-right: auto;\">\n<table style=\"width: 726px; height: autopx;\">\n<tbody>\n<tr>\n<td>\n<div style=\"width: 720px;\">\n<hr>\n<div class=\"dsc_editorialBlock\" style=\"width: 720px;\">\n<a href=\"https:\/\/www.education.datasciencecentral.com\/?utm_source=DSC&amp;utm_medium=tab\" target=\"_blank\" rel=\"noopener\"><img decoding=\"async\" src=\"https:\/\/scitechdaily.com\/images\/Amazing-Hubble-Image-of-Spiral-Galaxy-NGC-7331.jpg?profile=RESIZE_710x\" width=\"620\" class=\"align-full\"><\/a><\/p>\n<h2><strong>Data As A Galaxy<\/strong><\/h2>\n<div class=\"dsc_editorial\" style=\"text-align: justify;\">\n<p>One of the more significant &#8220;quiet&#8221; trends that I&#8217;ve observed in the last few years has been the migration of data to the cloud and with it the rise of Data as a Service (DaaS). This trend has had an interesting impact, in that it has rendered moot the question of whether it is better to centralize or decentralize data.<\/p>\n<p>There have always been pros and cons on both sides of this debate, and they are generally legitimate concerns. Centralization usually means greater control by an authority, but it can also force a bottleneck as everyone attempts to use the same resources. Decentralization, on the other hand, puts the data at the edges where it is most useful, but at the cost of potential pollution of namespaces, duplication and contamination. Spinning up another MySQL instance might seem like a good idea at the time, but inevitably the moment that you bring a database into existence, it takes on a life of its own.<\/p>\n<p>What seems to be emerging in the last few years is the belief that an enterprise data architecture should consist of multiple, concentric tiers of content, from highly curated and highly indexed data that represents the objects that are most significant to the organization, then increasingly looser, less curated content that represents the operational lifeblood of an organization, and outward from there to data that is generally not controlled by the organization and exists primarily in a transient state.<\/p>\n<p>Efficient data management means recognizing that there is both a cost and a benefit to data authority. A manufacturer&#8217;s data about its products is unique to that company, and as such, it should be seen as being authoritative. This data and metadata about what it produces has significant value both to itself and to the users of those products, and this tier usually requires significant curational management but also represents the greatest value to that company&#8217;s customers.<\/p>\n<p>Customer databases, on the other hand, may seem like they should be essential to an organization, but in practice, they usually aren&#8217;t. This is because customers, while important to a company from a revenue standpoint, are also fickle, difficult to categorize, and frequently subject to change their minds based upon differing needs, market forces, and so forth beyond the control of any single company. This data is usually better suited for the mills of machine learning, where precision takes a back seat to gist.<\/p>\n<p>Finally, on the outer edges of this galactic data, you get into the manifestation of data as social media. There is no benefit to trying to consume all of Google or even Twitter without taking on all of the headaches of being Google or Twitter without any of the benefits. This is data that is sampled, like taking soundings or wind measurements in the middle of a boat race. The individual measurements are relatively unimportant, only the broader term implications.<\/p>\n<p>From an organizational standpoint, it is crucial to understand the fact that the value of data differs based upon its context, authority, and connectedness. Analytics, ultimately, exists to enrich the value of the authoritative content that an organization has while determining what information has only transient relevance. A data lake or operational warehouse that contains the tailings from social media is likely a waste of time and effort unless the purpose of that data lake is to hold that data in order to glean transient trends, something that machine learning is eminently well suited for.\u00a0<\/p>\n<p class=\"MsoNormal\">This is why we run Data Science Central, and why we are expanding its focus to consider the width and breadth of digital transformation in our society.\u00a0<a href=\"https:\/\/www.datasciencecentral.com\/\">Data Science Central<\/a>\u00a0is your community. It is a chance to learn from other practitioners, and a chance to communicate what you know to the data science community overall. I encourage you to\u00a0<a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/so-you-want-to-write-for-dsc-1\">submit original articles<\/a>\u00a0and to make your name known to the people that are going to be hiring in the coming year. As always let us know what you think.<\/p>\n<p><span style=\"font-family: arial, helvetica, sans-serif;\">In media res,<\/span><br \/><span style=\"font-family: arial, helvetica, sans-serif;\"><a href=\"mailto:kcagle@techtarget.com\">Kurt Cagle<\/a><\/span><br \/><span style=\"font-family: arial, helvetica, sans-serif;\">Community Editor,<\/span><br \/><span style=\"font-family: arial, helvetica, sans-serif;\"><a href=\"https:\/\/datasciencecentral.com\/\">Data Science Central<\/a><\/span><\/p>\n<\/div>\n<\/div>\n<div>\n<hr>\n<div>\n<div class=\"dsc_section\" style=\"width: 720px;\">\n<h2 class=\"dsc_subtitle\"><span style=\"font-family: arial, helvetica, sans-serif;\">DSC Featured Articles<\/span><\/h2>\n<div class=\"dsc_sectionBody\">\n<ul>\n<li><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/data-agility-and-popularity-vs-data-quality-in-self-serve-bi-an-1#KartikPatel44284\">Data Agility and &#8216;Popularity&#8217; vs. Data Quality in Self-Serve BI and Analytics<\/a><\/li>\n<li><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/on-demand-mobile-apps-must-have-features-and-trends#YuliaKondratyuk44284\">On-demand Mobile Apps: Must-have Features and Trends<\/a><\/li>\n<li><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/big-data-in-the-healthcare-industry-definition-implementation#VarvaraMasalitina44284\">Big Data in the Healthcare Industry: Definition, Implementation, Risks<\/a><\/li>\n<li><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/important-skills-needed-to-become-a-successful-data-scientist-in#MikeAlreend44284\">Important Skills Needed to Become a Successful Data Scientist in 2021<\/a><\/li>\n<li><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/how-big-data-can-improve-your-golf-game-1#JordanFuller44284\">How Big Data Can Improve Your Golf Game<\/a><\/li>\n<li><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/r-vs-python-vs-julia-how-easy-it-is-to-write-efficient-code#DanielMoura44284\">R vs. Python vs.\u00a0Julia: How easy it is to write efficient code?<\/a><\/li>\n<li><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/the-role-of-iot-and-big-data-in-payroll-process-for-businesses#SwetaSharma44284\">The Role Of IoT and Big Data In Payroll Process For Businesses<\/a><\/li>\n<li><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/causal-ai-dictum-a-dataset-is-model-free#RobertRTucci44284\">Causal AI dictum: A dataset is model-free<\/a><\/li>\n<li><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/homework-assignment-create-a-covid19-at-risk-score#BillSchmarzo44284\">Homework Assignment: Create a COVID19 At-Risk Score<\/a><\/li>\n<li><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/3d-imaging-market-is-expected-to-generate-a-revenue-55-77-billion#AbhishekPeter44284\">3D imaging Market is Expected to Generate a revenue $ 55.77 Billion by 2027, Despite the Covid-19 Outbreak<\/a><\/li>\n<li><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/how-agile-methodology-assists-the-development-of-smart-and#wilsonalton96144284\">How Agile Methodology Assists the Development of Smart and Working Software Products?<\/a><\/li>\n<li><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/all-the-skills-require-for-a-data-scientist#ShekhSadliAlZadid44284\">All the Skills Require For A Data Scientist<\/a><\/li>\n<li><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/markov-decision-processes-1#MonikaSangwan44284\">Markov Decision Processes<\/a><\/li>\n<li><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/6448529:BlogPost:1045452#AndreasBlumauer44284\">Knowledge Organization: Make Semantics explicit<\/a><\/li>\n<li><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/a-simple-way-for-getting-started-with-fast-ai#ajitjaokar44284\">A simple way for getting started with fast.ai for pytorch<\/a><\/li>\n<li><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/unleashing-the-business-value-of-technology-part-2-connecting-to#BillSchmarzo44284\">Unleashing the Business Value of Technology Part 2: Connecting to Value<\/a><\/li>\n<li><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/defining-and-measuring-chaos-in-data-sets-why-and-how-in-simple-w#VincentGranville44284\">Defining and Measuring Chaos in Data Sets: Why and How, in Simple Words<\/a><\/li>\n<li><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/dsc-weekly-digest-22-march-2021#KurtCagle44284\">DSC Weekly Digest 22 March 2021<\/a><\/li>\n<\/ul>\n<\/div>\n<div>\n<hr>\n<div><span style=\"font-family: arial, helvetica, sans-serif;\"><strong><span style=\"font-size: 1.5em;\">TechTarget Articles<\/span><\/strong><\/span><\/div>\n<div class=\"dsc_section\" style=\"width: 720px;\">\n<div class=\"dsc_sectionBody\">\n<ul>\n<li><a href=\"https:\/\/searchenterpriseai.techtarget.com\/definition\/supervised-learning#DavidPetersson44284\">supervised learning<\/a><\/li>\n<li><a href=\"https:\/\/searchbusinessanalytics.techtarget.com\/feature\/A-look-at-the-DataOps-engineer-role-and-responsibilities#LisaMorgan44284\">A look at the DataOps engineer role and responsibilities<\/a><\/li>\n<li><a href=\"https:\/\/searchitchannel.techtarget.com\/news\/252498495\/Blue-Yonder-tackles-supply-chain-digital-transformation#JohnMoore44284\">Blue Yonder tackles supply chain digital transformation<\/a><\/li>\n<li><a href=\"https:\/\/whatis.techtarget.com\/reference\/5-things-to-think-about-before-switching-IT-career-paths#SheekhaSingh44284\">5 things to think about before switching IT career paths<\/a><\/li>\n<li><a href=\"https:\/\/internetofthingsagenda.techtarget.com\/blog\/IoT-Agenda\/Use-a-zero-trust-approach-to-combat-IoT-security-risks#PeterNewton44284\">Use a zero trust approach to combat IoT security risks<\/a><\/li>\n<\/ul>\n<\/div>\n<div>\n<hr>\n<div><span style=\"font-family: arial, helvetica, sans-serif;\"><strong><span style=\"font-size: 1.5em;\">Picture of the Week<\/span><\/strong><\/span><\/div>\n<div class=\"dsc_section\" style=\"width: 720px;\">\n<div class=\"dsc_sectionBody\">\n<div class=\"dsc_imageFull\"><span style=\"font-family: arial, helvetica, sans-serif;\"><span style=\"font-size: 13px;\"><a href=\"https:\/\/multimedia.getresponse360.com\/datascience-B\/photos\/ab38910a-861e-40d5-b9db-3f3c522bce05.png\" target=\"_blank\" rel=\"noopener\"><img decoding=\"async\" src=\"https:\/\/multimedia.getresponse360.com\/datascience-B\/photos\/ab38910a-861e-40d5-b9db-3f3c522bce05.png?profile=RESIZE_710x\" width=\"720\" class=\"align-full\"><\/a><\/span><\/span><\/div>\n<div>\n<center><span style=\"font-family: arial, helvetica, sans-serif;\"><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blog\/list?tag=dsc_cxo\" target=\"_blank\" rel=\"noopener\">Data Intensity Is Increasing<\/a><\/span><\/center>\n<\/div>\n<\/div>\n<div>\n<hr>\n<div class=\"dsc_section\" style=\"width: 720px;\">\n<div class=\"dsc_sectionBody\">\n<p style=\"text-align: center;\"><span style=\"font-family: arial, helvetica, sans-serif;\"><i>\u00a0<\/i><\/span><\/p>\n<div align=\"center\" style=\"font-family: Arial, Helvetica, sans-serif; font-size: 13px;\">\n<hr style=\"font-family: 'Times New Roman'; font-size: medium; text-align: start;\">\n<div style=\"margin: 1em 1em; font-family: Arial, Helvetica, sans-serif; font-size: 13px;\"><span style=\"font-family: arial, helvetica, sans-serif;\"><i>To make sure you keep getting these emails, please add mail@newsletter.datasciencecentral.com to your address book or whitelist us.<\/i><\/span><\/div>\n<hr style=\"font-family: 'Times New Roman'; font-size: medium; text-align: start;\">\n<div style=\"margin: 1em 1em;\">\n<span style=\"font-family: arial, helvetica, sans-serif;\"><a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/check-out-our-dsc-newsletter\">Join Data Science Central<\/a>\u00a0|\u00a0<a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blogs\/comprehensive-repository-of-data-science-and-ml-resources\">Comprehensive Repository of Data Science and ML Resources<\/a><\/p>\n<p><a href=\"https:\/\/www.datasciencecentral.com\/video\/video\/listFeatured\">Videos<\/a>\u00a0|\u00a0<a href=\"https:\/\/www.datasciencecentral.com\/page\/search?q=Python\">Search DSC<\/a>\u00a0|\u00a0<a href=\"https:\/\/www.datasciencecentral.com\/profiles\/blog\/new\">Post a Blog<\/a>\u00a0|\u00a0<a href=\"https:\/\/www.datasciencecentral.com\/forum\/topic\/new\">Ask a Question<\/a><\/span> <\/p>\n<p><span style=\"font-family: arial, helvetica, sans-serif; font-size: 14px;\"><strong>Follow us on Twitter<\/strong>:\u00a0<a href=\"https:\/\/www.twitter.com\/datasciencectrl\">@DataScienceCtrl<\/a>\u00a0|\u00a0<a href=\"https:\/\/www.twitter.com\/analyticbridge\">@AnalyticBridge<\/a><\/span>\n<\/div>\n<div style=\"margin: 1em 1em;\"><span style=\"font-family: arial, helvetica, sans-serif;\"><span style=\"font-size: 10.5pt; color: #172b4d;\">This email, and all related content, is published by\u00a0Data Science Central, a division of<\/span> <span style=\"font-size: 10.5pt;\"><a href=\"https:\/\/go.techtarget.com\/r\/139811236\/26795532\/28\" target=\"_blank\" rel=\"noopener\"><span style=\"color: #0052cc; text-decoration-line: none;\">TechTarget, Inc.<\/span><\/a><\/span><\/span><\/div>\n<p style=\"margin: 0in 0in 0in .5in;\"><span style=\"font-family: arial, helvetica, sans-serif;\"><span style=\"font-size: 10.5pt; color: #172b4d;\">275 Grove Street, Newton, Massachusetts, 02466 US<\/span><\/span><\/p>\n<p style=\"margin: 0in 0in 0in .5in;\"><span style=\"font-family: arial, helvetica, sans-serif;\"><span style=\"font-size: 10.5pt;\"><br \/><\/span> <span style=\"font-size: 10.5pt; color: #172b4d;\">You are receiving this email because you are a member of TechTarget. When you access content from this email, your information may be shared with the sponsors or future sponsors of that content and with our Partners, see up-to-date\u00a0<\/span> <span style=\"font-size: 10.5pt;\"><a href=\"https:\/\/go.techtarget.com\/r\/139811243\/26795532\/29\" target=\"_blank\" rel=\"noopener\"><span style=\"color: #0052cc; text-decoration-line: none;\">Partners List<\/span><\/a><\/span> <span style=\"font-size: 10.5pt; color: #172b4d;\">\u00a0below, as described in our\u00a0<\/span> <span style=\"font-size: 10.5pt;\"><a href=\"https:\/\/go.techtarget.com\/r\/139811244\/26795532\/30\" target=\"_blank\" rel=\"noopener\"><span style=\"color: #0052cc; text-decoration-line: none;\">Privacy Policy<\/span><\/a><\/span> <span style=\"font-size: 10.5pt; color: #172b4d;\">. For additional assistance, please contact:\u00a0<\/span> <span style=\"font-size: 10.5pt;\"><a href=\"http:\/\/feeds.feedburner.com\/mailt&amp;utmwebmaster@techtarget.com\" target=\"_blank\" rel=\"noopener\"><span style=\"color: #0052cc; text-decoration-line: none;\">webmaster@techtarget.com<\/span><\/a><\/span><\/span><\/p>\n<p style=\"margin: 0in 0in 0in .5in;\"><span style=\"font-family: arial, helvetica, sans-serif;\"><span style=\"font-size: 10.5pt;\"><br \/><\/span> <span style=\"font-size: 10.5pt; color: #172b4d;\">copyright 2021 TechTarget, Inc. all rights reserved. Designated trademarks, brands, logos and service marks are the property of their respective owners.<\/span><\/span><\/p>\n<p class=\"MsoNormal\" style=\"margin-left: .5in;\"><span style=\"font-family: arial, helvetica, sans-serif;\"><span style=\"font-size: 10.5pt;\"><a href=\"https:\/\/go.techtarget.com\/r\/139811244\/26795532\/31\" target=\"_blank\" rel=\"noopener\"><span style=\"color: #0052cc; text-decoration-line: none;\">Privacy Policy<\/span><\/a><\/span> <span style=\"font-size: 10.5pt; color: #172b4d;\">\u00a0|\u00a0<\/span> <span style=\"font-size: 10.5pt;\"><a href=\"https:\/\/go.techtarget.com\/r\/139811243\/26795532\/32\" target=\"_blank\" rel=\"noopener\"><span style=\"color: #0052cc; text-decoration-line: none;\">Partners List<\/span><\/a><\/span><\/span><\/p>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<p><a href=\"https:\/\/newsletter.datasciencecentral.com\/message_create_message.html?stid=&amp;messages_id=2698&amp;_messageId=2698#nextStep\" class=\"button blue\"><br \/><\/a><\/p>\n<\/div>\n<p><a href=\"https:\/\/www.datasciencecentral.com\/xn\/detail\/6448529:BlogPost:1045859\">Go to Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Author: Kurt A Cagle Data As A Galaxy One of the more significant &#8220;quiet&#8221; trends that I&#8217;ve observed in the last few years has been [&hellip;] <span class=\"read-more-link\"><a class=\"read-more\" href=\"https:\/\/www.aiproblog.com\/index.php\/2021\/03\/31\/dsc-weekly-digest-29-march-2021\/\">Read More<\/a><\/span><\/p>\n","protected":false},"author":1,"featured_media":469,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_bbp_topic_count":0,"_bbp_reply_count":0,"_bbp_total_topic_count":0,"_bbp_total_reply_count":0,"_bbp_voice_count":0,"_bbp_anonymous_reply_count":0,"_bbp_topic_count_hidden":0,"_bbp_reply_count_hidden":0,"_bbp_forum_subforum_count":0,"footnotes":""},"categories":[26],"tags":[],"_links":{"self":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/4525"}],"collection":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/comments?post=4525"}],"version-history":[{"count":0,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/4525\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media\/474"}],"wp:attachment":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media?parent=4525"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/categories?post=4525"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/tags?post=4525"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}