{"id":4236,"date":"2020-12-26T06:34:20","date_gmt":"2020-12-26T06:34:20","guid":{"rendered":"https:\/\/www.aiproblog.com\/index.php\/2020\/12\/26\/gross-underestimates-and-overestimates-from-the-same-data-covid-19-death-rates-example\/"},"modified":"2020-12-26T06:34:20","modified_gmt":"2020-12-26T06:34:20","slug":"gross-underestimates-and-overestimates-from-the-same-data-covid-19-death-rates-example","status":"publish","type":"post","link":"https:\/\/www.aiproblog.com\/index.php\/2020\/12\/26\/gross-underestimates-and-overestimates-from-the-same-data-covid-19-death-rates-example\/","title":{"rendered":"Gross Underestimates and Overestimates from the Same Data: Covid-19 Death Rates Example"},"content":{"rendered":"<p>Author: Stephanie Glen<\/p>\n<div>\n<ul>\n<li>Ten-fold differences have been reported in Covid-19 death rates.<\/li>\n<li>Estimate issues are because of how the data is calculated.<\/li>\n<li>Why the same data can yield gross overestimates <em>and<\/em> underestimates.<a href=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/8327310869?profile=original\" target=\"_blank\" rel=\"noopener noreferrer\"><br \/><img decoding=\"async\" src=\"https:\/\/storage.ning.com\/topology\/rest\/1.0\/file\/get\/8327310869?profile=RESIZE_710x\" class=\"align-full\"><\/a><\/li>\n<\/ul>\n<p><strong>What is the actual death rate for Covid-19?<\/strong> After nearly a year of the pandemic, <em>no one can agree on an answer.<\/em> Depending on which expert you ask, it&#8217;s somewhere between 0.53% and 6% for the general population and possibly as high as 35% for older patients with certain pre-existing conditions.&nbsp;These widely varying figures illustrate how difficult it is to take data and make predictions&#8211;even with the best data and <a href=\"https:\/\/searchenterpriseai.techtarget.com\/definition\/machine-learning-ML?_ga=2.130900104.644581112.1608769691-835279483.1603980226\" target=\"_blank\" rel=\"noopener noreferrer\">machine learning<\/a> tools at your disposal.<\/p>\n<p>Part of the problem is clarity: news sources and blogs in particular cite statistics without clarifying exactly what statistic they are talking about. For example, one MedPageToday article [1] mentions a &#8220;fatality rate&#8221;; The article doesn&#8217;t make it immediately clear if that fatality rate applies to hospitalized cases, cases who have tested positive, or the population as a whole (each of these rates would be vastly different).<\/p>\n<p>Despite Covid-19- fueling one of the largest explosions of scientific literature in history [2], we&#8217;re not even close to accurately figuring out what percentage of the population the virus actually kills. All of the reported figures amount to nothing more than data-driven guesswork.<\/p>\n<p><strong>The Anatomy of a Death Rate Calculation<\/strong><\/p>\n<p>A recent episode of The Guardian&#8217;s Science Weekly Update [3] addressed the question of why Covid-19 fatality rates vary so much. In the program, Paul Hunter, professor of medicine at the University of East Anglia,&nbsp; explains that the figures such as the World Health Organization&#8217;s reported death rate of 3.4 percent was calculated by the number of Covid-19 related deaths (as recorded on death certificates) divided by the number of confirmed cases (based on positive Covid-19 tests). That figure, called the &#8220;case fatality rate (CFR)&#8221; is is a statistic that Dr. Robert Pearl M.D. calls &#8220;inaccurate and misleading&#8221; [4]. Why? Depending on how you look at it, it&#8217;s either a gross underestimate <em>or<\/em> overestimate.<\/p>\n<p>The&nbsp; estimation issues arise because of how the figures are calculated. The&nbsp;<em>CFR<\/em> is calculated by recording people at the beginning of their illness and at the end of their illness&#8211;people who are still ill when the data is being recorded may still go on to die after the figures have been tallied. The 3.4%&nbsp; then is an <strong>underestimate<\/strong>&#8211;the people who are currently sick and go on to die will push that percentage up to around 5 to 6%.&nbsp; Although around 5% might seem like a reasonable estimate (probably one that matches figures you&#8217;ve often seen in the news), note how the above figure is obtained in the first place&#8211; the number of deaths divided by the number of cases. There are an unknown number of people, possibly up to 10 times higher than the official counts [5] of people with the virus who <em>don&#8217;t<\/em> get tested. If we could count all of these cases, most of which are probably mild or asymptomatic, then the death rate would be significantly lower, meaning that 3.4% is actually an <strong>overestimate<\/strong>. The actual number of deaths as it relates to the actual number of cases in the population is called the &#8220;true infection fatality rate (<em>IFR<\/em>)&#8221; and may be as low as&nbsp; 0.53% [7].&nbsp;<\/p>\n<p>The solution to more accurate reporting seems clear: find more cases. But this isn&#8217;t as easy as it sounds. One recent study showed that in France, a paltry 10% of Covid-19 cases were actually detected [8].<\/p>\n<p><strong>Throwing in a Few More Complications<\/strong><\/p>\n<p>Complicating matters even further is that, geographically speaking, detection rates also vary widely . When you try to compare death rate between countries, there may be more than a 20-fold difference in identified cases [5].&nbsp;<\/p>\n<p>Other issues that have lead to overestimates include&nbsp;not accounting for an aging population [6] or the presence of pre-existing medical conditions; The fatality rate for younger, healthier individuals is significantly lower than for older individuals with pre-existing conditions.&nbsp;Researchers at Johns Hopkins used machine learning to discover that age is the strongest predictor of who dies from Covid-19, ranging from a 1% fatality rate for the under-50s to a whopping 34 percent for those over age 85. However, those figures were also based on patients who are symptomatic and is therefore also an overestimate of death risk.&nbsp;<\/p>\n<p>Calculating Covid-19 death rates in the population is a challenge, and case counts are unreliable. In general, we can say that the&nbsp;organizations that are better at identifying mild cases will have the most accurate figures. However, identifying which organization is more &#8220;accurate&#8221; at this task is a challenge in itself.<\/p>\n<p><strong>Data Doesn&#8217;t Always Tell the Right Picture<\/strong><\/p>\n<p>The fact is, an analysis is only going to be as good as the data at hand. Collecting and analyzing data opens up a myriad of possible <a href=\"https:\/\/www.statisticshowto.com\/what-is-bias\/\" target=\"_blank\" rel=\"noopener noreferrer\">statistical biases<\/a>, [no term] all of which can completely ruin your analysis. And then&#8211;assuming you have <a href=\"https:\/\/www.statisticshowto.com\/reliability-validity-definitions-examples\/#Rel\" target=\"_blank\" rel=\"noopener noreferrer\">reliable&nbsp;<\/a>[no term] data&#8211;it then becomes a matter of clearly communicating your results to the general public: a matter which as the above example shows, is no easy task.<\/p>\n<\/p>\n<p><strong>References<\/strong><\/p>\n<p>[1]&nbsp;<a href=\"https:\/\/www.medpagetoday.com\/infectiousdisease\/covid19\/89750\" target=\"_blank\" rel=\"noopener noreferrer\">Here&#8217;s Why COVID-19 Mortality Has Dropped<\/a><\/p>\n<p>[2]&nbsp;<a href=\"https:\/\/www.sciencemag.org\/news\/2020\/05\/scientists-are-drowning-covid-19-papers-can-new-tools-keep-them-afloat\" target=\"_blank\" rel=\"noopener noreferrer\">Scientists are drowning in COVID-19 papers. Can new tools keep them afloat?<\/a><\/p>\n<p>[3] <a href=\"https:\/\/www.theguardian.com\/science\/audio\/2020\/mar\/17\/covid-19-why-are-there-different-fatality-rates\" target=\"_self\" rel=\"noopener noreferrer\">Covid-19: why are there different fatality rates? &ndash; Science Weekly Podcast<\/a><\/p>\n<p>[4]&nbsp;<a href=\"https:\/\/www.forbes.com\/sites\/robertpearl\/2020\/09\/22\/3-misleading-dangerous-coronavirus-statistics\/?sh=66ffc85b7169\" target=\"_blank\" rel=\"noopener noreferrer\">Three Misleading, Dangerous Coronavirus Statistics<\/a><\/p>\n<p>[5]&nbsp;<a href=\"https:\/\/hdsr.mitpress.mit.edu\/pub\/9421kmzi\/release\/3\" target=\"_blank\" rel=\"noopener noreferrer\">Estimating the Number of SARS-CoV-2 Infections and the Impact of Mitigation Policies in the United States<\/a><\/p>\n<p>[6]&nbsp;<a href=\"https:\/\/www.acpjournals.org\/doi\/10.7326\/M20-7385\" target=\"_blank\" rel=\"noopener noreferrer\">Impact of Population Growth and Aging on Estimates of Excess U.S. Deaths During the COVID-19 Pandemic, March to August 2020<\/a><\/p>\n<p>[7]&nbsp;<a href=\"https:\/\/www.sciencedirect.com\/science\/article\/pii\/S1201971220321809\" target=\"_blank\" rel=\"noopener noreferrer\">A systematic review and meta-analysis of published research data on COVID-19 infection fatality rates<\/a><\/p>\n<p>[8]&nbsp;<a href=\"https:\/\/www.nature.com\/articles\/d41586-020-00502-w\" target=\"_blank\" rel=\"noopener noreferrer\">COVID research updates: How 90% of French COVID cases evaded detection<\/a><\/p>\n<p>Image: CDC (Public Domain)<\/p>\n<\/div>\n<p><a href=\"https:\/\/www.datasciencecentral.com\/xn\/detail\/6448529:BlogPost:1005494\">Go to Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Author: Stephanie Glen Ten-fold differences have been reported in Covid-19 death rates. Estimate issues are because of how the data is calculated. Why the same [&hellip;] <span class=\"read-more-link\"><a class=\"read-more\" href=\"https:\/\/www.aiproblog.com\/index.php\/2020\/12\/26\/gross-underestimates-and-overestimates-from-the-same-data-covid-19-death-rates-example\/\">Read More<\/a><\/span><\/p>\n","protected":false},"author":1,"featured_media":468,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_bbp_topic_count":0,"_bbp_reply_count":0,"_bbp_total_topic_count":0,"_bbp_total_reply_count":0,"_bbp_voice_count":0,"_bbp_anonymous_reply_count":0,"_bbp_topic_count_hidden":0,"_bbp_reply_count_hidden":0,"_bbp_forum_subforum_count":0,"footnotes":""},"categories":[26],"tags":[],"_links":{"self":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/4236"}],"collection":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/comments?post=4236"}],"version-history":[{"count":0,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/posts\/4236\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media\/470"}],"wp:attachment":[{"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/media?parent=4236"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/categories?post=4236"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.aiproblog.com\/index.php\/wp-json\/wp\/v2\/tags?post=4236"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}