Site icon Real-Time Statistical Model for COVID-19 – Jack Syage

8. A Simple Model for Forecasting Final Fatalities

(4/3/20) I believe models should be as simple as possible and rely as much as possible on hard data, e.g., deaths

Always check my post called “Daily Rumblings” for late breaking updates.

The Gaussian Model introduced in my last blog (#7) can be extended to forecast the number of fatalities that will occur as the epidemic in a particular population reaches recovery. If one is monitoring the death rate per unit time (days, weeks, etc.) then one can match the shape of that curve to a Gaussian growth and recovery curve and determine how far up or down the curve the actual data lies. Based on the number of fatalities that has occurred on the latest date, one can extrapolate how many more deaths will occur after traversing the entire curve to recovery. The Figure below shows how this works.

Gaussian fatality model using the observed death rate data for Italy. The horizontal axis numbers represent week for convenience, but in fact this model is not dependent on time.

This model assumes that the rate of deaths (and case prevalence also, if one could only measure that well) will follow a rise and then a fall. A Gaussian model works well because it begins to rise exponentially then becomes relatively linear before rolling over and peaking. The recovery is then assumed to following a similar trend in reverse as shown in the bottom plot above. The death rate data for China (Post #7) bears this out. Now the total number of deaths up to a particular point on the Gaussian rate curve is obtained by integrating all the deaths to that point and is shown by the middle plot above. Now if one knows where on the Gaussian rate curve a particular population lies, then the final death count can be extrapolated from the current death count. The factors that convert current deaths to final deaths are shown in the top plot above and the Table on the right.

We show by example the case for Italy. The death rate in Italy has been rising exponentially, but is beginning to show a perceptible slowing from pure exponential growth (pseudo-linear region). These daily death counts are overlaid on the Gaussian rate curve as best as we can visualize. There is a large uncertainty particularly in the near-linear region of the Gaussian such that we could easily place the Italy data such that the last date overlays with week 4.5 rather than week 5.5 as shown. We therefore define this as the uncertainty boundaries for extrapolating to final fatality forecasts. The dotted lines represent these two limits and by tracing up to the multiplicative factor on the top plot we can calculate a final fatality based on the current total deaths.

The results of this model for our highlighted countries and U.S. states is tabulated in the Table below for observed death rate and total deaths as of 04/01/2020 (See Post #7 for these results).

For a particular population, the lower the number of the week on the curve the further from recovery is that population and the greater is the fatality factor relative to the current total deaths. The following observations can be made:

There have been a number of reports of projected deaths in the news, some outlandish as they do not assume any social isolation reductions and many that include a host of variables. Our U.S. administration is now projecting 100,000 to 240,000 total deaths, which fits between our uncertainty limits. The University of Washington updates their projections nearly daily and currently forecasts the following (https://covid19.healthdata.org/projections):

There are several caveats and assumptions to this model:

The utility of this model is that it is based solely on hard data, namely deaths and doesn’t rely on less certain variables. As each country and state moves up and over the curve, we will be able to refine the final fatality projections and reduce the uncertainty.

Exit mobile version