The COVID Cubic

The other night, like a lot of people, I learned about the so-called "cubic model" Trump's White House has been using to model the COVID outbreak. This is just plain bad math. For anyone who is comfortable with a modest amount of mathematics, perhaps Pre-Calculus, I want to share why Trump’s cubic model would get a big red "F" on this paper if they turned it into me.

Modeling Basics

Single Variable Models

The key to building a mathematical or statistical model is to include variables that represent the driving phenomenon. For instance, most econometric and epidemiological models include a variable for time because things are expected to change over time. Think flu cases. It changes over time. If we call tulip population  (for "number of cases"), then our model would be at least a function f of time t and we would write:

math-blog_The_COVID_Cubic_ipynb_at_master_·_VerdantAI_math-blog_·_GitHub.png

This model assumes that cold cases vary only in time. Put a pin in that thought if you will.

Multi-Variable Models

Suppose we wanted to make this a global model people could use anywhere on earth. Then season would be included in but we'd still need to know if the user is in the northern or southern hemisphere. If we add a variable l for location, then N would be a function of both t and l - we'd write:

math-blog_The_COVID_Cubic_ipynb_at_master_·_VerdantAI_math-blog_·_GitHub.png

Now we have a model for that varies by both time, maybe just season, and by location. The question is: How does it vary by time? When we ask "how does something vary by something else?" we are asking a question about the dynamics of the system. When we talk about a function changing in time, the dynamics, we are talking about the derivative of the function N and we use the notation:

math-blog_The_COVID_Cubic_ipynb_at_master_·_VerdantAI_math-blog_·_GitHub.png

There are a few other ways to write this. We use d N when the function only includes time t and if there are other variables included. The δ is a Greek delta but mathematicians usually just say "partial N".

So, dear reader, ask yourself:

  • Does the number of flu cases tomorrow depend on the number of flu cases today?

  • Will I get a different number tomorrow cases tomorrow if I live in New York or Montana?

  • When building my model, do I need to consider how many people might already be infected?

Of course it will.

It is at the very core of the f (n) problem. (Pronouce f (n) as "eff-n" and you'll see I'm politely swearing. When shouting, I use F (N). Anyone with a reasonable understanding of mathematical modeling will see that N (today) changes by N (yesterday). Since d N is the change, we get a differential equation of the form

math-blog_The_COVID_Cubic_ipynb_at_master_·_VerdantAI_math-blog_·_GitHub.png

where c is a number we use for scaling. If you followed the Wikipedia link, they sometimes use N ‘  or y ‘ for d N.

Unpin the single variable models and the Council of Economic Advisers (CEA) has already failed the class. But it gets worse.

Hold this pin.

Polynomial Modeling

In the parlance of mathematics, cubic means a function where the highest power of t is 3. E.g. f ( r ) = t ³ - 1. These graphs include the negative as they represent things that are presumed to decrease over time. I have

  • Linear: f ( t ) = t - 5

  • Quadratic: f ( t ) = - (t - 5) ²

  • Cubic: f ( t ) = - (t - 5) ³

  • Quartic: f ( t ) = - (t - 5) ⁴

for you to enjoy. Ignore the 5 and think about the shapes.

In [10]:

math-blog_The_COVID_Cubic_ipynb_at_master_·_VerdantAI_math-blog_·_GitHub.png

See that third one there?

It's a cubic polynomial model. And it sucks in a million ways for epidemics.

  • It starts up and to the left, one day with 150 cases out of nowhere.

  • It assumes exactly one change in dynamics where it flattens out. Take a look at the actuals and tell me what you think about that.

  • It doesn't consider the current number of cases, ONLY TIME. As if a virus keeps only a watch and has no idea how many people are infected.

  • It's an easy Excel plugin, so "modelers" can appeal to authority and say it must be right because Microsoft gave it to them. Ignorance is no excuse.

Cubic Model Variants

So the reader could follow it, our cubic model is boring. But here is an idea of how else a negative cubic can look.

In [18]:

covid_cubic_verdant.ai_brian_dolan


If you want to know more about how to properly use cubic models, honestly, just ask me. I love to talk about math. But I will say that polynomial models in general are okay JUST NOT HERE.

People Are Dying, Do it F (N) RIGHT

The proper, empirically valid way of modeling epidemics is using differential equations. Typically people use SIR Models. If you aren't familiar with these methods, you should not be doing epidemiological models for public policy. Period. End. Full stop.

How It's Supposed to Look

The number of infected people is usually denoted as I ( t ) in S-I-R models. I want to show you how a proper model should look, but it's too much to go over a full course on differential equations here. But damn, they are fun.

Here are some key properties to the generally expected progress of I ( t ) to people who study it professionally.

  • I ( 0 ) means the number of people infected at the outset. E.g. when t = 0. This has to be at least 1, otherwise there is no transmission.

  • The initial growth looks like a quadratic function. That is there is a small curve up from t = 0. It does not start high, it doesn't shoot straight up.

  • After the peak, the decay is MUCH SLOWER than the growth.

  • It has at least two inflection points (where it kinda bends from going up fast to going up slowly)

  • It was constructed by someone who understands epidemiology.

  • When t → ⏦  (time goes on), I ( t ) goes to 0.

When you read this, understand that the cubic model is a complete piece of crap in this context. It comes down from infinity, changes once, then goes to negative infinity, as if you can have "negative infinity" infections.

In [62]:

math_verdant.AI_BrianDolan

Wrap Up

Listen to experts. If you haven't wasted your life studying mathematics, ask someone who has. There are a lot of us and we like to talk about it. Don't believe everything you hear, even if it comes from a fancy title.


Who Am I?

I'm Brian Dolan, a mathematician and cyberneticist that cares about math, artificial intelligence and good analytics practices. I have my biases and preferences, of course. One of those is to rely on careful analysis rather than gut reactions or political diatribes. Get in touch with me at Verdant AI if you want to talk about solving complicated problems with sophisticated methods. Or if you just love the Irish banjo or ice hockey.

Be healthy, take care of each other and try to be nice.