HHS makes it easier to compare hospitalizations by age

Since mid-December, the Department of Health and Human Services has published a dataset on how the pandemic is impacting individual hospitals across the country. (You can read the CDD’s detailed description of that dataset here.) One of the most useful—and, in my opinion, most under-utilized—aspects of this facility dataset is that it provides COVID-19 hospital admissions broken out by age, allowing data users to discern which age groups are getting hardest hit by severe COVID-19 cases in different parts of the country.

This week, the HHS made it much easier to do that analysis. The agency added hospital admissions by age to its state-level hospitalization dataset. Now, if you want to see a patient breakdown for your state, you can simply look at the state-level info already compiled by HHS data experts, rather than summing up numbers from the facility-level info yourself.

Besides that convenience factor, there are two big advantages of the state-level info:

  • The state-level dataset is updated daily, while the facility-level dataset is updated weekly. More frequent data updates allow for more specific time series analysis.
  • Low patient numbers aren’t suppressed. In the facility-level dataset, patient numbers between 1 and 4 are suppressed with an error value (-999999) to protect patient privacy. In the age data, this happens at a lot of facilities, so it’s impossible for an outside data user to calculate accurate totals for a given city, county, or state. On the other hand, with HHS experts doing the aggregation in the state-level dataset, no values need to be obscured—basically, these state-level figures are much more accurate.

The age groups in the state-level dataset match those available in the facility-level dataset: pediatric COVID-19 patients, patients age 18-19, patients in ten-year age ranges from 20 to 79, and patients age 80 or older. HHS also splits the patient counts into those who have confirmed COVID-19 cases (meaning their diagnosis is verified by a PCR test) and those who have suspected cases (meaning the patients have COVID-19 symptoms or a positive result on a non-PCR test.)

You can find these new data in two places:

Also, Conor Kelly, COVID Tracking Project volunteer and COVID-19 visualizer extraordinaire, has added these new data to his COVID-19 Tableau dashboard. (See “Hosp. Admissions Over Time,” then “Admissions by Age.”) Highly recommend checking out that dashboard and exploring the trends for your state.

(Finally, it is possible I’m a little annoyed that the HHS made this lovely update immediately after I turned in an assignment in which I did this analysis the long way, with the facility-level dataset. Look out for that story early next week.)

Related posts

  • HHS makes it easier to compare hospitalizations by age
    This week, the HHS added hospital admissions by age to its state-level hospitalization dataset. Now, if you want to see a patient breakdown for your state, you can simply look at the state-level info already compiled by HHS data experts, rather than summing up numbers from the facility-level info yourself.
  • Featured sources, Jan. 10
    Featured sources for the week of Jan. 10 include a visualization of hospital facilities, therapeutics distribution, and hospital discharges.
  • Facility-level hospitalization data updated on schedule
    In the interest of giving credit to the HHS where credit is due: the agency updated its new facility-level hospitalization dataset right on schedule this past Monday. Last week, I used this hospitalization dataset—along with the HHS’s state-level hospitalization data—to build several visualizations showing how COVID-19 has hit hospitals at the individual, county, and state levels. I also wrote a brief article on COVID-19 hospitalizations for Stacker, hosting visualizations and highlighting some major insights.
  • COVID-19 data for your local hospital
    On Monday, the HHS published a new hospitalization dataset including capacity, new admissions, and other COVID-19-related numbers—for over 4,000 individual facilities across America. This is, as I put it in a COVID Tracking Project blog post analyzing the dataset, a big deal. Project lead Alexis Madrigal called it “probably the single most important data release that we’ve seen from the Federal government.” This post explains why the release is so exciting and what researchers may do with it.

Leave a Reply