Tag: CMS

  • Source shout-out: Centers for Medicare & Medicaid Services

    Source shout-out: Centers for Medicare & Medicaid Services

    CMS now allows users to search directly for COVID-19 data on specific nursing homes. Screenshot taken on August 15.

    A recent issue of Al Tompkins’ newsletter led me to check out the Centers for Medicare & Medicaid Services (CMS) COVID-19 data page for the first time in a few months. And it has gotten a serious renovation during that time.

    Among many other responsibilities, CMS oversees the nation’s nursing homes—providing funding, monitoring quality standards, and more. As a result, CMS has tracked COVID-19 cases and deaths in nursing homes throughout the pandemic, along with tracking how well nursing homes adhere to standards such as PPE use and regular testing.

    The agency’s public data COVID-19 in nursing homes used to be displayed as a very basic page with links out to spreadsheet downloads. In fact, back in September 2020, I called out the website for its poor data accessibility, explaining that the site was set up in a manner challenging for screenreaders.

    Now, the site includes a customized data explorer, easy-to-read overarching statistics, and an API that developers can use to readily access nursing home data. Perhaps most useful for the average reader (and for local journalists) is a data visualization section. Users can search an interactive map for nursing homes in their area, then view those facilities’ COVID-19 stats. The section also includes maps displaying nursing home COVID-19 cases, deaths, and vaccinations by state.

    In short, CMS’s COVID-19 data are now both easier to access (from the journalist’s or researcher’s perspective) and easier to explore right on the agency’s website. Thank you, CMS developers!

  • Featured sources, Jan. 17

    These sources, along with all sources from past issues, are listed here.

    • 6-month consequences of COVID-19 in patients discharged from hospital (Huang et al., The Lancet): I don’t usually feature scientific papers here, but this new study is important. It’s the biggest paper so far on COVID-19 long haulers, those patients who struggle with the disease for months after their diagnosis (or after not getting a diagnosis at all). This study followed about 1,700 patients over 6 months.
    • Global.health COVID-19 dataset: Global.health describes itself as a “global data repository and visualization platform that enables open access to real-time epidemiological anonymized line list data.” Its COVID-19 dataset—which promises information on 5 million anonymized cases—is not yet published, but is definitely a source to look out for.
    • COVID-19 survey of Medicare beneficiaries: This week, the Centers for Medicare & Medicaid Services published results of a survey of Medicare beneficiaries, focused on their experiences with COVID-19. The data include American seniors’ perceptions of vaccines, perceptions of COVID-19 safety, care experiences, and more.

  • Federal data updates, Dec. 13

    Rounding out the week with a couple of updates on federal data, unrelated to hospitalizations and vaccines.

    • New app for testing data: The Centers for Disease Control & Prevention (CDC) have developed an app called SimpleReport, which allows COVID-19 test providers to quickly report data to their local public health departments. An assisted living center in Tucson, Arizona was the first to pilot the app this week. The center’s Community Director said this app helped her quickly file data that would otherwise need to be entered in three different places.
    • CMS proposes that providers build standard databases: This past Thursday, the Centers for Medicare & Medicaid Services (CMS) announced a new rule to streamline data sharing between the agency and individual healthcare providers. Under this rule, providers would need to build application programming interfaces, or APIs. APIs are essentially data-sharing systems that provide a standardized format for information. Such standardization, CMS claims, would make it easier for patients to get medical treatments and prescriptions authorized by Medicaid.
    • Bill to make federal court filings free passes the House: PACER, or Public Access to Court Electronic Records, is an antiquated federal database of court filings which journalists and other researchers must pay to use. It costs 10 cents a page to access court dockets and other documents through PACER—and since court documents can get long, that cost adds up. The Open Courts Act, a bill which would make PACER free to the public, passed in the House of Representatives this past week. It now heads to the Senate. This bill may not be directly COVID-related right now, but I anticipate that journalists will be covering COVID-19 lawsuits for years after the pandemic ends.
  • CMS data and reporting updates

    CMS data and reporting updates

    The county-level testing dataset published by CMS has become a regular topic for this newsletter since it was released in early September. As a refresher for newer readers: CMS publishes both total PCR tests and test positivity rates for every county in the country; the dataset is intended as a resource for nursing home administrators, who are required to test their residents and staff at regular intervals based on the status of their county.

    This past Monday, October 5, I was pleasantly surprised to find a new update posted on CMS’ COVID-19 data page. I say “surprised” because I had been led to believe, both by past dataset updates and by reporting when the dataset was first published, that this source would be updated once every two weeks. And yet, here was a new update, with only one week’s delay (the last update before this was on Monday, September 28). CMS is also now posting weekly updates on an Archive page which goes back to August 19; some of these updates are older, while others were posted or edited in the past week.

    I always appreciate more frequent data, even when the data provider in question is not particularly transparent about their update strategy. Frequent updates are particularly useful for testing data; the nursing home administrators monitoring testing in their counties will be able to see information that better reflects the level of COVID-19 risk around them.

    I’ve updated my Tableau dashboard which visualizes these county-level data:

    As you can see, the majority of the Northeast and much of the West Coast continues to be in the green (positivity rates under 5%), while areas in the South and Midwest are not faring so well. Twelve counties have extremely high positivity rates (over 30%), eleven of which are in Midwestern states. This table allows you to rank and sort the test positivity rates by state.

    Also, a note on my methodology for this dashboard: in earlier iterations, I used state-level data from the COVID Tracking Project to calculate state test positivity rates for the same time period as the CMS has provided county-level rates. I then compared the county-level rates against state-level rates; this was the source of the “[x]% above state positivity rate” tooltips on the dashboard. After reading a new COVID Tracking Project blog post about the challenges of calculating and standardizing positivity rates, however, I realized that combining positivity rates from two different sources might misrepresent the COVID-19 status in those counties. So, I switched my method: the county-to-state comparisons are now based on averages of all the CMS-reported county-level positivity rates in each state.

    Finally, out of curiosity (and to practice my Tableau skills), I compared the CMS-reported test positivity rates for the five counties of New York City to the city-level rate reported by the NYC Department of Health.

    The positivity rates reported by the two sources follow the same general direction, but it’s interesting to see how the rates diverge when the five counties split up. Manhattan remaining far below 1% while Brooklyn surges up to 2%? Not surprising.

    Meanwhile, CMS is cracking down on COVID-19 reporting from hospitals: NPR reported this week that hospitals which fail to report complete, daily data to HHS can lose money from Medicare and Medicaid, starting this coming January.

  • Another update to county-level testing data

    Another update to county-level testing data

    This past Monday, September 28, the Centers for Medicare & Medicaid Services (CMS) updated the county-level testing dataset which the agency is publishing as a resource for nursing home administrators.

    I’ve discussed this dataset in detail in two past issues: after it was published in early September, and when it was first updated two weeks ago. The most recent update includes data from September 10 to September 23; CMS is continuing to analyze two weeks’ worth of testing data at a time, in order to improve the stability of these values. And this update came on a Monday, rather than a Thursday, decreasing the data lag from one week to five days.

    CMS press release from this past Tuesday describes one update to how CMS assigns test positivity categories, which nursing home administrators look at to determine how often they are required to test their patients and staff:

    Counties with 20 or fewer tests over 14 days will now move to “green” in the color-coded system of assessing COVID-19 community prevalence. Counties with both fewer than 500 tests and fewer than 2,000 tests per 100,000 residents, and greater than 10 percent positivity over 14 days – which would have been “red” under the previous methodology – will move to “yellow.”

    This change intends to address the concerns of rural states which are not doing much testing due to their small populations.

    I’ve updated my Tableau visualization with the most recent county data. The majority of the Northeast continues to be in the green, while areas in the South and Midwest pose higher concerns.

  • County-level test data gets an update

    County-level test data gets an update

    I spent the bulk of last week’s issue unpacking a new testing dataset released by the Centers for Medicare & Medicaid Services which provides test positivity rates for U.S. counties. At that point, I had some unanswered questions, such as “When will the dataset next be updated?” and “Why didn’t CMS publicize these data?”

    The dataset was updated this past week—on Thursday, September 17, to be precise. So far, it appears that CMS is operating on a two-week update schedule (the dataset was first published on Thursday, September 3). The data themselves, however, lag this update by a week: the spreadsheet’s documentation states that these data are as of September 9.

    CMS has also changed their methodology since the dataset’s first publication. Rather than publishing 7-day average positivity rates for each county, the dataset now presents 14-day average positivity rates. I assume that the 14 days in question are August 27 through September 9, though this is not clearly stated in the documentation.

    This choice was reportedly made “in order to use a greater amount of data to calculate percent test positivity and improve the stability of values.” But does it come at the cost of more up-to-date data? If CMS’s future updates continue to include one-week-old data, this practice would be antithetical to the actual purpose of the dataset: letting nursing home administrators know what the current testing situation is in their county so that they can plan testing at their facility accordingly.

    Additional documentation and methodology updates include:

    • The dataset now includes raw testing totals for each county (aggregated over 14 days) and 14-day test rates per 100,000 population. Still, without total positive tests for the same time period, it is impossible to replicate the CMS’s positivity calculations.
    • As these data now reflect a 14-day period, counties with under 20 tests in the past 14 days are now classified as Green and do not have reported positivity rates.
    • Counties with low testing volume, but high positivity rates (over 10%), are now sometimes reassigned to Yellow or Green tiers based on “additional criteria.” CMS does not specify what these “additional criteria” may be.

    I’ve made updated versions of my county-level testing Tableau visualizations, including the new total test numbers:

    This chart is color-coded according to CMS’s test positivity classifications. As you can see, New England is entirely in the green, while parts of the South, Midwest, and West Coast are spottier.

    Finally: CMS has a long way to go on data accessibility. A friend who works as a web developer responded to last week’s newsletter explaining how unspecific hyperlinks can make life harder for blind users and other people who use screenreaders. Screenreaders can be set to read all the links on a page as a list, rather than reading them in-text, to give users an idea of their navigation options. But when all the links are attached to the same text, users won’t know what their options are. The CMS page that links to this test positivity dataset is a major offender: I counted seven links that are simply attached to the word “here.”

    This practice is challenging for sighted users as well—imagine skimming through a page, looking for links, and having to read the same paragraph four times because you see the words “click here” over and over. (This is my experience every time I check for updates to the test positivity dataset.)

    “This is literally a test item in our editor training, that’s how important it is,” my friend said. “And yet people still get it wrong. ALL THE TIME.”

    One would think an agency dedicated to Medicare and Medicaid services would be better at web accessibility. And yet.

  • County-level testing data from an unexpected source

    County-level testing data from an unexpected source

    On September 3, 2020, the Center for Medicare & Medicaid Services (CMS) posted a county-level testing dataset. The dataset specifically provides test positivity rates for every U.S. county, for the week of August 27 to September 2.

    This is huge. It’s, like, I had to lie down after I saw it, huge. No federal health agency has posted county-level testing data since the pandemic started. Before September 3, if a journalist wanted to analyze testing data at any level more local than states, they would need to aggregate values from state and county public health departments and standardize them as best they could. The New York Times did just that for a dashboard on school reopening, as I discussed in a previous issue, but even the NYT’s data team was not able to find county-level values in some states. Now, with this new release, researchers and reporters can easily compare rates across the county and identify hotspot areas which need more testing support.

    So Betsy, you might ask, why are you reporting on this new dataset now? It’s been over a week since the county-level data were published. Well, as is common with federal COVID-19 data releases, this dataset was so poorly publicized that almost nobody noticed it.

    It didn’t merit a press release from CMS or the Department of Health and Human Services (HHS), and doesn’t even have its own data page: the dataset is posted towards the middle of this CMS page on COVID-19 in nursing homes:

    Highlighting mine.

    The dataset’s release was, instead, brought to my attention thanks to a tweet by investigative reporter Liz Essley Whyte of the Center for Public Integrity:

    In today’s issue, I’ll share my analysis of these data and answer, to the best of my ability, a couple of the questions that have come up about the dataset for me and my colleagues in the past few days.

    Analyzing the data

    Last week, I put together two Stacker stories based on these data. The first includes two county-level Tableau visualizations; these dashboards allow you to scroll into the region or state of your choice and see county test positivity rates, how those county rates compare to overall state positivity rates (calculated based on COVID Tracking Project data for the same time period, August 27 to September 2), and recent case and death counts in each county, sourced from the New York Times’ COVID-19 data repository. You can also explore the dashboards directly here.

    The second story takes a more traditional Stacker format: it organizes county test positivity rates by state, providing information on the five counties with the highest positivity rates in each. The story also includes overall state testing, case, and outcomes data from the COVId Tracking Project.

    As a reminder, a test positivity rate refers to the percent of COVID-19 tests for a given population which have returned a positive result over a specific period of time. Here’s how I explained the metric for Stacker:

    These positivity rates are typically reported for a short period of time, either one day or one week, and are used to reflect a region’s testing capacity over time. If a region has a higher positivity rate, that likely means either many people there have COVID-19, the region does not have enough testing available to accurately measure its outbreak, or both. If a region has a lower positivity rate, on the other hand, that likely means a large share of the population has access to testing, and the region is diagnosing a more accurate share of its infected residents.

    Test positivity rates are often used as a key indicator of how well a particular region is controlling its COVID-19 outbreak. The World Health Organization (WHO) recommends a test positivity rate of 5% or lower. This figure, and a more lenient benchmark of 10%, have been adopted by school districts looking to reopen and states looking to restrict out-of-state visitors as a key threshold that must be met.

    Which counties are faring the worst, according to this benchmark? Let’s take a look:

    This screenshot includes the 33 U.S. counties with the highest positivity rates. I picked the top 33 to highlight here because their rates are over 30%—six times the WHO’s recommended rate. The overall average positivity rate across the U.S. is 7.7%, but some of these extremely high-rate counties are likely driving up that average. Note that two counties, one in South Dakota and one in Virginia, have positivity rates of almost 90%.

    Overall, 1,259 counties are in what CMS refers to as the “Green” zone: their positivity rates are under 5%, or they have conducted fewer than 10 tests in the seven-day period represented by this dataset. 874 counties are in the “Yellow” zone, with positivity rates between 5% and 10%. 991 counties are in the “Red” zone, with positivity rates over 10%. South Carolina, Alabama, and Missouri have the highest shares of counties in the red, with 93.5%, 61.2%, and 50.4%, respectively:

    Meanwhile, eight states and the District of Columbia, largely in the northeast, have all of their counties in the green:

    My Tableau visualizations of these data also include an interactive table, which you can use to examine the values for a particular state. The dashboards are set up so that any viewers can easily download the underlying data, and I am, as always, happy to share my cleaned dataset and/or answer questions from any reporters who would like to use these data in their own stories. The visualizations and methodology are also open for syndication through Stacker’s RSS feed—I can share more details on this if anyone is interested.

    Answering questions about the data

    Why is the CMS publishing this dataset? Why not the CDC or HHS overall?

    These test positivity rates were published as a reference for nursing home administrators, who are required to test their staff regularly based on the prevalence of COVID-19 in a facility’s area. A new guidance for nursing homes dated August 26 explains the minimum testing requirement: nursing homes in green counties must test all staff at least once a month, those in yellow counties must test at least once a week, and those in red counties must test at least twice a week.

    It is important to note that facilities are only required to test staff, not residents. In fact, the guidance states that “routine testing of asymptomatic residents is not recommended,” though administrators may consider testing those residents who leave their facilities often.

    Where did the data come from?

    The CMS website does not clearly state a source for these data. Digging into the downloadable spreadsheet itself, however, reveals that the testing source is a “unified testing data set,” which is clarified in the sheet’s Documentation field as data reported by both state health departments and HHS:

    COVID-19 Electronic Lab Reporting (CELR) state health department-reported data are used to describe county-level viral COVID-19 laboratory test (RT-PCR) result totals when information is available on patients’ county of residence or healthcare providers’ practice location. HHS Protect laboratory data (provided directly to Federal Government from public health labs, hospital labs, and commercial labs) are used otherwise.

    What are the units?

    As I discussed at length in last week’s newsletter, no testing data can be appropriately contextualized without knowing the underlying test type and units. This dataset reports positivity rates for PCR tests, in units of specimens (or, as the documentation calls them, “tests performed.”) HHS’s public PCR testing dataset similarly reports in units of specimens.

    How are tests assigned to a county?

    As is typical for federal datasets, not every field is exactly what it claims to be. The dataset’s documentation elaborates that test results may be assigned to the county where a. a patient lives, b. the patient’s healthcare provider facility is located, c. the provider that ordered the test is located, or d. the lab that performed the test is located. Most likely, the patient’s address is used preferentially, with these other options used in absence of such information. But the disparate possiblities lead me to recommend proceeding with caution in using this dataset for geographical comparisons—I would expect the positivity rates reported here to differ from the county-level positivity rates reported by a state or county health department, which might have a different documentation procedure.

    How often will this dataset be updated?

    Neither the CMS page nor the dataset’s documentation itself indicate an update schedule. A report from the American Health Care Association suggests that the file will be updated on the first and third Mondays of each month—so, maybe it will be updated on the 21st, or maybe it will be updated tomorrow. Or maybe it won’t be updated until October. I will simply have to keep checking the spreadsheet and see what happens.

    Why won’t the dataset be updated every week, when nursing homes in yellow- and red-level counties are expected to test their staff at least once a week? Why is more public information about an update schedule not readily available? These are important questions which I cannot yet answer.

    Why wasn’t this dataset publicized?

    I really wish I could concretely answer this one. I tried submitting press requests and calling the CMS’ press line this past week; their mailbox, when I called on Friday, was full.

    But here’s my best guess: this dataset is intended as a tool for nursing home facilities. In that intention, it serves a very practical purpose, letting administrators know how often they should test their staff. If CMS or HHS put out a major press release, and if an article was published in POLITICO or the Wall Street Journal, the public scrutiny and politically-driven conspiracy theorists which hounded HHS during the hospitalization data switch would return in full force. Nursing home administrators and staff have more pressing issues to worry about than becoming part of a national political story—namely, testing all of their staff and residents for the novel coronavirus.

    Still, even for the sake of nursing homes, more information about this dataset is necessary to hold accountable both facilities and the federal agency that oversees them. How were nursing home administrators, the intended users of this dataset, notified of its existence? Will the CMS put out further notices to facilities when the data are updated? Is the CMS or HHS standing by to answer questions from nursing home staff about how to interpret testing data and set up a plan for regular screening tests?

    For full accountability, it is important for journalists like myself to be able to access not only data, but also the methods and processes around its collection and use.

  • New reporting requirements for nursing homes, hospitals, labs

    This week’s Department of Health and Human Services (HHS) update is a little indirect, in that it’s actually a Centers for Medicare & Medicaid Services (CMS) update.

    Last Tuesday, CMS announced that nursing homes are required to test their staff and offer to test their residents when a COVID-19 outbreak occurs. Facilities that don’t test adequately can be fined. These new requirements follow the recent large-scale distribution of antigen tests to nursing homes, a move I’ve discussed in previous issues. It is still unclear, however, how exactly the results of all this testing will be reported. While the CMS COVID-19 dataset includes some new fields on testing this week, these are primarily yes/no fields, such as “Tested Asymptomatic Staff and/or Personnel Facility-Wide After a New Case.” Actual test counts are not yet reported.

    The new CMS directive is also notable in that it includes new reporting directives for hospitals and COVID-19 labs. Hospitals were already required to report their COVID-19 patients, PPE needs, and other metrics to HHS, but now they are extra required to report them:

    In March, Vice President Mike Pence sent a letter to all hospitals requesting that they provide the results of COVID-19 tests performed in their in-house laboratories to help better understand and track disease patterns. CMS’ new rules require such reporting of test results in order to ensure a more complete picture in the nationwide surveillance of COVID-19, as well as a more efficient allocation of PPE and other vital supplies. Hospitals will face possible termination of Medicare and Medicaid payment if unable to correct reporting deficiencies.

    Outside laboratories conducting COVID-19 tests can similarly face monetary penalties if they fail to report to HHS: CMS will impose a fine of $1,000 a day for the first day a lab fails to report and $500 for each subsequent day. I and other folks at the COVID Tracking Project will be watching carefully to see how this directive impacts the completeness and accuracy of HHS data.