Population-Scale Patient Safety Data Reveal Inequalities in Adverse Events Before and During COVID-19 Pandemic

Adverse events from medications accounted for over 110,000 deaths in the US alone in 2019. The impact of the COVID-19 pandemic on patient safety and how it exacerbated pre-existing inequalities across diverse patient cohorts holds urgent clinical questions. However, intricate dependencies between the pandemic’s effects, safety profiles of drugs, and patient characteristics pose challenges to extracting clinically actionable insights.

An algorithmic approach to investigate the impact of COVID-19 on adverse drug safety and identify at risk cohorts is missing from the literature. In this project, we develop a model to reveal impacts of the pandemic on drug safety and identify at risk demographics from 10,443,476 adverse event reports spanning 19,193 adverse events and 3,624 drugs collected by the FDA Adverse Event Reporting System (FAERS).

Urgent need for algorithms aimed at improving safe medication use

Adverse events from medications accounted for over 110,000 deaths in the U.S. alone in 2019. The pandemic has further challenged healthcare systems’ ability to ensure safe medication use. Despite these urgent implications, the pandemic’s effects on adverse drugs effects remain unknown. Further, intricate dependencies between the pandemic’s effects, drugs, and patient characteristics present unique challenges for understanding patient safety during a public health emergency:

a) Algorithmic approaches are needed to unveil how the patient safety landscape changed with the pandemic onset. Such studies would reveal what inequalities in patient populations are exacerbated more than expected had the pandemic not occurred.

b) Algorithmic approaches are needed to compare patient safety to its pre-pandemic levels across patient groups and the entire range of human diseases and approved drugs.

Addressing this challenge can (i) inform drug prescription, (ii) improve patient safety by identifying individuals at high risk for adverse events and those who are disproportionally affected by preventable inequities, and (iii) enable comparison of COVID-19 pandemic to other health emergencies to unveil the disruptive nature of public health crises and inform health policy.

Novel algorithmic approach to drug safety

This study addresses the above challenge, representing the first study to unveil the disruptive nature of a public health crisis on patient safety. Using the largest dataset so far, consisting of 10,443,476 adverse drug event reports spanning 7 years (Jan. 2013-Sept 2020) and involving 3,624 drugs and 19,193 adverse events, we develop an algorithmic approach to investigate negative outcomes associated with medication use and how they changed during the pandemic.

In contrast to previous methods focused on small subsets of the adverse event landscape, our model investigates the entire range of human diseases and approved drugs. Our approach contains three key components: identifying adverse events whose incidence has significantly changed after the pandemic, removing temporal confounding factors in reporting trajectories, and pinpointing adverse events with considerable associations to medications.

Advantages of our algorithmic approach

1) Detect subtle patterns of drug safety: By correcting for confounding factors like temporal reporting trajectories, our model can detect impacts of the pandemic even in rare adverse events.

2) Generalizable: Our three step approach can flexibly identify differential reporting patterns in patient cohorts formed as a function of gender, age, adverse events, and drug.

3) New resource of adverse events and drug-event associations: This resource can be readily applied for use in pharmacoepidemiology and public health policy to inform medication use in diverse populations.

Key results and findings

Our algorithmic effort leads to several key findings:

  • We find substantial variation in adverse drug events before and during the pandemic. Among 64 adverse events identified by our analyses, we find 54 have increased incidence rates during the pandemic, even though adverse event reporting decreased by 4.4% overall.

  • We find that pre-pandemic gender differences are exaggerated during the pandemic. Women suffer from more drug adverse events than men relative to pre-pandemic levels, across all age cohorts. Comparing to male patients, women report 47.0% more adverse events whose occurrence significantly increased during the pandemic relative to pre-pandemic levels. Out of 53 adverse events with the pre-pandemic gender gap, 33 have increased gap during the pandemic more than expected had the pandemic not occurred.

  • We also find relevant clinical differences in adverse drug events outcomes across age groups. For example, acute kidney injury has seen a surprising increase in adult but not in elderly patients, suggesting an age-related disparity in the pandemic’s impact on drug-related kidney injury. In contrast, mental health-related events (hallucination, delusion, aggression, abnormal behavior, and dementia) have disproportionately increased in women and the elderly, indicating they constitute at risk patient cohorts. In contrast to narrowly focused prior studies, our work can unmix a population to identify patients at higher risk for adverse events during the pandemic than in the pre-pandemic time.

  • The algorithmic approach also identifies how the impact of adverse drug reactions on human organs changed during the pandemic. For example, while musculoskeletal and metabolic side effects are disproportionately found in women during the pandemic, immune- related adverse events are enriched only in men.

  • Detection of rare adverse events for Remdesivir. We detect novel rare adverse events such as hypoxia for Remdesivir, highlighting the role for algorithmic models for medications granted emergency approval.

  • Finally, we present a new resource of adverse drug effects and drug-event associations for use in pharmacoepidemiology and public health policy to inform safe medication use.

Broad impact

Our findings have implications for safe medication use and highlight the role of variation in adverse events for improving patient safety during a public health emergency.

a) Our algorithmic approach can identify differential reporting patterns in patient cohorts formed as a function of gender, age, adverse events, and drugs. With additional information on medical and non-medical characteristics, the approach is suitable for systematic safety surveillance to pinpoint individuals at high risk for safety events based on risk-altering interactions.

b) We expect this algorithmic approach to enable comparison of the COVID-19 pandemic to other health emergencies (like the nationwide opioid crisis in the U.S. and emergencies resulting hurricanes and wildfires) to unveil the disruptive nature of public health crises on patient safety.

c) Finally, our research can inform safe medication use by identifying populations at high risk for adverse events and proactive vigilance of vaccine programs in large and diverse populations.


Population-scale patient safety data reveal inequalities in adverse events before and during COVID-19 pandemic
Xiang Zhang, Marissa Sumathipala, Marinka Zitnik
In review 2021 [medRxiv]


Python implementation of the methodology developed and used in this project is available via GitHub repository.


All data used in the paper, including the raw and processed adverse event report dataset, adverse event ontology, drug ontology, the final and intermediate results of the analyses are shared with research community via Harvard Dataverse repository. The dataset is uniquely identified as https://doi.org/10.7910/DVN/G9SHDA.

Direct access to pre-processed datasets

Direct access to results


Latest News

Jul 2021:   Best Paper Award at ICML Interpretable ML for Healthcare

Our short paper on Interactive Visual Explanations for Deep Drug Repurposing received the Best Paper Award at ICML 2021 Interpretable ML in Healthcare Workshop. Stay tuned for more news on this evolving project.

Jul 2021:   Five presentations at ICML 2021

Jun 2021:   Theory and Evaluation for Explanations

We introduce the first axiomatic framework for theoretically analyzing, evaluating, and comparing GNN explanation methods. We formalize key properties that all methods should satisfy to generate reliable explanations: faithfulness, stability, and fairness.

Jun 2021:   Deep Contextual Learners for Protein Networks

New preprint on contextualized protein embeddings aims to characterize genes with disease-specific interactions and elucidate disease manifestation in specific cell types.

May 2021:   New Paper Accepted at UAI

Our unified framework for fair and stable graph representation learning has just been accepted at UAI. We establish a theoretical connection between counterfactual fairness and stability and use it in a framework that can be used with any GNN to learn fair and stable embeddings.

Apr 2021:   Hot Off the Press: COVID-19 Repurposing in PNAS

Hot off the press! We deployed AI/ML and network medicine algorithms to rank 6,340 drugs for their expected efficacy against SARS-CoV-2. We screened in human cells the top-ranked drugs, identifying six drugs that reduced viral infection, four of which could be repurposed to treat COVID-19.

Apr 2021:   Representation Learning for Biomedical Nets

In our survey on representation learning for biomedical networks we discuss how long-standing principles of network biology and medicine provide the conceptual grounding for representation learning, explain its successes, and inform future advances.

Mar 2021:   Receiving Amazon Research Award

We are excited about receiving Amazon Faculty Research Award on Actionable Graph Learning for Finding Cures for Emerging Diseases. Thank you to Amazon Science for supporting our research.

Mar 2021:   Michelle's Graduate Research Fellowship

Michelle M. Li won the NSF Graduate Research Fellowship Award. Congratulations!

Mar 2021:   Hot Off the Press: Multiscale Interactome

Hot off the press! We develop a multiscale interactome approach to explain disease treatments. The approach can predict drug-disease treatments, identify proteins and biological functions related to treatment, and identify genes that alter treatment’s efficacy and adverse reactions.

Mar 2021:   Graph Networks in Computational Biology

We are excited to share slides from our recent lecture on Graph Neural Networks in Computational Biology, which we gave at Stanford ML for Graphs course.

Mar 2021:   Fair and Stable Graph Representation Learning

We are thrilled to share the latest preprint on fair and stable graph representation learning.

Feb 2021:   New Preprint on Therapeutics Data Commons

Jan 2021:   An Algorithmic Approach to Patient Safety

The new algorithmic approach investigates population-scale patient safety data and reveals inequalities in adverse events before and during COVID-19 pandemic.

Jan 2021:   Workshop on AI in Health at the Web Conference

We are excited to co-organize Workshop on AI in Health: Transferring and Integrating Knowledge for Better Health at the Web (WWW) conference. The call for papers is open! We also announce the AI in Health Data Challenge.

Jan 2021:   Tutorial on ML for Drug Development

We will present a tutorial on ML/AI for drug discovery and development at IJCAI conference. See the tutorial website.

Dec 2020:   Two New Papers Published

Dec 2020:   Bayer Early Excellence in Science Award

Our research won the Bayer Early Excellence in Science Award. We are honored to have received this recognition!

Nov 2020:   Therapeutics Data Commons (TDC)

We are thrilled to announce Therapeutics Data Commons (TDC)! We invite you to join TDC. TDC is an open-source and community-driven effort.

Nov 2020:   National Symposium on the Future of Drugs

On behalf of the NSF, we are organizing the National Symposium on Drug Repurposing for Future Pandemics. We have a stellar lineup of invited speakers! Register at www.drugsymposium.org.

Zitnik Lab  ·  Harvard  ·  Department of Biomedical Informatics