Review 1: "Firearm Purchasing and Firearm Violence in the First Months of the Coronavirus Pandemic in the United States"

Jared D. Fisher; David W. Puelz

doi:doi:10.1162/2e3983f5.187d4cf5

Firearm Purchasing and Firearm Violence in the First Months of the Coronavirus Pandemic in the United States

by Julia P. Schleimer, Christopher D. McCort, Veronia A. Pear, Aaron Shev, Elizabeth Tomsich, Rameesha Asif-Sattar, Shani Buggs, Hannah S. Laqueur, and Garen J. Wintemute

Show Description
Published on Jul 11, 2020
www.medrxiv.org

Description

Importance. Firearm violence is a significant public health and safety problem in the United States. A surge in firearm purchases following the onset of the coronavirus pandemic may increase rates of firearm violence. Objective. To estimate the association between changes in firearm purchasing and interpersonal firearm violence during the coronavirus pandemic. Design. Cross-sectional time series study. We estimate the difference between observed rates of firearm purchases and those predicted by seasonal autoregressive integrated moving average models. Using negative binomial models, we then estimate the association between excess firearm purchases and rates of interpersonal firearm violence within states, controlling for confounders. Setting. The 48 contiguous states and the District of Columbia. Hawaii and Alaska are excluded due to missing or incomplete data. Exposure. The difference between observed and expected rates of firearm purchases in March through May 2020, approximated by National Instant Criminal Background Check System records. Main Outcome and Measure. Fatal and nonfatal injuries from interpersonal firearm violence, recorded in the Gun Violence Archive. Results. We estimate that there were 2.1 million excess firearm purchases from March through May 2020--a 64.3% increase over expected volume, and an increase of 644.4 excess purchases per 100,000 population. We estimate a relative rate of death and injury from firearm violence of 1.015 (95% Confidence Interval (CI): 1.005 to 1.025) for every 100 excess purchases per 100,000, in models that incorporate variation in purchasing across states and control for effects of the pandemic common to all states. This reflects an increase of 776 fatal and nonfatal injuries (95% CI: 216 to 1,335) over the number expected had no increase in purchasing occurred. Conclusions and Relevance. We find a significant increase in firearm violence in the United States associated with the coronavirus pandemic-related surge in firearm purchasing. Our findings are consistent with existing research. Firearm violence prevention strategies may be particularly important during the pandemic.

RR:C19 Evidence Scale rating by reviewer:

Potentially informative. The main claims made are not strongly justified by the methods and data, but may yield some insight. The results and conclusions of the study may resemble those from the hypothetical ideal study, but there is substantial room for doubt. Decision-makers should consider this evidence only with a thorough understanding of its weaknesses, alongside other evidence and theory. Decision-makers should not consider this actionable, unless the weaknesses are clearly understood and there is other theory and evidence to further support it.

***************************************

Review:

This paper examines the relationship between firearm purchases and firearm violence during the COVID-19 pandemic using a two-stage model. Using state-level firearm purchasing data from January 2011 to February 2020, the authors forecast what firearm purchasing behavior would have been in March, April, and May of 2020. The forecast residuals (the differences between these forecasts and the actual purchasing levels) are used to estimate the excess firearm purchases as a result of the shock of the pandemic. Finally, they model states’ incidences of firearm violence using the excess firearm purchases and some control variables. This model finds a statistically significant relationship between excess firearm purchases and excess firearm violence.

We are statisticians, thus for our Rapid Review, we will be focusing on the modeling decisions, statistical models, and causal inference components of this paper. On Rapid Review’s scale, we find this paper to be between not informative and potentially informative. It is possible that the results may be correct, or that the truth is the opposite of the results. In fact, the authors concede as much in the limitations section of the paper. If the opposite direction of association is true (i.e., if firearm violence is causing unexpected firearm purchases), the entire discussion should be tempered. Decision-makers should not throw out this work but should ask for further investigation since results of association are rarely meaningful. This paper can shed light on what to look for to garner appropriate evidence to make a strong conclusion, as close to causal as possible, in future work.

Here, we focus on four issues. The first issue is regarding the selection of their seasonal autoregressive integrated moving average (SARIMA) model used to predict firearm purchases. The complexity of their SARIMA model was chosen by a published¹ stepwise algorithm, and its fit tested with a standard test. However, both of these model checks are in-sample, while the crucial use of the model is out-of-sample: how well does the model forecast data it has not seen? To this end, we would recommend that the authors demonstrate that the model indeed has good forecast performance before the pandemic. Forward-rolling time series cross-validation would be an appropriate way to measure this, at each of the one-/two-/three-month horizons. This is important, because the remainder of the paper and the remainder of the model rely on the excess firearm purchases being appropriately predicted out-of-sample. Their SARIMA model may perform great out-of-sample, but presently there is no evidence presented of this nature.

Second, the study focuses on state-level data, missing out on heterogeneity within each state. The ideal study scenario would be one where the analyst is able to check whether all who commit firearm violence (during March-May 2020) personally had increased firearm availability due to purchasing activity. The authors note this is difficult to establish (“we have no information on whether the excess firearms acquired were those used in violence”) and have gathered what is likely the best-case data they can get on a state-level. However, consider a likely scenario in which most unexpected firearm purchases are in the suburbs and most firearm violence is located in cities. The authors’ approach would still find an association at an aggregate (state) level, but is this result meaningful if the purchasers are largely independent of the groups that are committing violence? To mitigate this, more granular data could be used. While individual-level data is unlikely, is the data available on the level of county or census tract?

Third, whether the data is state-level, individual-level, or anywhere in between, we have concerns about the covariates the authors have opted to leave out of the final model. In the exploratory analysis subsection, the authors comment on having tried many variables but left them out of the final model due to insignificance. However, some of the covariates left in the final model also lack statistical significance. We would ask that the authors include results with these other variables included in the models, perhaps in a table or set of tables. This would aid the clarity and transparency of the paper’s main point and robustness checks. This practice is common in economics papers when they use regression models to wrestle with associative and causal problems via observational data, as is the case here. Furthermore, it would seem that the pandemic’s effect on mental/emotional health and economic indicators would play large roles in this causal scenario, but these are not included in the final model nor the output.

Our last point regards causal inference. The authors do a good job of ensuring that non-causal words are used (“association”) and even include the blanket disclaimer in the limitations section. It might be worth noting this earlier in the paper, instead of near the end, and use language that is appropriate for the many non-technical readers interested in the paper (i.e. beyond “causal” and “association”). Related to this and our discussion above, with the current set of covariates and data, we are not entirely convinced the association is even meaningful. As a type of robustness check that could be performed, if “excess firearm purchases” was replaced with “excess online retail purchases”, would we still find an association between Amazon.com’s revenue and firearm violence using this method? It would be helpful for the authors to address these critiques through a more thorough data analysis and discussion in the text, beyond just citing other papers with similar conclusions and leaving these concerns for the limitations section at the end of the paper.

In conclusion, we suggest the authors include more descriptions of their statistical methods to convince readers of the soundness of their modeling choices. We need to see if the SARIMA model produces reasonable forecasts. We also need to see what the inclusion/exclusion of different covariates do to the statistical model and, importantly, the main exposure. Of course, if possible, data at a more granular level would help the argument along.

References:

Hyndman RJ, Khandakar Y. Automatic time series forecasting: the forecast package for R. J Stat Softw. 2008;27(3). doi:10.18637/jss.v027.i03