The Causal Review

1 What Problem Does It Solve?

Standard difference-in-differences (DiD) methods assume a binary treatment: units are either treated or not. But many real-world policies involve a continuous dose: a firm's share of costs covered by an investment subsidy, the number of training hours received by a worker, or the percentage of a community's area subject to environmental regulation. When the treatment dose varies continuously across units and over time, standard binary DiD estimates average over dose heterogeneity in an uncontrolled way.

The contdid package (Callaway et al., 2024) implements the Callaway et al. (2024) estimator for the Average Dose-Response Function (ADRF) in settings with staggered continuous treatment adoption . It combines the group-time average treatment effect framework of Callaway and Sant'Anna (2021) with doubly robust estimation and kernel smoothing to trace out the full dose-response curve.

2 The Estimand

Let Dit ∈ [0, ¯d] be the continuous treatment dose received by unit i in period t. Define Yit(d)as the potential outcome under dose d. The ADRF at dose level d is:

ADRF(d) = E[Y_it(d) − Y_it(0)], (1)

the average treatment effect of receiving dose d relative to no treatment. When d is binary, (1) reduces to the standard ATT.

The ADRF can also be defined at the level of dose changes (causal response function):

ADRF^Δ(d, d') = E[Y_it(d') − Y_it(d)], (2)

the average effect of moving from dose d to dose d'.

3 Identifying Assumptions

Identification of the ADRF requires:

Parallel trends for continuous doses: for all dose levels d, the pre-treatment trend in $Y(0)$ is independent of the dose received: ‍

E[Y_it(0) − Y_i,t−1(0) | D_i] = E[Y_it(0) − Y_i,t−1(0)]. (3)

This is stronger than the binary parallel trends condition because it must hold for every dose level d, not just for a treated vs. control comparison.

No anticipation: outcomes are unaffected by future treatment. ‍
Staggered adoption structure: once a unit receives a positive dose, it remains treated (irreversibility), analogous to the staggered binary adoption setting. ‍
Overlap: for each dose level d, there are sufficient control (untreated or not-yet-treated) units with similar pre-treatment characteristics.

4 Installation and Setup

# Install contdid from GitHub (not yet on CRAN as of 2026)
install.packages("remotes")
remotes::install_github("bcallaway11/contdid")

# Load the package
library(contdid)
library(ggplot2) # for plotting results

5 A Minimal Working Example

The following example uses a simulated panel dataset with a continuous treatment dose to illustrate the main workflow.

library(contdid)
set.seed(42)

# Simulate panel data with continuous treatment
n_units <- 300
n_periods <- 6
n_obs <- n_units * n_periods

# Unit-level dose (0 for never-treated, random [0,1] for treated)
unit_id <- rep(1:n_units, each = n_periods)
period <- rep(1:n_periods, n_units)

# Treatment: units 1-200 adopt treatment in period 4
# Dose varies across treated units
dose_unit <- ifelse(unit_id <= 200, runif(n_obs, 0, 1), 0)
dose <- ifelse(period >= 4, dose_unit, 0)

# Covariates and outcome
X <- rnorm(n_obs)
Y <- dose + 0.5 * X + rnorm(n_obs)

dat <- data.frame(
  id = unit_id, t = period, D = dose,
  Y = Y, X = X
)

# Estimate the ADRF
result <- cont_did(
  yname = "Y",
  tname = "t",
  idname = "id",
  dname = "D",
  xformla = ~ X, # covariates for DR estimation
  data = dat,
  control_group = "nevertreated", # or "notyettreated"
  nboot = 199 # bootstrap reps for SE
)

# Summarise and plot
summary(result)
plot(result)

The cont_did() function returns an object with ADRF estimates at a grid of dose values, together with bootstrap standard errors and 95% uniform confidence bands.

6 Key Options and Their Meaning`‍`

Argument	Description
control_group	"nevertreated" uses only units with zero dose throughout; "notyettreated" also uses units treated later
xformla	R formula for covariates used in doubly robust propensity score and outcome regression
nboot	Number of bootstrap repetitions (default 999) for standard errors and uniform bands
bw	Bandwidth for kernel smoothing of the ADRF (default: plug-in selector)
anticipation	Number of periods of anticipation (default 0)

Table 1: Key arguments to cont_did() [cite: 1868-1869]

7 Comparison to Alternative Approaches

Method	Key assumption	Package
contdid	Parallel trends (continuous dose)	contdid
Generalised PS (Hirano-Imbens)	Unconfoundedness	CausalGPS
Doubly robust ADRF	Unconfoundedness	npcausal
Standard binary DiD	Parallel trends (binary)	did

Table 2: Methods for continuous treatment causal effects [cite: 1871-1872]

The key distinction is the identification assumption. GPS and npcausal require unconfoundedness—that all confounders are observed in X. The contdid estimator instead relies on parallel trends, which is more plausible in panel settings with unit and time fixed effects. If reliable panel data are available with multiple pre-treatment periods, contdid is the preferred choice.

8 Pitfalls and Practical Advice

Insufficient dose variation. The ADRF is poorly identified near dose levels d with few observations. Inspect the empirical distribution of D before interpretation; do not trust estimates at the tails of the dose distribution. ‍
Balanced panel requirement. The current implementation requires a balanced panel (all units observed in all periods). Unbalanced panels need preprocessing to fill in missing observations or to identify and restrict the estimation sample. ‍
Parallel trends is harder to verify. With binary treatment, pre-trend tests check a single condition. With continuous treatment, parallel trends must hold for all dose levels—a richer assumption that is harder to test nonparametrically. ‍
Choice of bandwidth. The ADRF estimate is sensitive to the kernel bandwidth. Report estimates at multiple bandwidths and check stability.

9 Conclusion

The contdid package extends the staggered DiD framework to continuous treatment doses, filling a significant gap in the applied econometrics toolkit. By estimating the full Average Dose-Response Function rather than a single binary ATT, it provides richer information about treatment effect heterogeneity across dose levels. Researchers working with policies that have continuous intensity variation—subsidies, training programmes, environmental regulations—should consider contdid as their primary estimation tool, conditional on having a valid parallel trends argument.

References

Callaway, B. and Sant'Anna, P. H. C. (2021). Difference-in-differences with multiple time periods. Journal of Econometrics, 225(2):200-230.
Callaway, B., Goodman-Bacon, A., and Sant'Anna, P. H. C. (2024). Difference-in-differences with a continuous treatment. NBER Working Paper No. 32117.
Hirano, K. and Imbens, G. W. (2004). The propensity score with continuous treatments. In A. Gelman and X.-L. Meng (eds.), Applied Bayesian Modeling and Causal Inference from Incomplete-Data Perspectives, pages 73-84. Wiley .
Kennedy, E. H. (2017). Non-parametric methods for doubly robust estimation of continuous treatment effects. Journal of the Royal Statistical Society, Series B, 79(4):1229-1245 .

The contdid Package in R: Estimating Dose-Response Functions with Continuous Treatments

1 What Problem Does It Solve?

2 The Estimand

3 Identifying Assumptions

4 Installation and Setup

5 A Minimal Working Example

6 Key Options and Their Meaning`‍`

7 Comparison to Alternative Approaches

8 Pitfalls and Practical Advice

9 Conclusion

References

Continue Reading

The causalml Package in Python: Uplift Modeling and CATE Meta-Learners

The gsynth Package in R: Generalized Synthetic Control with Interactive Fixed Effects

Recent Results: Immigration, Migration, and Labour Markets

Natural Experiments: Finding Causal Evidence Without Randomisation

Regression Discontinuity Design: Sharp, Fuzzy, and the CCT Bandwidth

The Credibility Revolution in Econometrics: Thirty Years of Causal Inference

Article Title

The contdid Package in R: Estimating Dose-Response Functions with Continuous Treatments

1 What Problem Does It Solve?

2 The Estimand

3 Identifying Assumptions

4 Installation and Setup

5 A Minimal Working Example

6 Key Options and Their Meaning‍

7 Comparison to Alternative Approaches

8 Pitfalls and Practical Advice

9 Conclusion

References

Continue Reading

The causalml Package in Python: Uplift Modeling and CATE Meta-Learners

The gsynth Package in R: Generalized Synthetic Control with Interactive Fixed Effects

Recent Results: Immigration, Migration, and Labour Markets

Natural Experiments: Finding Causal Evidence Without Randomisation

Regression Discontinuity Design: Sharp, Fuzzy, and the CCT Bandwidth

The Credibility Revolution in Econometrics: Thirty Years of Causal Inference

Stay current with causal inference

Article Title

6 Key Options and Their Meaning`‍`