The Causal Review

1 Introduction

Two of the most widely used methods for evaluating policies and interventions in social science are difference-in-differences (DiD) and the synthetic control method (SC). Both use control units to construct a counterfactual for a treated unit. Both require an assumption about the absence of treatment in the control group. Yet they are built on different foundations, make different assumptions, and suit different research designs.

The choice between them is consequential. Applying DiD when SC is more appropriate- or vice versa can yield badly misleading estimates. This article steelmans both approaches, identifies the conditions under which each is preferred, and discusses hybrid methods that combine their strengths.

2 The Case for Difference-in-Differences

2.1 Simplicity and Transparency

DiD has a compelling simplicity: it subtracts the pre-post change in control units from the pre-post change in treated units. The estimator is:

$$ \hat{\tau}^{DiD} = (\bar{Y}_{T,post} - \bar{Y}_{T,pre}) - (\bar{Y}_{C,post} - \bar{Y}_{C,pre}) $$

(1)

This can be estimated by OLS and extended to multiple time periods, multiple groups, and covariates straightforwardly. The identifying assumption parallel trends is transparent and, to some degree, empirically testable via pre-trend tests.

2.2 Multiple Treated Units

DiD shines when there are many treated units. With a large treatment group, the DiD estimator has low variance, and heterogeneity across treated units can be studied. The modern staggered DiD literature Callaway-Sant'Anna, Goodman-Bacon, de Chaisemartin- D'Haultfoeuille is built on DiD and provides tools for settings with staggered treatment timing across many units [Roth et al., 2023].

SC, in contrast, was designed for a single (or very few) treated units. With many treated units, SC is computationally cumbersome and requires solving a separate optimisation for each treated unit.

2.3 Inference

DiD inference is straightforward under standard panel regression: cluster standard errors at the unit level, or use block bootstrap. Asymptotic theory is well-developed for large N.

SC inference via placebo permutations is valid but has low power when the donor pool is small. With fewer than 40-50 donor units, the permutation distribution is coarse, and meaningful p-values are hard to achieve.

3 The Case for Synthetic Control

3.1 One (or Few) Treated Units

The most important advantage of SC is its suitability for comparative case studies with a single treated unit a country, state, or city. DiD with a single treated unit relies on a single control or an unweighted average of controls, both of which may be poor comparators. SC constructs a weighted comparison that closely matches the treated unit in the pre-treatment period, reducing pre-treatment bias by design.

As Abadie [2021] argues, when the treated unit is unique California's tobacco programme, Germany's reunification, an individual country's trade reform-SC is more appropriate than DiD because there is no natural comparison unit.

3.2 Transparent Pre-Treatment Fit

A critical advantage of SC over DiD is that the pre-treatment fit is directly observable. The researcher can inspect how closely the synthetic control tracks the treated unit before treatment and report the pre-treatment MSPE (mean squared prediction error). Poor pre-treatment fit is a visible warning sign that the counterfactual is unreliable.

In DiD, the parallel trends assumption concerns the counterfactual trend post-treatment, which is unobservable. The pre-trend test is an imperfect proxy. SC makes the analogous check much more transparent.

3.3 Handling Non-Parallel Trends

DiD requires that treated and control units would have followed parallel trends in the absence of treatment. SC relaxes this: it allows treated and control units to have different pre-treatment levels and trends, as long as a weighted combination of control units can replicate the treated unit's pre-treatment path. This is a weaker assumption when there are systematic differences between the treated unit and any single control.

4 Key Differences

Table 1: Synthetic control vs. difference-in-differences: key comparisons
Dimension	DiD	Synthetic Control
# treated units	Many (preferred)	One or few
# control units	Few is fine	More is better
Identifying assumption	Parallel trends	Factor model / pre-treatment fit
Inference	Cluster SE / bootstrap	Permutation placebo tests
Pre-treatment fit	Tested (imperfectly)	Explicitly minimised
Extrapolation	Less sensitive	Sensitive to convex hull
Dynamic effects	Straightforward	Plotted post-treatment
Staggered treatment	Modern methods available	Cumbersome for many treated
Software	did, fixest	Synth, augsynth

5 The Hybrid Approach: Synthetic DiD

A compelling synthesis is Synthetic DiD (SDiD) proposed by Arkhangelsky et al. [2021]. SDiD constructs unit weights (as in SC) to remove pre-treatment level differences and time weights to down-weight early pre-treatment periods mimicking the "local" comparison of DiD.

The estimator satisfies a double robustness property: it is consistent if either the unit weights or the time weights are correctly specified.

SDiD handles multiple treated units more naturally than SC, and it outperforms standard DiD when control units have heterogeneous trends. It is implemented in the synthdid R package.

6 A Decision Framework

Researchers facing a design choice should consider:

How many treated units?

One or two: Synthetic control (or SDiD).

Many (e.g., 20+ treated states): DiD (possibly staggered).

3-15: Both are viable; SDiD is a strong default.

Is parallel trends plausible without weighting?

If the treated unit is visibly different in pre-treatment levels: SC or SDiD.

If treated and control units look similar pre-treatment: DiD is efficient and straightforward.

Is the treated unit within the convex hull of control units?

If the treated unit is an extreme case (highest income country, largest city): SC extrapolates poorly. Use DiD or ASCM.

Are treatment effects dynamic (growing or decaying)?

Both methods can accommodate dynamics; event-study plots are informative for both.

7 An Empirical Illustration: The Mariel Boatlift

The Mariel Boatlift controversy illustrates the sensitivity of the method choice. Card [1990] used DiD with a single treated city (Miami) and a comparison group of four cities. Borjas [2017] re-analysed the data with different control groups and found larger wage effects. Peri and Yasenov [2019] used SC to construct an optimal synthetic Miami and found effects closer to Card's original null result.

The disagreement turned partly on which comparisons are most valid exactly the question that SC and DiD answer differently. SC's data-driven weighting produced a synthetic Miami with much better pre-treatment fit than the simple DiD comparator, suggesting the SC result is more credible in this case [Abadie, 2021].

8 Conclusion

DiD and SC are complements, not substitutes. DiD is the method of choice when there are many treated units, parallel trends is plausible, and a rich pre-treatment period is available to test for trend differences. SC is the method of choice for comparative case studies with one or few treated units, where constructing a data-driven counterfactual improves on any natural comparison. Synthetic DiD combines the strengths of both and is an attractive default in intermediate settings. The key is to choose the method based on the research design not based on which produces the preferred result.

References

Abadie, A., Diamond, A., and Hainmueller, J. (2010). Synthetic control methods for comparative case studies. Journal of the American Statistical Association, 105(490):493-505.
Abadie, A. (2021). Using synthetic controls: Feasibility, data requirements, and methodological aspects. Journal of Economic Literature, 59(2):391-425.
Arkhangelsky, D., Athey, S., Hirshberg, D.A., Imbens, G.W., and Wager, S. (2021). Synthetic difference-in-differences. American Economic Review, 111(12):4088-4118.
Card, D. (1990). The impact of the Mariel boatlift on the Miami labor market. Industrial and Labor Relations Review, 43(2):245-257.
Borjas, G.J. (2017). The wage impact of the Marielitos: A reappraisal. ILR Review, 70(5):1077-1110.
Peri, G. and Yasenov, V. (2019). The labor market effects of a refugee wave: Synthetic control method meets the Mariel boatlift. Journal of Human Resources, 54(2):267-309.
Roth, J., Sant'Anna, P.H.C., Bilinski, A., and Poe, J. (2023). What's trending in difference-in-differences? A synthesis of the recent econometrics literature. Journal of Econometrics, 235(2):2218-2244.

Synthetic Control vs. Difference-in-Differences: When to Use Which?

1 Introduction

2 The Case for Difference-in-Differences

2.1 Simplicity and Transparency

2.2 Multiple Treated Units

2.3 Inference

3 The Case for Synthetic Control

3.1 One (or Few) Treated Units

3.2 Transparent Pre-Treatment Fit

3.3 Handling Non-Parallel Trends

4 Key Differences

5 The Hybrid Approach: Synthetic DiD

6 A Decision Framework

7 An Empirical Illustration: The Mariel Boatlift

8 Conclusion

References

Continue Reading

The causalml Package in Python: Uplift Modeling and CATE Meta-Learners

The gsynth Package in R: Generalized Synthetic Control with Interactive Fixed Effects

Recent Results: Immigration, Migration, and Labour Markets

Natural Experiments: Finding Causal Evidence Without Randomisation

Regression Discontinuity Design: Sharp, Fuzzy, and the CCT Bandwidth

The Credibility Revolution in Econometrics: Thirty Years of Causal Inference

Article Title

Synthetic Control vs. Difference-in-Differences: When to Use Which?

1 Introduction

2 The Case for Difference-in-Differences

2.1 Simplicity and Transparency

2.2 Multiple Treated Units

2.3 Inference

3 The Case for Synthetic Control

3.1 One (or Few) Treated Units

3.2 Transparent Pre-Treatment Fit

3.3 Handling Non-Parallel Trends

4 Key Differences

5 The Hybrid Approach: Synthetic DiD

6 A Decision Framework

7 An Empirical Illustration: The Mariel Boatlift

8 Conclusion

References

Continue Reading

The causalml Package in Python: Uplift Modeling and CATE Meta-Learners

The gsynth Package in R: Generalized Synthetic Control with Interactive Fixed Effects

Recent Results: Immigration, Migration, and Labour Markets

Natural Experiments: Finding Causal Evidence Without Randomisation

Regression Discontinuity Design: Sharp, Fuzzy, and the CCT Bandwidth

The Credibility Revolution in Econometrics: Thirty Years of Causal Inference

Stay current with causal inference

Article Title