Suppose we are interested in estimating the average treatment effect (ATE), defined in potential-outcome notation as
where the outer expectation is over . Assuming strong ignorability, so that
and
the ATE can be written as
where the outer expectation is over covariates . Likewise, recall that the ATE can also be written as
where the outer expectation is over the pair .
Now, consider the estimand
where the expectation is taken over the joint distribution of and
and
are fixed functions. We will consider what happens for particular specifications of
and
. In particular, we will consider two cases.
Suppose . In this case, our estimand becomes
By iterated expectation, the middle term can be rewritten as
which we see will cancel with the third term, leaving only the first term, which is equivalent to .
Suppose . In this case, our estimand becomes
By iterated expectation, the third term becomes
which cancels with the first term, leaving only the second term, which in this case is equivalent to .
If both of the above conditions hold, one gets the same result — that this estimand is equivalent to — but only one of them is necessary. Applying similar reasoning to
allows us to estimate the ATE.
If neither nor
, then one is of course simply out of luck (i.e., won’t be able to estimate ATE via this estimand).
