Faithfulness and causal discovery

Causal discovery refers to the process of inferring an underlying causal graph from data. To do this, we need to make an assumption called "faithfulness". From Shalizi's book: The joint distribution has all of the conditional independence relations implied by the causal Markov property, and only those conditional independence relations. The point of the faithfulness… Continue reading Faithfulness and causal discovery

Unmeasured confounder bias

Today we take a look at the classic linear regression model and observe the well-known phenomenon that regression coefficient estimates can be biased if relevant "confounding" variables are not included in the regression. We will revisit this leading example many times during the course of the semester, both to reinforce ideas and to critique the… Continue reading Unmeasured confounder bias

Stochastic search variable selection

The topic of today's post is Bayesian "variable selection" using point-mass mixture priors. This builds of off the previous post concretely, adapting the ideas to the linear regression setting. The key reference for this approach to variable selection is George and McCulloch; see also the literature review of Hahn and Carvalho. The model is simply… Continue reading Stochastic search variable selection

Mixtures of conjugate priors

Conjugate models (likelihood-prior pairs) refer to parametric Bayesian models where the posterior distribution is expressible in the same parametric form as the prior. Conjugate models can be given a deep theoretical characterizations; see, for example, here. The origins of the work, however, can be found in the textbook of Raiffa and Schlaifer, who invented it as a… Continue reading Mixtures of conjugate priors

Gaussian factor models

By a Gaussian factor model, I refer to the following specification: $latex X_i = \mathbf{B}f_i + \epsilon_i; \epsilon_i \sim N(0, \Psi); f_i \sim N(0, I_k).&s=1$ Each observation $latex X_i$ is a p-dimensional column vector, $latex \mathbf{B}$ is a p-by-k real-valued matrix of "factor loadings", and the "factor scores" $latex f_i$ are k-dimensional column vectors. The… Continue reading Gaussian factor models