Stack Exchange

votes

Generalization of degrees of freedom for t distribution for coefficients after multiple imputation

Posted on 6/21/2016•Solved on 8/12/2025

UQ ID:462

Science

Cross Validated

Donald Rubin has shown that regression coefficient estimates have fatter tails after multiple imputation and has provided a formula for the degrees of freedom to use as a t-distribution approximation to the coefficient estimates resulting from Rubin's rule for combining multiple imputations. I would like a generalization of this approach that handles more than single degree of freedom tests, e.g., an adjustment to the multiple degree of freedom distribution for "chunk tests" that test combinations of parameters. Has anyone seen such a procedure? The goal is to improve (raise) the coverage probability of confidence intervals after multiple imputation, and to better preserve type-I error.

Ultimately it would be nice to have a way to convert covariance matrices arising from Rubin's rule so that normal and distributions can still be used, and confidence coverage will be more accurate. In the single parameter case one could just bump up the standard error of by a factor equal to the critical value divided by the normal critical value.

Follow-up

As I've gotten more into Bayesian modeling I came to know about posterior stacking for incorporating multiple imputation into Bayesian analysis using MCMC posterior sampling. This makes the tails of the posterior distribution properly heavier, automatically, and avoids complexities of the approximate Bayesian method of multiple inputation requiring the use of Rubin's rule. An example is in the RMS course notes.

Keywords & Tags

bayesian

missing-data

multiple-imputation

degrees-of-freedom

t-distribution

Model Answers (9)

Submitted answers from various AI models

Solved

No reviews yet

K2-Think

Pending UQ Validation

Needs Human Verification

MBZUAI-IFM

Qwen2.5-32B

Sep 18, 2025, 01:50 PM

To address the generalization of degrees of freedom adjustments for chunk tests (multivariate hypothesis tests) after multiple imputation (MI), we build on Rubin's original framework and extensions proposed in the statistical literature. Here's a structured approach:

1. Rubin's Framework for Single Parameters

Rubin's rules combine estimates from ( m ) imputed datasets. For a single coef...

0 model reviews

0 human reviews

No reviews yet

GPT-5

Failed UQ Validation

Needs Human Verification

OpenAI

Sep 2, 2025, 06:56 AM

Yes. The scalar t-approximation that Rubin derived generalizes to “chunk” (multi-df) tests via an F-approximation, with a denominator degrees-of-freedom (df) that depends on the between- and within-imputation variability for the joint hypothesis. This is standard in the multiple-imputation (MI) literature and is implemented in major software (SAS PROC MIANALYZE, Stata mi test, R mice/mitools/Ameli...

0 model reviews

0 human reviews

No reviews yet

Claude Opus 4

Failed UQ Validation

Needs Human Verification

Anthropic

Aug 1, 2025, 12:00 AM

This is an excellent question about extending Rubin's degrees of freedom adjustment from univariate to multivariate tests after multiple imputation.

The Challenge

Rubin's original work provided the adjusted degrees of freedom for univariate t-tests:

where is the number of imputations, is th...

0 model reviews

0 human reviews

No reviews yet

o3 Pro

Failed UQ Validation

Needs Human Verification

OpenAI

Aug 1, 2025, 12:00 AM

Yes – the single-degree-of-freedom (“univariate”) t–approximation that Rubin (1987) derived has been generalised to any set of linear hypotheses.
The generalisations were worked out in a series of papers by Xiao-Ling Meng, Don Rubin and collaborators and are implemented in standard MI software (e.g. mice::pool, mitools, SAS PROC MIANALYZE, Stata mi test, …).

Key references

Meng, X.-L. & Rub...

0 model reviews

0 human reviews

No reviews yet

o3

Failed UQ Validation

Needs Human Verification

OpenAI

Aug 1, 2025, 12:00 AM

Yes – the univariate “fraction‐of‐missing‐information’’ argument that leads to the scalar t–distribution can be extended to any linear hypothesis that involves several regression coefficients simultaneously. Exactly the same quantities that are used in Rubin’s combining rules (the within– and between–imputation covariance matrices) are needed, one only has to form them for the linear combination o...

0 model reviews

0 human reviews

No reviews yet

o4 mini

Failed UQ Validation

Needs Human Verification

OpenAI

Aug 1, 2025, 12:00 AM

After you do your m imputations and fit the same regression model to each one you end up (via Rubin’s‐rules) with

Q_i = the p×1 vector of estimates from imputation i,
U_i = the p×p “within‐imputation” covariance for Q_i.

You form
Q̄ = (1/m)∑{i=1}^m Q_i
Ū = (1/m)∑{i=1}^m U_i
B = (1/(m−1))∑_{i=1}^m (Q_i−Q̄)(Q_i−Q̄)′

and the total covariance
T = Ū + (1+1/m) B.

Un...

0 model reviews

0 human reviews

No reviews yet

Gemini 2.5 Pro

UQ Validated

Human Verified

Google

Aug 1, 2025, 12:00 AM

Excellent question. You've correctly identified a key challenge in frequentist inference after multiple imputation (MI) and are asking about the standard, and most accepted, solution.

Yes, a generalization for multi-parameter tests (often called "chunk tests" or omnibus tests) exists, and it's a crucial part of the MI inference framework developed by Rubin and his colleagues. The resulting test i...

0 model reviews

0 human reviews

No reviews yet

DeepSeek R1

Failed UQ Validation

Needs Human Verification

DeepSeek

deepseek-ai/DeepSeek-R1

Aug 1, 2025, 12:00 AM

<think> We are given a question about generalizing the degrees of freedom adjustment for t-distributions in the context of multiple imputation to handle multiple parameters (e.g., for chi-square tests). The user also mentions a follow-up about Bayesian methods (posterior stacking) that avoid the need for Rubin's rule.

The original question is about extending Rubin's method for combining multiple ...

0 model reviews

0 human reviews

No reviews yet

Claude 3.7 Sonnet

Failed UQ Validation

Needs Human Verification

Anthropic

Aug 1, 2025, 12:00 AM

Generalizing Degrees of Freedom After Multiple Imputation

Your question addresses an important nuance in multiple imputation (MI) methodology, particularly when moving from single-parameter inference to multivariate or "chunk" tests. Let me address this systematically.

Background: Rubin's Rules and the t-Distribution

As you noted, Rubin showed that after multiple imputation, the total vari...

0 model reviews

0 human reviews

Submit Your Model's Answer

Your submission will be validated by our LLM validator and then by human experts

You need to be logged in to submit an answer

: Assessing Language Models on Unsolved Questions

UQ: Assessing Language Models on Unsolved Questions

Generalization of degrees of freedom for t distribution for coefficients after multiple imputation

K2-Think

1. Rubin's Framework for Single Parameters

GPT-5

Claude Opus 4

The Challenge

o3 Pro

o3

o4 mini

Gemini 2.5 Pro

DeepSeek R1

Claude 3.7 Sonnet

Generalizing Degrees of Freedom After Multiple Imputation

Background: Rubin's Rules and the t-Distribution