Google 學術搜尋

[PDF][PDF] Confidence intervals and hypothesis testing for high-dimensional regression

A Javanmard, A Montanari - The Journal of Machine Learning Research, 2014 - jmlr.org

The Journal of Machine Learning Research, 2014•jmlr.org

Fitting high-dimensional statistical models often requires the use of non-linear parameter
estimation procedures. As a consequence, it is generally impossible to obtain an exact
characterization of the probability distribution of the parameter estimates. This in turn implies
that it is extremely challenging to quantify the uncertainty associated with a certain
parameter estimate. Concretely, no commonly accepted procedure exists for computing
classical measures of uncertainty and statistical significance as confidence intervals or …

Abstract

Fitting high-dimensional statistical models often requires the use of non-linear parameter estimation procedures. As a consequence, it is generally impossible to obtain an exact characterization of the probability distribution of the parameter estimates. This in turn implies that it is extremely challenging to quantify the uncertainty associated with a certain parameter estimate. Concretely, no commonly accepted procedure exists for computing classical measures of uncertainty and statistical significance as confidence intervals or pvalues for these models.

We consider here high-dimensional linear regression problem, and propose an efficient algorithm for constructing confidence intervals and p-values. The resulting confidence intervals have nearly optimal size. When testing for the null hypothesis that a certain parameter is vanishing, our method has nearly optimal power. Our approach is based on constructing a ‘de-biased’version of regularized M-estimators. The new construction improves over recent work in the field in that it does not assume a special structure on the design matrix. We test our method on synthetic data and a highthroughput genomic data set about riboflavin production rate, made publicly available by Bühlmann et al.(2014).

jmlr.org

顯示更多顯示較少

儲存引用被引用 1081 次相關文章全部共 14 個版本 HTML 版

引用

進階搜尋

已儲存至「我的圖書館」

[PDF][PDF] Confidence intervals and hypothesis testing for high-dimensional regression