This work was supported by National Natural Science Foundation of China (Grant No. 11671059).
Appendix Note that given $~j~$, The following notation will be used throughout the proof of Theorem For the event in Lemma Now we prove Theorem Note that given the property of the solution of LASSO, we have $~\mbox{Cardinality}(\hat~A)~<~n\cdot~p$, and thus $\hat~A$ satisfies (C.3). Furthermore, according to Theorem This result is similar to By construction, set $\widetilde{Z}_{jj'}~=~\frac{1}{\lambda_2}~(-~\hat~\Sigma_{jj'}~+~[\bar{\Theta}^{-1}]_{jj'})$, which makes $~\widetilde{Z}~$ satisfy Let $~\Delta~=~\bar{\Theta}-~\Theta~$, and $~R(\Delta)~$ be the difference between the gradient $~\nabla~(\log~|\bar~\Theta|)~=~\bar{\Theta}^{-1}~$ and its first-order Taylor expansion around $~\Theta~$ by using the first and second derivatives of the log-determinant function, which is Then the solution of We now consider the upper bounds of $~\|\bar~H_A\|_{\infty}$ and $~\|\bar~R_A\|_{\infty}~$. First, consider a multivariate Gaussian vector; the deviation of the sample covariance matrix $~\hat~\Sigma~$ has an exponential type tail bound. Let $~\delta~=~(\frac{\tau~\log(np)}{n})^{1/2}~$. We have Given $j$, since $~\mbox{Cardinality}(A_j)~\leqslant~n~$, the maximum likelihood estimate of $~\Theta~$ based on $~A~$ exists and is unique Define $~\mathcal{B}~$ as Now we prove the upper bound of $~|\bar{\widetilde{Z}}_{A/S}|~$ in By a direct calculation, we have Let $~M~=~M_1~\times~M_2~$. With probability at least $~1-~\frac{1}{p^{\tau-1}}~$, we have The minimum absolute value $~\theta_{\min}~$ of nonzero entries of $\Theta$ satisfies In addition, according to the result of Lemma Since
[1] Banerjee O, Ghaoui L-E, d'Aspremont A. Model selection through sparse maximum likelihood estimation for multivariate Gaussian or binary data. J Mach Learn Res, 2008, 9: 485--516. Google Scholar
[2] Bickel P J, Levina E. Regularized estimation of large covariance matrices. Ann Statist, 2008, 36: 199-227 CrossRef Google Scholar
[3] Burnatowska-Hledin M A, Kossoris J B, Van Dort C J. T47D breast cancer cell growth is inhibited by expression of VACM-1, a cul-5 gene.. Biochem BioPhys Res Commun, 2004, 319: 817-825 CrossRef PubMed Google Scholar
[4] Chang H-Y, Nuyten D-S, Sneddon J-B, et al. Robustness, scalability, and integration of a wound-response gene expression signature in predicting breast cancer survival. Proc Natl Acad Sci USA, 2001, 102: 3738--343. Google Scholar
[5] Chen M, Ren Z, Zhao H. Asymptotically Normal and Efficient Estimation of Covariate-Adjusted Gaussian Graphical Model.. J Am Statistical Association, 2016, 111: 394-406 CrossRef PubMed Google Scholar
[6] Dempster A-P. Covariance selection. Biometrics, 1972, 28: 157--175. Google Scholar
[7] Fan J-Q, Feng Y, Wu Y-C. Network exploration via the adaptive LASSO and scad penalties. Ann Appl Stat, 2009, 1: 521--541. Google Scholar
[8] Fan J, Li R. Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties. J Am Statistical Association, 2001, 96: 1348-1360 CrossRef Google Scholar
[9] Fan J, Lv J. Sure independence screening for ultrahigh dimensional feature space. J R Statistical Soc-Ser B (Statistical Methodology), 2008, 70: 849-911 CrossRef Google Scholar
[10] Ferguson T S. An Inconsistent Maximum Likelihood Estimate. J Am Statistical Association, 1982, 77: 831-834 CrossRef Google Scholar
[11] Friedman J, Hastie T, Tibshirani R. Sparse inverse covariance estimation with the graphical lasso.. Biostatistics, 2008, 9: 432-441 CrossRef PubMed Google Scholar
[12] Han X, Poon R Y C. Critical differences between isoforms of securin reveal mechanisms of separase regulation.. Mol Cell Biol, 2013, 33: 3400-3415 CrossRef PubMed Google Scholar
[13] Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd ed. New York: Springer, 2009. Google Scholar
[14] Hayward B E, Moran V, Strain L. Bidirectional Imprinting of a Single Gene: GNAS1 Encodes Maternally, Paternally, and Biallelically Derived Proteins. Proc Natl Acad Sci USA, 1998, 95: 15475-15480 CrossRef PubMed ADS Google Scholar
[15] Janková J, van de Geer S. Confidence intervals for high-dimensional inverse covariance estimation. Electron J Statist, 2015, 9: 1205-1229 CrossRef Google Scholar
[16] Lam C, Fan J-Q. Sparsistency and rates of convergence in large covariance matrix estimation. Ann Statist, 2009, 37: 42--54. Google Scholar
[17] Li N, Zhang J, Liao D. Association between C4, C4A, and C4B copy number variations and susceptibility to autoimmune diseases: a meta-analysis. Sci Rep, 2017, 7: 42628 CrossRef PubMed ADS Google Scholar
[18] Meinshausen N. Relaxed Lasso. Comput Stat Data Anal, 2007, 52: 374-393 CrossRef Google Scholar
[19] Meinshausen N, Bühlmann P. High-dimensional graphs and variable selection with the Lasso. Ann Statist, 2006, 34: 1436-1462 CrossRef Google Scholar
[20] Meinshausen N, Yu B. Lasso-type recovery of sparse representations for high-dimensional data. Ann Statist, 2009, 37: 246-270 CrossRef Google Scholar
[21] Miyoshi Y, Iwao K, Egawa C. Association of centrosomal kinase STK15/BTAK mRNA expression with chromosomal instability in human breast cancers.. Int J Cancer, 2001, 92: 370-373 CrossRef PubMed Google Scholar
[22] Negahban S, Ravikumar P, Wainwright M-J, et al. A unified framework for high-dimensional analysis of $m$-estimators with decomposable regularizers. Statist Sci, 2012, 27: 1348--1356. Google Scholar
[23] Ning Y, Liu H. A general theory of hypothesis tests and confidence regions for sparse high dimensional models. Ann Statist, 2017, 45: 158-195 CrossRef Google Scholar
[24] Otterbach F, Callies R, Frey U H. The T393C polymorphism in the gene GNAS1 of G protein is associated with survival of patients with invasive breast carcinoma.. Breast Cancer Res Treat, 2007, 105: 311-317 CrossRef PubMed Google Scholar
[25] Peng J, Wang P, Zhou N. Partial Correlation Estimation by Joint Sparse Regression Models.. J Am Statistical Association, 2009, 104: 735-746 CrossRef PubMed Google Scholar
[26] Raskutti G, Wainwright M-J, Yu B. Restricted eigenvalue properties for correlated gaussian designs. J Mach Learn Res, 2010, 11: 2241--2259. Google Scholar
[27] Raskutti G, Wainwright M J, Yu B. Minimax Rates of Estimation for High-Dimensional Linear Regression Over $\ell_q$-Balls. IEEE Trans Inform Theor, 2011, 57: 6976-6994 CrossRef Google Scholar
[28] Ravikumar P, Wainwright M J, Raskutti G. High-dimensional covariance estimation by minimizing $\ell_1$-penalized log-determinant divergence. Electron J Statist, 2011, 5: 935-980 CrossRef Google Scholar
[29] Rothman A J, Bickel P J, Levina E. Sparse permutation invariant covariance estimation. Electron J Statist, 2008, 2: 494-515 CrossRef Google Scholar
[30] Taylor J, Tibshirani R. Can J Stat, 2018, 46: 41-61 CrossRef PubMed Google Scholar
[31] Uhler C, Raskutti G, Bühlmann P. Geometry of the faithfulness assumption in causal inference. Ann Statist, 2013, 41: 436-463 CrossRef Google Scholar
[32] van de Geer S, Bühlmann P, Ritov Y. On asymptotically optimal confidence regions and tests for high-dimensional models. Ann Statist, 2014, 42: 1166-1202 CrossRef Google Scholar
[33] van de Vijver M-J, He Y-D, van't Veer L-J, et al. A gene-expression signature as a predictor of survival in breast cancer. N Engl J Med, 2002, 37: 1999--2009. Google Scholar
[34] Wang H-C, Chiu C-F, Tsai R-Y, et al. Association of genetic polymorphisms of EXO1 gene with risk of breast cancer in taiwan. Anticancer Res, 2009, 29: 3897--3901. Google Scholar
[35] West M, Blanchette C, Dressman H. Predicting the clinical status of human breast cancer by using gene expression profiles. Proc Natl Acad Sci USA, 2001, 98: 11462-11467 CrossRef PubMed ADS Google Scholar
[36] Yuan M. Efficient computation of $\ell_1$ regularized estimates in Gaussian graphical models. J Comput Graphical Stat, 2008, 17: 809-826 CrossRef Google Scholar
[37] Yuan M, Lin Y. Model selection and estimation in the Gaussian graphical model. Biometrika, 2007, 94: 19-35 CrossRef Google Scholar
[38] Zhang C H, Zhang S S. Confidence intervals for low dimensional parameters in high dimensional linear models. J R Stat Soc B, 2014, 76: 217-242 CrossRef Google Scholar
[39] Zhou S-H, Rütimann P, Xu M, et al. High-dimensional covariance estimation based on Gaussian graphical models. J Mach Learn Res, 2011, 12: 2975--3026. Google Scholar
Figure 3
(Color online) The inferred graph from the breast cancer microarray data set by the proposed method. The yellow nodes correspond to the seven genes that have the highest estimated degrees
$n=100$ | $p=50$ | ||||
Two-step method | Adaptive GLASSO | Gelato | SCAD | ||
Model 1 | 0.16 (0.06) | 0.17 (0.07) | 0.17 (0.09) | 0.38 (0.05) | |
Model 2 | 2.23 (0.12) | 2.32 (0.16) | 2.91 (0.11) | 2.29 (0.07) | |
Model 3a | 1.41 (0.14) | 2.63 (0.11) | 1.68 (0.31) | 2.16 (0.12) | |
Model 3b | 2.10 (0.14) | 2.79 (0.11) | 2.65 (0.41) | 2.54 (0.09) | |
$n=50$ | $p=100$ | ||||
Two-step method | Adaptive GLASSO | Gelato | SCAD | ||
Model 1 | 0.19 (0.07) | 0.26 (0.13) | 0.24 (0.14) | 0.82 (0.02) | |
Model 2 | 4.36 (0.16) | 4.61 (0.07) | 4.86 (0.04) | 5.81 (0.02) | |
Model 3a | 4.08 (0.25) | 4.43 (0.25) | 4.22 (0.36) | 5.65 (0.35) | |
Model 3b | 5.69 (0.33) | 6.19 (0.17) | 5.86 (0.88) | 6.98 (0.15) | |
$n=200$ | $p=400$ | ||||
Two-step method | Adaptive GLASSO | Gelato | SCAD | ||
Model 1 | 0.15 (0.02) | 0.82 (0.03) | 0.61 (0.02) | 0.60 (0.02) | |
Model 2 | 4.25 (0.07) | 5.19 (0.08) | 4.83 (0.06) | 5.23 (0.05) | |
Model 3a | 6.55 (0.05) | 6.70 (0.09) | 5.41 (0.08) | 6.56 (0.06) | |
Model 3b | 7.16 (0.19) | $10.09~(0.04)~\,$ | 9.17 (0.05) | 9.83 (0.05) | |
$n=400$ | $p=800$ | ||||
Two-step method | Adaptive GLASSO | Gelato | SCAD | ||
Model 1 | 0.08 (0.04) | 0.78 (0.02) | 0.25 (0.02) | 1.03 (0.01) | |
Model 2 | 4.07 (0.04) | 6.40 (0.03) | 4.13 (0.02) | 6.45 (0.04) | |
Model 3a | 9.28 (0.20) | $14.77~(0.36)~\,$ | $15.99~(0.12)~\,$ | 9.93 (0.08) | |
Model 3b | $10.51~(0.04)~\,$ | $16.57~(0.17)~\,$ | $18.21~(0.10)~\,$ | $14.39~(0.03)~\,$ |