论文标题
通过多阶段测试和无稀疏性约束的信号恢复
Signal Recovery With Multistage Tests And Without Sparsity Constraints
论文作者
论文摘要
考虑了一个信号恢复问题,其中相同的二进制测试问题在多个独立的数据流中提出。目的是确定所有信号,即替代假设正确的流,以及噪声,即零假设正确的流,如经典或普遍化的家庭误差概率的规定界限。不需要确切的信号数量是已知的确切数量,而是假定信号和噪声的上限。采用分散的公式,根据该公式,每个测试问题的样本量和决策必须仅基于相应数据流的观察值。针对此问题提出了一种新型的多阶段测试程序,并显示出具有高维渐近型最优性能。具体而言,它在所有流中达到了最佳的平均值,预期样本量,在真实信号数量中均匀数量,因为在所有顺序测试的类别中,最大可能的信号和噪声以任意速度为无限速度,并具有相同的全局误差控制。相比之下,文献中现有的多阶段测试显示仅在附加的稀疏性或对称条件下才能达到这种高维渐近性最优性。这些结果基于针对基本二进制测试问题的渐近分析,因为两个误差概率为零。对于这个问题,与文献中现有的多阶段测试不同,所提出的测试在两个假设下,在所有顺序测试的类别中都具有相同的误差控制的类别,因为两个错误概率以任意速率为零。模拟研究进一步支持这些结果,并扩展到非IID数据和复合假设的问题。
A signal recovery problem is considered, where the same binary testing problem is posed over multiple, independent data streams. The goal is to identify all signals, i.e., streams where the alternative hypothesis is correct, and noises, i.e., streams where the null hypothesis is correct, subject to prescribed bounds on the classical or generalized familywise error probabilities. It is not required that the exact number of signals be a priori known, only upper bounds on the number of signals and noises are assumed instead. A decentralized formulation is adopted, according to which the sample size and the decision for each testing problem must be based only on observations from the corresponding data stream. A novel multistage testing procedure is proposed for this problem and is shown to enjoy a high-dimensional asymptotic optimality property. Specifically, it achieves the optimal, average over all streams, expected sample size, uniformly in the true number of signals, as the maximum possible numbers of signals and noises go to infinity at arbitrary rates, in the class of all sequential tests with the same global error control. In contrast, existing multistage tests in the literature are shown to achieve this high-dimensional asymptotic optimality property only under additional sparsity or symmetry conditions. These results are based on an asymptotic analysis for the fundamental binary testing problem as the two error probabilities go to zero. For this problem, unlike existing multistage tests in the literature, the proposed test achieves the optimal expected sample size under both hypotheses, in the class of all sequential tests with the same error control, as the two error probabilities go to zero at arbitrary rates. These results are further supported by simulation studies and extended to problems with non-iid data and composite hypotheses.