第7章 样本量、功效与稳健选项


I. 目的与范围(Purpose & Scope)

与分层设计指引,使不确定度评估与质量门阈值(Chapter 8)对齐。稳健选项情形下的样本量(N)与统计功效(1−β)的计算与配置口径,覆盖均值差、相关系数、比例/通过率等常见指标;提供重尾/异方差/相关给出

II. 前置条件与输入(Prerequisites & Inputs)


III. 连续型终点:均值差(Difference in Means)


IV. 相关性终点:r(Phase Consistency 等)


V. 通过率/比例类指标(合格率、门限通过)


VI. 相关结构与设计效能修正(Design Effects)


VII. 稳健选项(Heavy-Tail / Heteroskedasticity / Outliers)


VIII. 自举/置换的功效评估(Non-Parametric Power)


IX. 顺序与自适应方案(Sequential / Adaptive)


X. 与质量门阈值映射(Alignment with Gates)


XI. 机读模板(Machine-Readable)


A. power_plan.yaml

version: "1.0.0"

targets:

- name: "DeltaT_arr"

type: "means_diff"

alpha: 0.01

power: 0.90

effect_size_d: 0.35

design: { paired: true, deff: 1.00 }

- name: "r_phi"

type: "correlation"

alpha: 0.01

power: 0.90

r_target: 0.60

fisher_z: true

- name: "epsilon_flux"

type: "threshold_rate"

alpha: 0.05

power: 0.80

p0: 0.15

p1: 0.07

robust:

loss: "huber"

inflate_factor: 1.20

stratification: ["batch","device","region"]

correlation:

path: { kernel: "exp", L_c_m: 25.0 }

sequential:

scheme: "obrien_fleming"

looks: 2

see:

- "EFT.WP.Core.Metrology v1.0:check_dim"

- "Data.Benchmarks v1.0:PROTO"


B. sample_size_report.md(最小要素)

# Sample Size & Power Report

- Targets: DeltaT_arr (d=0.35), r_phi (r=0.60), epsilon_flux (p0→p1).

- Assumptions: alpha=0.01/0.05, power=0.90/0.80, DEFF=1.10, L_c=25 m.

- Methods: z-approx / Fisher-z / exact proportion; robust inflate ×1.20.

- Outputs: N per group/stratum, sequential spending, bootstrap power curve.


XII. 与其它章节衔接(Cross-Refs)

第11章;结果页与评分:见第8章;质量门与停止:见第6章;传播方法:见第5章;协方差与相关结构:见第3章控制式与观测量:见

XIII. 执行勾选清单(Checklist)