目录文档-技术白皮书44-EFT.WP.Data.ModelCards v1.0

第12章 校准与不确定度


I. 章节目的与范围

、评估与报告口径、覆盖区间与显著性、相关性处理与合成规则;确保与《评测协议与指标》《目标函数、优化与超参》《预处理与特征工程》《训练数据与采样绑定》及计量章一致。规范性定义固化模型卡中的 calibration 与 uncertainty 的

II. 术语与依赖


III. 字段与结构(规范性)

calibration:

method: "<temperature|vector_scale|histogram_binning|isotonic|bayesian|custom>"

params: {t: 1.7?} # 例:温度缩放

eval: # 校准效果评估

report: ["ece","brier","calibration_curve"]

ece_bins: 15

significance: {test:"bootstrap", alpha:0.05}

coverage: # 覆盖/容忍区间

target_p: 0.95

method: "<tolerance|bayes>"

interval: "<two-sided|one-sided>"

notes?: "<non-normative>"

uncertainty:

model: "<GUM|linear|montecarlo|bayesian>"

components: # 成分(系统/随机)

- {name:"thermal", type:"random", value:2.1, unit:"K", distribution:"normal", coverage:{k:1.0}}

- {name:"cal_gain", type:"systematic", value:0.8, unit:"%", distribution:"normal", coverage:{k:2.0}, corr_group:"instrument"}

correlation: # 相关性口径

posture: "<groups|covariance>"

groups: [{name:"instrument", pairwise:"rho=0.6"}]

covariance?: {Sigma: []}

propagation: # 传播与合成

rule: "<rss|linear|montecarlo|bayesian>"

linearization?: "first-order"

samples?: 0

coverage_policy:

target_p: 0.95

k: 2.0

report:

significant_figures: 3

unit_consistency: true


IV. 校准方法与评估


V. 不确定度建模与传播

  1. 成分分类:随机(Type A)与系统(Type B)成分分栏;每项记录 name/type/value/unit/distribution/coverage/method。
  2. 传播规则
    • rss:独立标准不确定度合成 u_c = ( sqrt( Σ u_i^2 ) );
    • linear:一阶泰勒 u_c = ( sqrt( J Σ J^T ) ),J = ( ∂f / ∂x );
    • montecarlo|bayesian:给出样本数或先验/似然,并报告覆盖区间与 target_p。
  3. 扩展不确定度:U = ( k * u_c ),正态近似下 k≈2 对应约 95%。

VI. 相关性处理


VII. 计量与单位


VIII. 路径依赖量(如 T_arr)的口径

  1. 两种等价表达
    • T_arr = ( 1 / c_ref ) * ( ∫ n_eff d ell )
    • T_arr = ( ∫ ( n_eff / c_ref ) d ell )
  2. 登记要求:在模型卡记录 delta_form、path="gamma(ell)"、measure="d ell";将 n_eff、c_ref 等源的不确定度并入传播式并通过 check_dim。

IX. 机器可读片段(可直接嵌入)

calibration:

method: "temperature"

params: {t: 1.7}

eval: {report:["ece","brier","calibration_curve"], ece_bins:15, significance:{test:"bootstrap", alpha:0.05}}

coverage: {target_p:0.95, method:"tolerance", interval:"two-sided"}

uncertainty:

model: "linear"

components:

- {name:"thermal", type:"random", value:2.1, unit:"K", distribution:"normal", coverage:{k:1.0}}

- {name:"cal_gain", type:"systematic", value:0.8, unit:"%", distribution:"normal", coverage:{k:2.0}, corr_group:"instrument"}

correlation: {posture:"groups", groups:[{name:"instrument", pairwise:"rho=0.6"}]}

propagation: {rule:"linear", linearization:"first-order"}

coverage_policy: {target_p:0.95, k:2.0}

report: {significant_figures:3, unit_consistency:true}


X. 与评测协议、目标函数及资源的一致性


XI. 导出清单与审计轨

export_manifest:

artifacts:

- {path:"calibration/report.md", sha256:"..."}

- {path:"calibration/curve.png", sha256:"..."}

- {path:"uncertainty/breakdown.csv", sha256:"..."}

- {path:"uncertainty/covariance.npy", sha256:"..."}

references:

- "EFT.WP.Core.DataSpec v1.0:EXPORT"

- "EFT.WP.Core.Metrology v1.0:check_dim"

可校验并与模型卡字段一致;引用携带“卷名 vX.Y:锚点”。必须校准与不确定度相关工件

XII. 本章合规自检


版权与许可(CC BY 4.0)

版权声明:除另有说明外,《能量丝理论》(含文本、图表、插图、符号与公式)的著作权由作者(“屠广林”先生)享有。
许可方式:本作品采用 Creative Commons 署名 4.0 国际许可协议(CC BY 4.0)进行许可;在注明作者与来源的前提下,允许为商业或非商业目的进行复制、转载、节选、改编与再分发。
署名格式(建议):作者:“屠广林”;作品:《能量丝理论》;来源:energyfilament.org;许可证:CC BY 4.0。

首次发布: 2025-11-11|当前版本:v5.1
协议链接:https://creativecommons.org/licenses/by/4.0/