第9章 偏倚、伦理与安全


I. 目标与范围(Purpose & Scope)


II. 前置条件与依赖(Prerequisites & Inputs)


III. 偏倚识别(Bias Identification)


IV. 偏倚缓解(Bias Mitigation)


V. 伦理合规(Ethics Compliance)


VI. 安全与滥用防护(Safety & Misuse Prevention)

  1. 风险场景:越权推断、群体歧视、恶意自动化、越界部署(超出相干窗/失锁/路径缺失)。
  2. 防护策略
    • 输入防护:Schema 校验、单位/量纲检查(I70-dim_check),路径块必备 gamma/measure/delta_form;
    • 输出防护:阈值门与保守区间,越界则降级或 [Restricted]
    • 运行时:速率/配额、幂等键、黑/灰名单与熔断;
    • 模型侧:不安全提示过滤、对抗/越权检测(如置信度门与漂移告警)。
  3. 事件与回退:异常触发 rollback_fsm.yaml,记录到 audit.jsonl。

VII. 路径量统一口径(Normative Path Forms)

正文显式路径与测度;数据侧记录 delta_form;路径数组满足 len(gamma_ell)=len(d_ell)=len(n_eff)≥2。


VIII. 质量门映射(Gate Mapping)


IX. 机读策略与清单(Machine-Readable Policies)
A. safety_policy.yaml

version: "1.0.0"

inputs:

schema_check: true

unit_dim_check: true

path_required: { gamma: true, measure: true, delta_form: ["general","factored"] }

outputs:

confidence_guard: { min_conf: 0.6 }

restricted_label: "[Restricted]"

runtime:

rate_limit: { rps: 100, burst: 200 }

quota: { daily_calls: 100000 }

circuit_breaker: { error_ratio: 0.2, window_s: 60 }


B. bias_report.md(纲要)

# Bias Report

- Stratified coverage + CIs

- Measurement bias: δt_abs/Δτ_ch/σ_y(τ)/n_eff residuals

- Labeling consistency: κ/MAE/DTW

- High-risk slices & mitigation


C. ethics.md(纲要)

# Ethics Statement

- Purpose limitation & consent

- Minimization & de-identification

- Governance roles & escalation

- Third-party license & redistribution terms


X. 反例与修正(Anti-Patterns & Fixes)


XI. 交叉引用(Cross-References)


XII. 执行勾选清单(Checklist)