% Bibliography for hypothesis h-13dc63ff74 % Title: whether debate-structured causal reasoning improves calibration over direct LLM baselines requires proximal validation % Generated: 2026-04-28T16:23:36Z