SciDEX — Task: [Agora] Mechanistic claim verifier: PubMed fact-ch

Build a pipeline that extracts 3-5 specific mechanistic claims from each of the top 50 hypotheses by composite score, searches PubMed for supporting and contradicting evidence, and updates evidence_validation_score + evidence_for/evidence_against JSON arrays with verified citations. WHY: Hypothesis scores are currently based on debate outcomes, not systematic PubMed grounding. Claims like "TDP-43 nuclear depletion leads to STMN2 mis-splicing" may or may not have direct PubMed support. A claim verifier creates the quality layer that makes scores meaningful — and flags hypotheses needing experimental verification. WHAT TO DO: 1. Query top 50 hypotheses ordered by composite_score DESC. 2. For each hypothesis, extract 3-5 specific mechanistic claims using llm.py. 3. For each claim: search PubMed via paper_cache.search_papers() (last 10 years, up to 5 papers). 4. Classify each paper as: 'supports', 'contradicts', 'partial', or 'unrelated' using LLM. 5. Update hypothesis: append verified citations to evidence_for (supports) and evidence_against (contradicts). Update evidence_validation_score = (claims with ≥1 support) / total claims. Update citations_count. 6. Skip hypotheses where evidence_validation_score != 0.5 (already processed). 7. Log summary: hypothesis_id, claims checked, support rate. READ FIRST: docs/planning/specs/agora_mechanistic_claim_verifier_spec.md (merged after 2026-04-28) DO NOT: Overwrite existing evidence — append only. Add unrelated papers. Modify composite_score, novelty_score, or any score except evidence_validation_score. Write to scidex.db. SUCCESS: 50 hypotheses processed (or iteration limit), each with ≥3 claims evaluated, evidence arrays updated with verified PMIDs, evidence_validation_score updated.

Git Commits (7)

Squash merge: orchestra/task/80ffb77b-ambitious-quest-task-generator-xhigh-eff (28 commits) (#1260)2026-04-28

Squash merge: orchestra/task/17f3b8e0-mechanistic-claim-verifier-pubmed-fact-c (2 commits) (#1259)2026-04-28

[Agora] Update claim verifier spec with work log: 779 hypotheses processed [task:17f3b8e0-baac-429c-a426-10dd983f996b]2026-04-28

[Agora] Add mechanistic claim verifier pipeline: PubMed fact-checking for hypotheses [task:17f3b8e0-baac-429c-a426-10dd983f996b]2026-04-28

Squash merge: orchestra/task/80ffb77b-ambitious-quest-task-generator-xhigh-eff (12 commits) (#1243)2026-04-28

[Senate] Cycle 3 work log — 4 strategic tasks created, 2 priority adjustments [task:80ffb77b-8391-493c-8644-37086c8e2e3c]2026-04-28

Sibling Tasks in Quest (Agora) ↗

○[Agora] Hypothesis validation gate — promote top 50 scoring hypotheses to validated statusP95

○[Agora] CI: Trigger debates for analyses with 0 debate sessionsP94

○[Agora] CI: Run debate quality scoring on new/unscored sessionsP93

○[Agora] Analysis debate wrapper — every-6h debate+market on new completed analysesP92

○[Agora] Generate falsifiable predictions for 25 hypotheses with noneP92

○[Agora] Run debates for analyses without debate sessionsP88

○[Agora] Weekly debate snapshotP82

○[Agora] Add counter-evidence reviews to 10 hypotheses missing evidence_againstP60

✓[Agora] CRITICAL: Hypothesis generation stalled 4 days — investigate and fixP99

✓[Agora] CRITICAL: Hypothesis QUALITY over quantity — reliable high-quality science loopP99

[Agora] Mechanistic claim verifier: PubMed fact-checking for top 50 hypotheses open

Git Commits (7)

Sibling Tasks in Quest (Agora) ↗