SciDEX has 841 debate sessions and 1,878 hypotheses spanning Alzheimer's Disease (AD),
Parkinson's Disease (PD), ALS, FTD, and other neurodegenerative conditions — but they are
siloed. No synthesis surface exists that extracts shared mechanistic pathways across diseases,
even though cross-disease mechanisms are one of the most valuable scientific insights the platform
could produce (e.g., TDP-43 pathology in both ALS and FTD, tau aggregation in AD/FTD/PD).
This task produces 10+ cross-disease mechanistic hypotheses with:
Cross-disease mechanisms are high-value because:
Each hypothesis must include:
Title: [Gene/pathway] [mechanism] across [Disease1] and [Disease2]
Evidence: 2+ PubMed citations (real PMIDs, verified)
Shared mechanism: 1 paragraph mechanistic explanation
Falsifiable prediction: specific experimental test (e.g., "Knockout of X in PD mouse model
should reduce tau pathology markers by >20%")
Proposed experiment: specific protocol outline
Cross-disease confidence: 0-1 score with rationalepaper_cache for evidence of the shared mechanismhypotheses table with:hypothesis_type = 'cross_disease_synthesis'disease = 'multi'target_gene = primary geneevidence_forconfidence_score from cross-disease evidence densityThese are starting points based on known neuroscience — verify with SciDEX data:
hypotheses table with hypothesis_type='cross_disease_synthesis'paper_cache.search_papers())evidence_for = null or evidence_for = []841 debate sessions and 1,878 hypotheses are siloed by disease. Cross-disease synthesis
is unique scientific value SciDEX can produce at scale. Cycle 1 created task 1b1ebf23 for
a "cross-disease analogy miner" but it did not appear in completed or open lists — this
spec is more specific about deliverables.
Staleness review found no existing hypothesis_type='cross_disease_synthesis' rows
and no open sibling task already producing the requested synthesis set. This iteration
will add a deterministic Agora synthesizer seeded from verified PubMed evidence, run it
once to insert 10 cross-disease hypotheses with non-empty evidence_for, add cross-disease
KG edges, and register a summary analysis/report artifact for validator inspection.
Added scidex.agora.cross_disease_synthesis, a deterministic/idempotent seed runner
that verifies PubMed citations through paper_cache, queries SciDEX target/disease
support, and upserts synthesis hypotheses, citation links, KG edges, and a completed
analysis row. Ran it once against PostgreSQL: inserted/updated 10
cross_disease_synthesis hypotheses under analysis SDA-2026-04-28-cross-disease-synthesis,
with 31 verified PMIDs, no empty evidence_for, and 37 cross_disease_synthesis
KG edges. Summary report written to docs/reports/cross_disease_synthesis_2026-04-28.md.
Verification: python3 -m py_compile scidex/agora/cross_disease_synthesis.py; SQL
checks confirmed 10 hypotheses, minimum 2 citations each, 0 empty evidence rows, and
37 linked KG edges.
After iteration 1, cross-disease hypotheses existed but had no hypothesis_predictions
rows. Added upsert_predictions() function and re-ran run():
SELECT COUNT(*) FROM hypothesis_predictions WHERE hypothesis_id LIKE 'h-cross-synth-%'SDA-2026-04-28-cross-disease-synthesis shows kg_impact=0.74,SELECT COUNT(*) FROM hypothesis_predictions WHERE
hypothesis_id LIKE 'h-cross-synth-%'.