Completion criteria are empty JSON with no defined thresholds, making it impossible to verify whether 11 or any specific number of hypotheses have been adequately enriched with PubMed evidence
Attach real PubMed-backed evidence to hypotheses whose evidence_for field is empty. This improves scientific grounding and prevents debate, ranking, and market workflows from relying on unsupported claims.
evidence_for entriesevidence_for, prioritizing active and high-impact rows.paper_cache.search_papers or paper_cache.get_paper to find relevant PubMed evidence.c488a683-47f - Agora questpaper_cache PubMed lookup helpers030034d6-752e-4ac9-9935-36489c7ec792.evidence_for, but all 43 are archived placeholder rows titled [Archived Hypothesis]; there are 0 active/non-placeholder hypotheses requiring PubMed evidence.scripts/add_pubmed_evidence.py --dry-run --limit 5 and confirmed it would query the literal placeholder title and attach the same unrelated PMIDs to archived rows.scripts/add_pubmed_evidence.py so dry runs and live runs ignore archived placeholders and write structured PubMed evidence objects instead of bare PMID arrays.scripts/backfill_evidence_pubmed.py for the existing unit tests and PostgreSQL-aware backfill helper behavior.python3 scripts/add_pubmed_evidence.py --dry-run --limit 5 reports 0 actionable hypotheses and 43 archived placeholders ignored.quest_engine.discover_gaps(get_db()) no longer emits the hypothesis-pubmed-evidence gap for the archived placeholder backlog.pytest -q tests/test_backfill_evidence_pubmed.py -> 18 passed; python3 -m py_compile scripts/add_pubmed_evidence.py scripts/backfill_evidence_pubmed.py quest_engine.py passed.python3 scripts/add_pubmed_evidence.py --limit 5 shows 0 actionable hypotheses needing evidence.[Archived Hypothesis] placeholders (archived status) — not valid enrichment targets.h-a2b3485737 (CAPN1/CAPN2, score=0.4199, status=proposed): 5 PubMed PMIDs attached with structured evidence ({pmid, doi, claim, source, year, url, strength, caveat}).SELECT COUNT(*)).5eb210854 merged to main; top 25 hypotheses by composite_score all have non-empty evidence_for (verified via DB query). Sample PMIDs verified via paper_cache.get_paper(): 41491101, 41530860, 41714746, 41804841 — all return real papers. All 43 empty-evidence rows are [Archived Hypothesis] placeholders.d02ec580-83c8-4bc0-8495-17a069138c6aevidence_for; 43 archived placeholders ignored.scripts/add_pubmed_evidence.py --limit 20.evidence_for = 0; with evidence populated = 1144.scripts/add_pubmed_evidence.py with --thin-evidence N flag to also enrich hypotheses with fewer than N evidence entries (merges without overwriting existing entries).evidence_for = [] (empty JSON array).scripts/add_pubmed_evidence.py --limit 10 --pmids-per 5 — all 4 enriched.SELECT COUNT(*) WHERE jsonb_array_length(evidence_for)=1 → 52; WHERE evidence_for IS NULL OR =[] → 0.scripts/add_pubmed_evidence.py --thin-evidence 2 --limit 11 --pmids-per 5 — enriched 10/11 hypotheses (1 malformed row h-2f43b42f with title="..." had no search results).active_empty = 0.--ids targeting plus curated override support in scripts/add_pubmed_evidence.py so specific hypotheses can be rewritten when a previous generic search attached low-fit PMIDs.scripts/pubmed_evidence_overrides.json with hand-checked evidence sets for:h-31ca9240f9fc (TBK1): 40858618, 30146158, 25803835, 33031745h-f373e16bb108 (EZH2): 31202798, 32933418, 32553389, 31048495h-530326b97069 (MMP-9/TDP-43): 39067491, 30458231, 33300249, 21209826h-177d9cb05108 (CHI3L1/CHIT1): 31123140, 32762702, 30134252, 24295388, 28989002
python3 scripts/add_pubmed_evidence.py --ids ... to replace the low-quality citations in-place.citations_count bookkeeping so replacement writes now match the actual number of evidence entries instead of preserving stale higher counts."" (empty string) or [None] or malformed long-text strings where PMIDs should be.scripts/pubmed_evidence_overrides.json with 12 new curated entries (total now 16 hypotheses):h-fe1dfe730e (PINK1-PRKN mitophagy): 5 PMIDs including 36503124, 33168089, 25697963h-95a1adb645 (APOE/clusterin CSF): 5 PMIDs including 39510798, 38176942h-86d0aa1ede (rutin/tau): 34116706h-0ca9a295f6 (rutin/p62 autophagy): 4 PMIDsh-5744614d14 (HSF1/MAPT): 40769451h-56af4a2b91 (exosome/alpha-synuclein): 25425650h-8e3748fe5c (mTORC1/ULK1): 5 PMIDs including 33906557, 28686223h-ecfaa2cbb2 (parthenolide/ADORA2A): 41795299h-6ca2dbc5f0 (REST/MAPT): 5 PMIDs including 12130773, 37919281h-6f6f920e83 (heparan sulfate/SNCA): 5 PMIDs including 35790300, 32824376h-63ef3ee258 (alpha7 nAChR/amyloid): 5 PMIDs including 20164328, 25959067h-3c562f5aff (astrocytes/cholinesterase): 17640880
tests/test_add_pubmed_evidence.py validating override file structure.evidence_for; 105 with < 3 evidence entries; 1278 total with evidence.scripts/add_pubmed_evidence.py --thin-evidence 3 --limit 20 --pmids-per 5.evidence_for; thin-evidence (< 3) dropped from 105 → 89 (16 upgraded to ≥3 entries); strong-evidence (≥3) = 1189.--save-report PATH flag to scripts/add_pubmed_evidence.py to persist enrichment results as a JSON file for auditability.scripts/add_pubmed_evidence.py --thin-evidence 3 --limit 20 --save-report data/evidence_reports/pubmed_backfill_2026-04-26.json.data/evidence_reports/pubmed_backfill_2026-04-26.json (20 processed, 19 updated, 79 PMIDs total).evidence_for, 22 with 1 entry); 43 archived placeholders ignored.scripts/add_pubmed_evidence.py --thin-evidence 2 --limit 20 --pmids-per 5 (merge mode).hypothesis_papers junction table (0–2 linked papers).paper_cache.search_papers() with 2-3 targeted PubMed queries using hypothesis title + target genes as search terms.papers table (paper_id = paper-pmid-{pmid}) and linked via hypothesis_papers (evidence_direction='for', strength='medium').hypothesis_papers entries; 138 total new paper links inserted.{
"_gate_retry_count": 2,
"_gate_last_decision": "REVISE",
"_gate_last_reason": "Auto-deploy blocked: branch push failed: To https://github.com/SciDEX-AI/SciDEX.git\n ! [rejected] orchestra/task/e92be9ec-add-pubmed-evidence-to-11-hypotheses-lac -> orchestra/task/e92be9ec-add-pubmed-evidence-to-11-hypotheses-lac",
"_gate_branch": "orchestra/task/e92be9ec-add-pubmed-evidence-to-11-hypotheses-lac",
"_gate_changed_files": [
"docs/planning/specs/quest_engine_hypothesis_pubmed_evidence_spec.md",
"scripts/add_pubmed_evidence.py",
"scripts/pubmed_evidence_overrides.json",
"tests/test_add_pubmed_evidence.py"
],
"_gate_diff_stat": "...quest_engine_hypothesis_pubmed_evidence_spec.md | 34 ++\n scripts/add_pubmed_evidence.py | 127 +++--\n scripts/pubmed_evidence_overrides.json | 594 +++++++++++++++++++++\n tests/test_add_pubmed_evidence.py | 33 ++\n 4 files changed, 757 insertions(+), 31 deletions(-)",
"_gate_history": [
{
"ts": "2026-04-26 09:44:56",
"decision": "REVISE",
"reason": "Auto-deploy blocked: branch push failed: To https://github.com/SciDEX-AI/SciDEX.git\n ! [rejected] orchestra/task/e92be9ec-add-pubmed-evidence-to-11-hypotheses-lac -> orchestra/task/e92be9ec-add-pubmed-evidence-to-11-hypotheses-lac",
"instructions": "",
"judge_used": "",
"actor": "minimax:70",
"retry_count": 1
},
{
"ts": "2026-04-26 09:50:12",
"decision": "REVISE",
"reason": "Auto-deploy blocked: branch push failed: To https://github.com/SciDEX-AI/SciDEX.git\n ! [rejected] orchestra/task/e92be9ec-add-pubmed-evidence-to-11-hypotheses-lac -> orchestra/task/e92be9ec-add-pubmed-evidence-to-11-hypotheses-lac",
"instructions": "",
"judge_used": "",
"actor": "codex:52",
"retry_count": 2
}
]
}