Define structured JSON schemas for each neuroscience experiment type that SciDEX extracts
from papers. Each schema captures the minimum fields needed to represent the experiment's
design, results, and statistical evidence in a machine-readable, comparable format.
The schemas must be:
genetic_association (GWAS, candidate gene, Mendelian)protein_interaction (co-IP, mass spec, proximity labeling)gene_expression (RNA-seq, qPCR, microarray, scRNA-seq)animal_model (transgenic mice, behavioral, histology)cell_biology (cell culture, organoids, iPSC)clinical (biomarkers, imaging, cognitive, drug trials)neuropathology (histology, immunostaining, EM)
schemas/experiments/ directoryvalidate_experiment_metadata(metadata, experiment_type) in artifact_registry.pyvalidate_experiment_metadata() functionatl-ex-02-PIPE — Extraction pipeline uses these schemas as output targetsatl-ex-07-BKFL — Backfill uses schemas to structure existing experiment artifactsq-schema-governance — Schemas registered in governance system once builtEXPERIMENT_TYPES, CONTROLLED_VOCABULARIES constants and validate_experiment_metadata() function in artifact_registry.py. Added EXPERIMENT_TYPES (7 experiment types), CONTROLLED_VOCABULARIES (model_system, species, effect_direction, replication_status, tissue, brain_region), and validate_experiment_metadata(metadata, experiment_type) validation function with type-specific checks. Validation tested and working. Rebased onto latest origin/main. [task:atl-ex-01-SCHM]{
"requirements": {
"analysis": 5
},
"completion_shas": [
"f949cb3865b3261f5644d891652a3307ec44dd49",
"e8278bd573fffd4286a5d02e6a36a6d467d128c0"
],
"completion_shas_checked_at": "2026-04-14T03:52:05.754968+00:00"
}