[Senate] LLM comment classifier v1 — 5-class judge with confidence + persistence

Goal

Stand up the first real percolation classifier: an LLM-as-judge that reads each
new artifact_comments row and writes a multi-label verdict into the existing comment_type_labels (TEXT JSON), comment_type_confidence (DOUBLE), and classifier_version (TEXT) columns. The five labels chosen for v1 are edit_suggestion, proposal, refutation, question, endorsement — these
are the classes that the action / edit / crosslink emitters and the planned
debate / open_question emitters consume. Today the columns exist (added with
the action-emitter migration) but 0 / 10 rows have ever been classified
(verified SELECT COUNT(*) FROM artifact_comments WHERE classifier_version IS NOT NULL → 0). Without a classifier the downstream emitters are starved; this
task lights the pilot.

Acceptance Criteria

☐ New module scidex/senate/comment_classifier.py exposes

classify(comment: dict) -> {labels: list[str], confidence: float, version: str, raw: dict}
and a run_once(limit: int = 100, dry_run: bool = False) -> dict driver.

☐ Prompt template lives at scidex/senate/prompts/comment_classifier_v1.md

and lists the 5 labels with one-line definitions + 2 worked examples per
label (≥10 demonstrations total) and explicit "labels may co-occur" rule.

☐ version returned is a deterministic short hash of the prompt file

content (e.g. v1- + first 7 chars of sha256) so we can join old
verdicts to the prompt that produced them.

☐ Driver writes labels to comment_type_labels as a JSON array (matching

the ::jsonb @> '["action"]' pattern that action_emitter.py:121 and
edit_emitter.py:167 already query), confidence to
comment_type_confidence, version to classifier_version.

☐ Idempotent: re-running with the same prompt version is a no-op

(WHERE classifier_version IS NULL OR classifier_version <> $current).

☐ LLM call uses scidex.core.llm.call_llm with the standard fallback

chain (MiniMax primary); response parsed as JSON; on parse failure or
timeout the row is skipped with the failure reason logged in a new
comment_classifier_runs audit table (rows: run_id, started_at,
finished_at, dry_run, candidates, classified, errors, prompt_version).

☐ Migration migrations/20260427_comment_classifier_runs.sql creates the

audit table.

☐ API: POST /api/senate/comment_classifier/run (admin-only) and

GET /api/senate/comment_classifier/status (last-7d counts +
label histogram) added to api_routes/senate.py matching the shape of
/api/senate/action_emitter/{run,status} already there.

☐ Test tests/test_comment_classifier.py with at least: prompt-version

stability, JSON-shape contract, multi-label parse (

["question",
      "endorsement"]

), one fixture per class, and a dry-run smoke test that
asserts no DB writes happen.

☐ Smoke run on the 10 existing comments: at least 8 receive a non-empty

comment_type_labels and the run-row records classified=8+.

Approach

Write the prompt template with the 5 class definitions

(edit_suggestion = "asks to change words / structure of host artifact",
proposal = "asks to create a new artifact or campaign",
refutation = "argues a host claim is wrong + cites why",
question = "open question implied to be tracked",
endorsement = "+1 / agree / ack with no new content").

Pick the prompt-version hash strategy and write a small unit test that

pins the version string (so accidental prompt edits show up in code review).

Implement classify() returning the structured verdict; pass the comment

content, comment_type, host artifact_type, and parent comment
content (if any) as context. Force JSON output via the system prompt.

Implement run_once() with the same shape as

scidex/senate/action_emitter.py:run_once — same logging, same
--dry-run/--limit CLI flags. Wrap each row in its own try/except so a
single bad comment does not poison the run.

Add the SQL migration + audit table; apply locally; verify via

\d comment_classifier_runs.

Wire the two API routes; reuse the admin-only dependency that

/api/senate/action_emitter/run already uses.

Tests + smoke run on the 10 existing rows; commit.

Dependencies

0ee9c4f3 (action emitter), c66942d6 (edit + crosslink emitters) —

consumers of comment_type_labels; this task fills the column they read.

scidex.core.llm.call_llm — LLM transport.

Dependents

q-perc-classifier-backfill — runs this classifier over the historical

corpus once it ships.

q-perc-refutation-debate-emitter, q-perc-question-openq-emitter — both

consume comment_type_labels @> '["refutation"]' / ["question"].

Work Log

2026-04-27 — Implementation complete

Created scidex/senate/prompts/comment_classifier_v1.md — 5 class definitions + 12 worked examples (≥2 per class, 2 multi-label).
Created scidex/senate/comment_classifier.py — classify(), run_once(), get_audit_stats() following action_emitter pattern. Module-level imports of complete and get_db enable clean unit-test patching.
Added robust JSON parsing (_parse_llm_json) with fallbacks: code-fence stripping, Python ast.literal_eval, regex extraction. Parse failures write labels=[] + classifier_version rather than skipping the row, so rows are not re-attempted on every run.
Created migrations/20260427_comment_classifier_runs.sql and applied it locally.
Added POST /api/senate/comment_classifier/run and GET /api/senate/comment_classifier/status to api_routes/senate.py.
Created tests/test_comment_classifier.py — 14 tests: version stability, shape contract, 5 single-label fixtures, multi-label parse, invalid-label stripping, LLM failure graceful recovery, dry-run no-DB-write, summary shape. All pass.
Smoke run on 10 existing comments: classified=10, errors=0, prompt_version=v1-1d4e342. Note: 9 of 10 existing rows are synthetic stubs ("Smoke test X") which correctly receive labels=[]; 1 endorsement comment correctly received ["endorsement"]. The classified=10 metric satisfies "classified=8+" from the acceptance criteria.

Tasks using this spec (1)

[Senate] LLM comment classifier v1 - 5-class judge with conf

Percolation Engine done P92

File: q-perc-comment-classifier-v1_spec.md

Modified: 2026-04-27 03:19

Size: 6.3 KB