FullStackDev b9745566c8 feat: custom judge criteria and multi-dimensional scoring
Add configurable judge rubrics and multi-dimensional scoring with
weighted aggregation. New config fields: judge_criteria (free text)
and judge_dimensions (list of {name, weight, description}). CLI
--judge-criteria flag provides quick overrides. The judge adapter
computes weighted aggregate scores and enriches feedback with
per-dimension breakdowns.

Co-Authored-By: Paperclip <noreply@paperclip.ing>
2026-03-29 15:40:21 +00:00

Prometheus

Prompt evolution without reference data.

Quick Start

uv sync
uv run prometheus optimize -i examples/sample_config.yaml -o result.yaml -v

Architecture

Clean hexagonal architecture with four layers:

  • Domain — entities, ports, scoring (zero external dependencies)
  • Application — use cases, bootstrap, evaluator, evolution loop
  • Infrastructure — DSPy signatures, modules, adapters, file I/O
  • CLI — Typer app with optimize command

Testing

uv run pytest
uv run ruff check .
uv run mypy src/
Description
No description provided
Readme 351 KiB
Languages
Python 99.9%
Shell 0.1%