kenneth.computer

Tag: multi-judge

2 items with this tag.

  • Feb 15, 2026

    Multi-judge behavioral evaluation of GLM-5

    • multi-judge
    • llm-as-judge
    • judge-reliability
    • behavioral-evals
    • petri
    • alignment
    • glm-5
  • Jan 03, 2026

    LLM-as-judge for behavior evaluation

    • machine-psychology
    • evaluation
    • methodology
    • literature-review
    • multi-judge
    • judge-reliability

Created with Quartz v4.5.2 | Kenneth Francis Cavanagh © 2026