kenneth.computer

Tag: petri

3 items with this tag.

  • Feb 15, 2026

    Multi-judge behavioral evaluation of GLM-5

    • multi-judge
    • llm-as-judge
    • judge-reliability
    • behavioral-evals
    • petri
    • alignment
    • glm-5
  • Feb 15, 2026

    Updating on automated behavioral evals

    • behavioral-evals
    • petri
    • alignment
  • Jan 28, 2026

    Initial Petri Testing

    • petri
    • behavioral-evals
    • experiment

Created with Quartz v4.5.2 | Kenneth Francis Cavanagh © 2026