kenneth.computer

Tag: evaluation

1 item with this tag.

  • Jan 03, 2026

    LLM-as-judge for behavior evaluation

    • machine-psychology
    • evaluation
    • methodology
    • literature-review
    • multi-judge
    • judge-reliability

Created with Quartz v4.5.2 | Kenneth Francis Cavanagh © 2026