AI SummaryPrismBench enables developers to create specialized LLM agents through YAML configuration for systematic evaluation of model capabilities using Monte Carlo Tree Search. Useful for ML engineers, researchers, and teams building production LLM systems who need comprehensive benchmarking and evaluation frameworks.
Description
PrismBench: A comprehensive framework for evaluating Large Language Model capabilities through Monte Carlo Tree Search. Systematically maps model strengths, automatically discovers challenging concept combinations, and provides detailed performance analysis with containerized deployment and OpenAI-compatible API support.
Install
# Add AGENTS.md to your project root curl -o AGENTS.md "https://raw.githubusercontent.com/PrismBench/PrismBench/main/docs/Custom-Agents.md"
Quality Score
Acceptable
63/100
Trust & Transparency
No License Detected
Review source code before installing
Verified Open Source
Hosted on GitHub — publicly auditable
Maintained
Last commit 4mo ago
3 stars
0 forks