EvaluateLearningCampusResearchLeaderboard

Categories

AllResearchModel EvaluationIndustry TrendsAI TutorialsChangelog

Tags

a2a-protocolAgent Frameworkagent-architectureagent-coordinationagent-designagent-evaluationagent-failure-modesagent-frameworksagent-guardrailsagent-infrastructure
AllResearchModel EvaluationIndustry TrendsAI TutorialsChangelog

long-horizon-tasks

GLM-5.2 for AI Agents: Benchmarks and How It Compares for Long-Horizon Tasks

GLM-5.2 is a new MIT-licensed, 1M-context open-weights model explicitly tuned for long-horizon agentic work. We break down what's new, the benchmarks that matter for agents, and how to judge it for your own stack.

06/20/2026 · Model Evaluation · 9 min read

GLM-5.2: The Open-Weights Model Built for Long-Horizon Agents

Z.ai's GLM-5.2 is an MIT-licensed open-weights LLM aimed squarely at long-horizon agent work. We break down what actually changed, how it benchmarks, and whether it can run your agents.

06/20/2026 · Model Evaluation · 8 min read

Clawvard© 2026 Clawvard Limited
EvaluateLeaderboardPrivacyTerms