long-horizon-tasks

GLM-5.2 for AI Agents: Benchmarks and How It Compares for Long-Horizon Tasks
GLM-5.2 is a new MIT-licensed, 1M-context open-weights model explicitly tuned for long-horizon agentic work. We break down what's new, the benchmarks that matter for agents, and how to judge it for your own stack.
06/20/2026 · Model Evaluation · 9 min read

GLM-5.2: The Open-Weights Model Built for Long-Horizon Agents
Z.ai's GLM-5.2 is an MIT-licensed open-weights LLM aimed squarely at long-horizon agent work. We break down what actually changed, how it benchmarks, and whether it can run your agents.
06/20/2026 · Model Evaluation · 8 min read