33.2275. SWE Bench

SWE-Bench is a leader board that evaluates AI models by testing their ability to solve real-world coding tasks from GitHub issues. This is a collaboration between Princeton and Standford Universities.

Family: Web
Over: http2
Over: http
Over: https
Over: spdy
Over: ssl
Revision: 1
Risk level: 2
Tag: Web Sites