33.2275. SWE Bench

Home Prev	cOS Core 14.00.19 Application Control Signatures	Next

SWE-Bench is a leader board that evaluates AI models by testing their ability to solve real-world coding tasks from GitHub issues. This is a collaboration between Princeton and Standford Universities.

Family:	Web
Over:	http2
Over:	http
Over:	https
Over:	spdy
Over:	ssl
Revision:	1
Risk level:	2
Tag:	Web Sites