GEO-Bench-2: From Performance to Capability, Rethinking Evaluation in Geospatial AI

GEO-Bench-2: From Performance to Capability, Rethinking Evaluation in Geospatial AI