CL_DEVICE_MAX_WORK_GROUP_SIZE
should return a single size_t
value (for example 512, but I don’t know what it’d be on your system). This is the maximum number of work-items in a work-group, not the maximum in each dimension. So in your case you are trying to make a 2D work-group with 32*32 = 1024 work-items, and presumably CL_DEVICE_MAX_WORK_GROUP_SIZE
is less than 1024 on your system.
See the OpenCL 1.1 spec, table 4.3, page 37, the definition of CL_DEVICE_MAX_WORK_GROUP_SIZE
:
Maximum number of work-items in a work-group executing a kernel using the data parallel execution model.