On the GPU Performance of 3D Stencil Computations Implemented in OpenCL