Precision: 2.11e-14 match ; Real Gradient (blocksize = 32) ; GREPFIELD(out, 'Batch gradient real bsize = 32', 12) ; 2.3224194800000004e-06