Precision: 3.67e-08 match ; Real Gradient (blocksize = 16) ; GREPFIELD(out, 'Batch gradient real bsize = 16', 12) ; 3.65e-05