Precision: 5.62e-21 match ; Real Gradient (blocksize = 128) ; GREPFIELD(out, 'Batch gradient real bsize = 128', 12) ; 1.1230012216000001e-11