Precision: 2.21e-12 match ; Real Gradient (blocksize = 64) ; GREPFIELD(out, 'Batch gradient real bsize = 64', 12) ; 1.3239927129000002e-11