Avoid the transposed X branch in the Z = X * Y matrix multiplication (#439)
Should make results reproducible for different number of threads and batch sizespull/444/head master-483bab2
parent
404e1da38e
commit
483bab2e3d
Loading…
Reference in New Issue