I am coding an implementation for Fox Algorithm with MPI in C. I already subdivised my global_matrix into smaller blocks. So each process has a little block of matrix A and matrix B. However I have trouble understanding how to implement Fox algorithm : a lot of code found on the internet are doing the following.
Implementation for Fox Algorithm
What I don't understand : In the last slides, there is C code that should implement the algorithm. But it seems that the temp array is never properly initialized and thus should result in weird behaviours when used in MPI_Bcast() and in the matrix multiplication.
I think I have the algo almost working but my result values are definitely wrong.
(I can provide code if you need)
Thanks for your answers !