Abstract:In order to give a full play of SMP clusters, this research studied the method to improve the performance of Fox algorithm for parallel matrix multiplication exploiting optimization on single processor, OpenMP and MPI, which involves the levels of instruction, shared memory and distributed memory respectively. Through invoking mathematic library and hybrid programming, the numerical results derived on DeepComp 6800 are satisfactory.