共 19 条
- [1] Agarwal R.C.(1994)Exploiting functional parallelism of POWER2 to design high-performance numerical algorithms IBM J. Res. Develop. 38 563-576
- [2] Gustavson F.G.(2000)A portable programming interface for performance evaluation on modern processors Int. J. High Perfor. Comput. Appl. 14 189-204
- [3] Zubair M.(1997)Highly Scalable Parallel Algorithms for Sparse Matrix Factorization IEEE Trans. Parallel Distrib. Syst. 8 502-520
- [4] Browne S.(1998)GEMM-Based Level 3 BLAS: model implementations and performance evaluation benchmark ACM Trans. Math. Softw. 24 268-302
- [5] Dongarra J.(1998)GEMM-based level 3 BLAS: portability and optimization issues ACM Trans. Math. Softw. 24 303-316
- [6] Garner N.(2003)SuperLU_DIST: a scalable distributed-memory sparse direct solver for unsymmetric linear systems ACM Trans. Math. Softw. 29 110-140
- [7] Ho G.(undefined)undefined undefined undefined undefined-undefined
- [8] Mucci P.(undefined)undefined undefined undefined undefined-undefined
- [9] Gupta A.(undefined)undefined undefined undefined undefined-undefined
- [10] Karypis G.(undefined)undefined undefined undefined undefined-undefined