Matrix Multiplication in Python Program

Fused FP8 4-Way Dot Product With Scaling and FP32 Accumulation

Abstract: For a variety of ML applications, generalized matrix multiply (GEMM) with DOT product is the most computationally intensive operation. This paper presents a microarchitecture exploration of ...

IEEE

Sparse Matrix-Vector Multiplication with Reduced-Precision Memory Accessor

Abstract: Mixed-precision computation, which uses multiple different precision in a single code, is being studied to increase computational speed and energy efficiency. It typically uses the IEEE ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Fused FP8 4-Way Dot Product With Scaling and FP32 Accumulation

Sparse Matrix-Vector Multiplication with Reduced-Precision Memory Accessor

Trending now