HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
1.
▲
Beating NumPy matrix multiplication in 150 lines of C
salykova.github.io
81 comments
2 years ago
p1esk
392 points
2.
▲
Matrix Core Programming on AMD GPUs
salykova.github.io
5 comments
9 months ago
skidrow
116 points
3.
▲
Beating cuBLAS in Single-Precision General Matrix Multiplication
salykova.github.io
8 comments
a year ago
skidrow
98 points
4.
▲
Advanced Matrix Multiplication Optimization on Multi-Core Processors (2024)
salykova.github.io
3 comments
9 months ago
skidrow
85 points
5.
▲
Matrix Core Programming on AMD CDNA3 and CDNA4 Architecture
salykova.github.io
3 comments
9 months ago
skidrow
24 points
6.
▲
Beating NumPy's matrix multiplication in 150 lines of C code
salykova.github.io
discuss
2 years ago
alexmolas
11 points
7.
▲
Beating OpenBLAS in FP32 Matrix Multiplication
salykova.github.io
discuss
a year ago
chmaynard
7 points
8.
▲
Beating cuBLAS in Single-Precision General Matrix Multiplication
salykova.github.io
discuss
a year ago
chmaynard
7 points
9.
▲
Beating NumPy's matrix multiplication in 150 lines of C code
salykova.github.io
discuss
2 years ago
salykova
5 points
10.
▲
Beating OpenBLAS in FP32 Matrix Multiplication
salykova.github.io
1 comment
a year ago
skidrow
4 points
11.
▲
Beating OpenBLAS in FP32 Matrix Multiplication
salykova.github.io
discuss
a year ago
skidrow
4 points
12.
▲
Beating cuBLAS in Single-Precision General Matrix Multiplication
salykova.github.io
discuss
a year ago
EvgeniyZh
4 points
13.
▲
Beating NumPy's matrix multiplication in 150 lines of C code
salykova.github.io
discuss
2 years ago
thunderbong
4 points
14.
▲
Beating cuBLAS in Single-Precision General Matrix Multiplication
salykova.github.io
discuss
a year ago
skidrow
3 points
15.
▲
Beating cuBLAS in Single-Precision General Matrix Multiplication
salykova.github.io
discuss
a year ago
skidrow
3 points
16.
▲
Matrix Core Programming on AMD GPUs
salykova.github.io
discuss
9 months ago
skidrow
2 points
17.
▲
Introduction to Matrix Core Programming on AMD CDNA3 and CDNA4 Architecture
salykova.github.io
discuss
9 months ago
skidrow
2 points
18.
▲
Beating OpenBLAS in FP32 Matrix Multiplication
salykova.github.io
discuss
a year ago
skidrow
2 points
19.
▲
Beating Nvidia's cuBLAS in GEMM
salykova.github.io
discuss
a year ago
lemonsq
2 points
20.
▲
Beating OpenBLAS in FP32 Matrix Multiplication
salykova.github.io
discuss
a year ago
skidrow
2 points
21.
▲
Show HN: Beating cuBLAS in Single-Precision General Matrix Multiplication
salykova.github.io
discuss
a year ago
skidrow
2 points
22.
▲
Beating OpenBLAS in FP32 Matrix Multiplication
salykova.github.io
discuss
a year ago
skidrow
2 points
23.
▲
Beating NumPy's matrix multiplication in 150 lines of C code
salykova.github.io
discuss
2 years ago
salykova
2 points
24.
▲
Beating OpenBLAS in FP32 Matrix Multiplication
salykova.github.io
discuss
a year ago
skidrow
1 points
25.
▲
Beating OpenBLAS in Matrix Multiplication
salykova.github.io
discuss
a year ago
skidrow
1 points