HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
1.
▲
Aiter: AI Tensor Engine for ROCm
rocm.blogs.amd.com
88 comments
a year ago
hochmartinez
179 points
2.
▲
AMD ROCm Software Blogs
rocm.blogs.amd.com
54 comments
2 years ago
jrepinc
118 points
3.
▲
Matrix Core Programming on AMD CDNA Architecture
rocm.blogs.amd.com
22 comments
7 months ago
salykova
61 points
4.
▲
Boosting Computational Fluid Dynamics Performance with AMD MI300X
rocm.blogs.amd.com
45 comments
a year ago
latchkey
47 points
5.
▲
C++17 Parallel Algorithms and Hipstdpar
rocm.blogs.amd.com
1 comment
2 years ago
mathiasgredal
8 points
6.
▲
FP8 GEMM Optimization on AMD CDNA4 Architecture
rocm.blogs.amd.com
discuss
5 days ago
skidrow
4 points
7.
▲
FP8 GEMM Optimization on AMD CDNA4 Architecture
rocm.blogs.amd.com
discuss
5 days ago
skidrow
3 points
8.
▲
Deep Dive into 4-Wave Interleave FP8 GEMM
rocm.blogs.amd.com
discuss
5 days ago
skidrow
3 points
9.
▲
ROCm 7.13: Expanding Hardware, Tools, and Reach
rocm.blogs.amd.com
discuss
a month ago
mindcrime
3 points
10.
▲
Introducing hipThreads: A C++ - Style Concurrency Library for AMD GPUs
rocm.blogs.amd.com
discuss
4 months ago
pjmlp
3 points
11.
▲
Matrix Core Programming on AMD CDNA4 Architecture
rocm.blogs.amd.com
discuss
7 months ago
salykova
3 points
12.
▲
Large language model inference optimizations on AMD GPUs
rocm.blogs.amd.com
discuss
2 years ago
atomlib
3 points
13.
▲
Instella from AMD open 3B-parameter language models
rocm.blogs.amd.com
1 comment
a year ago
programd
2 points
14.
▲
FP8 GEMM Optimization on AMD CDNA4 Architecture
rocm.blogs.amd.com
discuss
3 days ago
skidrow
2 points
15.
▲
Unlocking Extreme AMD Instinct Inference with Software-Hardware Co-Optimization
rocm.blogs.amd.com
discuss
5 days ago
mooreds
2 points
16.
▲
Primus Projection: Estimate Memory and Performance Before You Train
rocm.blogs.amd.com
discuss
2 months ago
matt_d
2 points
17.
▲
Matrix Core Programming on AMD GPUs
rocm.blogs.amd.com
discuss
7 months ago
salykova
2 points
18.
▲
Matrix Core Programming on AMD CDNA3 and CDNA4 Architecture
rocm.blogs.amd.com
discuss
7 months ago
salykova
2 points
19.
▲
Matrix Core Programming on AMD CDNA 3 and CDNA 4 Architecture
rocm.blogs.amd.com
discuss
9 months ago
ashvardanian
2 points
20.
▲
Installing ROCm from Source with Spack
rocm.blogs.amd.com
discuss
a year ago
fngarrett
2 points
21.
▲
AMD GPU Operator and Metrics Exporter
rocm.blogs.amd.com
discuss
a year ago
ankitg12
2 points
22.
▲
Supercharge DeepSeek-R1 Inference on AMD Instinct MI300X
rocm.blogs.amd.com
discuss
a year ago
pama
2 points
23.
▲
Unlock DeepSeek-R1 Inference Performance on AMD Instinct MI300X GPU
rocm.blogs.amd.com
discuss
a year ago
breadislove
2 points
24.
▲
FP8 GEMM Optimization on AMD CDNA4 Architecture
rocm.blogs.amd.com
discuss
4 days ago
skidrow
1 points
25.
▲
FP8 GEMM Optimization on AMD CDNA4 Architecture
rocm.blogs.amd.com
discuss
4 days ago
skidrow
1 points
26.
▲
Matrix Core Programming on AMD GPUs
rocm.blogs.amd.com
discuss
7 months ago
salykova
1 points
27.
▲
Matrix Core Programming on AMD CDNA3 and CDNA4 Architecture
rocm.blogs.amd.com
discuss
7 months ago
salykova
1 points
28.
▲
AMD MI300X for LLM Serving Disaggregating Prefill and Decode with SGLang
rocm.blogs.amd.com
discuss
10 months ago
latchkey
1 points
29.
▲
AMD MI300 compute and memory partitioning
rocm.blogs.amd.com
discuss
a year ago
ankitg12
1 points
30.
▲
Overview of Architectural Improvements in vLLM V1
rocm.blogs.amd.com
discuss
a year ago
simonpure
1 points
More