HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
91.
▲
Train 1T parameter LLM with 8 GPUs?
1 comment
a month ago
kendy1992
3 points
92.
▲
Rvidia-exporter – Prometheus metrics exporter for Nvidia GPUs
github.com/neo-airouter
1 comment
2 months ago
sacrelege
3 points
93.
▲
AMD ROCm: 40x slower at linear algebra than older Nvidia GPUs
github.com/ROCm
1 comment
2 months ago
PhilipVinc
3 points
94.
▲
Show HN: AudioGhost AI – Run Meta's Sam-Audio on Consumer GPUs (4GB-6GB VRAM)
github.com/0x0funky
1 comment
6 months ago
0x0funky
3 points
95.
▲
Shimmy v1.7.0: Running 42B Moe Models on Consumer GPUs with 99.9% VRAM Reduction
github.com/Michael-A-Kuykendall
1 comment
8 months ago
MKuykendall
3 points
96.
▲
LLM inference load balancer optimized for AMD Radeon VII GPUs
github.com/janit
discuss
3 months ago
velmu
3 points
97.
▲
Show HN: Local video search with Qwen3-VL: no API, runs on Apple Silicon, GPUs
github.com/ssrajadh
discuss
3 months ago
sohamrj
3 points
98.
▲
Java Running Directly on Apple Silicon GPUs with TornadoVM Metal Codegen
github.com/beehive-lab
discuss
4 months ago
mikepapadim
3 points
99.
▲
Show HN: UHOP – An Open Hardware Optimization Platform for GPUs
github.com/sevenloops
discuss
8 months ago
danielbisina
3 points
100.
▲
ZLUDA - CUDA on Non-Nvidia GPUs
github.com/vosen
discuss
a year ago
danboarder
3 points
101.
▲
AdaptiveCpp: Implementation of SYCL and C++ CPUs and GPUs
github.com/AdaptiveCpp
discuss
2 years ago
kristianp
3 points
102.
▲
Show HN: Python Monitoring for AI: LLMs, OpenAI, Inference, GPUs
github.com/graphsignal
discuss
3 years ago
npgraph
3 points
103.
▲
Show HN: Run and fine-tune 175B+ LMs in Colab using a P2P network of GPUs
github.com/bigscience-workshop
discuss
4 years ago
borzunov
3 points
104.
▲
Show HN: DreamBooth Models on Serverless GPUs
github.com/mystic-ai
discuss
4 years ago
paul-nai
3 points
105.
▲
KataGo: AlphaZero-like training with only 47 GPUs
github.com/lightvector
discuss
6 years ago
gslin
3 points
106.
▲
Build and run Docker containers leveraging Nvidia GPUs
github.com/NVIDIA
discuss
11 years ago
jonbaer
3 points
107.
▲
Show HN: I built a 2nd-order PyTorch optimizer for LLMs that runs on 16GB GPUs
4 comments
2 months ago
dnosoz
2 points
108.
▲
Show HN: Velda – Run jobs with serverless GPUs, without container images
velda.io
2 comments
a month ago
eagleonhill
2 points
109.
▲
Show HN: QingMing – Exact vector search on consumer GPUs (no index)
github.com/uulong950
1 comment
5 months ago
uulong
2 points
110.
▲
Show HN: Picomon, a minimal TUI monitor for AMD GPUs
github.com/omarkamali
1 comment
7 months ago
omneity
2 points
111.
▲
Show HN: KV Marketplace – share LLM attention caches across GPUs like memcached
github.com/neelsomani
1 comment
7 months ago
nsomani
2 points
112.
▲
NumPy-First AI: Persona-Aware Semantic Models Without GPUs
github.com/farukalpay
1 comment
8 months ago
HenryAI
2 points
113.
▲
Show HN: Loft CLI – Fine-tune and run LLMs (1–3B) on 8 GB MacBook Air, no GPUs
1 comment
a year ago
dips2umar
2 points
114.
▲
Show HN: AI Infra for non-Nvidia GPUs
github.com/felafax
1 comment
2 years ago
shadowfax92
2 points
115.
▲
Kyanite: NN inference library, in/for Rust, using CPU or Nvidia GPUs
github.com/KarelPeeters
1 comment
3 years ago
homarp
2 points
116.
▲
Show HN: cuSBF – faster Bloom filter on GPUs for DNA sequences
github.com/tdortman
discuss
8 days ago
tdortman
2 points
117.
▲
Show HN: Profine – Profile and rewrite your ML training loop on real GPUs
github.com/ProfineAI
discuss
a month ago
aisinghal
2 points
118.
▲
Show HN: Inferential – Multi-robot inference scheduling on shared GPUs
github.com/nalinraut
discuss
3 months ago
nalinraut
2 points
119.
▲
Show HN: Run autoresearch on a gaming PC (Windows and RTX GPUs fork)
github.com/jsegov
discuss
3 months ago
segov
2 points
120.
▲
Ask HN: Why does single-node DDP sometimes get slower with more GPUs?
discuss
4 months ago
traceopt-ai
2 points
More