HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
1.
▲
Doing cheap PyTorch inference with Modal.com
fikisipi.substack.com
discuss
3 years ago
pyentropy
6 points
2.
▲
No Local GPU? No Problem Running Andrej Karpathy’s NanoGPT on Modal.com
martincapodici.com
discuss
3 years ago
mcapodici
3 points
3.
▲
Fast, lazy container loading in modal.com [video]
youtube.com
discuss
2 years ago
eatonphil
1 points
4.
▲
Deploying AI-powered Django apps to Modal.com
tolkunov.dev
discuss
2 years ago
todsacerdoti
1 points
5.
▲
Show HN: AI that generates 3blue1brown-style explainer videos
tma.live
46 comments
a year ago
zan2434
93 points
6.
▲
Show HN: I Built an Open Source API with Insanely Fast Whisper and Fly GPUs
github.com/JigsawStack
1 comment
2 years ago
yoeven
13 points
7.
▲
Show HN: isometric.nyc/snow
isometric.nyc
2 comments
4 months ago
cannoneyed
10 points
8.
▲
Ask HN: Which serverless GPU websites have you tried and which one was best?
1 comment
3 years ago
tikkun
9 points
9.
▲
Ask HN: Best cloud GPU solution for inference API?
2 comments
3 years ago
topoftheforts
4 points
10.
▲
Show HN: An unstructured data workspace for data transformations with LLM
usefolio.ai
discuss
3 months ago
nibab
4 points
11.
▲
Show HN: Tarot Card Generator
youtarot.app
discuss
2 years ago
peab
3 points
12.
▲
Show HN: Stable Diffusion Pokémon Cards
modal-labs-example-text-to-pokemon-fastapi-app.modal.run
discuss
3 years ago
thundergolfer
3 points
13.
▲
Show HN: Tokiwi – An online tokenizer for any Hugging Face model
tokiwi.dev
discuss
a year ago
TweedBeetle
2 points
14.
▲
Ask HN: Looking for empirical advices on "serverless" ML hosting
discuss
2 years ago
tonyabracadabra
2 points
15.
▲
Show HN: I hate paying for GPUs while developing – this is how I solved it
adithyask.medium.com
discuss
9 months ago
Adithya-Kolavi
1 points
16.
▲
Ask HN: What Inference Server do you use to host TTS Models?
discuss
a year ago
samagra14
1 points
17.
▲
Instructor with Jason Liu
discuss
2 years ago
CShorten
1 points
18.
▲
DoppelBot: Replace Your CEO with an LLM
modal.com
117 comments
a year ago
gk1
232 points
19.
▲
The Missing Nvidia GPU Glossary
modal.com
70 comments
a year ago
birdculture
230 points
20.
▲
'I paid for the whole GPU, I am going to use the whole GPU'
modal.com
45 comments
a year ago
mooreds
154 points
21.
▲
Keeping 20k GPUs healthy
modal.com
62 comments
5 months ago
jxmorris12
134 points
22.
▲
We reverse-engineered Flash Attention 4
modal.com
48 comments
9 months ago
birdculture
134 points
23.
▲
Lambda on hard mode: serverless HTTP in Rust
modal.com
51 comments
2 years ago
pierremenard
131 points
24.
▲
Static IPs for Serverless Containers
modal.com
66 comments
2 years ago
ekzhang
125 points
25.
▲
We built a cloud GPU notebook that boots in seconds
modal.com
34 comments
8 months ago
birdculture
91 points
26.
▲
Cutting inference cold starts by 40x with LP, FUSE, C/R, and CUDA-checkpoint
modal.com
18 comments
a month ago
charles_irl
91 points
27.
▲
Three types of LLM workloads and how to serve them
modal.com
5 comments
5 months ago
charles_irl
75 points
28.
▲
Linear Programming for Fun and Profit
modal.com
15 comments
a year ago
hmac1282
62 points
29.
▲
GPU memory snapshots: sub-second startup (2025)
modal.com
13 comments
5 months ago
jxmorris12
27 points
30.
▲
Boosting multimodal inference performance by >10% with a single Python dict
modal.com
discuss
2 months ago
jxmorris12
16 points
More