HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
961.
▲
Show HN: KTransformers–236B Model and 1M Context LLM Inference on Local Machines
github.com/kvcache-ai
3 comments
2 years ago
sssummer
20 points
962.
▲
Show HN: Demon – open-source real-time music diffusion engine, 25Hz local GPU
daydreamlive.github.io
13 comments
a month ago
ryanontheinside
17 points
963.
▲
Show HN: Finetune Llama-3.1 2x faster in a Colab
colab.research.google.com
2 comments
2 years ago
danielhanchen
16 points
964.
▲
Show HN: Salad, a distributed cloud for AI (like Airbnb for GPUs)
4 comments
2 years ago
bobjmiles
15 points
965.
▲
Ping.gg monitoring engine now open source (Go)
1 comment
11 years ago
vruiz
15 points
966.
▲
Show HN: KTransformers:671B DeepSeek-R1 on a Single Machine-286 tokens/s Prefill
github.com/kvcache-ai
discuss
a year ago
sssummer
14 points
967.
▲
Show HN: Willow Inference Server: Optimized ASR/TTS/LLM for Willow/WebRTC/REST
github.com/toverainc
13 comments
3 years ago
kkielhofner
13 points
968.
▲
Show HN: Lightweight Llama3 Inference Engine – CUDA C
github.com/abhisheknair10
discuss
a year ago
abhisheknair10
12 points
969.
▲
Show HN: Automatic 1111, but as a Python Package
github.com/saketh12
discuss
2 years ago
saketh105
11 points
970.
▲
Show HN: Coderive – Iterating through 1 Quintillion Inside a Loop in just 50ms
github.com/DanexCodr
13 comments
6 months ago
DanexCodr
8 points
971.
▲
Show HN: Groupon Scraper in Python
8 comments
15 years ago
svrocks
8 points
972.
▲
Show HN: onprem unstructured data extraction with 4 lines of code
github.com/NanoNets
discuss
a year ago
souvik3333
8 points
973.
▲
Show HN: A Decentralized NFT Based Gaming Platform (Dragons vs. Tigers)
github.com/SachPlayZ
discuss
2 years ago
grobat79
8 points
974.
▲
Show HN: Local GLaDOS
old.reddit.com
discuss
2 years ago
dnhkng
8 points
975.
▲
Show HN: Libredesk – self-hosted, single binary Intercom/Zendesk alternative
libredesk.io
4 comments
2 months ago
avr5500
7 points
976.
▲
Show HN: I/Claude reverse-engineered Figma's binary WebSocket protocol
github.com/allan-simon
3 comments
3 months ago
allan_s
7 points
977.
▲
Show HN: WaveletLM – wavelet-based, attention-free model with O(n log n) scaling
github.com/ramongougis
1 comment
2 months ago
anarmorarm
7 points
978.
▲
Show HN: Serve 100 Large AI models on a single GPU with low impact to TTFT
github.com/leoheuler
1 comment
8 months ago
leonheuler
7 points
979.
▲
Show HN: Federation of robots collaboratively train an object manipulation model
github.com/adap
discuss
a year ago
jafermarq
7 points
980.
▲
Show HN: OS Megakernel that match M5 Max Tok/w at 2x the Throughput on RTX 3090
github.com/Luce-Org
1 comment
3 months ago
GreenGames
6 points
981.
▲
Show HN: Blink-Edit – Cursor-style next-edit predictions for Neovim (local LLMs)
github.com/BlinkResearchLabs
discuss
5 months ago
atemyipod
6 points
982.
▲
Show HN: Revolutionizing Blockchain Gaming: AI-Driven NFT Battleground
github.com/SachPlayZ
discuss
2 years ago
grobat79
6 points
983.
▲
Show HN: AI Council – multi-model deliberation that runs in the browser
github.com/prijak
1 comment
4 months ago
prijak
5 points
984.
▲
Show HN: I built an AI movie making and design engine in Rust
github.com/storytold
1 comment
5 months ago
echelon
5 points
985.
▲
Show HN: ClawMem – Open-source agent memory with SOTA local GPU retrieval
github.com/yoloshii
discuss
3 months ago
yoloshii
5 points
986.
▲
TinyTTS: Ultra-light English TTS (9M params, 20MB), 8x CPU, 67x GPU
discuss
4 months ago
letrghieu
5 points
987.
▲
Show HN: Clawbernetes – Replace kubectl with conversation (Rust)
github.com/clawbernetes
discuss
4 months ago
redclaw
5 points
988.
▲
Show HN: Open-source fine-tuning in a Colab notebook
colab.research.google.com
discuss
2 years ago
danielhanchen
5 points
989.
▲
Show HN: Self-hosted RAG with MCP support for OpenClaw
github.com/2dogsandanerd
2 comments
5 months ago
2dogsanerd
4 points
990.
▲
Help Us Create SONOFA (Smaller Object Notation Offering Frequent Advantages)
discuss
11 years ago
guilt
4 points
More