HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
91.
▲
Show HN: Chonkie Cloud – No-nonsense chunking now on the the cloud
cloud.chonkie.ai
5 comments
a year ago
snyy
6 points
92.
▲
Show HN: OS Megakernel that match M5 Max Tok/w at 2x the Throughput on RTX 3090
github.com/Luce-Org
1 comment
2 months ago
GreenGames
6 points
93.
▲
Show HN: Blink-Edit – Cursor-style next-edit predictions for Neovim (local LLMs)
github.com/BlinkResearchLabs
discuss
5 months ago
atemyipod
6 points
94.
▲
Show HN: I'm tired of my LLM bullshitting. So I fixed it
9 comments
5 months ago
BobbyLLM
5 points
95.
▲
Show HN: AI Council – multi-model deliberation that runs in the browser
github.com/prijak
1 comment
4 months ago
prijak
5 points
96.
▲
Show HN: I built an AI movie making and design engine in Rust
github.com/storytold
1 comment
5 months ago
echelon
5 points
97.
▲
Show HN: ClawMem – Open-source agent memory with SOTA local GPU retrieval
github.com/yoloshii
discuss
3 months ago
yoloshii
5 points
98.
▲
TinyTTS: Ultra-light English TTS (9M params, 20MB), 8x CPU, 67x GPU
discuss
4 months ago
letrghieu
5 points
99.
▲
Show HN: Clawbernetes – Replace kubectl with conversation (Rust)
github.com/clawbernetes
discuss
4 months ago
redclaw
5 points
100.
▲
Show HN: Open-source fine-tuning in a Colab notebook
colab.research.google.com
discuss
2 years ago
danielhanchen
5 points
101.
▲
Show HN: Self-hosted RAG with MCP support for OpenClaw
github.com/2dogsandanerd
2 comments
5 months ago
2dogsanerd
4 points
102.
▲
Show HN: NSED is public – Mixture-of-Models to Hit SOTA using self-hosted AI
github.com/peeramid-labs
discuss
4 months ago
t_peersky
4 points
103.
▲
Show HN: ArtCraft AI crafting engine, written in Rust
github.com/storytold
discuss
5 months ago
echelon
4 points
104.
▲
Show HN: HORenderer3: A C++ software renderer implementing OpenGL 3.3 pipeline
github.com/Hobanghann
discuss
5 months ago
zghdls
4 points
105.
▲
Run 35B LLMs on Dual Pascal GPUs with QLoRA
discuss
9 months ago
rickesh_tn
4 points
106.
▲
Show HN: A reasoning model that infers over whole tasks in 1ms in latent space
github.com/OrderOneAI
6 comments
a year ago
orderone_ai
3 points
107.
▲
Show HN: Turn any ComfyUI workflow into a web app or API
6 comments
a year ago
jjdelannoy
3 points
108.
▲
Show HN: OctoFlow – A GPU-native programming language
4 comments
4 months ago
mr_octopus
3 points
109.
▲
Seeking Advice on Improving OCR for Watermarked PDFs in My RAG Pipeline
3 comments
4 months ago
hundredtrillion
3 points
110.
▲
Tq-KV – Rust implementation of TurboQuant that works on GGUF models
discuss
3 months ago
onurgokyildiz
3 points
111.
▲
Security Layer 4.0 – First semantic firewall blocks malicious intent"
discuss
7 months ago
jaspertvdm
3 points
112.
▲
Show HN: I built a 2nd-order PyTorch optimizer for LLMs that runs on 16GB GPUs
4 comments
2 months ago
dnosoz
2 points
113.
▲
Why stop at 1M tokens when you can have 10M?
4 comments
8 months ago
Zen_Sherbert
2 points
114.
▲
Show HN: ArtCraft AI crafting engine, written in Rust
github.com/storytold
2 comments
5 months ago
echelon
2 points
115.
▲
Show HN: Loqi, a "local-first" translation tool using Ollama/llama.cpp
github.com/danterolle
discuss
7 hours ago
danterolle
2 points
116.
▲
Show HN: Glq LLM quantization using E8 lattice
github.com/cnygaard
discuss
21 days ago
acd
2 points
117.
▲
Show HN: MaximusLLM – Train 262k-vocab LLMs on a single 16GB GPU
github.com/yousef-rafat
discuss
3 months ago
yousef_g
2 points
118.
▲
Show HN: From Claude Code to OpenCode – My Evolution in Vibe AI Engineering
discuss
3 months ago
denis4inet
2 points
119.
▲
Show HN: Kiln – WebGPU-native out-of-core volume rendering for multi-GB datasets
github.com/MPanknin
discuss
4 months ago
m_panknin
2 points
120.
▲
Show HN: Dia-Jax – A Jax port of the Dia text-to-speech dialogue model
github.com/jaco-bro
discuss
a year ago
jaco-bro
2 points
More