HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
31.
▲
Show HN: KTransformers–236B Model and 1M Context LLM Inference on Local Machines
github.com/kvcache-ai
3 comments
2 years ago
sssummer
20 points
32.
▲
Show HN: KTransformers:671B DeepSeek-R1 on a Single Machine-286 tokens/s Prefill
github.com/kvcache-ai
discuss
a year ago
sssummer
14 points
33.
▲
Show HN: Bonsai 1.7B ternary model at 442T/s on M4 Max
agents2agents.ai
3 comments
2 months ago
hhuytho
13 points
34.
▲
Show HN: RoundtableJS – Open-source programmatic survey library
github.com/roundtableAI
1 comment
2 years ago
timshell
13 points
35.
▲
Show HN: Off Grid: On-device AI-web browsing, tools vision,image,voice–3x faster
5 comments
4 months ago
ali_chherawalla
12 points
36.
▲
Show HN: OneUptime (New Update) – Open-Source Datadog Alternative
6 comments
2 years ago
devneelpatel
8 points
37.
▲
Show HN: Quickwit – OSS Alternative to Datadog, Elasticsearch
github.com/quickwit-oss
2 comments
2 years ago
francoismassot
8 points
38.
▲
Tq-KV – Rust implementation of TurboQuant that works on GGUF models
discuss
3 months ago
onurgokyildiz
3 points
39.
▲
Show HN: Configurable Open Source Audio Spectrum Analyzer
github.com/sylwekkominek
discuss
10 months ago
sylwekkominek
3 points
40.
▲
Show HN: iceoryx2 v0.3.0 released – zero-copy IPC middleware in Rust
github.com/eclipse-iceoryx
discuss
2 years ago
elfenpiff
3 points
41.
▲
Show HN: Open dataset of real-world LLM performance on Apple Silicon
devpadapp.com
4 comments
4 months ago
uncSoft
2 points
42.
▲
Ask HN: How would you design an interface where artworks come alive with story?
2 comments
8 months ago
dejicarr
2 points
43.
▲
Show HN: Loft CLI – Fine-tune and run LLMs (1–3B) on 8 GB MacBook Air, no GPUs
1 comment
a year ago
dips2umar
2 points
44.
▲
Show HN: NeuG – High-performance Embedded graph DB, one line to serve
discuss
2 months ago
robeenly
2 points
45.
▲
Sumi – Open-source voice-to-text with local AI polishing
discuss
3 months ago
alkd
2 points
46.
▲
Show HN: I wrote an LLM inference engine in pure Go – 48 tok/s zero dependencies
github.com/computerex
discuss
4 months ago
computerex
2 points
47.
▲
Show HN: OctoFlow v1.0.0 – GPU VM where the GPU runs autonomously, CPU is BIOS
discuss
4 months ago
mr_octopus
2 points
48.
▲
Show HN: I maintain Valkey GLIDE – built a Node queue doing 48k jobs/s
github.com/avifenesh
discuss
4 months ago
anotherCodder
2 points
49.
▲
Show HN: A private, PQ-secure, infinitely scalable blockchain[fully open-source]
github.com/nerv-bit
discuss
5 months ago
Nerv_b
2 points
50.
▲
Ask HN: Is there an open-source Git-backed multi-tenant wiki?
discuss
6 years ago
ponsfrilus
2 points
51.
▲
Seeking grant proposals $250k total budget for privacy blockchain tech projects
discuss
8 years ago
exolymph
2 points
52.
▲
A small tool I made for local LLMs: LLM-neofetch-plus
2 comments
4 months ago
HFerrahoglu
1 points
53.
▲
Ollama and Bifrost –> Qwen3 in Claude Code
2 comments
10 months ago
all2
1 points
54.
▲
Ask HN: Best LLM model for a RAG-based Android app across all smartphones?
1 comment
3 months ago
swaminarayan
1 points
55.
▲
Off Grid: On-device AI-web browsing, tools, vision, image gen, voice – 3x faster
1 comment
4 months ago
ali_chherawalla
1 points
56.
▲
Ask HN: GitHub-based spam/scam emails
discuss
13 years ago
munchor
1 points
57.
▲
Show HN: WayInfer – Native GGUF engine that runs models larger than your RAM
discuss
3 months ago
ahmedm24
1 points
58.
▲
Show HN: Go LLM inference with a Vulkan GPU back end that beats Ollama's CUDA
github.com/computerex
discuss
4 months ago
computerex
1 points
59.
▲
Show HN: Voxtral Mini 4B Realtime running in the browser
github.com/TrevorS
discuss
4 months ago
adefa
1 points
60.
▲
Show HN: Open-source multi-agent subtitle translator (self-hosted)
github.com/subtitlesdog
discuss
5 months ago
mrqjr
1 points
More