HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
61.
▲
Show HN: Kalosm – a local first AI meta-framework in Rust
github.com/floneum
discuss
3 years ago
Evan-Almloff
4 points
62.
▲
Show HN: Cachey, a Read-Through Cache for S3
github.com/s2-streamstore
2 comments
9 months ago
shikhar
3 points
63.
▲
Show HN: Run Unsloth Dynamic GGUFs using Docker model runner
github.com/docker
1 comment
7 months ago
ericcurtin
3 points
64.
▲
Show HN: Awesome Flux – Open Source repo for FLUX model resources
github.com/Eris2025
1 comment
2 years ago
Gene05
3 points
65.
▲
Show HN: Collider – the platform for local LLM debug and inference at warp speed
github.com/gotzmann
1 comment
3 years ago
Ambix
3 points
66.
▲
Fixed a llama.cpp bug silently disabling Vulkan GPU on all 32-bit ARM devices
discuss
3 months ago
perinban
3 points
67.
▲
Tq-KV – Rust implementation of TurboQuant that works on GGUF models
discuss
3 months ago
onurgokyildiz
3 points
68.
▲
Show HN: I built a 2nd-order PyTorch optimizer for LLMs that runs on 16GB GPUs
4 comments
2 months ago
dnosoz
2 points
69.
▲
Show HN: Loft CLI – Fine-tune and run LLMs (1–3B) on 8 GB MacBook Air, no GPUs
1 comment
a year ago
dips2umar
2 points
70.
▲
Show HN: LlamaFarm – Working on binary AI Project deployment – (early preview)
github.com/llama-farm
1 comment
a year ago
rgthelen
2 points
71.
▲
Sumi – Open-source voice-to-text with local AI polishing
discuss
3 months ago
alkd
2 points
72.
▲
Show HN: Reduction Blockprint Planner/Simulator
reduction-planner.hirson.xyz
discuss
4 months ago
gh5000
2 points
73.
▲
Show HN: OctoFlow v1.0.0 – GPU VM where the GPU runs autonomously, CPU is BIOS
discuss
4 months ago
mr_octopus
2 points
74.
▲
Show HN: Local Voice Assistant
discuss
4 months ago
armcat
2 points
75.
▲
Show HN: Promptscout a local prompt enricher for Claude Code
github.com/obsfx
discuss
4 months ago
obsfx
2 points
76.
▲
Show HN: TrendScope – Real-time financial sentiment analysis on a cheap VPS
trendscope.akamaar.dev
discuss
5 months ago
mohammede
2 points
77.
▲
Show HN: SpeedyEDA – One-line exploratory data analysis
discuss
5 months ago
dawitworku
2 points
78.
▲
Running a 270M LLM on Android (architecture and benchmarks)
discuss
7 months ago
ayushranjan99
2 points
79.
▲
Show HN: ONNX optimized SigLIP and related foundation models
github.com/rhysdg
discuss
2 years ago
rhysdg
2 points
80.
▲
DSPTools: Open Source DSP simulator for iOS devices
discuss
14 years ago
medius
2 points
81.
▲
Show HN: Python Bindings for llama.cpp with some CLIs
github.com/thomasantony
discuss
3 years ago
tantony
2 points
82.
▲
Show HN: Localvoxtral – Local real-time dictation on macOS with streaming STT
github.com/T0mSIlver
2 comments
4 months ago
T0mSIlver
1 points
83.
▲
Day 1 of trying to fit a Chatbot into a QR Code
2 comments
a year ago
kuberwastaken
1 points
84.
▲
Off Grid: On-device AI-web browsing, tools, vision, image gen, voice – 3x faster
1 comment
4 months ago
ali_chherawalla
1 points
85.
▲
Show HN: Mixture of Voices–Open source goal-based AI router-uses BGE transformer
1 comment
9 months ago
KylieM
1 points
86.
▲
Show HN: WayInfer – Native GGUF engine that runs models larger than your RAM
discuss
3 months ago
ahmedm24
1 points
87.
▲
I built two Loihi-parity neuromorphic processors from scratch
discuss
4 months ago
catalyst-neuro
1 points
88.
▲
Show HN: Running an LLM Inside Scratch
github.com/Broyojo
discuss
4 months ago
broyojo
1 points
89.
▲
Show HN: ARIA – P2P distributed inference protocol for 1-bit LLMs on CPU
github.com/spmfrance-cloud
discuss
5 months ago
anthonymu
1 points
90.
▲
Show HN: Loclean – Local semantic data cleaning with LLMs and Pydantic
github.com/nxank4
discuss
5 months ago
nxank4
1 points
More