HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
601.
▲
High frequency performance measurements for Linux
github.com/uber-common
discuss
10 years ago
WestCoastJustin
1 points
602.
▲
Show HN: Port of OpenAI's Whisper model in C/C++
github.com/ggerganov
87 comments
4 years ago
ggerganov
399 points
603.
▲
Show HN: Project S.A.T.U.R.D.A.Y. – open-source, self hosted, J.A.R.V.I.S.
github.com/GRVYDEV
30 comments
3 years ago
GRVYDEV
121 points
604.
▲
Show HN: A modern C++20 AI SDK (GPT‑4o, Claude 3.5, tool‑calling)
6 comments
a year ago
cauchyk
56 points
605.
▲
Show HN: The Tao of tmux, available for free on the web, has been newly edited
5 comments
9 years ago
tony
48 points
606.
▲
Show HN: Windows port of OpenAI's Whisper automatic speech recognition model
github.com/Const-me
20 comments
3 years ago
Const-me
43 points
607.
▲
Show HN: TurboPilot: Copilot clone runs code completion LLM on your CPU
github.com/ravenscroftj
4 comments
3 years ago
DrRavenstein
37 points
608.
▲
Show HN: Grammar Generator App for Llama.cpp
grammar.intrinsiclabs.ai
6 comments
3 years ago
aduffy
19 points
609.
▲
Show HN: Micron: a high performance C++23 (re)implementation of Libc and the STL
github.com/rfgplk
3 comments
20 days ago
rfgplk
6 points
610.
▲
Show HN: OpenTheo – Transcribed and searchable Bible teaching with Whisper.cpp
opentheo.com
discuss
3 years ago
dinoleif
5 points
611.
▲
Show HN: I built BakLLaVA and llama.cpp demo and it went viral on X
1 comment
3 years ago
Obertr
4 points
612.
▲
Fixed a llama.cpp bug silently disabling Vulkan GPU on all 32-bit ARM devices
discuss
3 months ago
perinban
3 points
613.
▲
Apple predicted the rise of local LLMs, hence the M2 Ultra
3 comments
3 years ago
behnamoh
2 points
614.
▲
Show HN: Galene-stt: automatic captioning for the Galene videconferencing system
github.com/jech
discuss
2 years ago
jech
2 points
615.
▲
Show HN: bigWav.app – web based transcription powered by Whipser and WASM
bigwav.app
discuss
3 years ago
emadda
2 points
616.
▲
Show HN: Running LLM on smartwatch – found llama.cpp loading model twice in RAM
discuss
3 months ago
perinban
1 points
617.
▲
Llama.cpp 30B runs with only 6GB of RAM now
github.com/ggerganov
414 comments
3 years ago
msoad
1311 points
618.
▲
Llama.cpp: Port of Facebook's LLaMA model in C/C++, with Apple Silicon support
github.com/ggerganov
284 comments
3 years ago
mrtksn
989 points
619.
▲
Llama.cpp: Full CUDA GPU Acceleration
github.com/ggerganov
310 comments
3 years ago
gzer0
728 points
620.
▲
Show HN: Alpaca.cpp – Run an Instruction-Tuned Chat-Style LLM on a MacBook
github.com/antimatter15
283 comments
3 years ago
antimatter15
673 points
621.
▲
Modern C++ Programming Course
github.com/federico-busato
194 comments
3 years ago
asicsp
493 points
622.
▲
Talk-Llama
github.com/ggerganov
140 comments
3 years ago
plurby
474 points
623.
▲
Gemma.cpp: lightweight, standalone C++ inference engine for Gemma models
github.com/google
130 comments
2 years ago
mfiguiere
422 points
624.
▲
Llama: Add grammar-based sampling
github.com/ggerganov
105 comments
3 years ago
davepeck
417 points
625.
▲
New exponent functions that make SiLU and SoftMax 2x faster, at full accuracy
github.com/ggerganov
72 comments
2 years ago
weinzierl
382 points
626.
▲
LLama.cpp now has a web interface
github.com/ggerganov
49 comments
3 years ago
xal
328 points
627.
▲
M2 Ultra can run 128 streams of Llama 2 7B in parallel
github.com/ggerganov
173 comments
3 years ago
behnamoh
268 points
628.
▲
Meta's Segment Anything written with C++ / GGML
github.com/YavorGIvanov
31 comments
3 years ago
ariym
233 points
629.
▲
Talk = GPT-2 and Whisper and WASM
github.com/ggerganov
50 comments
4 years ago
tomthe
189 points
630.
▲
Whisper.cpp v1.4.0
github.com/ggerganov
45 comments
3 years ago
tosh
162 points
More