HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
661.
▲
DeepSeek-R1 speeds up llama.cpp code by x2
github.com/ggerganov
3 comments
a year ago
roboboffin
6 points
662.
▲
Show HN: SAM3-CPU – Run Segment Anything on CPU with memory-aware video chunking
github.com/rhubarb-ai
1 comment
3 months ago
judlaw
6 points
663.
▲
llama.cpp now supports StarCoder model series
github.com/ggerganov
1 comment
3 years ago
wsxiaoys
6 points
664.
▲
LLaMA 7B model running on 4GB RAM Raspberry Pi 4
github.com/ggerganov
discuss
3 years ago
amrrs
6 points
665.
▲
Show HN: Spirit of C++
github.com/legends2k
4 comments
7 years ago
legends2k
5 points
666.
▲
Bloomz.cpp: Run multilingual BLOOM model with C++
github.com/NouamaneTazi
2 comments
3 years ago
osanseviero
5 points
667.
▲
I implemented CLIP inference in plain C/C++
github.com/monatis
1 comment
3 years ago
monatis
5 points
668.
▲
AWS is laying the groundwork for nested virtualization on EC2
github.com/aws
discuss
4 months ago
acj
5 points
669.
▲
Show HN: AI-SDK-Cpp – Unified C++ SDK for OpenAI, Anthropic, and More
github.com/iskakaushik
discuss
a year ago
cauchyk
5 points
670.
▲
Safetensors.cpp – Zero Dependency Safetensors Loading and Storing in C++
github.com/carsonpo
discuss
2 years ago
carsonpoole
5 points
671.
▲
Llama.cpp: SOTA 2-bit quants
github.com/ggerganov
discuss
2 years ago
tosh
5 points
672.
▲
Whisper.cpp: Port of OpenAI's Whisper model in C/C++
github.com/ggerganov
discuss
4 years ago
lnyan
5 points
673.
▲
Show HN: 2048.cpp – Play 2048 in your terminal
github.com/plibither8
discuss
8 years ago
plibither8
5 points
674.
▲
Llama.cpp speculative sampling: 2x faster inference for large models
github.com/ggerganov
1 comment
3 years ago
bobivl
4 points
675.
▲
Prima.cpp – run 70B-Scale LLMs on low-powered home clusters
github.com/Lizonghang
discuss
a year ago
oleg_tarasov
4 points
676.
▲
Llama.cpp PR with 99% of code written by DeepSeek-R1
github.com/ggerganov
discuss
a year ago
zelag
4 points
677.
▲
Bark.cpp: Port of Suno AI's Bark in C/C++ for fast inference
github.com/PABannier
discuss
2 years ago
siraben
4 points
678.
▲
Source code of Google Gemma model in C++
github.com/google
discuss
2 years ago
yu3zhou4
4 points
679.
▲
Wchess
github.com/ggerganov
discuss
3 years ago
tosh
4 points
680.
▲
Whisper.wasm
github.com/ggerganov
discuss
3 years ago
tosh
4 points
681.
▲
AMD ROCm Support Added to Llama.cpp
github.com/ggerganov
discuss
3 years ago
irusensei
4 points
682.
▲
Full GPU Inference of LLaMA on Apple Silicon Using Metal
github.com/ggerganov
discuss
3 years ago
behnamoh
4 points
683.
▲
Inference at the Edge
github.com/ggerganov
discuss
3 years ago
Mizza
4 points
684.
▲
Whisper: performant port of OpenAI's Whisper spech recognition model in C/C++
github.com/ggerganov
discuss
4 years ago
nateb2022
4 points
685.
▲
Show HN: Bark.cpp, fast TTS model for multilingual realistic audio generation
github.com/PABannier
3 comments
2 years ago
el_pa_b
3 points
686.
▲
Alpaca 7B running on Google Pixel 7 Pro
github.com/rupeshs
2 comments
3 years ago
oneinfiniteloop
3 points
687.
▲
Llama.cpp now supports tool calling (OpenAI-compatible)
github.com/ggerganov
1 comment
a year ago
ochafik
3 points
688.
▲
GGML Flash Attention support merged into llama.cpp
github.com/ggerganov
1 comment
2 years ago
smcleod
3 points
689.
▲
Kubernetes In-Place Pod Resource Resize in Action: Kube Startup CPU Boost
github.com/google
1 comment
2 years ago
mikowhy
3 points
690.
▲
A C++ AirPlay 2 sender: the encrypted RAOP/RTSP recipe, written down
github.com/akustikrausch
discuss
3 days ago
akustikrausch
3 points
More