HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
91.
▲
Llama.cpp supports distributed inference across machines on a local network
github.com/ggerganov
discuss
2 years ago
behnamoh
3 points
92.
▲
CUDA: Faster Mixtral Prompt Processing
github.com/ggerganov
discuss
3 years ago
tosh
3 points
93.
▲
Llama.cpp: Support for Phi-2
github.com/ggerganov
discuss
3 years ago
tosh
3 points
94.
▲
QMoE Support for Mixtral
github.com/ggerganov
discuss
3 years ago
tosh
3 points
95.
▲
Llama 2: poc for running 70B on CPU
github.com/ggerganov
discuss
3 years ago
tosh
3 points
96.
▲
Inference at the edge: Efficient transformer model inference on-device
github.com/ggerganov
discuss
3 years ago
lioeters
3 points
97.
▲
K-Quants
github.com/ggerganov
discuss
3 years ago
tosh
3 points
98.
▲
StableLM already being ported to ggml
github.com/ggerganov
discuss
3 years ago
theolivenbaum
3 points
99.
▲
Llama.cpp: Add GPU support to ggml
github.com/ggerganov
discuss
3 years ago
mromanuk
3 points
100.
▲
Tweet2Doom: A Twitter bot that plays Doom
github.com/ggerganov
discuss
5 years ago
ggerganov
3 points
101.
▲
Kbd-audio – Tools for capturing and analysing keyboard input paired with
github.com/ggerganov
discuss
8 years ago
pplonski86
3 points
102.
▲
LLM quantization severely damages model quality and perplexity
github.com/ggerganov
3 comments
3 years ago
behnamoh
2 points
103.
▲
Show HN: r2t2 – Transmit data with the PC speaker
github.com/ggerganov
1 comment
5 years ago
ggerganov
2 points
104.
▲
Show HN: Using talking buttons and data-over-sound to control devices
github.com/ggerganov
1 comment
5 years ago
ggerganov
2 points
105.
▲
Show HN: Waver – Messaging Through Sound
github.com/ggerganov
1 comment
5 years ago
ggerganov
2 points
106.
▲
Rust macro to generate AI code at compile-time
github.com/germangb
discuss
6 months ago
michidk
2 points
107.
▲
A Transformer-based model predicting the articles of German nouns
github.com/dominik3141
discuss
a year ago
jimmy76615
2 points
108.
▲
Llama.vim: Plugin for Neovim
github.com/ggerganov
discuss
2 years ago
mariuz
2 points
109.
▲
Llama.vim: Plugin for Neovim
github.com/ggerganov
discuss
2 years ago
ibobev
2 points
110.
▲
Attention and final logit soft-capping, update scaling factor to Gemma2
github.com/ggerganov
discuss
2 years ago
tosh
2 points
111.
▲
ggml: Add Flash Attention
github.com/ggerganov
discuss
2 years ago
tosh
2 points
112.
▲
llama.cpp bfloat16 support
github.com/ggerganov
discuss
2 years ago
indigodaddy
2 points
113.
▲
Llama.cpp: Mac Prebuilds
github.com/ggerganov
discuss
2 years ago
tosh
2 points
114.
▲
DigesterBot: A telegram bot to help you study
github.com/german94
discuss
2 years ago
gpinzon94
2 points
115.
▲
Llama.cpp incoming backends: Vulkan, Kompute, SYCL
github.com/ggerganov
discuss
2 years ago
irusensei
2 points
116.
▲
Llama.cpp: Self-Extend Support
github.com/ggerganov
discuss
2 years ago
tosh
2 points
117.
▲
GGUF File Format
github.com/ggerganov
discuss
2 years ago
warkanlock
2 points
118.
▲
K-Quants
github.com/ggerganov
discuss
2 years ago
tosh
2 points
119.
▲
Show HN: Modern C++ implementations of a words counter with benchmarks
github.com/germandiagogomez
discuss
3 years ago
germandiago
2 points
120.
▲
Llama: Add Mixtral Support
github.com/ggerganov
discuss
3 years ago
tosh
2 points
More