HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
91.
▲
QMoE Support for Mixtral
github.com/ggerganov
discuss
3 years ago
tosh
3 points
92.
▲
Llama 2: poc for running 70B on CPU
github.com/ggerganov
discuss
3 years ago
tosh
3 points
93.
▲
Inference at the edge: Efficient transformer model inference on-device
github.com/ggerganov
discuss
3 years ago
lioeters
3 points
94.
▲
K-Quants
github.com/ggerganov
discuss
3 years ago
tosh
3 points
95.
▲
StableLM already being ported to ggml
github.com/ggerganov
discuss
3 years ago
theolivenbaum
3 points
96.
▲
Llama.cpp: Add GPU support to ggml
github.com/ggerganov
discuss
3 years ago
mromanuk
3 points
97.
▲
Tweet2Doom: A Twitter bot that plays Doom
github.com/ggerganov
discuss
5 years ago
ggerganov
3 points
98.
▲
Kbd-audio – Tools for capturing and analysing keyboard input paired with
github.com/ggerganov
discuss
8 years ago
pplonski86
3 points
99.
▲
LLM quantization severely damages model quality and perplexity
github.com/ggerganov
3 comments
3 years ago
behnamoh
2 points
100.
▲
Show HN: r2t2 – Transmit data with the PC speaker
github.com/ggerganov
1 comment
5 years ago
ggerganov
2 points
101.
▲
Show HN: Using talking buttons and data-over-sound to control devices
github.com/ggerganov
1 comment
5 years ago
ggerganov
2 points
102.
▲
Show HN: Waver – Messaging Through Sound
github.com/ggerganov
1 comment
5 years ago
ggerganov
2 points
103.
▲
Llama.vim: Plugin for Neovim
github.com/ggerganov
discuss
2 years ago
mariuz
2 points
104.
▲
Llama.vim: Plugin for Neovim
github.com/ggerganov
discuss
2 years ago
ibobev
2 points
105.
▲
Attention and final logit soft-capping, update scaling factor to Gemma2
github.com/ggerganov
discuss
2 years ago
tosh
2 points
106.
▲
ggml: Add Flash Attention
github.com/ggerganov
discuss
2 years ago
tosh
2 points
107.
▲
llama.cpp bfloat16 support
github.com/ggerganov
discuss
2 years ago
indigodaddy
2 points
108.
▲
Llama.cpp: Mac Prebuilds
github.com/ggerganov
discuss
2 years ago
tosh
2 points
109.
▲
Llama.cpp incoming backends: Vulkan, Kompute, SYCL
github.com/ggerganov
discuss
2 years ago
irusensei
2 points
110.
▲
Llama.cpp: Self-Extend Support
github.com/ggerganov
discuss
2 years ago
tosh
2 points
111.
▲
GGUF File Format
github.com/ggerganov
discuss
2 years ago
warkanlock
2 points
112.
▲
K-Quants
github.com/ggerganov
discuss
2 years ago
tosh
2 points
113.
▲
Llama: Add Mixtral Support
github.com/ggerganov
discuss
3 years ago
tosh
2 points
114.
▲
Performance of Llama.cpp on Apple Silicon
github.com/ggerganov
discuss
3 years ago
tosh
2 points
115.
▲
(2) Apple Silicon Performance · ggerganov/llama.cpp · Discussion #4167
github.com/ggerganov
discuss
3 years ago
gavi
2 points
116.
▲
Llama.cpp Was Hacked in an Evening
github.com/ggerganov
discuss
3 years ago
behnamoh
2 points
117.
▲
Llama.cpp Supports Falcon Now
github.com/ggerganov
discuss
3 years ago
gslin
2 points
118.
▲
New llama.cpp format GGUF now merged
github.com/ggerganov
discuss
3 years ago
mchiang
2 points
119.
▲
GPU Support to Ggml
github.com/ggerganov
discuss
3 years ago
melenaboija
2 points
120.
▲
Ggwave: Tiny Data-over-Sound Library
github.com/ggerganov
discuss
3 years ago
lachlan_gray
2 points
More