HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
61.
▲
Show HN: GPT-J inference on the CPU using C/C++
github.com/ggerganov
2 comments
4 years ago
ggerganov
5 points
62.
▲
Llama.cpp: SOTA 2-bit quants
github.com/ggerganov
discuss
2 years ago
tosh
5 points
63.
▲
Falcon 40B Working on Ggml
github.com/ggerganov
discuss
3 years ago
__anon-2023__
5 points
64.
▲
Whisper.cpp: Port of OpenAI's Whisper model in C/C++
github.com/ggerganov
discuss
4 years ago
lnyan
5 points
65.
▲
HNTERM – Browse Hacker News interactively in your terminal
github.com/ggerganov
discuss
4 years ago
graderjs
5 points
66.
▲
ImTui: Immediate Mode Text-Based User Interface C++ Library
github.com/ggerganov
discuss
4 years ago
seansh
5 points
67.
▲
Real-Time Capturing Exact Keystrokes Using Sound
github.com/ggerganov
discuss
8 years ago
foobaw
5 points
68.
▲
Llama.cpp speculative sampling: 2x faster inference for large models
github.com/ggerganov
1 comment
3 years ago
bobivl
4 points
69.
▲
Show HN: GGWave – Data over Sound for Microcontrollers
github.com/ggerganov
1 comment
4 years ago
ggerganov
4 points
70.
▲
ImTui: Immediate Mode Text-Based User Interface C++ Library
github.com/ggerganov
1 comment
5 years ago
signa11
4 points
71.
▲
Llama.cpp PR with 99% of code written by DeepSeek-R1
github.com/ggerganov
discuss
a year ago
zelag
4 points
72.
▲
Wchess
github.com/ggerganov
discuss
3 years ago
tosh
4 points
73.
▲
Whisper.wasm
github.com/ggerganov
discuss
3 years ago
tosh
4 points
74.
▲
AMD ROCm Support Added to Llama.cpp
github.com/ggerganov
discuss
3 years ago
irusensei
4 points
75.
▲
Full GPU Inference of LLaMA on Apple Silicon Using Metal
github.com/ggerganov
discuss
3 years ago
behnamoh
4 points
76.
▲
Inference at the Edge
github.com/ggerganov
discuss
3 years ago
Mizza
4 points
77.
▲
Whisper: performant port of OpenAI's Whisper spech recognition model in C/C++
github.com/ggerganov
discuss
4 years ago
nateb2022
4 points
78.
▲
ImTui: Immediate Mode Text-Based User Interface Library for C++
github.com/ggerganov
discuss
7 years ago
pcr910303
4 points
79.
▲
ImTui: Immediate mode text-based user interface library
github.com/ggerganov
discuss
7 years ago
ingve
4 points
80.
▲
Llama.cpp now supports tool calling (OpenAI-compatible)
github.com/ggerganov
1 comment
a year ago
ochafik
3 points
81.
▲
GGML Flash Attention support merged into llama.cpp
github.com/ggerganov
1 comment
2 years ago
smcleod
3 points
82.
▲
Show HN: GPT-2 inference on the CPU using C/C++
github.com/ggerganov
1 comment
4 years ago
ggerganov
3 points
83.
▲
Show HN: Tweet2Doom – A Twitter bot that plays Doom
github.com/ggerganov
1 comment
5 years ago
ggerganov
3 points
84.
▲
Show HN: ggwave – tiny data-over-sound library
github.com/ggerganov
1 comment
5 years ago
ggerganov
3 points
85.
▲
Whisper.cpp: Looking for Maintainers
github.com/ggerganov
discuss
a year ago
tech234a
3 points
86.
▲
Distributed LLM Inference with Llama.cpp
github.com/ggerganov
discuss
2 years ago
tosh
3 points
87.
▲
Control Vectors have been added to llama.cpp
github.com/ggerganov
discuss
2 years ago
Der_Einzige
3 points
88.
▲
Llama.cpp supports distributed inference across machines on a local network
github.com/ggerganov
discuss
2 years ago
behnamoh
3 points
89.
▲
CUDA: Faster Mixtral Prompt Processing
github.com/ggerganov
discuss
3 years ago
tosh
3 points
90.
▲
Llama.cpp: Support for Phi-2
github.com/ggerganov
discuss
3 years ago
tosh
3 points
More