Search: github.com/ggerganov | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

61.

Show HN: GPT-J inference on the CPU using C/C++

github.com/ggerganov

4 years ago

5 points

62.

Llama.cpp: SOTA 2-bit quants

github.com/ggerganov

2 years ago

5 points

63.

Falcon 40B Working on Ggml

github.com/ggerganov

3 years ago

5 points

64.

Whisper.cpp: Port of OpenAI's Whisper model in C/C++

github.com/ggerganov

4 years ago

5 points

65.

HNTERM – Browse Hacker News interactively in your terminal

github.com/ggerganov

4 years ago

5 points

66.

ImTui: Immediate Mode Text-Based User Interface C++ Library

github.com/ggerganov

4 years ago

5 points

67.

Real-Time Capturing Exact Keystrokes Using Sound

github.com/ggerganov

8 years ago

5 points

68.

Llama.cpp speculative sampling: 2x faster inference for large models

github.com/ggerganov

3 years ago

4 points

69.

Show HN: GGWave – Data over Sound for Microcontrollers

github.com/ggerganov

4 years ago

4 points

70.

ImTui: Immediate Mode Text-Based User Interface C++ Library

github.com/ggerganov

5 years ago

4 points

71.

Llama.cpp PR with 99% of code written by DeepSeek-R1

github.com/ggerganov

a year ago

4 points

72.

github.com/ggerganov

3 years ago

4 points

73.

github.com/ggerganov

3 years ago

4 points

74.

AMD ROCm Support Added to Llama.cpp

github.com/ggerganov

3 years ago

4 points

75.

Full GPU Inference of LLaMA on Apple Silicon Using Metal

github.com/ggerganov

3 years ago

4 points

76.

Inference at the Edge

github.com/ggerganov

3 years ago

4 points

77.

Whisper: performant port of OpenAI's Whisper spech recognition model in C/C++

github.com/ggerganov

4 years ago

4 points

78.

ImTui: Immediate Mode Text-Based User Interface Library for C++

github.com/ggerganov

7 years ago

4 points

79.

ImTui: Immediate mode text-based user interface library

github.com/ggerganov

7 years ago

4 points

80.

Llama.cpp now supports tool calling (OpenAI-compatible)

github.com/ggerganov

a year ago

3 points

81.

GGML Flash Attention support merged into llama.cpp

github.com/ggerganov

2 years ago

3 points

82.

Show HN: GPT-2 inference on the CPU using C/C++

github.com/ggerganov

4 years ago

3 points

83.

Show HN: Tweet2Doom – A Twitter bot that plays Doom

github.com/ggerganov

5 years ago

3 points

84.

Show HN: ggwave – tiny data-over-sound library

github.com/ggerganov

5 years ago

3 points

85.

Whisper.cpp: Looking for Maintainers

github.com/ggerganov

a year ago

3 points

86.

Distributed LLM Inference with Llama.cpp

github.com/ggerganov

2 years ago

3 points

87.

Control Vectors have been added to llama.cpp

github.com/ggerganov

2 years ago

3 points

88.

Llama.cpp supports distributed inference across machines on a local network

github.com/ggerganov

2 years ago

3 points

89.

CUDA: Faster Mixtral Prompt Processing

github.com/ggerganov

3 years ago

3 points

90.

Llama.cpp: Support for Phi-2

github.com/ggerganov

3 years ago

3 points