Search: github.com/cpq | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

661.

DeepSeek-R1 speeds up llama.cpp code by x2

github.com/ggerganov

a year ago

6 points

662.

Show HN: SAM3-CPU – Run Segment Anything on CPU with memory-aware video chunking

github.com/rhubarb-ai

3 months ago

6 points

663.

llama.cpp now supports StarCoder model series

github.com/ggerganov

3 years ago

6 points

664.

LLaMA 7B model running on 4GB RAM Raspberry Pi 4

github.com/ggerganov

3 years ago

6 points

665.

Show HN: Spirit of C++

github.com/legends2k

7 years ago

5 points

666.

Bloomz.cpp: Run multilingual BLOOM model with C++

github.com/NouamaneTazi

3 years ago

5 points

667.

I implemented CLIP inference in plain C/C++

github.com/monatis

3 years ago

5 points

668.

AWS is laying the groundwork for nested virtualization on EC2

4 months ago

5 points

669.

Show HN: AI-SDK-Cpp – Unified C++ SDK for OpenAI, Anthropic, and More

github.com/iskakaushik

a year ago

5 points

670.

Safetensors.cpp – Zero Dependency Safetensors Loading and Storing in C++

github.com/carsonpo

2 years ago

5 points

671.

Llama.cpp: SOTA 2-bit quants

github.com/ggerganov

2 years ago

5 points

672.

Whisper.cpp: Port of OpenAI's Whisper model in C/C++

github.com/ggerganov

4 years ago

5 points

673.

Show HN: 2048.cpp – Play 2048 in your terminal

github.com/plibither8

8 years ago

5 points

674.

Llama.cpp speculative sampling: 2x faster inference for large models

github.com/ggerganov

3 years ago

4 points

675.

Prima.cpp – run 70B-Scale LLMs on low-powered home clusters

github.com/Lizonghang

a year ago

4 points

676.

Llama.cpp PR with 99% of code written by DeepSeek-R1

github.com/ggerganov

a year ago

4 points

677.

Bark.cpp: Port of Suno AI's Bark in C/C++ for fast inference

github.com/PABannier

2 years ago

4 points

678.

Source code of Google Gemma model in C++

github.com/google

2 years ago

4 points

679.

github.com/ggerganov

3 years ago

4 points

680.

github.com/ggerganov

3 years ago

4 points

681.

AMD ROCm Support Added to Llama.cpp

github.com/ggerganov

3 years ago

4 points

682.

Full GPU Inference of LLaMA on Apple Silicon Using Metal

github.com/ggerganov

3 years ago

4 points

683.

Inference at the Edge

github.com/ggerganov

3 years ago

4 points

684.

Whisper: performant port of OpenAI's Whisper spech recognition model in C/C++

github.com/ggerganov

4 years ago

4 points

685.

Show HN: Bark.cpp, fast TTS model for multilingual realistic audio generation

github.com/PABannier

2 years ago

3 points

686.

Alpaca 7B running on Google Pixel 7 Pro

github.com/rupeshs

3 years ago

oneinfiniteloop

3 points

687.

Llama.cpp now supports tool calling (OpenAI-compatible)

github.com/ggerganov

a year ago

3 points

688.

GGML Flash Attention support merged into llama.cpp

github.com/ggerganov

2 years ago

3 points

689.

Kubernetes In-Place Pod Resource Resize in Action: Kube Startup CPU Boost

github.com/google

2 years ago

3 points

690.

A C++ AirPlay 2 sender: the encrypted RAOP/RTSP recipe, written down

github.com/akustikrausch

3 days ago

3 points