HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
811.
▲
Llama.cpp speculative sampling: 2x faster inference for large models
github.com/ggerganov
1 comment
3 years ago
bobivl
4 points
812.
▲
Meta DMCAs llama-dl Repository
github.com/shawwn
1 comment
3 years ago
ronsor
4 points
813.
▲
LLaMA models licence change to Apache 2.0 approved
github.com/facebookresearch
1 comment
3 years ago
matjet
4 points
814.
▲
LlamaLib: A cross-platform C++/C# library for local LLMs based on llama.cpp
github.com/undreamai
discuss
5 months ago
benuix
4 points
815.
▲
Llama.cpp PR with 99% of code written by DeepSeek-R1
github.com/ggerganov
discuss
a year ago
zelag
4 points
816.
▲
Llama-Stack
github.com/meta-llama
discuss
2 years ago
ushakov
4 points
817.
▲
Llama2 in Mojo is 15-20% faster than llama2.c
github.com/tairov
discuss
3 years ago
yoquan
4 points
818.
▲
AMD ROCm Support Added to Llama.cpp
github.com/ggerganov
discuss
3 years ago
irusensei
4 points
819.
▲
Show HN: Llama2 Embeddings FastAPI Service
github.com/Dicklesworthstone
discuss
3 years ago
eigenvalue
4 points
820.
▲
Llama – A CLI for outsourcing computation to AWS Lambda
github.com/nelhage
discuss
3 years ago
pavanyara
4 points
821.
▲
Full GPU Inference of LLaMA on Apple Silicon Using Metal
github.com/ggerganov
discuss
3 years ago
behnamoh
4 points
822.
▲
Inference at the Edge
github.com/ggerganov
discuss
3 years ago
Mizza
4 points
823.
▲
Llama – A Terminal File Manager
github.com/antonmedv
discuss
4 years ago
marban
4 points
824.
▲
Show HN: Fine-tune llama3 model to support function calling
github.com/michaelnny
5 comments
2 years ago
michaelnny
3 points
825.
▲
Llama-zip: a command-line utility for lossless text compression
github.com/AlexBuz
2 comments
2 years ago
ukuina
3 points
826.
▲
Show HN: Llamaphone- Single-file Front end for Llamafile
github.com/KerbalNo15
2 comments
3 years ago
KerbalNo15
3 points
827.
▲
Llama2.zig
github.com/donge
2 comments
3 years ago
donge
3 points
828.
▲
Llama.cpp now supports tool calling (OpenAI-compatible)
github.com/ggerganov
1 comment
a year ago
ochafik
3 points
829.
▲
GGML Flash Attention support merged into llama.cpp
github.com/ggerganov
1 comment
2 years ago
smcleod
3 points
830.
▲
LlamaStash – Zero-overhead, terminal-native llama.cpp launcher
github.com/llamastash
discuss
23 days ago
deepu105
3 points
831.
▲
ParseBench: Document Parsing Benchmark for AI Agents
github.com/run-llama
discuss
2 months ago
firasd
3 points
832.
▲
To Use Snapdragon NPU, HTP Ops Libraries Must Be Signed with Trusted Certs
github.com/qualcomm
discuss
5 months ago
WhereIsTheTruth
3 points
833.
▲
Disclaimer: I am not a webdev, this PR was vibe coded
github.com/olegshulyakov
discuss
10 months ago
WhereIsTheTruth
3 points
834.
▲
Show HN: Worflows.py, the best way to build agents
github.com/run-llama
discuss
a year ago
pierre
3 points
835.
▲
A tool for migrating and optimizing prompts from other LLMs to Llama
github.com/meta-llama
discuss
a year ago
yawnxyz
3 points
836.
▲
Open source Claude Artifacts – built with Llama 3.1 405B
github.com/Nutlope
discuss
2 years ago
sabrina_ramonov
3 points
837.
▲
Distributed LLM Inference with Llama.cpp
github.com/ggerganov
discuss
2 years ago
tosh
3 points
838.
▲
Practical Llama 3 inference implemented in a single Java file
github.com/mukel
discuss
2 years ago
simonpure
3 points
839.
▲
Meta Llama 3 GitHub
github.com/meta-llama
discuss
2 years ago
adif_sgaid
3 points
840.
▲
LlamaIndex is a data framework for your LLM applications
github.com/run-llama
discuss
2 years ago
Brajeshwar
3 points
More