HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
721.
▲
Llama.cpp 30B runs with only 6GB of RAM now
github.com/ggerganov
414 comments
3 years ago
msoad
1311 points
722.
▲
Llama3 implemented from scratch
github.com/naklecha
269 comments
2 years ago
Hadi7546
1041 points
723.
▲
Llama.cpp: Port of Facebook's LLaMA model in C/C++, with Apple Silicon support
github.com/ggerganov
284 comments
3 years ago
mrtksn
989 points
724.
▲
Facebook LLAMA is being openly distributed via torrents
github.com/facebookresearch
693 comments
3 years ago
micro_charm
909 points
725.
▲
Llama.cpp: Full CUDA GPU Acceleration
github.com/ggerganov
310 comments
3 years ago
gzer0
728 points
726.
▲
Llama2.c: Inference llama 2 in one file of pure C
github.com/karpathy
165 comments
3 years ago
anjneymidha
707 points
727.
▲
Show HN: Llama 3.2 Interpretability with Sparse Autoencoders
github.com/PaulPauls
99 comments
2 years ago
PaulPauls
579 points
728.
▲
Llama: Add grammar-based sampling
github.com/ggerganov
105 comments
3 years ago
davepeck
417 points
729.
▲
New exponent functions that make SiLU and SoftMax 2x faster, at full accuracy
github.com/ggerganov
72 comments
2 years ago
weinzierl
382 points
730.
▲
Show HN: Llama-dl – high-speed download of LLaMA, Facebook's 65B GPT model
github.com/shawwn
130 comments
3 years ago
sillysaurusx
343 points
731.
▲
LLama.cpp now has a web interface
github.com/ggerganov
49 comments
3 years ago
xal
328 points
732.
▲
NotebookLlama: An open source version of NotebookLM
github.com/meta-llama
72 comments
2 years ago
bibinmohan
322 points
733.
▲
Llama 2 Everywhere (L2E): Standalone, Binary Portable, Bootable Llama 2
github.com/trholding
55 comments
3 years ago
jjwiseman
320 points
734.
▲
Llama 3.1 Omni Model
github.com/ictnlp
41 comments
2 years ago
taikon
304 points
735.
▲
M2 Ultra can run 128 streams of Llama 2 7B in parallel
github.com/ggerganov
173 comments
3 years ago
behnamoh
268 points
736.
▲
Fork of Facebook’s LLaMa model to run on CPU
github.com/markasoftware
170 comments
3 years ago
__anon-2023__
246 points
737.
▲
Show HN: LlamaGym – fine-tune LLM agents with online reinforcement learning
github.com/KhoomeiK
28 comments
2 years ago
KhoomeiK
239 points
738.
▲
Show HN: LlamaGPT – Self-hosted, offline, private AI chatbot, powered by Llama 2
github.com/getumbrel
75 comments
3 years ago
mayankchhabra
225 points
739.
▲
Llama-Scan: Convert PDFs to Text W Local LLMs
github.com/ngafar
83 comments
10 months ago
nawazgafar
221 points
740.
▲
llama-fs: A self-organizing file system with llama 3
github.com/iyaja
62 comments
2 years ago
archb
221 points
741.
▲
Llama 3.1 in C
github.com/trholding
36 comments
2 years ago
AMICABoard
212 points
742.
▲
Llama.rs – Rust port of llama.cpp for fast LLaMA inference on CPU
github.com/setzer22
24 comments
3 years ago
rrampage
202 points
743.
▲
Llama 2 on ONNX runs locally
github.com/microsoft
76 comments
3 years ago
tmoneyy
190 points
744.
▲
Show HN: Llama2 Embeddings FastAPI Server
github.com/Dicklesworthstone
31 comments
3 years ago
eigenvalue
178 points
745.
▲
Llama.cpp Now Supports Qwen2-VL (Vision Language Model)
github.com/ggerganov
50 comments
2 years ago
BUFU
155 points
746.
▲
Llama-Factory: Unified, Efficient Fine-Tuning for 100 Open LLMs
github.com/hiyouga
19 comments
9 months ago
jinqueeny
132 points
747.
▲
Llama-agents: an async-first framework for building production ready agents
github.com/run-llama
28 comments
2 years ago
pierre
116 points
748.
▲
Show HN: LLaMA tokenizer that runs in browser
github.com/belladoreai
23 comments
3 years ago
belladoreai
115 points
749.
▲
LlamaAcademy: Teach GPTs to understand API documentation with LoRA
github.com/danielgross
11 comments
3 years ago
danicgross
104 points
750.
▲
Performance of llama.cpp on Apple Silicon A-series
github.com/ggerganov
41 comments
3 years ago
mobilio
100 points
More