HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
841.
▲
Control Vectors have been added to llama.cpp
github.com/ggerganov
discuss
2 years ago
Der_Einzige
3 points
842.
▲
Llama.cpp supports distributed inference across machines on a local network
github.com/ggerganov
discuss
2 years ago
behnamoh
3 points
843.
▲
Llama-Terminal-Completion
github.com/adammpkins
discuss
3 years ago
tosh
3 points
844.
▲
CUDA: Faster Mixtral Prompt Processing
github.com/ggerganov
discuss
3 years ago
tosh
3 points
845.
▲
Llama.cpp: Support for Phi-2
github.com/ggerganov
discuss
3 years ago
tosh
3 points
846.
▲
QMoE Support for Mixtral
github.com/ggerganov
discuss
3 years ago
tosh
3 points
847.
▲
Karpathy removes llama licence from llama2.c
github.com/karpathy
discuss
3 years ago
orwellg1984
3 points
848.
▲
A Clojure Wrapper for Llama.cpp
github.com/phronmophobic
discuss
3 years ago
simonpure
3 points
849.
▲
Llama Recipes
github.com/facebookresearch
discuss
3 years ago
atg_abhishek
3 points
850.
▲
Llama 2: poc for running 70B on CPU
github.com/ggerganov
discuss
3 years ago
tosh
3 points
851.
▲
Inference at the edge: Efficient transformer model inference on-device
github.com/ggerganov
discuss
3 years ago
lioeters
3 points
852.
▲
K-Quants
github.com/ggerganov
discuss
3 years ago
tosh
3 points
853.
▲
Suddenly 403 Forbidden (LLaMA)
github.com/facebookresearch
discuss
3 years ago
grae_QED
3 points
854.
▲
Connect your LLM with external data
github.com/jerryjliu
discuss
3 years ago
snork_alt
3 points
855.
▲
Llama.cpp: Add GPU support to ggml
github.com/ggerganov
discuss
3 years ago
mromanuk
3 points
856.
▲
LLaMA-Adapter: Efficient Fine-Tuning of LLaMA
github.com/ZrrSkywalker
discuss
3 years ago
GaggiX
3 points
857.
▲
Show HN: LlamaBot – Turn any Rails app into an autonomous AI agent in 2 minutes
github.com/KodyKendall
3 comments
a year ago
kody_06
2 points
858.
▲
LLM quantization severely damages model quality and perplexity
github.com/ggerganov
3 comments
3 years ago
behnamoh
2 points
859.
▲
Llama.cpp with CUDA Support on Original Jetson Nano (4GB)
github.com/kreier
2 comments
3 months ago
Abishek_Muthian
2 points
860.
▲
LLaMA Terminal Completion, a local virtual assistant for the terminal
github.com/adammpkins
2 comments
3 years ago
adammpkins
2 points
861.
▲
Show HN: Liteparse, an OSS universal fast document parser by LlamaParse team
github.com/run-llama
1 comment
3 months ago
pierre
2 points
862.
▲
How to verify that a snippet of Python code doesn't access protected members
github.com/run-llama
1 comment
2 years ago
tslmy
2 points
863.
▲
Finetune LLaMa2 for Any Language
github.com/UnderstandLingBV
1 comment
3 years ago
UnderstandLing
2 points
864.
▲
Llama2.mojo - outperforms Karpathy’s llama2.c by 30% in multi-threaded inference
github.com/tairov
1 comment
3 years ago
swyx
2 points
865.
▲
Show HN: Llama2.f90 – Toy LLaMA2 model inference in Fortran
github.com/rbitr
1 comment
3 years ago
andy99
2 points
866.
▲
Python bindings (and OpenAI API compatible server) for llama.cpp
github.com/abetlen
1 comment
3 years ago
tosh
2 points
867.
▲
Llama.swift
github.com/alexrozanski
1 comment
3 years ago
alexrozanski
2 points
868.
▲
Show HN: Llamactl – Self-hosted LLM manager with OpenAI-compatible routing
github.com/lordmathis
discuss
3 months ago
lordmathis
2 points
869.
▲
Llama-swap: Reliable model swapping
github.com/mostlygeek
discuss
3 months ago
hbcondo714
2 points
870.
▲
Show HN: Llamada – minimalist toolkit to define functions with prompts
github.com/blaesus
discuss
8 months ago
blaesus
2 points
More