Search: github.com/ollama | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

811.

Llama.cpp speculative sampling: 2x faster inference for large models

github.com/ggerganov

3 years ago

4 points

812.

Meta DMCAs llama-dl Repository

github.com/shawwn

3 years ago

4 points

813.

LLaMA models licence change to Apache 2.0 approved

github.com/facebookresearch

3 years ago

4 points

814.

LlamaLib: A cross-platform C++/C# library for local LLMs based on llama.cpp

github.com/undreamai

5 months ago

4 points

815.

Llama.cpp PR with 99% of code written by DeepSeek-R1

github.com/ggerganov

a year ago

4 points

816.

github.com/meta-llama

2 years ago

4 points

817.

Llama2 in Mojo is 15-20% faster than llama2.c

github.com/tairov

3 years ago

4 points

818.

AMD ROCm Support Added to Llama.cpp

github.com/ggerganov

3 years ago

4 points

819.

Show HN: Llama2 Embeddings FastAPI Service

github.com/Dicklesworthstone

3 years ago

4 points

820.

Llama – A CLI for outsourcing computation to AWS Lambda

github.com/nelhage

3 years ago

4 points

821.

Full GPU Inference of LLaMA on Apple Silicon Using Metal

github.com/ggerganov

3 years ago

4 points

822.

Inference at the Edge

github.com/ggerganov

3 years ago

4 points

823.

Llama – A Terminal File Manager

github.com/antonmedv

4 years ago

4 points

824.

Show HN: Fine-tune llama3 model to support function calling

github.com/michaelnny

2 years ago

3 points

825.

Llama-zip: a command-line utility for lossless text compression

github.com/AlexBuz

2 years ago

3 points

826.

Show HN: Llamaphone- Single-file Front end for Llamafile

github.com/KerbalNo15

3 years ago

3 points

827.

github.com/donge

3 years ago

3 points

828.

Llama.cpp now supports tool calling (OpenAI-compatible)

github.com/ggerganov

a year ago

3 points

829.

GGML Flash Attention support merged into llama.cpp

github.com/ggerganov

2 years ago

3 points

830.

LlamaStash – Zero-overhead, terminal-native llama.cpp launcher

github.com/llamastash

23 days ago

3 points

831.

ParseBench: Document Parsing Benchmark for AI Agents

github.com/run-llama

2 months ago

3 points

832.

To Use Snapdragon NPU, HTP Ops Libraries Must Be Signed with Trusted Certs

github.com/qualcomm

5 months ago

WhereIsTheTruth

3 points

833.

Disclaimer: I am not a webdev, this PR was vibe coded

github.com/olegshulyakov

10 months ago

WhereIsTheTruth

3 points

834.

Show HN: Worflows.py, the best way to build agents

github.com/run-llama

a year ago

3 points

835.

A tool for migrating and optimizing prompts from other LLMs to Llama

github.com/meta-llama

a year ago

3 points

836.

Open source Claude Artifacts – built with Llama 3.1 405B

github.com/Nutlope

2 years ago

sabrina_ramonov

3 points

837.

Distributed LLM Inference with Llama.cpp

github.com/ggerganov

2 years ago

3 points

838.

Practical Llama 3 inference implemented in a single Java file

github.com/mukel

2 years ago

3 points

839.

Meta Llama 3 GitHub

github.com/meta-llama

2 years ago

3 points

840.

LlamaIndex is a data framework for your LLM applications

github.com/run-llama

2 years ago

3 points