Search: github.com/vllm-project | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

1.

Ask HN: Are you training and running custom LLMs and how are you doing it?

3 years ago

15 points

2.

vLLM introduces memory optimizations for long-context inference

github.com/vllm-project

3 months ago

5 points

3.

vLLM IR: A Functional Intermediate Representation for vLLM

github.com/vllm-project

2 months ago

4 points

4.

Vllm: High-throughput and memory-efficient inference and serving engine for LLMs

github.com/vllm-project

3 years ago

3 points

5.

github.com/vllm-project

3 years ago

3 points

6.

VLLM-Omni: A framework for efficient model inference with Omni-modality models

github.com/vllm-project

7 months ago

2 points

7.

vLLM (high-throughput LLM serving engine)

github.com/vllm-project

4 months ago

roody_wurlitzer

2 points

8.

Easy, fast, and cheap LLM serving for everyone

github.com/vllm-project

3 years ago

2 points

9.

Official PR Reveals the Inference Code for Mixtral 8x7B

github.com/vllm-project

3 years ago

2 points

10.

github.com/vllm-project

3 years ago

2 points

11.

LLM compressor: compress models for efficient deployment

github.com/vllm-project

2 years ago

1 points

12.

vLLM multi-turn conversations design

github.com/vllm-project

5 months ago

1 points

13.

Cost-efficient and pluggable Infrastructure components for GenAI inference

github.com/vllm-project

a year ago

1 points

14.

Cost-efficient and pluggable Infrastructure components for GenAI inference

github.com/vllm-project

a year ago

1 points

15.

VLLM Sacrifices Accuracy for Speed

github.com/vllm-project

2 years ago

1 points

16.

github.com/vllm-project

3 years ago

1 points

17.

Mixtral Expert Parallelism

github.com/vllm-project

3 years ago

1 points

18.

I made a GitHub repo for (beginner) Python devs using LangChain for LLM projects

github.com/lypsoty112

2 years ago

1 points

19.

Feedback on an open source Ruby – LLM project

github.com/pcarolan

7 months ago

7 points

20.

Memex: Rust powered “memory” (doc store and semantic search) for LLM projects

github.com/spyglass-search

3 years ago

2 points

21.

Show HN: Νοῦς – A Customizable LLM Project

github.com/Albertlungu

6 months ago

1 points

22.

Show HN: Laminar – Open-Source DataDog + PostHog for LLM Apps, Built in Rust

github.com/lmnr-ai

2 years ago

203 points

23.

Show HN: OpenLIT – Open-Source LLM Observability with OpenTelemetry

github.com/openlit

2 years ago

62 points

24.

Show HN: OxyJen – Java framework to orchestrate LLMs in a graph-style execution

4 months ago

2 points

25.

Show HN: Tokuin – CLI load tester and token estimator for LLM APIs

github.com/nooscraft

7 months ago

2 points

26.

OpenLIT – Open-Source LLM Observability with OpenTelemetry

2 years ago

2 points

27.

Show HN: Hnsqlite: hnswlib and SQLite integrated for text embedding search

github.com/jiggy-ai

3 years ago

2 points

28.

Show HN: LLM AuthZ Audit – find auth gaps and prompt injection in LLM apps

github.com/aiauthz

4 months ago

1 points

29.

Show HN: Ask AI Paul Graham

pocket-pg-851564657364.us-east1.run.app

a year ago

1 points

30.

Show HN: OpenLIT – Open-Source LLM Observability with OpenTelemetry

github.com/openlit

2 years ago

1 points