HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
1.
▲
Ask HN: Are you training and running custom LLMs and how are you doing it?
2 comments
3 years ago
kordlessagain
15 points
2.
▲
vLLM introduces memory optimizations for long-context inference
github.com/vllm-project
discuss
3 months ago
addisud
5 points
3.
▲
vLLM IR: A Functional Intermediate Representation for vLLM
github.com/vllm-project
discuss
2 months ago
matt_d
4 points
4.
▲
Vllm: High-throughput and memory-efficient inference and serving engine for LLMs
github.com/vllm-project
discuss
3 years ago
tosh
3 points
5.
▲
Vllm
github.com/vllm-project
discuss
3 years ago
kordlessagain
3 points
6.
▲
VLLM-Omni: A framework for efficient model inference with Omni-modality models
github.com/vllm-project
1 comment
7 months ago
zyh888
2 points
7.
▲
vLLM (high-throughput LLM serving engine)
github.com/vllm-project
discuss
4 months ago
roody_wurlitzer
2 points
8.
▲
Easy, fast, and cheap LLM serving for everyone
github.com/vllm-project
discuss
3 years ago
vincent_s
2 points
9.
▲
Official PR Reveals the Inference Code for Mixtral 8x7B
github.com/vllm-project
discuss
3 years ago
georgehill
2 points
10.
▲
VLLM
github.com/vllm-project
discuss
3 years ago
sherlockxu
2 points
11.
▲
LLM compressor: compress models for efficient deployment
github.com/vllm-project
1 comment
2 years ago
hajduksplit
1 points
12.
▲
vLLM multi-turn conversations design
github.com/vllm-project
discuss
5 months ago
CCs
1 points
13.
▲
Cost-efficient and pluggable Infrastructure components for GenAI inference
github.com/vllm-project
discuss
a year ago
rrampage
1 points
14.
▲
Cost-efficient and pluggable Infrastructure components for GenAI inference
github.com/vllm-project
discuss
a year ago
delduca
1 points
15.
▲
VLLM Sacrifices Accuracy for Speed
github.com/vllm-project
discuss
2 years ago
behnamoh
1 points
16.
▲
vllm
github.com/vllm-project
discuss
3 years ago
tosh
1 points
17.
▲
Mixtral Expert Parallelism
github.com/vllm-project
discuss
3 years ago
tosh
1 points
18.
▲
I made a GitHub repo for (beginner) Python devs using LangChain for LLM projects
github.com/lypsoty112
1 comment
2 years ago
MaartenBoon
1 points
19.
▲
Feedback on an open source Ruby – LLM project
github.com/pcarolan
1 comment
7 months ago
pcarolan
7 points
20.
▲
Memex: Rust powered “memory” (doc store and semantic search) for LLM projects
github.com/spyglass-search
discuss
3 years ago
homarp
2 points
21.
▲
Show HN: Νοῦς – A Customizable LLM Project
github.com/Albertlungu
discuss
6 months ago
albertlungu
1 points
22.
▲
Show HN: Laminar – Open-Source DataDog + PostHog for LLM Apps, Built in Rust
github.com/lmnr-ai
45 comments
2 years ago
skull8888888
203 points
23.
▲
Show HN: OpenLIT – Open-Source LLM Observability with OpenTelemetry
github.com/openlit
22 comments
2 years ago
aman_041
62 points
24.
▲
Show HN: OxyJen – Java framework to orchestrate LLMs in a graph-style execution
discuss
4 months ago
bdivyansh11
2 points
25.
▲
Show HN: Tokuin – CLI load tester and token estimator for LLM APIs
github.com/nooscraft
discuss
7 months ago
oshadha89
2 points
26.
▲
OpenLIT – Open-Source LLM Observability with OpenTelemetry
discuss
2 years ago
aman_041
2 points
27.
▲
Show HN: Hnsqlite: hnswlib and SQLite integrated for text embedding search
github.com/jiggy-ai
discuss
3 years ago
wskish
2 points
28.
▲
Show HN: LLM AuthZ Audit – find auth gaps and prompt injection in LLM apps
github.com/aiauthz
discuss
4 months ago
iamspathan
1 points
29.
▲
Show HN: Ask AI Paul Graham
pocket-pg-851564657364.us-east1.run.app
discuss
a year ago
zh2408
1 points
30.
▲
Show HN: OpenLIT – Open-Source LLM Observability with OpenTelemetry
github.com/openlit
discuss
2 years ago
patcher99
1 points
More