HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
481.
▲
Psychec: ML-style type inference for C
github.com/ltcmelo
1 comment
6 months ago
fanf2
2 points
482.
▲
VLLM-Omni: A framework for efficient model inference with Omni-modality models
github.com/vllm-project
1 comment
7 months ago
zyh888
2 points
483.
▲
HAIF – Hyperswarm-RPC AI Inference Framework
github.com/Branexai
1 comment
9 months ago
alvaropaco
2 points
484.
▲
Checkpoint-engine: A middleware to update model weights in LLM inference engines
github.com/MoonshotAI
1 comment
9 months ago
jasonjmcghee
2 points
485.
▲
WIP: Nvidia Parakeet ASR mode inference in GGML
github.com/jason-ni
1 comment
10 months ago
jasonni
2 points
486.
▲
Show HN: Local LLM Inference in Godot and Unity
github.com/nobodywho-ooo
1 comment
a year ago
nobodywho
2 points
487.
▲
MLX-based LLM inference engine for macOS with native Swift implementation
github.com/Trans-N-ai
1 comment
a year ago
jovezhong
2 points
488.
▲
Inference Llama models in one file of pure C for Win98
github.com/exo-explore
1 comment
a year ago
mastar2323
2 points
489.
▲
Optillm: An Optimizing Inference Proxy with Plugins
github.com/codelion
1 comment
2 years ago
codelion
2 points
490.
▲
Show HN: ChainFactory – Run Structured LLM Inference with Easy Parallelism
github.com/pankajgarkoti
1 comment
2 years ago
garkotipankaj
2 points
491.
▲
Llama2.mojo - outperforms Karpathy’s llama2.c by 30% in multi-threaded inference
github.com/tairov
1 comment
3 years ago
swyx
2 points
492.
▲
Stable Fast: Lightweight Inference Optimization Library for Stable Diffusion
github.com/chengzeyi
1 comment
3 years ago
chengzeyi
2 points
493.
▲
Kyanite: NN inference library, in/for Rust, using CPU or Nvidia GPUs
github.com/KarelPeeters
1 comment
3 years ago
homarp
2 points
494.
▲
Show HN: Llama2.f90 – Toy LLaMA2 model inference in Fortran
github.com/rbitr
1 comment
3 years ago
andy99
2 points
495.
▲
CTranslate2: An efficient inference engine for Transformer models
github.com/OpenNMT
1 comment
3 years ago
wsxiaoys
2 points
496.
▲
Nebullvm open-source accelerator of AI inference. Feedback?
1 comment
4 years ago
emilec___
2 points
497.
▲
Gluon: A static, type inferred and embeddable language written in Rust
github.com/gluon-lang
1 comment
5 years ago
fish45
2 points
498.
▲
Schema – Infer, Translate Between GraphQL, JSON, YAML, TOML, XML
github.com/Confbase
1 comment
6 years ago
confbase
2 points
499.
▲
Monero Binaries on getmonero.org Infected
github.com/monero-project
1 comment
7 years ago
rocqua
2 points
500.
▲
Show HN: MinLlama – Llama 3.2 inference in ~100 lines of NumPy
github.com/timothygao8710
discuss
a day ago
timothygao
2 points
501.
▲
Beast: Inference Economy Inversion in Agentic Coding Systems
github.com/Byron2306
discuss
3 days ago
Byron230686
2 points
502.
▲
Show HN: Alloy – a PyTorch backend and inference engine for Apple Silicon
github.com/rayanht
discuss
4 days ago
rayanht
2 points
503.
▲
Ax-engine: Native Apple Silicon ML inference runtime with a fast Rust core
github.com/defai-digital
discuss
15 days ago
automatosx
2 points
504.
▲
Show HN: We built an LLM inference engine in pure Python – no PyTorch, no Triton
github.com/Zyora-Dev
discuss
22 days ago
zyoraclub
2 points
505.
▲
Tuning CPU-only Qwen3-30B inference with an IBM Quantum sampling loop
github.com/Shack870
discuss
a month ago
Royce-CMR
2 points
506.
▲
SIE: Unified Inference Engine for Embeddings, Reranking, and Extraction
github.com/superlinked
discuss
a month ago
modinfo
2 points
507.
▲
Show HN: YieldOS-Lite – A simulator for LLM inference control-plane governance
github.com/nikitph
discuss
a month ago
loaderchips
2 points
508.
▲
KinetiX: An intra-inference hardware interlock for LLMs
github.com/johndoerch-eng
discuss
a month ago
kinetix_system
2 points
509.
▲
Show HN: AI/ML benchmark for local LLM inference and XGBoost training on GPU/CPU
github.com/albedan
discuss
a month ago
albedan
2 points
510.
▲
Arknet – decentralized AI inference, fair launch, one binary
github.com/st-hannibal
discuss
2 months ago
st-hannibal
2 points
More