HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
601.
▲
Shard – A Distributed P2P AI Network for Shared Inference
github.com/TrentPierce
4 comments
4 months ago
tpierce89
1 points
602.
▲
Gen-Selective Pseudo Labeling, Based on Datasets and Serverless Inference API
github.com/louisbrulenaudet
2 comments
2 years ago
brulenaudet
1 points
603.
▲
The AI Privacy Problem Isn't What You Tell AI – It's What AI Infers
github.com/cnaebadi
1 comment
7 days ago
sewyed
1 points
604.
▲
Show HN: Piqc – An open-source GPU waste scanner for LLM inference clusters
github.com/paralleliq
1 comment
20 days ago
samhoss93
1 points
605.
▲
Show HN: GPT-2 inference in pure C#, 0 bytes allocated per token
github.com/DevOnBike
1 comment
a month ago
dev-on-bike
1 points
606.
▲
Show HN: Self-healing data pipeline for F1 telemetry (Python and Type Inference)
1 comment
5 months ago
tarekclarke
1 points
607.
▲
Show HN: vLLM Studio – Web UI to manage vLLM/SGLang inference servers at home
github.com/0xsero
1 comment
5 months ago
week7820
1 points
608.
▲
Show HN: Nvidia's CUDA libraries are generic and not optimized for LLM inference
github.com/Venkat2811
1 comment
5 months ago
venkat_2811
1 points
609.
▲
General plug-and-play inference lib for RLMs
github.com/alexzhang13
1 comment
6 months ago
larodi
1 points
610.
▲
Inferal Workspace Architecture: How We Work at Inferal
gist.github.com
1 comment
6 months ago
yrashk
1 points
611.
▲
Llmedge an on device LLM, vision, and speech inference library for Android
github.com/Aatricks
1 comment
6 months ago
aatricks
1 points
612.
▲
[OPEN-SOURCE] Whisper finetuning, inference, auto GPU upscale, proxy and co
github.com
1 comment
7 months ago
amarcel
1 points
613.
▲
Show HN: GPU-Based Kubernetes HPA for Triton Inference Server
github.com/uzunenes
1 comment
7 months ago
uzunenes
1 points
614.
▲
Sample Forge – Research tool for deterministic inference in LLM's
github.com/manfrom83
1 comment
9 months ago
nowittyusername
1 points
615.
▲
A proposal for clean cloud-free AI inference network
github.com/franzkruhm
1 comment
10 months ago
zanfr
1 points
616.
▲
ParaAttention: Speed Up Flux and Mochi Inference with Multiple GPUs
github.com/chengzeyi
1 comment
2 years ago
chengzeyi
1 points
617.
▲
Llama Deck:CLI for running multiple language implementations of LLM inference
github.com/xxxbf0222
1 comment
2 years ago
mikepapadim
1 points
618.
▲
Benchmarked Llama2 and mistral across popular inference engines and precisions
github.com/premAI-io
1 comment
2 years ago
anindya2002
1 points
619.
▲
Curated List of 50 Open-Source LLM Inference Tools: Seeking Contributions
github.com/vince-lam
1 comment
2 years ago
vincelam
1 points
620.
▲
Show HN: Fortran inference code for the Mamba state space language model
github.com/rbitr
1 comment
3 years ago
andy99
1 points
621.
▲
GPT-Fast: Simple and efficient GPT inference in <1000 LOC of Python
github.com/pytorch-labs
1 comment
3 years ago
Palmik
1 points
622.
▲
Generate Nix packages from URLs with hash prefetching and dependency inference
github.com/nix-community
1 comment
3 years ago
figsoda
1 points
623.
▲
Show HN: Kylo – Simple FAQ Bot Built with Facebook's Infersent
github.com/avinassh
1 comment
7 years ago
avinassh
1 points
624.
▲
Clevr-Iep: Inferring and Executing Programs for Visual Reasoning
github.com/facebookresearch
1 comment
9 years ago
runesoerensen
1 points
625.
▲
XcodeGhost infectd Apps List
github.com/zengyun-programmer
1 comment
11 years ago
dengjh
1 points
626.
▲
Configurable zombie infection simulation
github.com/Ellzord
discuss
11 years ago
javinpaul
1 points
627.
▲
Native Inference Engine for macOS 14 or newer
github.com/tictacguy
discuss
8 days ago
tomolomolo
1 points
628.
▲
Why Gemma-4 26B MoE works in HuggingFace but breaks in prod inference engines
github.com/maeddesg
discuss
a month ago
maeddesg
1 points
629.
▲
Show HN: Gosd: High-performance Stable Diffusion inference in pure Go(no CGO)
github.com/l8bloom
discuss
2 months ago
krakato
1 points
630.
▲
WebLLM is a high-performance in-browser LLM inference engine
github.com/mlc-ai
discuss
2 months ago
doener
1 points
More