HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
361.
▲
Show HN: Our command line tool to transpile AI Inference from Python to C++
github.com/muna-ai
discuss
5 months ago
olokobayusuf
4 points
362.
▲
Show HN: I wrote inference for Qwen3 0.6B in C/CUDA
github.com/asdf93074
discuss
9 months ago
mk93074
4 points
363.
▲
Show HN: Klartraum, a neural rendering inference engine
github.com/fortmeier
discuss
a year ago
fortmeier
4 points
364.
▲
Show HN: Furnace – Rust and Burn inference server, zero Python, single binary
discuss
a year ago
gilfeather
4 points
365.
▲
Fenic: The dataframe (re)built for LLM inference
github.com/typedef-ai
discuss
a year ago
asiramdas
4 points
366.
▲
Zorokee/ArtificialCast: Type-safe transformation powered by inference
github.com/Zorokee
discuss
a year ago
cratermoon
4 points
367.
▲
Bark.cpp: Port of Suno AI's Bark in C/C++ for fast inference
github.com/PABannier
discuss
2 years ago
siraben
4 points
368.
▲
Jetstream: New LLM Inference Engine
github.com/google
discuss
2 years ago
gfortaine
4 points
369.
▲
LLM Inference Endpoint Performance Benchmarking Tool
github.com/ray-project
discuss
3 years ago
richardliaw
4 points
370.
▲
Accelerating Inferencing Services with Kontain
github.com/kontainapp
discuss
3 years ago
gnode1
4 points
371.
▲
LLM-J: A pure Java implementation of a LLM inference engine
github.com/tjake
discuss
3 years ago
mfiguiere
4 points
372.
▲
Full GPU Inference of LLaMA on Apple Silicon Using Metal
github.com/ggerganov
discuss
3 years ago
behnamoh
4 points
373.
▲
Show HN: Deterministic objective Bayesian inference for spatial models [pdf]
buildingblock.ai
discuss
3 years ago
rnburn
4 points
374.
▲
Inference at the Edge
github.com/ggerganov
discuss
3 years ago
Mizza
4 points
375.
▲
Show HN: TypeScript query builder with full type inference
edgedb.com
discuss
4 years ago
colinmcd
4 points
376.
▲
Whats new in Scala 2.8: type constructor inference
adriaanm.github.com
discuss
15 years ago
DanielRibeiro
4 points
377.
▲
Linux.Midrashim: x64 ELF infector virus
github.com/guitmz
discuss
6 years ago
guitmz
4 points
378.
▲
Show HN: Larq – Binarized Neural Network Inference with MLIR and TFLite
github.com/larq
discuss
6 years ago
lgeiger
4 points
379.
▲
Fast In-Browser Inference with ONNX.js, WebAssembly and WebGL
github.com/Microsoft
discuss
7 years ago
0101111101
4 points
380.
▲
Infer Clojure specs from sample data. Inspired by F#'s type providers
github.com/stathissideris
discuss
9 years ago
tosh
4 points
381.
▲
Show HN: oLLM – LLM Inference for large-context tasks on consumer GPUs
github.com/Mega4alik
7 comments
10 months ago
anuarsh
3 points
382.
▲
Show HN: A reasoning model that infers over whole tasks in 1ms in latent space
github.com/OrderOneAI
6 comments
a year ago
orderone_ai
3 points
383.
▲
Show HN: Standalone TurboQuant KV Cache Inference
github.com/g023
4 comments
3 months ago
g023
3 points
384.
▲
Ternative – C++/CUDA inference engine for ternary LLMs with runtime LoRA
github.com/michelangeloromerochisco
1 comment
a month ago
michelangeloro
3 points
385.
▲
Xinity Runtime: Apache 2.0 LLM inference engine for on-premise deployment
github.com/xinity-ai
1 comment
3 months ago
xinity
3 points
386.
▲
Show HN: Dendrite – O(1) KV cache forking for tree-structured LLM inference
github.com/BioInfo
1 comment
3 months ago
RyeCatcher
3 points
387.
▲
Show HN: Kremis – Rust graph DB; every answer is fact, inference, or unknown
github.com/TyKolt
1 comment
3 months ago
TyKolt
3 points
388.
▲
vLLM-mlx – 65 tok/s LLM inference on Mac with tool calling and prompt caching
github.com/raullenchai
1 comment
4 months ago
raullen
3 points
389.
▲
A Distributed Inference Framework Enabling Running Models Exceeding Total Memory
github.com/firstbatchxyz
1 comment
7 months ago
driaforall
3 points
390.
▲
Metaphysical Priming reduces Gemini 3.0 Pro inference latency by 60%
github.com/Cactus-mp4
1 comment
7 months ago
cactus-jpg
3 points
More