HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
301.
▲
Show HN: Bhumi–OSS Python Library w Rust Underhead for 2.5x Faster LLM Inference
bhumi.trilok.ai
discuss
a year ago
rachpradhan
8 points
302.
▲
Show HN: ChainFactory – Run Structured LLM Inference with Easy Parallelism
github.com/pankajgarkoti
discuss
2 years ago
garkotipankaj
8 points
303.
▲
gg: "M2 Ultra is the absolute best personal LLM inference node you can buy."
github.com/ggerganov
discuss
3 years ago
behnamoh
8 points
304.
▲
LLaMA-rs: a Rust port of llama.cpp for fast LLaMA inference on CPU
github.com/setzer22
discuss
3 years ago
darthdeus
8 points
305.
▲
Sahi: A Vision library for sliced inference on large images/small objects
github.com/obss
discuss
5 years ago
yagizdegirmenci
8 points
306.
▲
Show HN: Docker Model Runner Integrates vLLM for High-Throughput Inference
github.com/docker
1 comment
7 months ago
ericcurtin
7 points
307.
▲
Show HN: Jlama – A fast Java inference engine for GPT and Llama models
github.com/tjake
1 comment
3 years ago
tjake
7 points
308.
▲
Show HN: Llamero – A GUI app to easily download, install and infer LLaMA models
github.com/mpociot
1 comment
3 years ago
mpociot
7 points
309.
▲
Alpa: Auto-parallelizing large model training and inference (by UC Berkeley)
github.com/alpa-projects
1 comment
4 years ago
zhisbug
7 points
310.
▲
Show HN: Secure XGBoost training and inference on encrypted data
github.com/mc2-project
1 comment
6 years ago
chesterl
7 points
311.
▲
Show HN: Composable middleware for LLM inference Optimization Passes
github.com/liquidos-ai
discuss
4 months ago
human_hack3r
7 points
312.
▲
Distributed LLama3 Inference
github.com/evilsocket
discuss
2 years ago
345765476586
7 points
313.
▲
Stable Diffusion Inference on iOS
github.com/madebyollin
discuss
4 years ago
pizza
7 points
314.
▲
Cligen: A Native API-Inferred Command-Line Interface Generator for Nim
github.com/c-blake
3 comments
a year ago
TheWiggles
6 points
315.
▲
RxInferServer – Remote Bayesian Inference from Python via Julia
3 comments
a year ago
bvdmitri
6 points
316.
▲
Show HN: Larq – Binarized Neural Network Inference with MLIR and TFLite
github.com/larq
1 comment
6 years ago
khelwegen
6 points
317.
▲
OpenUMA – bring Apple-style unified memory to x86 AI inference (Rust, Linux)
github.com/hamtun24
discuss
3 months ago
hamtun24
6 points
318.
▲
Llama.cpp: Deterministic Inference Mode (CUDA): RMSNorm, MatMul, Attention
github.com/ggml-org
discuss
9 months ago
diwank
6 points
319.
▲
Llama Inference in 150 Lines
gist.github.com
discuss
2 years ago
kevmo314
6 points
320.
▲
Show HN: Launch StableStudio local inference in one commmand
github.com/brycedrennan
discuss
3 years ago
bryced
6 points
321.
▲
Rust+OpenCL+AVX2 implementation of LLaMA inference code
github.com/Noeda
discuss
3 years ago
myers
6 points
322.
▲
ncnn: High-performance neural network inference framework optimized for mobile
github.com/Tencent
discuss
4 years ago
davikr
6 points
323.
▲
Wase – WebAssembly made easy. Strongly typed infered low-level language for WASM
github.com/area9innovation
discuss
4 years ago
asgeralstrup
6 points
324.
▲
Statistical Inference Considered Harmful
github.com/frankmcsherry
discuss
10 years ago
rargulati
6 points
325.
▲
Ask HN: What is the best tool to infer data type of tabular data?
7 comments
5 years ago
mahalel
5 points
326.
▲
Show HN: Zod – TypeScript-first validation library with static type inference
github.com/vriad
3 comments
6 years ago
vriad
5 points
327.
▲
Show HN: GPT-J inference on the CPU using C/C++
github.com/ggerganov
2 comments
4 years ago
ggerganov
5 points
328.
▲
I implemented CLIP inference in plain C/C++
github.com/monatis
1 comment
3 years ago
monatis
5 points
329.
▲
GeosPy: Geolocation Inference Made Easy
github.com/tylfin
1 comment
10 years ago
tylfin
5 points
330.
▲
Show HN: CUDA Profiler for Production Inference
github.com/graphsignal
discuss
26 minutes ago
npgraph
5 points
More