HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
421.
▲
OpenVINO – open-source toolkit for optimizing and deploying AI inference
github.com/openvinotoolkit
discuss
6 months ago
peter_d_sherman
3 points
422.
▲
Show HN: Distributed Storage System to 8x LLM Inference, GPU Training Efficiency
github.com/blackbird-io
discuss
8 months ago
hackerpanda123
3 points
423.
▲
LLM-optimizer: Benchmark and optimize LLM inference across frameworks with ease
github.com/bentoml
discuss
9 months ago
djhu9
3 points
424.
▲
LLM Inference in pure Java with a GPU acceleration enabled
github.com/beehive-lab
discuss
a year ago
mikepapadim
3 points
425.
▲
Show HN: I made TypeScript's type inference more strict (and smarter)
github.com/kakasoo
discuss
a year ago
kakasoo
3 points
426.
▲
The Path to Open-Sourcing the DeepSeek Inference Engine
github.com/deepseek-ai
discuss
a year ago
vitorgrs
3 points
427.
▲
Deepseek CPP for CPU only inference
github.com/andrewkchan
discuss
a year ago
amrrs
3 points
428.
▲
AntiSlop Sampler for LLM Inference
github.com/sam-paech
discuss
a year ago
rahimnathwani
3 points
429.
▲
Jet.jl: static type checker with type inference for Julia
github.com/aviatesk
discuss
2 years ago
fanf2
3 points
430.
▲
Show HN: Bayesian Neural Networks and Uncertainty for Inferring Unseen Classes
github.com/MNoorFawi
discuss
2 years ago
mnoorfawi
3 points
431.
▲
Real-Time Streaming Apps with Nvidia Open Source Triton Inference
github.com/nickaggarwal
discuss
2 years ago
agcat
3 points
432.
▲
Distributed LLM Inference with Llama.cpp
github.com/ggerganov
discuss
2 years ago
tosh
3 points
433.
▲
Practical Llama 3 inference implemented in a single Java file
github.com/mukel
discuss
2 years ago
simonpure
3 points
434.
▲
Gemma.cpp: lightweight, standalone C++ inference engine for Gemma models
github.com/google
discuss
2 years ago
ot
3 points
435.
▲
Llama.cpp supports distributed inference across machines on a local network
github.com/ggerganov
discuss
2 years ago
behnamoh
3 points
436.
▲
RCE in Nvidia Triton Inference Server
github.com/protectai
discuss
2 years ago
byt3bl33d3r
3 points
437.
▲
Show HN: Inference-only implementation of Mamba optimized for CPU
github.com/flawedmatrix
discuss
2 years ago
flawedmatrix
3 points
438.
▲
Show HN: NOS – A fast, and ergonomic PyTorch inference server
github.com/autonomi-ai
discuss
3 years ago
EarlyOom
3 points
439.
▲
Training and inference code for audio generation models
github.com/Stability-AI
discuss
3 years ago
treesciencebot
3 points
440.
▲
Vllm: High-throughput and memory-efficient inference and serving engine for LLMs
github.com/vllm-project
discuss
3 years ago
tosh
3 points
441.
▲
Small inference runtime for deep neural networks
github.com/maekawatoshiki
discuss
3 years ago
uint256_t
3 points
442.
▲
Inference at the edge: Efficient transformer model inference on-device
github.com/ggerganov
discuss
3 years ago
lioeters
3 points
443.
▲
WebGPU ONNX inference runtime written in Rust
github.com/webonnx
discuss
3 years ago
f_devd
3 points
444.
▲
Show HN: Python Monitoring for AI: LLMs, OpenAI, Inference, GPUs
github.com/graphsignal
discuss
3 years ago
npgraph
3 points
445.
▲
Show HN: Nix-init – Generate Nix packages from URLs with dependency inference
github.com/nix-community
discuss
3 years ago
figsoda
3 points
446.
▲
Fast type inference library for Common Lisp
github.com/marcoheisig
discuss
4 years ago
medo-bear
3 points
447.
▲
Using OpenAI Codex's “DaVinci-Edit” Model for Gradual Type Inference
github.com/GammaTauAI
discuss
4 years ago
elleven
3 points
448.
▲
Show HN: Spartan Schema - Ultra-minimal JSON schemas with Typescript inference
github.com/ar-nelson
discuss
4 years ago
ar-nelson
3 points
449.
▲
Type inference for the database access layer in PHP
discuss
4 years ago
markusstaab
3 points
450.
▲
The exhaustive Pattern Matching library for TypeScript with smart type inference
github.com/gvergnaud
discuss
4 years ago
itstaken
3 points
More