HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
661.
▲
Open Retrieval-Based Inference Toolkit
github.com/schmitech
discuss
10 months ago
schmitech
1 points
662.
▲
Pydantic/GenAI-prices – Calculate prices for calling LLM inference APIs
github.com/pydantic
discuss
a year ago
alexmorley
1 points
663.
▲
Show HN: Pure CUDA C Inference for Qwen3 0.6B in One File, No Dependencies
github.com/gigit0000
discuss
a year ago
yb0000
1 points
664.
▲
Confidential AI Inference with Attestation: Run LLMs and Agents on Tees
github.com/nearai
discuss
a year ago
transpute
1 points
665.
▲
Ask HN: What Inference Server do you use to host TTS Models?
discuss
a year ago
samagra14
1 points
666.
▲
ArtificialCast: Type-safe transformation powered by inference
github.com/Zorokee
discuss
a year ago
mpweiher
1 points
667.
▲
A collection of reproducible LLM inference engine benchmarks: SGLang vs. vLLM
github.com/Michaelvll
discuss
a year ago
zhwu
1 points
668.
▲
The Path to Open-Sourcing the DeepSeek Inference Engine
github.com/deepseek-ai
discuss
a year ago
xnhbx
1 points
669.
▲
Show HN: SQL-based inference for Gradient Boosting Models
github.com/mattismegevand
discuss
a year ago
mattismegevand
1 points
670.
▲
Show HN: Acord – A Daemon for AI Inference
github.com/alpaca-core
discuss
a year ago
bstanimirov
1 points
671.
▲
Cost-efficient and pluggable Infrastructure components for GenAI inference
github.com/vllm-project
discuss
a year ago
rrampage
1 points
672.
▲
Cost-efficient and pluggable Infrastructure components for GenAI inference
github.com/vllm-project
discuss
a year ago
delduca
1 points
673.
▲
Show HN: TokenFlow – Visualize LLM inference speed
dave.ly
discuss
a year ago
davely
1 points
674.
▲
Show HN: Bodhi App – Local LLM Inference
getbodhi.app
discuss
a year ago
anagri
1 points
675.
▲
CUDA/Metal accelerated language model inference
github.com/zeux
discuss
a year ago
mooreds
1 points
676.
▲
Computer vision models inference directly on mobile
github.com/software-mansion
discuss
a year ago
mrys
1 points
677.
▲
Show HN: Rust Powered Inference, Ingestion and Indexing with EmbedAnything
github.com/StarlightSearch
discuss
2 years ago
Sonam_AI
1 points
678.
▲
KubeAI – AI Inference Operator for Kubernetes
github.com/substratusai
discuss
2 years ago
dunwaldo
1 points
679.
▲
Bitnet.js: Node.js Implementation of Microsoft's BitNet.CPP Inference Framework
github.com/stackblogger
discuss
2 years ago
stackblogger
1 points
680.
▲
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding
github.com/facebookresearch
discuss
2 years ago
zerojames
1 points
681.
▲
Llama Stack by Meta – Inference, Safety, Memory, Agentic System, Evaluation
github.com/meta-llama
discuss
2 years ago
vikrantrathore
1 points
682.
▲
Beartype Introduces Infer_hint()
github.com/beartype
discuss
2 years ago
diwank
1 points
683.
▲
Hindley-Milner inferencer write in Python (2017)
github.com/ethe
discuss
2 years ago
autohime
1 points
684.
▲
Flux: Official inference repo for FLUX.1 models
github.com/black-forest-labs
discuss
2 years ago
tosh
1 points
685.
▲
Llama3 Inference in Pure LuaJIT
github.com/CapsAdmin
discuss
2 years ago
CapsAdmin
1 points
686.
▲
SiLLM – Silicon LLM Training and Inference Toolkit
github.com/armbues
discuss
2 years ago
tosh
1 points
687.
▲
Show HN: ai("question", data) infers the ML model and answers with Python type
github.com/jmaczan
discuss
2 years ago
yu3zhou4
1 points
688.
▲
Show HN: Text-to-ML: like HuggingGPT + LangChain + type inference
github.com/jmaczan
discuss
2 years ago
yu3zhou4
1 points
689.
▲
Show HN: Inference Llama2 with High-Level C++
github.com/frost-beta
discuss
2 years ago
zcbenz
1 points
690.
▲
Show HN: Geniusrise – open-source inference endpoints for text, vision, audio
github.com
discuss
2 years ago
ixaxaar
1 points
More