Search: github.com/tnfe | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

511.

Openpi-flash: Real-time inference engine for openpi

github.com/Hebbian-Robotics

2 months ago

2 points

512.

Show HN: Stateful Inference with 99% Token Savings

github.com/umbecanessa

2 months ago

2 points

513.

Rcarmo/go-AI: A mildly sane inference API library for go

github.com/rcarmo

2 months ago

2 points

514.

Show HN: Mimikos – Zero-config mock server that infers API behavior from OpenAPI

2 months ago

2 points

515.

Kubernetes operator for deploying, serving, and improve LLM inference engines

github.com/cliver-project

2 months ago

2 points

516.

Living Memory Inference

github.com/alash3al

2 months ago

2 points

517.

Swift package AI inference engine generated from Rust crate

github.com/ondeinference

3 months ago

2 points

518.

Open-source ZK proofs for ML inference – verify AI decisions cryptographically

github.com/OE-GOD

3 months ago

2 points

519.

AirLLM optimizes inference memory usage

github.com/lyogavin

4 months ago

2 points

520.

Show HN: I wrote an LLM inference engine in pure Go – 48 tok/s zero dependencies

github.com/computerex

4 months ago

2 points

521.

Show HN: Name-classifier – infers attributes about a person from a name

github.com/douglas-larocca

4 months ago

2 points

522.

C inference for Qwen3-ASR 0.6B and 1.7B transcriptions models

github.com/antirez

4 months ago

2 points

523.

Show HN: I built a unified inference layer for Document Processing Models

github.com/adithya-s-k

4 months ago

2 points

524.

Show HN: Evolved x86 AVX-512 kernels for NF4 LLM inference

github.com/Anuar81

4 months ago

2 points

525.

OMLX – Ollama for MLX (LLM Inference Server for Apple Silicon)

github.com/jundot

4 months ago

2 points

526.

Show HN: Omni-NLI – A multi-interface server for natural language inference

5 months ago

2 points

527.

Show HN: EmbodIOS – AI Operating System with Kernel-Level Inference

github.com/dddimcha

5 months ago

2 points

528.

Rig: Distributed LLM inference across machines in Rust

github.com/buyukakyuz

5 months ago

2 points

529.

Tract: Self-contained, TensorFlow and ONNX inference

github.com/sonos

5 months ago

2 points

530.

EmbodIOS - AI inference as the operating system (3.5s cold start)

github.com/dddimcha

5 months ago

2 points

531.

HF-mem: CLI to estimate inference memory requirements for Hugging Face models

github.com/alvarobartt

6 months ago

2 points

532.

Mini-SGLang: A lightweight yet high-performance inference framework for LLM

github.com/sgl-project

6 months ago

2 points

533.

Go apps can directly integrate llama.cpp for HW accelerated local inference

github.com/hybridgroup

7 months ago

2 points

534.

Show HN: Olla – Lightweight LLM Proxy for Homelab and OnPrem AI Inference

10 months ago

thushanfernando

2 points

535.

WebAssembly binding for llama.cpp – Enabling on-browser LLM inference

github.com/ngxson

a year ago

2 points

536.

Show HN: Dwani.ai – multimodal inference API for Indian languages

a year ago

2 points

537.

GPT4Free: "educational project" for free LLM inference from various services

github.com/xtekky

a year ago

2 points

538.

OmniPainter: Training-Free Stylized Text-to-Image Generation with Fast Inference

github.com/maxin-cn

a year ago

2 points

539.

GPU-enabled Llama 3 inference in Java from scratch

github.com/beehive-lab

a year ago

2 points

540.

BitNet 1.58bit GPU Inference Kernel

github.com/microsoft

a year ago

2 points