Search: github.com/tnfe | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

361.

Show HN: Our command line tool to transpile AI Inference from Python to C++

github.com/muna-ai

5 months ago

4 points

362.

Show HN: I wrote inference for Qwen3 0.6B in C/CUDA

github.com/asdf93074

9 months ago

4 points

363.

Show HN: Klartraum, a neural rendering inference engine

github.com/fortmeier

a year ago

4 points

364.

Show HN: Furnace – Rust and Burn inference server, zero Python, single binary

a year ago

4 points

365.

Fenic: The dataframe (re)built for LLM inference

github.com/typedef-ai

a year ago

4 points

366.

Zorokee/ArtificialCast: Type-safe transformation powered by inference

github.com/Zorokee

a year ago

4 points

367.

Bark.cpp: Port of Suno AI's Bark in C/C++ for fast inference

github.com/PABannier

2 years ago

4 points

368.

Jetstream: New LLM Inference Engine

github.com/google

2 years ago

4 points

369.

LLM Inference Endpoint Performance Benchmarking Tool

github.com/ray-project

3 years ago

4 points

370.

Accelerating Inferencing Services with Kontain

github.com/kontainapp

3 years ago

4 points

371.

LLM-J: A pure Java implementation of a LLM inference engine

github.com/tjake

3 years ago

4 points

372.

Full GPU Inference of LLaMA on Apple Silicon Using Metal

github.com/ggerganov

3 years ago

4 points

373.

Show HN: Deterministic objective Bayesian inference for spatial models [pdf]

buildingblock.ai

3 years ago

4 points

374.

Inference at the Edge

github.com/ggerganov

3 years ago

4 points

375.

Show HN: TypeScript query builder with full type inference

4 years ago

4 points

376.

Whats new in Scala 2.8: type constructor inference

adriaanm.github.com

15 years ago

4 points

377.

Linux.Midrashim: x64 ELF infector virus

github.com/guitmz

6 years ago

4 points

378.

Show HN: Larq – Binarized Neural Network Inference with MLIR and TFLite

github.com/larq

6 years ago

4 points

379.

Fast In-Browser Inference with ONNX.js, WebAssembly and WebGL

github.com/Microsoft

7 years ago

4 points

380.

Infer Clojure specs from sample data. Inspired by F#'s type providers

github.com/stathissideris

9 years ago

4 points

381.

Show HN: oLLM – LLM Inference for large-context tasks on consumer GPUs

github.com/Mega4alik

10 months ago

3 points

382.

Show HN: A reasoning model that infers over whole tasks in 1ms in latent space

github.com/OrderOneAI

a year ago

3 points

383.

Show HN: Standalone TurboQuant KV Cache Inference

github.com/g023

3 months ago

3 points

384.

Ternative – C++/CUDA inference engine for ternary LLMs with runtime LoRA

github.com/michelangeloromerochisco

a month ago

3 points

385.

Xinity Runtime: Apache 2.0 LLM inference engine for on-premise deployment

github.com/xinity-ai

3 months ago

3 points

386.

Show HN: Dendrite – O(1) KV cache forking for tree-structured LLM inference

github.com/BioInfo

3 months ago

3 points

387.

Show HN: Kremis – Rust graph DB; every answer is fact, inference, or unknown

github.com/TyKolt

3 months ago

3 points

388.

vLLM-mlx – 65 tok/s LLM inference on Mac with tool calling and prompt caching

github.com/raullenchai

4 months ago

3 points

389.

A Distributed Inference Framework Enabling Running Models Exceeding Total Memory

github.com/firstbatchxyz

7 months ago

3 points

390.

Metaphysical Priming reduces Gemini 3.0 Pro inference latency by 60%

github.com/Cactus-mp4

7 months ago

3 points