Search: github.com/tnfe | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

421.

OpenVINO – open-source toolkit for optimizing and deploying AI inference

github.com/openvinotoolkit

6 months ago

peter_d_sherman

3 points

422.

Show HN: Distributed Storage System to 8x LLM Inference, GPU Training Efficiency

github.com/blackbird-io

8 months ago

3 points

423.

LLM-optimizer: Benchmark and optimize LLM inference across frameworks with ease

github.com/bentoml

9 months ago

3 points

424.

LLM Inference in pure Java with a GPU acceleration enabled

github.com/beehive-lab

a year ago

3 points

425.

Show HN: I made TypeScript's type inference more strict (and smarter)

github.com/kakasoo

a year ago

3 points

426.

The Path to Open-Sourcing the DeepSeek Inference Engine

github.com/deepseek-ai

a year ago

3 points

427.

Deepseek CPP for CPU only inference

github.com/andrewkchan

a year ago

3 points

428.

AntiSlop Sampler for LLM Inference

github.com/sam-paech

a year ago

3 points

429.

Jet.jl: static type checker with type inference for Julia

github.com/aviatesk

2 years ago

3 points

430.

Show HN: Bayesian Neural Networks and Uncertainty for Inferring Unseen Classes

github.com/MNoorFawi

2 years ago

3 points

431.

Real-Time Streaming Apps with Nvidia Open Source Triton Inference

github.com/nickaggarwal

2 years ago

3 points

432.

Distributed LLM Inference with Llama.cpp

github.com/ggerganov

2 years ago

3 points

433.

Practical Llama 3 inference implemented in a single Java file

github.com/mukel

2 years ago

3 points

434.

Gemma.cpp: lightweight, standalone C++ inference engine for Gemma models

github.com/google

2 years ago

3 points

435.

Llama.cpp supports distributed inference across machines on a local network

github.com/ggerganov

2 years ago

3 points

436.

RCE in Nvidia Triton Inference Server

github.com/protectai

2 years ago

3 points

437.

Show HN: Inference-only implementation of Mamba optimized for CPU

github.com/flawedmatrix

2 years ago

3 points

438.

Show HN: NOS – A fast, and ergonomic PyTorch inference server

github.com/autonomi-ai

3 years ago

3 points

439.

Training and inference code for audio generation models

github.com/Stability-AI

3 years ago

3 points

440.

Vllm: High-throughput and memory-efficient inference and serving engine for LLMs

github.com/vllm-project

3 years ago

3 points

441.

Small inference runtime for deep neural networks

github.com/maekawatoshiki

3 years ago

3 points

442.

Inference at the edge: Efficient transformer model inference on-device

github.com/ggerganov

3 years ago

3 points

443.

WebGPU ONNX inference runtime written in Rust

github.com/webonnx

3 years ago

3 points

444.

Show HN: Python Monitoring for AI: LLMs, OpenAI, Inference, GPUs

github.com/graphsignal

3 years ago

3 points

445.

Show HN: Nix-init – Generate Nix packages from URLs with dependency inference

github.com/nix-community

3 years ago

3 points

446.

Fast type inference library for Common Lisp

github.com/marcoheisig

4 years ago

3 points

447.

Using OpenAI Codex's “DaVinci-Edit” Model for Gradual Type Inference

github.com/GammaTauAI

4 years ago

3 points

448.

Show HN: Spartan Schema - Ultra-minimal JSON schemas with Typescript inference

github.com/ar-nelson

4 years ago

3 points

449.

Type inference for the database access layer in PHP

4 years ago

3 points

450.

The exhaustive Pattern Matching library for TypeScript with smart type inference

github.com/gvergnaud

4 years ago

3 points