Search: github.com/tritongp | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

31.

Show HN: Autogenerate efficient backward kernels for Triton

github.com/IaroslavElistratov

7 months ago

1 points

32.

Show HN: Efficient `Torch.cdist` Using Triton

github.com/jinensetpal

a year ago

1 points

33.

Automatic Warp Specialization in Triton

github.com/triton-lang

a year ago

1 points

34.

OpenAI Triton: language and compiler for highly efficient Deep-Learning

github.com/openai

2 years ago

1 points

35.

Triton: Runtime for highly efficient custom Deep-Learning primitives

github.com/openai

3 years ago

1 points

36.

Show HN: Digital watermarking by motion vector of H.264

github.com/truongpt

6 years ago

1 points

37.

Triton Kubernetes, a multi-cloud Kubernetes solution

github.com/joyent

8 years ago

1 points

38.

Triton-Augment: GPU Kernel Fusion for 5-73x Faster Image/Video Augmentation

github.com/yuhezhang-ai

7 months ago

3 points

39.

Real-Time Streaming Apps with Nvidia Open Source Triton Inference

github.com/nickaggarwal

2 years ago

3 points

40.

Ask HN: What Inference Server do you use to host TTS Models?

a year ago

1 points

41.

RCE in Nvidia Triton Inference Server

github.com/protectai

2 years ago

3 points

42.

Liger-Kernel: Efficient Triton kernels for LLM training

github.com/linkedin

2 years ago

15 points

43.

Show HN: Attorch – PyTorch's nn module written in Python using OpenAI's Triton

github.com/BobMcDear

2 years ago

4 points

44.

Bounty for Optimized Triton Kernels for full fine tunes

github.com/OpenAccess-AI-Collective

2 years ago

3 points

45.

Show HN: We built an LLM inference engine in pure Python – no PyTorch, no Triton

github.com/Zyora-Dev

20 days ago

2 points

46.

Solving an Obfuscated Crackme with BinaryNinja and Triton

github.com/jeffli678

6 years ago

2 points

47.

Show HN: Iris – Distributed GPU Programming with RMA in Pure Python/Triton

github.com/ROCm

9 months ago

1 points

48.

PyTorch 2.3: User-Defined Triton Kernels, Tensor Parallelism in Distributed

github.com/pytorch

2 years ago

1 points

49.

Show HN: Tabby – A self-hosted GitHub Copilot

github.com/TabbyML

3 years ago

627 points

50.

Show HN: 80% faster, 50% less memory, 0% loss of accuracy Llama finetuning

github.com/unslothai

3 years ago

385 points

51.

Show HN: Metashade – a Pythonic GPU shading/compute EDSL

github.com/ppenenko

2 years ago

47 points

52.

Show HN: Sleuth, open source workspace search in natural language

3 years ago

31 points

53.

Show HN: Finetune Llama-3.1 2x faster in a Colab

colab.research.google.com

2 years ago

16 points

54.

Show HN: Dbg – One CLI debugger for every language (AI-agent ready)

redknightlois.github.io

2 months ago

7 points

55.

Show HN: Living Memory Dynamics – "living" episodic memory embedding space

github.com/mordiaky

6 months ago

6 points

56.

Show HN: LLGTRT: TensorRT-LLM+Rust server w/ OpenAI-compat and Structured Output

github.com/guidance-ai

2 years ago

6 points

57.

Show HN: Open-source fine-tuning in a Colab notebook

colab.research.google.com

2 years ago

5 points

58.

Show HN: UHOP – Escaping Nvidia Lock-In with an Open Hardware Optimization Layer

8 months ago

3 points

59.

Show HN: UHOP – An Open Hardware Optimization Platform for GPUs

github.com/sevenloops

8 months ago

3 points

60.

Why stop at 1M tokens when you can have 10M?

8 months ago

2 points