HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
31.
▲
Show HN: Autogenerate efficient backward kernels for Triton
github.com/IaroslavElistratov
discuss
7 months ago
iaroo
1 points
32.
▲
Show HN: Efficient `Torch.cdist` Using Triton
github.com/jinensetpal
discuss
a year ago
codeinassembly
1 points
33.
▲
Automatic Warp Specialization in Triton
github.com/triton-lang
discuss
a year ago
subharmonicon
1 points
34.
▲
OpenAI Triton: language and compiler for highly efficient Deep-Learning
github.com/openai
discuss
2 years ago
tosh
1 points
35.
▲
Triton: Runtime for highly efficient custom Deep-Learning primitives
github.com/openai
discuss
3 years ago
nateb2022
1 points
36.
▲
Show HN: Digital watermarking by motion vector of H.264
github.com/truongpt
discuss
6 years ago
truongpt
1 points
37.
▲
Triton Kubernetes, a multi-cloud Kubernetes solution
github.com/joyent
discuss
8 years ago
merqurio
1 points
38.
▲
Triton-Augment: GPU Kernel Fusion for 5-73x Faster Image/Video Augmentation
github.com/yuhezhang-ai
2 comments
7 months ago
seedlingfl
3 points
39.
▲
Real-Time Streaming Apps with Nvidia Open Source Triton Inference
github.com/nickaggarwal
discuss
2 years ago
agcat
3 points
40.
▲
Ask HN: What Inference Server do you use to host TTS Models?
discuss
a year ago
samagra14
1 points
41.
▲
RCE in Nvidia Triton Inference Server
github.com/protectai
discuss
2 years ago
byt3bl33d3r
3 points
42.
▲
Liger-Kernel: Efficient Triton kernels for LLM training
github.com/linkedin
2 comments
2 years ago
letmehandle
15 points
43.
▲
Show HN: Attorch – PyTorch's nn module written in Python using OpenAI's Triton
github.com/BobMcDear
discuss
2 years ago
bornaahz
4 points
44.
▲
Bounty for Optimized Triton Kernels for full fine tunes
github.com/OpenAccess-AI-Collective
discuss
2 years ago
bratao
3 points
45.
▲
Show HN: We built an LLM inference engine in pure Python – no PyTorch, no Triton
github.com/Zyora-Dev
discuss
20 days ago
zyoraclub
2 points
46.
▲
Solving an Obfuscated Crackme with BinaryNinja and Triton
github.com/jeffli678
discuss
6 years ago
ingve
2 points
47.
▲
Show HN: Iris – Distributed GPU Programming with RMA in Pure Python/Triton
github.com/ROCm
1 comment
9 months ago
mawad
1 points
48.
▲
PyTorch 2.3: User-Defined Triton Kernels, Tensor Parallelism in Distributed
github.com/pytorch
discuss
2 years ago
lnyan
1 points
49.
▲
Show HN: Tabby – A self-hosted GitHub Copilot
github.com/TabbyML
126 comments
3 years ago
wsxiaoys
627 points
50.
▲
Show HN: 80% faster, 50% less memory, 0% loss of accuracy Llama finetuning
github.com/unslothai
119 comments
3 years ago
danielhanchen
385 points
51.
▲
Show HN: Metashade – a Pythonic GPU shading/compute EDSL
github.com/ppenenko
8 comments
2 years ago
ppenenko
47 points
52.
▲
Show HN: Sleuth, open source workspace search in natural language
getsleuth.xyz
8 comments
3 years ago
ayanb9440
31 points
53.
▲
Show HN: Finetune Llama-3.1 2x faster in a Colab
colab.research.google.com
2 comments
2 years ago
danielhanchen
16 points
54.
▲
Show HN: Dbg – One CLI debugger for every language (AI-agent ready)
redknightlois.github.io
discuss
2 months ago
redknight666
7 points
55.
▲
Show HN: Living Memory Dynamics – "living" episodic memory embedding space
github.com/mordiaky
discuss
6 months ago
Mordiaky
6 points
56.
▲
Show HN: LLGTRT: TensorRT-LLM+Rust server w/ OpenAI-compat and Structured Output
github.com/guidance-ai
discuss
2 years ago
mmoskal
6 points
57.
▲
Show HN: Open-source fine-tuning in a Colab notebook
colab.research.google.com
discuss
2 years ago
danielhanchen
5 points
58.
▲
Show HN: UHOP – Escaping Nvidia Lock-In with an Open Hardware Optimization Layer
uhop.dev
discuss
8 months ago
danielbisina
3 points
59.
▲
Show HN: UHOP – An Open Hardware Optimization Platform for GPUs
github.com/sevenloops
discuss
8 months ago
danielbisina
3 points
60.
▲
Why stop at 1M tokens when you can have 10M?
4 comments
8 months ago
Zen_Sherbert
2 points
More