HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
31.
▲
I fixed a segfault in Triton that broke every RTX 5070/5080/5090
github.com/triton-lang
discuss
3 months ago
pat90000
1 points
32.
▲
Triton CUDA Tile IR Back End
github.com/triton-lang
discuss
5 months ago
my123
1 points
33.
▲
Show HN: Autogenerate efficient backward kernels for Triton
github.com/IaroslavElistratov
discuss
7 months ago
iaroo
1 points
34.
▲
Show HN: Efficient `Torch.cdist` Using Triton
github.com/jinensetpal
discuss
a year ago
codeinassembly
1 points
35.
▲
Automatic Warp Specialization in Triton
github.com/triton-lang
discuss
a year ago
subharmonicon
1 points
36.
▲
OpenAI Triton: language and compiler for highly efficient Deep-Learning
github.com/openai
discuss
2 years ago
tosh
1 points
37.
▲
Triton: Runtime for highly efficient custom Deep-Learning primitives
github.com/openai
discuss
3 years ago
nateb2022
1 points
38.
▲
Triton Kubernetes, a multi-cloud Kubernetes solution
github.com/joyent
discuss
8 years ago
merqurio
1 points
39.
▲
Triton-Augment: GPU Kernel Fusion for 5-73x Faster Image/Video Augmentation
github.com/yuhezhang-ai
2 comments
7 months ago
seedlingfl
3 points
40.
▲
Real-Time Streaming Apps with Nvidia Open Source Triton Inference
github.com/nickaggarwal
discuss
2 years ago
agcat
3 points
41.
▲
Ask HN: What Inference Server do you use to host TTS Models?
discuss
a year ago
samagra14
1 points
42.
▲
Show HN: Friction – A trilogy of archival fiction told via GitHub Markdown
github.com/andreas-breidenthal
1 comment
6 months ago
a-breidenthal
3 points
43.
▲
RCE in Nvidia Triton Inference Server
github.com/protectai
discuss
2 years ago
byt3bl33d3r
3 points
44.
▲
Liger-Kernel: Efficient Triton kernels for LLM training
github.com/linkedin
2 comments
2 years ago
letmehandle
15 points
45.
▲
Show HN: Attorch – PyTorch's nn module written in Python using OpenAI's Triton
github.com/BobMcDear
discuss
2 years ago
bornaahz
4 points
46.
▲
Show HN: PreQL/Trilogy – A Higher-Level, Composable SQL
github.com/preqldata
5 comments
2 years ago
efromvt
3 points
47.
▲
Bounty for Optimized Triton Kernels for full fine tunes
github.com/OpenAccess-AI-Collective
discuss
2 years ago
bratao
3 points
48.
▲
The pi type trilogy (Rust RFC)
github.com/rust-lang
discuss
9 years ago
miqkt
3 points
49.
▲
Show HN: We built an LLM inference engine in pure Python – no PyTorch, no Triton
github.com/Zyora-Dev
discuss
20 days ago
zyoraclub
2 points
50.
▲
Solving an Obfuscated Crackme with BinaryNinja and Triton
github.com/jeffli678
discuss
6 years ago
ingve
2 points
51.
▲
List of Parallels Between the Original Trilogy and Ep. VII TFA
gist.github.com
discuss
10 years ago
galori
2 points
52.
▲
Show HN: Iris – Distributed GPU Programming with RMA in Pure Python/Triton
github.com/ROCm
1 comment
9 months ago
mawad
1 points
53.
▲
PyTorch 2.3: User-Defined Triton Kernels, Tensor Parallelism in Distributed
github.com/pytorch
discuss
2 years ago
lnyan
1 points
54.
▲
Show HN: Tabby – A self-hosted GitHub Copilot
github.com/TabbyML
126 comments
3 years ago
wsxiaoys
627 points
55.
▲
Show HN: 80% faster, 50% less memory, 0% loss of accuracy Llama finetuning
github.com/unslothai
119 comments
3 years ago
danielhanchen
385 points
56.
▲
Show HN: Metashade – a Pythonic GPU shading/compute EDSL
github.com/ppenenko
8 comments
2 years ago
ppenenko
47 points
57.
▲
Show HN: Sleuth, open source workspace search in natural language
getsleuth.xyz
8 comments
3 years ago
ayanb9440
31 points
58.
▲
Show HN: Finetune Llama-3.1 2x faster in a Colab
colab.research.google.com
2 comments
2 years ago
danielhanchen
16 points
59.
▲
Show HN: Bhumi–OSS Python Library w Rust Underhead for 2.5x Faster LLM Inference
bhumi.trilok.ai
discuss
a year ago
rachpradhan
8 points
60.
▲
Show HN: Dbg – One CLI debugger for every language (AI-agent ready)
redknightlois.github.io
discuss
2 months ago
redknight666
7 points
More