HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
91.
▲
Show HN: A Python tool for text-based AI training and generation using GPT-2
github.com/minimaxir
41 comments
6 years ago
minimaxir
174 points
92.
▲
Show HN: OCR pipeline for ML training (tables, diagrams, math, multilingual)
github.com/ses4255
38 comments
a year ago
ses425500000
170 points
93.
▲
Understanding R1-Zero-Like Training: A Critical Perspective
github.com/sail-sg
21 comments
a year ago
pama
160 points
94.
▲
Paper Tape Is All You Need – Training a Transformer on a 1976 Minicomputer
github.com/dbrll
26 comments
3 months ago
rahen
145 points
95.
▲
Open-Llama: Complete training pipeline for building large language models
github.com/s-JoL
12 comments
3 years ago
bayes-song
141 points
96.
▲
Diffusion training from scratch on a micro-budget
github.com/SonyResearch
23 comments
a year ago
lnyan
135 points
97.
▲
YaFSDP: a sharded data parallelism framework, faster for pre-training LLMs
github.com/yandex
16 comments
2 years ago
wiradikusuma
135 points
98.
▲
TScale – Distributed training on consumer GPUs
github.com/Foreseerr
27 comments
a year ago
zX41ZdbW
130 points
99.
▲
TensorFlow Code for Google Research's BERT: Pre-Training Method for NLP Tasks
github.com/google-research
13 comments
8 years ago
ArtWomb
129 points
100.
▲
Show HN: Terminal-Bench-RL: Training long-horizon terminal agents with RL
github.com/Danau5tin
12 comments
a year ago
Danau5tin
125 points
101.
▲
Show HN: ART – a new open-source RL framework for training agents
github.com/OpenPipe
12 comments
a year ago
kcorbitt
116 points
102.
▲
Training open-source LLMs on ChatGPT output is a really bad idea.
gist.github.com
76 comments
3 years ago
laprise
114 points
103.
▲
NanoGPT: The simplest, fastest repository for training medium-sized GPTs
github.com/karpathy
21 comments
2 years ago
ulrischa
114 points
104.
▲
Driving dataset for car autopilot AI training
github.com/commaai
44 comments
10 years ago
EvgeniyZh
100 points
105.
▲
Horovod: Distributed Training Framework for TensorFlow, Keras, and PyTorch
github.com/uber
9 comments
8 years ago
axiomdata316
100 points
106.
▲
Nvidia's DG-Net: Dress up people with different clothes/use as training data
github.com/NVlabs
34 comments
7 years ago
zhedong
87 points
107.
▲
QUIK is a method for quantizing LLM post-training weights to 4 bit precision
github.com/IST-DASLab
24 comments
3 years ago
anigbrowl
85 points
108.
▲
Show HN: Bounding-box labeler tool to generate the training data for YOLO v2
github.com/Cartucho
16 comments
8 years ago
cartucho
64 points
109.
▲
Determined: Deep Learning Training Platform
github.com/determined-ai
6 comments
3 years ago
petemir
59 points
110.
▲
Automate deep learning training with Kubernetes GPU-cluster
github.com/Langhalsdino
14 comments
9 years ago
Langhalsdino
57 points
111.
▲
Node.js training exercises, in CoffeeScript
gist.github.com
5 comments
15 years ago
bergie
57 points
112.
▲
PyTorch elastic training
github.com/pytorch
8 comments
7 years ago
jonbaer
46 points
113.
▲
Aim: Record, search and compare ML training runs
github.com/aimhubio
26 comments
5 years ago
polm23
45 points
114.
▲
Fmllm: 4mb training data, 100mb model, Fibonacci embeddings, near-coherent. WTF?
github.com/henrygabriels
27 comments
10 months ago
gabriel666smith
37 points
115.
▲
Show HN: Graphsignal – Machine learning profiler for training and inference
graphsignal.com
8 comments
4 years ago
dmitrim
35 points
116.
▲
Show HN: BrowseBrawl – What if browser agents battled to generate training data?
browser-brawl.com
18 comments
4 months ago
HrubyOnRails
30 points
117.
▲
Show HN: Next-Gen AI Training: LLM-RLHF-Tuning with PPO and DPO
github.com/raghavc
9 comments
2 years ago
rags1
30 points
118.
▲
Microsoft has open sourced their Front end Bootcamp training materials
github.com/Microsoft
6 comments
7 years ago
saranshk
29 points
119.
▲
PyTorch 1.7 Released with CUDA 11, New APIs for FFTs, Win Distributed Training
github.com/pytorch
1 comment
6 years ago
DreamFlasher
29 points
120.
▲
Show HN: Stable Diffusion training, inpainting, classifier guidance and upscale
github.com/Jack000
discuss
4 years ago
Jack000
29 points
More