HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
61.
▲
9x MobileNet V2 size reduction with Quantization aware training
github.com/dakshjain-1616
2 comments
4 months ago
gauravvij137
2 points
62.
▲
Show HN: Mistral-7B training using pyspark,DeepSpeed
github.com/genji970
discuss
10 months ago
gituser123
2 points
63.
▲
Sacrificial Training
github.com/jmward01
discuss
a year ago
thunderbong
2 points
64.
▲
Training LLMs on 1080 Tis without shadow weights
github.com/batteryphil
1 comment
4 months ago
batteryphil
1 points
65.
▲
Free Ruby AI Training Materials
github.com/thedayisntgray
discuss
a year ago
thedayisntgray
1 points
66.
▲
A Tutorial on Training Self-Play Agents
github.com/hardmaru
discuss
6 years ago
hardmaru
1 points
67.
▲
Open-Source LaMDA Model
discuss
4 years ago
EnricoShippole
27 points
68.
▲
Show HN: Fine-tuning an LLM on your code for better code completions
prvn.sh
discuss
a year ago
prvnsmpth
4 points
69.
▲
Nebulgym, a new open-source that accelerates AI training (~1.5-2x)
1 comment
4 years ago
emilec___
3 points
70.
▲
Show HN: TorchSubmit – Painless multi-node training with PyTorch (no SLURM/K8s)
github.com/dream3d-ai
discuss
2 years ago
tony_francis
3 points
71.
▲
Security of BIOS/UEFI System Firmware from Attacker and Defender Perspectives
github.com/advanced-threat-research
3 comments
9 years ago
adulau
57 points
72.
▲
I have trained StyleGAN2 from scratch with a dataset of female portraits
github.com/l4rz
20 comments
5 years ago
EvgeniyZh
20 points
73.
▲
Building a Simple (Android) User Interface (using JRuby / Ruboto)
github.com/KCErb
discuss
12 years ago
MrBra
2 points
74.
▲
ml-engineering/training/performance
github.com/stas00
discuss
a year ago
lordswork
2 points
75.
▲
Radare2 from a to Z (extended edition) [reverse engineering]
github.com/radareorg
discuss
10 years ago
j_s
2 points
76.
▲
MNIST Training in C# – Deep Learning
github.com/deepakkumar1984
discuss
9 years ago
siadroid
1 points
77.
▲
Show HN: Syna – Minimal ML and RL Framework Built from Scratch with NumPy
github.com/sql-hkr
discuss
8 months ago
sql-hkr
8 points
78.
▲
How Are My Hyperparameters Affecting My Training Time?
github.com/sigopt
discuss
10 years ago
alexcmu
2 points
79.
▲
Hands-on workshops and training sessions at Universe
github.com/blog
discuss
10 years ago
dwaxe
1 points
80.
▲
Llm.c – LLM training in simple, pure C/CUDA
github.com/karpathy
168 comments
2 years ago
tosh
1050 points
81.
▲
DeepSeek open source DeepEP – library for MoE training and Inference
github.com/deepseek-ai
71 comments
a year ago
helloericsf
536 points
82.
▲
CoreNet: A library for training deep neural networks
github.com/apple
131 comments
2 years ago
rocauc
494 points
83.
▲
SmolGPT: A minimal PyTorch implementation for training a small LLM from scratch
github.com/Om-Alve
55 comments
a year ago
amrrs
434 points
84.
▲
Show HN: Every Breath You Take – Heart Rate Variability Training
github.com/kbre93
118 comments
3 years ago
kbre93
348 points
85.
▲
Databricks Releases 15K Record Training Corpus for Instruction Tuning LLMs
github.com/databrickslabs
89 comments
3 years ago
xatalytic
347 points
86.
▲
Show HN: Duplicate 3 layers in a 24B LLM, logical deduction .22→.76. No training
github.com/alainnothere
80 comments
3 months ago
xlayn
265 points
87.
▲
Full LLM training and evaluation toolkit
github.com/huggingface
6 comments
2 years ago
testerui
249 points
88.
▲
DeepSpeed Chat: Easy, fast and affordable RLHF training of ChatGPT-like models
github.com/microsoft
55 comments
3 years ago
quantisan
240 points
89.
▲
LLMs can see and hear without any training
github.com/facebookresearch
66 comments
a year ago
T-A
210 points
90.
▲
Autoresearch: Agents researching on single-GPU nanochat training automatically
github.com/karpathy
58 comments
4 months ago
simonpure
208 points
More