HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
1.
▲
Building LLMs from the Ground Up: A 3-Hour Coding Workshop
magazine.sebastianraschka.com
136 comments
2 years ago
mdp2021
970 points
2.
▲
GPT-OSS vs. Qwen3 and a detailed look how things evolved since GPT-2
magazine.sebastianraschka.com
97 comments
10 months ago
ModelForge
490 points
3.
▲
Understanding Reasoning LLMs
magazine.sebastianraschka.com
183 comments
a year ago
sebg
473 points
4.
▲
LLM architecture comparison
magazine.sebastianraschka.com
24 comments
a year ago
mdp2021
418 points
5.
▲
Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation)
magazine.sebastianraschka.com
27 comments
3 years ago
rasbt
342 points
6.
▲
Understanding large language models: A cross-section of the relevant literature
magazine.sebastianraschka.com
31 comments
3 years ago
headalgorithm
307 points
7.
▲
Components of a Coding Agent
magazine.sebastianraschka.com
90 comments
3 months ago
MindGods
300 points
8.
▲
Why the original transformer figure is wrong, and some other tidbits about LLMs
magazine.sebastianraschka.com
49 comments
3 years ago
rasbt
237 points
9.
▲
Finetuning Large Language Models
magazine.sebastianraschka.com
70 comments
3 years ago
headalgorithm
223 points
10.
▲
Understanding Llama 2 and the New Code Llama LLMs
magazine.sebastianraschka.com
34 comments
3 years ago
rasbt
170 points
11.
▲
Coding Self-Attention, Multi-Head Attention, Cross-Attention, Causal-Attention
magazine.sebastianraschka.com
11 comments
2 years ago
rasbt
142 points
12.
▲
Ten Noteworthy AI Research Papers of 2023
magazine.sebastianraschka.com
19 comments
2 years ago
danboarder
128 points
13.
▲
AI and Open Source in 2023
magazine.sebastianraschka.com
67 comments
3 years ago
belter
123 points
14.
▲
Training and aligning LLMs with RLHF and RLHF alternatives
magazine.sebastianraschka.com
14 comments
3 years ago
rasbt
102 points
15.
▲
Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch
magazine.sebastianraschka.com
10 comments
2 years ago
rasbt
96 points
16.
▲
KV Sharing, MHC, and Compressed Attention
magazine.sebastianraschka.com
3 comments
a month ago
gmays
35 points
17.
▲
A Visual Guide to Attention Variants in Modern LLMs
magazine.sebastianraschka.com
1 comment
3 months ago
Anon84
23 points
18.
▲
A Technical Tour of the DeepSeek Models from V3 to v3.2
magazine.sebastianraschka.com
1 comment
7 months ago
ibobev
23 points
19.
▲
AI Research Papers in Jan 2024: Model Merging, Mixtures of Experts, Smaller LLMs
magazine.sebastianraschka.com
discuss
2 years ago
rasbt
20 points
20.
▲
A Visual Guide to Attention Variants in Modern LLMs
magazine.sebastianraschka.com
discuss
3 months ago
Brajeshwar
9 points
21.
▲
The State of LLMs 2025: Progress, Progress, and Predictions
magazine.sebastianraschka.com
discuss
6 months ago
vismit2000
9 points
22.
▲
Ten Noteworthy AI Research Papers of 2023
magazine.sebastianraschka.com
discuss
2 years ago
lucasus
9 points
23.
▲
A Technical Tour of the DeepSeek Models from V3 to v3.2
magazine.sebastianraschka.com
discuss
7 months ago
giuliomagnifico
8 points
24.
▲
Why would a famous former university ML professor make his posts paywalled?
magazine.sebastianraschka.com
1 comment
3 years ago
behnamoh
7 points
25.
▲
A Technical Tour of the DeepSeek Models from V3 to v3.2
magazine.sebastianraschka.com
1 comment
7 months ago
mzl
5 points
26.
▲
LLM Research Papers: The 2026 List (January to May)
magazine.sebastianraschka.com
discuss
14 days ago
ibobev
5 points
27.
▲
LLM Research Papers: The 2024 List
magazine.sebastianraschka.com
discuss
2 years ago
ModelForge
5 points
28.
▲
New LLM Pre-Training and Post-Training Paradigms: How Modern LLMs Are Trained
magazine.sebastianraschka.com
discuss
2 years ago
sbbq
5 points
29.
▲
The latest major open LLM releases: Mixtral, Llama 3, Phi-3, and OpenELM
magazine.sebastianraschka.com
discuss
2 years ago
rasbt
5 points
30.
▲
AI Research Papers in November 2023: hallucinations and reasoning capabilities
magazine.sebastianraschka.com
discuss
3 years ago
rasbt
5 points
More