Search: magazine.sebastianraschka.com | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

1.

Building LLMs from the Ground Up: A 3-Hour Coding Workshop

magazine.sebastianraschka.com

2 years ago

970 points

2.

GPT-OSS vs. Qwen3 and a detailed look how things evolved since GPT-2

magazine.sebastianraschka.com

10 months ago

490 points

3.

Understanding Reasoning LLMs

magazine.sebastianraschka.com

a year ago

473 points

4.

LLM architecture comparison

magazine.sebastianraschka.com

a year ago

418 points

5.

Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation)

magazine.sebastianraschka.com

3 years ago

342 points

6.

Understanding large language models: A cross-section of the relevant literature

magazine.sebastianraschka.com

3 years ago

307 points

7.

Components of a Coding Agent

magazine.sebastianraschka.com

3 months ago

300 points

8.

Why the original transformer figure is wrong, and some other tidbits about LLMs

magazine.sebastianraschka.com

3 years ago

237 points

9.

Finetuning Large Language Models

magazine.sebastianraschka.com

3 years ago

223 points

10.

Understanding Llama 2 and the New Code Llama LLMs

magazine.sebastianraschka.com

3 years ago

170 points

11.

Coding Self-Attention, Multi-Head Attention, Cross-Attention, Causal-Attention

magazine.sebastianraschka.com

2 years ago

142 points

12.

Ten Noteworthy AI Research Papers of 2023

magazine.sebastianraschka.com

2 years ago

128 points

13.

AI and Open Source in 2023

magazine.sebastianraschka.com

3 years ago

123 points

14.

Training and aligning LLMs with RLHF and RLHF alternatives

magazine.sebastianraschka.com

3 years ago

102 points

15.

Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch

magazine.sebastianraschka.com

2 years ago

96 points

16.

KV Sharing, MHC, and Compressed Attention

magazine.sebastianraschka.com

a month ago

35 points

17.

A Visual Guide to Attention Variants in Modern LLMs

magazine.sebastianraschka.com

3 months ago

23 points

18.

A Technical Tour of the DeepSeek Models from V3 to v3.2

magazine.sebastianraschka.com

7 months ago

23 points

19.

AI Research Papers in Jan 2024: Model Merging, Mixtures of Experts, Smaller LLMs

magazine.sebastianraschka.com

2 years ago

20 points

20.

A Visual Guide to Attention Variants in Modern LLMs

magazine.sebastianraschka.com

3 months ago

9 points

21.

The State of LLMs 2025: Progress, Progress, and Predictions

magazine.sebastianraschka.com

6 months ago

9 points

22.

Ten Noteworthy AI Research Papers of 2023

magazine.sebastianraschka.com

2 years ago

9 points

23.

A Technical Tour of the DeepSeek Models from V3 to v3.2

magazine.sebastianraschka.com

7 months ago

giuliomagnifico

8 points

24.

Why would a famous former university ML professor make his posts paywalled?

magazine.sebastianraschka.com

3 years ago

7 points

25.

A Technical Tour of the DeepSeek Models from V3 to v3.2

magazine.sebastianraschka.com

7 months ago

5 points

26.

LLM Research Papers: The 2026 List (January to May)

magazine.sebastianraschka.com

14 days ago

5 points

27.

LLM Research Papers: The 2024 List

magazine.sebastianraschka.com

2 years ago

5 points

28.

New LLM Pre-Training and Post-Training Paradigms: How Modern LLMs Are Trained

magazine.sebastianraschka.com

2 years ago

5 points

29.

The latest major open LLM releases: Mixtral, Llama 3, Phi-3, and OpenELM

magazine.sebastianraschka.com

2 years ago

5 points

30.

AI Research Papers in November 2023: hallucinations and reasoning capabilities

magazine.sebastianraschka.com

3 years ago

5 points