HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
31.
▲
AI Research Papers (October 2023)
magazine.sebastianraschka.com
discuss
3 years ago
rasbt
5 points
32.
▲
AI chips, acquisitions, new "small" open-source LLMs, and new LoRA techniques
magazine.sebastianraschka.com
discuss
3 years ago
rasbt
5 points
33.
▲
Understanding Encoder and Decoder LLMs
magazine.sebastianraschka.com
discuss
3 years ago
rasbt
5 points
34.
▲
Developments in LLM Architectures: KV Sharing, MHC, and Compressed Attention
magazine.sebastianraschka.com
discuss
a month ago
ibobev
4 points
35.
▲
My Workflow for Understanding LLM Architectures
magazine.sebastianraschka.com
discuss
2 months ago
ibobev
4 points
36.
▲
A Round Up and Comparison of 10 Open-Weight LLM Releases in Spring 2026
magazine.sebastianraschka.com
discuss
4 months ago
MindGods
4 points
37.
▲
The State of LLMs 2025: Progress, Progress, and Predictions
magazine.sebastianraschka.com
discuss
6 months ago
ibobev
4 points
38.
▲
LLM Evaluation from Scratch: Multiple Choice, Verifiers, Leaderboards, LLM Judge
magazine.sebastianraschka.com
discuss
9 months ago
ModelForge
4 points
39.
▲
The State of Reinforcement Learning for LLM Reasoning
magazine.sebastianraschka.com
discuss
a year ago
mdp2021
4 points
40.
▲
The State of Reasoning Models
magazine.sebastianraschka.com
discuss
a year ago
sbbq
4 points
41.
▲
Recent Developments in LLM Architectures: KV Sharing, MHC, Compressed Attention
magazine.sebastianraschka.com
discuss
a month ago
pretext
3 points
42.
▲
The State of LLMs 2025: Progress, Problems, and Predictions
magazine.sebastianraschka.com
discuss
6 months ago
ModelForge
3 points
43.
▲
From GPT-2 to GPT-OSS: Analyzing the Architectural Advances
magazine.sebastianraschka.com
discuss
10 months ago
mdp2021
3 points
44.
▲
The Big LLM Architecture Comparison
magazine.sebastianraschka.com
discuss
a year ago
Quizzical4230
3 points
45.
▲
Understanding the LLM Development Cycle: Building, Training, Finetuning
magazine.sebastianraschka.com
discuss
2 years ago
rasbt
3 points
46.
▲
Noteworthy AI Research Papers of 2023
magazine.sebastianraschka.com
discuss
2 years ago
rasbt
3 points
47.
▲
AI research papers summaries and highlights (Aug to Sep)
magazine.sebastianraschka.com
discuss
3 years ago
rasbt
3 points
48.
▲
Does it beat LLMs? NN+Gzip method reimplemented and explained step-by-step
magazine.sebastianraschka.com
discuss
3 years ago
rasbt
3 points
49.
▲
Large Language Models 3.0
magazine.sebastianraschka.com
discuss
3 years ago
headalgorithm
3 points
50.
▲
New LLM Pre-Training and Post-Training Paradigms
magazine.sebastianraschka.com
1 comment
6 months ago
lr0
2 points
51.
▲
Recent Developments in LLM Architectures: KV Sharing, MHC, Compressed Attention
magazine.sebastianraschka.com
discuss
a month ago
vismit2000
2 points
52.
▲
A Researcher's Field Guide to Non-Standard LLM Architectures
magazine.sebastianraschka.com
discuss
8 months ago
ModelForge
2 points
53.
▲
Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)
magazine.sebastianraschka.com
discuss
8 months ago
ibobev
2 points
54.
▲
Understanding and Coding the KV Cache in LLMs from Scratch
magazine.sebastianraschka.com
discuss
a year ago
tosh
2 points
55.
▲
Coding LLMs from the Ground Up: A Complete Course
magazine.sebastianraschka.com
discuss
a year ago
mdp2021
2 points
56.
▲
The State of LLM Reasoning Models
magazine.sebastianraschka.com
discuss
a year ago
Philpax
2 points
57.
▲
Understanding Multimodal LLMs
magazine.sebastianraschka.com
discuss
2 years ago
lapnect
2 points
58.
▲
Building a GPT-Style LLM Classifier from Scratch
magazine.sebastianraschka.com
discuss
2 years ago
mdp2021
2 points
59.
▲
Tips for LLM Pretraining and Evaluating Reward Models
magazine.sebastianraschka.com
discuss
2 years ago
sbbq
2 points
60.
▲
Tips for LLM Pretraining and Evaluating Reward Models
magazine.sebastianraschka.com
discuss
2 years ago
tosh
2 points
More