HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
61.
▲
New LLM Pre-Training and Post-Training Paradigms: How Modern LLMs Are Trained
magazine.sebastianraschka.com
discuss
2 years ago
sbbq
5 points
62.
▲
The latest major open LLM releases: Mixtral, Llama 3, Phi-3, and OpenELM
magazine.sebastianraschka.com
discuss
2 years ago
rasbt
5 points
63.
▲
AI Research Papers in November 2023: hallucinations and reasoning capabilities
magazine.sebastianraschka.com
discuss
3 years ago
rasbt
5 points
64.
▲
AI Research Papers (October 2023)
magazine.sebastianraschka.com
discuss
3 years ago
rasbt
5 points
65.
▲
AI chips, acquisitions, new "small" open-source LLMs, and new LoRA techniques
magazine.sebastianraschka.com
discuss
3 years ago
rasbt
5 points
66.
▲
Understanding Encoder and Decoder LLMs
magazine.sebastianraschka.com
discuss
3 years ago
rasbt
5 points
67.
▲
Naive Bayes and Text Classification
sebastianraschka.com
discuss
11 years ago
betolink
5 points
68.
▲
Turn Your Twitter Timeline into a Word Cloud Using Python
sebastianraschka.com
discuss
12 years ago
rasbt
4 points
69.
▲
Implementing a Principal Component Analysis (PCA) in Python step by step
sebastianraschka.com
discuss
12 years ago
rasbt
4 points
70.
▲
Preparing data for Machine Learning tasks in python
sebastianraschka.com
discuss
12 years ago
mathattack
4 points
71.
▲
Diving deep into Python – the not-so-obvious language parts
sebastianraschka.com
discuss
12 years ago
signa11
4 points
72.
▲
Developments in LLM Architectures: KV Sharing, MHC, and Compressed Attention
magazine.sebastianraschka.com
discuss
a month ago
ibobev
4 points
73.
▲
My Workflow for Understanding LLM Architectures
magazine.sebastianraschka.com
discuss
2 months ago
ibobev
4 points
74.
▲
A Round Up and Comparison of 10 Open-Weight LLM Releases in Spring 2026
magazine.sebastianraschka.com
discuss
4 months ago
MindGods
4 points
75.
▲
The State of LLMs 2025: Progress, Progress, and Predictions
magazine.sebastianraschka.com
discuss
6 months ago
ibobev
4 points
76.
▲
Getting the Most Out of a Technical Book
sebastianraschka.com
discuss
7 months ago
quietlearning
4 points
77.
▲
Popular Attention Alternatives: GQA, MLA, SWA
sebastianraschka.com
discuss
8 months ago
ModelForge
4 points
78.
▲
Multi-Head Latent Attention
sebastianraschka.com
discuss
8 months ago
ModelForge
4 points
79.
▲
LLM Evaluation from Scratch: Multiple Choice, Verifiers, Leaderboards, LLM Judge
magazine.sebastianraschka.com
discuss
9 months ago
ModelForge
4 points
80.
▲
PyTorch in One Hour: From Tensors to Training Neural Networks on Multiple GPUs
sebastianraschka.com
discuss
a year ago
mariuz
4 points
81.
▲
PyTorch in One Hour: From Tensors to Training Neural Networks on Multiple GPUs
sebastianraschka.com
discuss
a year ago
sbbq
4 points
82.
▲
Coding LLMs from the Ground Up: A Complete Course
sebastianraschka.com
discuss
a year ago
sbbq
4 points
83.
▲
The State of Reinforcement Learning for LLM Reasoning
magazine.sebastianraschka.com
discuss
a year ago
mdp2021
4 points
84.
▲
The State of Reasoning Models
magazine.sebastianraschka.com
discuss
a year ago
sbbq
4 points
85.
▲
Understanding Reasoning LLMs
sebastianraschka.com
discuss
a year ago
sbbq
4 points
86.
▲
Implementing a Byte Pair Encoding (BPE) Tokenizer from Scratch
sebastianraschka.com
discuss
a year ago
sbbq
4 points
87.
▲
Collection of 1k LLM Research Papers of 2024
sebastianraschka.com
discuss
a year ago
sbbq
4 points
88.
▲
Understanding Multimodal LLMs: The Main Techniques and Latest Models
sebastianraschka.com
discuss
2 years ago
sbbq
4 points
89.
▲
Numerical matrix manipulation – cheat sheet for Matlab, NumPy, R, and Julia
sebastianraschka.com
discuss
9 years ago
PascLeRasc
4 points
90.
▲
Answers to Frequently Asked Questions in Machine Learning
sebastianraschka.com
discuss
10 years ago
rasbt
4 points
More