HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
181.
▲
Moe-LLaVA: Mixture of Experts for Large Vision-Language Models
github.com/PKU-YuanGroup
discuss
2 years ago
GaggiX
2 points
182.
▲
Hydra – Model of Experts
github.com/SkunkworksAI
discuss
3 years ago
tosh
2 points
183.
▲
Pruning GPT-OSS 4.8B to 20B (232 models)
github.com/AmanPriyanshu
1 comment
10 months ago
privacyhateai
3 points
184.
▲
Rails MVC VS Sproutcore MVC
gmoeck.github.com
8 comments
15 years ago
gmoeck
44 points
185.
▲
Show HN: A different interface for reading Hacker News
moeffju.github.com
19 comments
15 years ago
moeffju
36 points
186.
▲
Don't Make Your Code "More Testable"
gmoeck.github.com
discuss
14 years ago
michaelfairley
4 points
187.
▲
Why you should care about encapsulation
gmoeck.github.com
discuss
15 years ago
mapleoin
1 points
188.
▲
Löb and möb: strange loops in Haskell (2015)
github.com/quchen
60 comments
3 years ago
hjnkk
153 points
189.
▲
Löb and Möb: Loops in Haskell (2013)
github.com/quchen
16 comments
8 months ago
fanf2
91 points
190.
▲
Löb and möb: strange loops in Haskell (2013)
github.com/quchen
10 comments
8 years ago
improv32
86 points
191.
▲
Löb and möb: strange loops in Haskell
github.com/quchen
discuss
13 years ago
lelf
4 points
192.
▲
Löb and möb: strange loops in Haskell
github.com/quchen
discuss
4 years ago
isaac21259
4 points
193.
▲
Löb and möb: strange loops in Haskell
github.com/quchen
discuss
11 years ago
wz1000
2 points
194.
▲
DeepSeek open source DeepEP – library for MoE training and Inference
github.com/deepseek-ai
71 comments
a year ago
helloericsf
536 points
195.
▲
Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model
github.com/MoonshotAI
2 comments
a year ago
ConteMascetti71
352 points
196.
▲
Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model
twitter.com
179 comments
a year ago
c4pt0r
348 points
197.
▲
LPLB: An early research stage MoE load balancer based on linear programming
github.com/deepseek-ai
discuss
7 months ago
simonpure
43 points
198.
▲
DeepSeek-VL2: MoE Vision-Language Models for Advanced Multimodal Understanding
github.com/deepseek-ai
7 comments
a year ago
selvan
36 points
199.
▲
DeepSeek-V2: A Strong, Economical, and Efficient Moe Language Model
github.com/deepseek-ai
3 comments
2 years ago
jasondavies
14 points
200.
▲
A Library to build MoE from HF models
6 comments
2 years ago
zmy999
9 points
201.
▲
Show HN: Phase Router – capacity-aware routing for MoE
github.com/TSltd
1 comment
2 months ago
TSltd
5 points
202.
▲
Running a 35B MoE model on a 2017 AMD RX 580 8GB via Vulkan (no ROCm/CUDA)
github.com/aivisionslab-studios
discuss
2 days ago
aivisionslab
4 points
203.
▲
LongCat-Flash, a language model with 560B total parameters, MoE architecture
github.com/meituan-longcat
discuss
10 months ago
jinqueeny
4 points
204.
▲
HuggingFace: Support for the Mixtral Moe
github.com/huggingface
discuss
3 years ago
tosh
4 points
205.
▲
Slicing an 80B MoE LLM into 40B domain specialists
github.com/JThomas-CoE
1 comment
3 months ago
JThomas-CoE
3 points
206.
▲
Show HN: A 6.9B Moe LLM in Rust, Go, and Python
github.com/fumi-engineer
1 comment
5 months ago
NightBlossom
3 points
207.
▲
Shimmy v1.7.0: Running 42B Moe Models on Consumer GPUs with 99.9% VRAM Reduction
github.com/Michael-A-Kuykendall
1 comment
8 months ago
MKuykendall
3 points
208.
▲
Huawei's Pangu Pro MoE model is likely derived from Qwen model
github.com/HonestAGI
1 comment
a year ago
delifue
3 points
209.
▲
Show HN: Modernizing my old PhD work in an evening with little Qwen3.6 MoE
github.com/verdverm
discuss
a month ago
verdverm
3 points
210.
▲
QMoE Support for Mixtral
github.com/ggerganov
discuss
3 years ago
tosh
3 points
More