HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
211.
▲
OpenMoE – A family of open-sourced Mixture-of-Experts (MoE) LLMs
github.com/XueFuzhao
discuss
3 years ago
tim_sw
3 points
212.
▲
Show HN: Tok/s on a 35B MoE model using a $100 AMD crypto APU and Vulkan
github.com/akandr
1 comment
3 months ago
akandr
2 points
213.
▲
Mistral: Light-weight library for mixture-of-experts (MoE) training
github.com/mistralai
1 comment
3 years ago
georgehill
2 points
214.
▲
Show HN: Ported Cerebras REAP to MLX – Prune MoE Experts on a MacBook
github.com/egesabanci
discuss
21 days ago
egesabanci
2 points
215.
▲
Live 204-node MoE visualization reveals emergent cognitive stratification
github.com/eriirfos-eng
discuss
a month ago
rfi-irfos
2 points
216.
▲
Show HN: 35B MoE LLM and other models locally on an old AMD crypto APU (BC250)
github.com/akandr
discuss
3 months ago
akandr
2 points
217.
▲
Dots.llm1: open-source MoE LLM with 142B total and 14B active parameters
github.com/rednote-hilab
discuss
a year ago
simonpure
2 points
218.
▲
Every Flop Counts: Scaling 300B Moe LLMs Without Premium GPUs [pdf]
github.com/inclusionAI
discuss
a year ago
mountainview
2 points
219.
▲
Lamini Memory Tuning: near-perfect fact recall via 1M-way MoE [pdf]
github.com/lamini-ai
discuss
2 years ago
Bluestein
2 points
220.
▲
Show HN: SwiftLM – Qwen Chat on iPhone, 100B+ Moe on M5 Pro 64GB (Native Swift)
github.com/SharpAI
2 comments
3 months ago
aegis_camera
1 points
221.
▲
DirectStorage LLM Weight Streaming: 4x faster loading, MoE expert streaming
github.com/kibbyd
1 comment
4 months ago
kibbyd1985
1 points
222.
▲
Micro-Expert-Router: Running Mixtral-Class Moe Models on NVMe SSDs Without a GPU
github.com/randyap8-wq
discuss
a month ago
randyap8
1 points
223.
▲
Why Gemma-4 26B MoE works in HuggingFace but breaks in prod inference engines
github.com/maeddesg
discuss
a month ago
maeddesg
1 points
224.
▲
Has anyone else hit expert homogeneity collapse in small MoE models?
github.com/eriirfos-eng
discuss
a month ago
rfi-irfos
1 points
225.
▲
ARCHE3-7B – Sparse Moe with SmartRouter and Foundation Curriculum Training
discuss
3 months ago
OpenSynapseLabs
1 points
226.
▲
QuantumLeap: 2.3× faster MoE inference with intelligent expert caching
github.com/MartinCrespoC
discuss
3 months ago
ikharoz
1 points
227.
▲
Show HN: Adaptive-K – Cut MoE inference costs 30-50% with entropy-guided routing
github.com/Gabrobals
discuss
5 months ago
Gabrielebalsamo
1 points
228.
▲
Show HN: LLM Inference Performance Analytic Tool for Moe Models (DeepSeek/etc.)
github.com/kevinyuan
discuss
7 months ago
kevin-2025
1 points
229.
▲
DeepSeek-VL2: Moe Vision-Language Models for Advanced Multimodal Understanding [pdf]
github.com/deepseek-ai
discuss
2 years ago
limoce
1 points
230.
▲
Aria: Open Multimodal Native Moe
github.com/rhymes-ai
discuss
2 years ago
simonpure
1 points
231.
▲
Yosoro – Moe Style Markdown NoteBook
github.com/IceEnd
discuss
8 years ago
tvvocold
1 points
232.
▲
Show HN: Terminal-Bench-RL: Training long-horizon terminal agents with RL
github.com/Danau5tin
12 comments
a year ago
Danau5tin
125 points
233.
▲
Show HN: Pica – Rust-based agentic AI infrastructure (open-source)
picaos.com
44 comments
a year ago
moekatib
63 points
234.
▲
Launch HN: General Instinct (YC P26) – Frontier models on edge devices
16 comments
18 days ago
guanming0717
63 points
235.
▲
Show HN: LeanRL: Fast PyTorch RL with Torch.compile and CUDA Graphs
github.com/pytorch-labs
5 comments
2 years ago
vmoens
53 points
236.
▲
Show HN: KTransformers–236B Model and 1M Context LLM Inference on Local Machines
github.com/kvcache-ai
3 comments
2 years ago
sssummer
20 points
237.
▲
Show HN: Run 500B+ Parameter LLMs Locally on a Mac Mini
github.com/opengraviton
10 comments
3 months ago
fatihturker
17 points
238.
▲
Show HN: Lemonade: Run LLMs Locally with GPU and NPU Acceleration
github.com/lemonade-sdk
discuss
10 months ago
ramkrishna2910
15 points
239.
▲
Show HN: KTransformers:671B DeepSeek-R1 on a Single Machine-286 tokens/s Prefill
github.com/kvcache-ai
discuss
a year ago
sssummer
14 points
240.
▲
Show HN: OpenGraviton – Run 500B+ parameter models on a consumer Mac Mini
opengraviton.github.io
5 comments
4 months ago
fatihturker
13 points
More