HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
1.
▲
Implementing DeepSeek R1's GRPO algorithm from scratch
github.com/policy-gradient
3 comments
a year ago
xcodevn
192 points
2.
▲
A minimal hackable implementation of policy gradients (GRPO, PPO, REINFORCE)
github.com/zafstojano
discuss
5 months ago
starzmustdie
1 points
3.
▲
Experimenting with policy gradient methods in Jax
github.com/elliotvilhelm
discuss
a year ago
monadicmonad
2 points
4.
▲
OpenAi Gym: Policy Gradient
github.com/Mortiniera
discuss
7 years ago
mortinie
2 points
5.
▲
Multi-Agent Deep Deterministic Policy Gradient
github.com/openai
discuss
8 years ago
stablemap
2 points
6.
▲
Controlling a unicycle with Policy Gradients
github.com/pauli-space
discuss
8 years ago
aidanrocke
1 points
7.
▲
AI and Games
discuss
10 months ago
shehabyasser
3 points
8.
▲
Show HN: Qantify – GPU-Accelerated Trading Library with Advanced Math and AutoML
github.com/Alradyin
discuss
7 months ago
Alradyin
1 points