HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
601.
▲
Corroded – Ruining Rust
github.com/buyukakyuz
discuss
6 months ago
ravenical
2 points
602.
▲
Wii Reinforcement Learning
github.com/VIPTankz
discuss
6 months ago
arvindh-manian
2 points
603.
▲
Reinforcement Learning Infrastructure for LLM Agents
github.com/NVIDIA-NeMo
discuss
6 months ago
bakigul
2 points
604.
▲
Show HN: I built an OpenTelemetry extension that shows traces as rain in the sky
github.com/theletterf
discuss
6 months ago
theletterf
2 points
605.
▲
Verifiers: Environments for LLM Reinforcement Learning
github.com/PrimeIntellect-ai
discuss
9 months ago
dominik-space
2 points
606.
▲
Show HN: AI coding assistant that helps to build things, not to ruin them
github.com/volotat
discuss
10 months ago
volotat
2 points
607.
▲
AReaL, Distributed Reinforcement Learning System for LLM Reasoning
github.com/inclusionAI
discuss
a year ago
jinqueeny
2 points
608.
▲
AReaL: Distributed Reinforcement Learning System for LLM Reasoning
github.com/inclusionAI
discuss
a year ago
jinqueeny
2 points
609.
▲
In-Context Reinforcement Learning
github.com/dunnolab
discuss
2 years ago
vokneruk
2 points
610.
▲
Tetris Gymnasium: A customizable reinforcement learning environment for Tetris
github.com/Max-We
discuss
2 years ago
mw00
2 points
611.
▲
Kolmogorov-Arnold Network for Reinforcement Leaning, Initial Experiments
github.com/riiswa
discuss
2 years ago
riiswa
2 points
612.
▲
Matrix digital rain implemented in Bash
github.com/wick3dr0se
discuss
2 years ago
thunderbong
2 points
613.
▲
Pydantic v2 ruined the elegance of Pydantic v1
github.com/pydantic
discuss
2 years ago
behnamoh
2 points
614.
▲
Ask HN: Rinf copies flutter_rust_bridge, says bridge bad, claims rinf ultimate
discuss
2 years ago
fzyzcjy
2 points
615.
▲
Pearl – A Production-Ready Reinforcement Learning AI Agent Library by Meta
github.com/facebookresearch
discuss
3 years ago
jcater
2 points
616.
▲
Friend: An extensible authentication and authorization library for Clojure Ring
github.com/cemerick
discuss
14 years ago
nickik
2 points
617.
▲
Why we need Reinforcement Learning for Language Model training
gist.github.com
discuss
3 years ago
yamrzou
2 points
618.
▲
Melting Pot: A suite of test scenarios for multi-agent reinforcement learning
github.com/deepmind
discuss
5 years ago
lnyan
2 points
619.
▲
Inverse Reinforcement Learning on Acrobot-v1
github.com/Vrroom
discuss
5 years ago
matroid
2 points
620.
▲
DeepMimic: Motion imitation with deep reinforcement learning
github.com/xbpeng
discuss
5 years ago
homarp
2 points
621.
▲
Jupylet: A Jupyter extension for Reinforcement Learning experiments
github.com/nir
discuss
5 years ago
cool-RR
2 points
622.
▲
FinRL: A Deep Reinforcement Learning Library for Automated Stock Trading
github.com/AI4Finance-LLC
discuss
6 years ago
T-A
2 points
623.
▲
Carl (Car Game for Reinforcement Learning)
github.com/MatthiasSchinzel
discuss
6 years ago
MadS123
2 points
624.
▲
Show HN: Decentralized Reinforcement Learning with Societal Decision-Making
github.com/mbchang
discuss
6 years ago
bbdaph
2 points
625.
▲
Julia Reinforcement Learning Implementations
github.com/fabio-4
discuss
6 years ago
fabio-4
2 points
626.
▲
Making rxi's lite my main text editor
github.com/a327ex
discuss
6 years ago
dsego
2 points
627.
▲
Minimal implementations of Reinforcement Learning algorithms
github.com/seungeunrho
discuss
6 years ago
ag8
2 points
628.
▲
People's Reinforcement Learning (PRL)
github.com/opium-sh
discuss
6 years ago
jonbaer
2 points
629.
▲
Deep-tic-tac-toe: deep reinforcement learning to play tic-tac-toe
github.com/ZackAkil
discuss
6 years ago
sebg
2 points
630.
▲
Show HN: Lock Free MRMW Ring in Go
github.com/mitghi
discuss
7 years ago
mitghi
2 points
More