HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
481.
▲
A Github course in reinforcement learning in the wild
github.com/yandexdataschool
discuss
9 years ago
sklearnman
3 points
482.
▲
Show HN: A tool to re-key static AWS access keys
github.com/vaijab
discuss
10 years ago
vaijab
3 points
483.
▲
Rlclaw autonomous ML research companion
github.com/photon-cat
1 comment
3 months ago
photoncat
2 points
484.
▲
Linguistic RL: 3B Models Exceed 100B Performance (86% vs. 81%)
github.com/DRawson5570
1 comment
7 months ago
drawson5570
2 points
485.
▲
Raksha by Google Research
github.com/google-research
1 comment
5 years ago
tiziano88
2 points
486.
▲
Wii Reinforcement Learning
github.com/VIPTankz
discuss
6 months ago
arvindh-manian
2 points
487.
▲
Show HN: Linguistic RL – A 7B model discovers Occam's Razor through reflection
github.com/DRawson5570
discuss
8 months ago
drawson5570
2 points
488.
▲
Recent cross-research on LLM and RL on ArXiv
github.com/WindyLab
discuss
10 months ago
Anon84
2 points
489.
▲
Abusing Roku APIs
github.com/RoseSecurity
discuss
3 years ago
notmysql_
2 points
490.
▲
Show HN: I've just ported the RWKV LLM to Fortran
github.com/FortAI-Hub
discuss
3 years ago
matteogrella
2 points
491.
▲
Rkflashtool: Tools for Flashing Rockchip Devices
github.com/linux-rockchip
discuss
5 years ago
pabs3
2 points
492.
▲
Show HN: Decentralized Reinforcement Learning with Societal Decision-Making
github.com/mbchang
discuss
6 years ago
bbdaph
2 points
493.
▲
A GitHub course in reinforcement learning in the wild
github.com/yandexdataschool
discuss
8 years ago
justheuristic
2 points
494.
▲
A git-course on reinforcement learning in the wild
github.com/yandexdataschool
discuss
9 years ago
sklearnman
2 points
495.
▲
Show HN: A Node.js Implementation of the Rapid Automatic Keyword Extraction Algo
github.com/waseem18
discuss
9 years ago
wasim_thabraze
2 points
496.
▲
Show HN: Hands-on course for building RL environments for LLMs
github.com/anakin87
1 comment
2 months ago
anakin87
1 points
497.
▲
Show HN: Framework for Transferring AI Capabilities (Students Surpass Teachers)
github.com/DRawson5570
1 comment
7 months ago
drawson5570
1 points
498.
▲
The open-source embodied intelligence simulation platform
github.com/loongOpen
1 comment
a year ago
OpenLoong
1 points
499.
▲
MicroSafe-RL – Deterministic $1.18 \mu s$ safety layer for Edge AI on MCUs
github.com/Kretski
discuss
3 months ago
DREDREG
1 points
500.
▲
MicroSafe-RL v1.0 – Sub-microsecond safety for Edge AI
github.com/Kretski
discuss
3 months ago
DREDREG
1 points
501.
▲
Show HN: Modeled healthcare de-identification as longitudinal RL control problem
github.com/azithteja91
discuss
4 months ago
vkatganti
1 points
502.
▲
Rkgk UI — A low latency Digital Art software on the browser
github.com/michael-0acf4
discuss
6 months ago
michael-0acf4
1 points
503.
▲
Practical RL (Yandex Data School)
github.com/yandexdataschool
discuss
a year ago
xianshou
1 points
504.
▲
Logic R1: Reproduce DeepSeek R1 Zero on 2K Logic Puzzle Dataset
github.com/Unakar
discuss
a year ago
limoce
1 points
505.
▲
Suika Reinforcement Learning Environment
github.com/edwhu
discuss
2 years ago
edhu2017
1 points
506.
▲
Awesome-Rl-for-Cybersecurity
github.com/Limmen
discuss
5 years ago
limmen
1 points
507.
▲
Fixing a deadlock in a Common Lisp library for Kafka
github.com/SahilKang
discuss
6 years ago
sahil-kang
1 points
508.
▲
Numerical Integration: RK4
github.com/felipetavares
discuss
7 years ago
felipetavares
1 points
509.
▲
Show HN: Jeevan-rakht
github.com/UdacityFrontEndScholarship
discuss
8 years ago
skywalker212
1 points
510.
▲
Controlling a unicycle with Policy Gradients
github.com/pauli-space
discuss
8 years ago
aidanrocke
1 points
More