HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
511.
▲
GitHub course of practical reinforcement learning
github.com/yandexdataschool
discuss
9 years ago
sshb
1 points
512.
▲
Asyncronous RL in Tensorflow and Keras and OpenAI's Gym
github.com/coreylynch
discuss
10 years ago
mau
1 points
513.
▲
Schelling's dynamic model of segregation simulated in Racket
github.com/jmoy
discuss
11 years ago
yomritoyj
1 points
514.
▲
Show HN: CodeRLM – Tree-sitter-backed code indexing for LLM agents
github.com/JaredStewart
37 comments
4 months ago
jared_stewart
81 points
515.
▲
LLM generated parsers and compliance checkers for Sparrow DSL
discuss
a month ago
melezhik
3 points
516.
▲
Show HN: Drone Swarm Control with RL in AirSim and SB3
github.com/Lauqz
discuss
a year ago
Lauqz
2 points
517.
▲
Show HN: Terminal-Bench-RL: Training long-horizon terminal agents with RL
github.com/Danau5tin
12 comments
a year ago
Danau5tin
125 points
518.
▲
Show HN: Reinforcement Learning from Scratch with TypeScript
github.com/desi-ivanov
discuss
3 years ago
evolveyourmind
7 points
519.
▲
TallMountain – Stoic Virtue Ethics for an LLM Agent
github.com/seamus-brady
6 comments
9 months ago
s_brady
3 points
520.
▲
Show HN: RL from Scratch
github.com/desi-ivanov
discuss
3 years ago
evolveyourmind
3 points
521.
▲
Scaling Coding-Agent RL to 32x H100s. 160% Improvement on Stanford's TBench
github.com/Danau5tin
1 comment
8 months ago
Danau5tin
2 points
522.
▲
MFEK GPT-3 Policy – on GPT-3 aided computer code appearing in our FOSS libraries
github.com/MFEK
1 comment
3 years ago
kopipe
2 points
523.
▲
Show HN: Recursive Language Model for Querying Human Action by Ludwig von Mises
github.com/mateolafalce
discuss
5 months ago
lafalce
2 points
524.
▲
Sparse Predictive Hierarchies, an alternative to deep learning [pdf]
github.com/ogmacorp
discuss
7 years ago
craigjb
2 points
525.
▲
Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting
github.com/ChenRocks
discuss
8 years ago
blopeur
2 points
526.
▲
Open source remake of Lode Runner for Roku box/tv
github.com/lvcabral
discuss
10 years ago
lvcabral
2 points
527.
▲
Ask HN: Why is this Racket code so fast?
27 comments
4 years ago
exdsq
30 points
528.
▲
Show HN: CLI to Test Supabase RLS Policies
github.com/Rodrigotari1
4 comments
8 months ago
rodrigotarca
4 points
529.
▲
Show HN: Tape/Z – a toolkit for analysing z/OS assembler (HLASM) code
github.com/avishek-sen-gupta
discuss
a year ago
armorer
2 points
530.
▲
Deep Reinforcement Learning in Depth in 60 Days
github.com/andri27-ts
19 comments
8 years ago
andri27
189 points
531.
▲
Show HN: COBOL-REKT, a toolkit for analysing and reverse-engineering COBOL
github.com/avishek-sen-gupta
49 comments
2 years ago
armorer
91 points
532.
▲
Car Reinforcement Learning Training
github.com/leesweqq
1 comment
a year ago
kyleliiii
4 points
533.
▲
Master Deep Reinforcement Learning – Week 3
github.com/andri27-ts
discuss
8 years ago
andri27
4 points
534.
▲
Prince of Persia Port for Roku Box and TVs
github.com/lvcabral
discuss
10 years ago
lvcabral
3 points
535.
▲
In-Context Reinforcement Learning
github.com/dunnolab
discuss
2 years ago
vokneruk
2 points
536.
▲
Typesetting.js
rlemon.github.com
discuss
14 years ago
jrgifford
2 points
537.
▲
Show HN: Retro 3000: 80s-style CLI API but with modern capabilities and easy API
github.com/sdegutis
discuss
7 years ago
sdegutis
2 points
538.
▲
Deep Reinforcement Learning in Depth Week 5 – TRPO and PPO
github.com/andri27-ts
discuss
8 years ago
andri27
1 points
539.
▲
Show HN: Minimalist self-hosted CI server written on Raku
1 comment
2 years ago
melezhik
6 points
540.
▲
Show HN: TextPolicy – reinforcement learning for text generation on a MacBook
github.com/teilomillet
discuss
10 months ago
teilom
4 points
More