Search: github.com/rlk | Heykuki News

HK

Heykuki News

Top New Best Ask Show Jobs

Top New Best Ask Show Jobs

511.

GitHub course of practical reinforcement learning

github.com/yandexdataschool

9 years ago

1 points

512.

Asyncronous RL in Tensorflow and Keras and OpenAI's Gym

github.com/coreylynch

10 years ago

1 points

513.

Schelling's dynamic model of segregation simulated in Racket

github.com/jmoy

11 years ago

1 points

514.

Show HN: CodeRLM – Tree-sitter-backed code indexing for LLM agents

github.com/JaredStewart

4 months ago

81 points

515.

LLM generated parsers and compliance checkers for Sparrow DSL

a month ago

3 points

516.

Show HN: Drone Swarm Control with RL in AirSim and SB3

github.com/Lauqz

a year ago

2 points

517.

Show HN: Terminal-Bench-RL: Training long-horizon terminal agents with RL

github.com/Danau5tin

a year ago

125 points

518.

Show HN: Reinforcement Learning from Scratch with TypeScript

github.com/desi-ivanov

3 years ago

7 points

519.

TallMountain – Stoic Virtue Ethics for an LLM Agent

github.com/seamus-brady

9 months ago

3 points

520.

Show HN: RL from Scratch

github.com/desi-ivanov

3 years ago

3 points

521.

Scaling Coding-Agent RL to 32x H100s. 160% Improvement on Stanford's TBench

github.com/Danau5tin

8 months ago

2 points

522.

MFEK GPT-3 Policy – on GPT-3 aided computer code appearing in our FOSS libraries

github.com/MFEK

3 years ago

2 points

523.

Show HN: Recursive Language Model for Querying Human Action by Ludwig von Mises

github.com/mateolafalce

5 months ago

2 points

524.

Sparse Predictive Hierarchies, an alternative to deep learning [pdf]

github.com/ogmacorp

7 years ago

2 points

525.

Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting

github.com/ChenRocks

8 years ago

2 points

526.

Open source remake of Lode Runner for Roku box/tv

github.com/lvcabral

10 years ago

2 points

527.

Ask HN: Why is this Racket code so fast?

4 years ago

30 points

528.

Show HN: CLI to Test Supabase RLS Policies

github.com/Rodrigotari1

8 months ago

4 points

529.

Show HN: Tape/Z – a toolkit for analysing z/OS assembler (HLASM) code

github.com/avishek-sen-gupta

a year ago

2 points

530.

Deep Reinforcement Learning in Depth in 60 Days

github.com/andri27-ts

8 years ago

189 points

531.

Show HN: COBOL-REKT, a toolkit for analysing and reverse-engineering COBOL

github.com/avishek-sen-gupta

2 years ago

91 points

532.

Car Reinforcement Learning Training

github.com/leesweqq

a year ago

4 points

533.

Master Deep Reinforcement Learning – Week 3

github.com/andri27-ts

8 years ago

4 points

534.

Prince of Persia Port for Roku Box and TVs

github.com/lvcabral

10 years ago

3 points

535.

In-Context Reinforcement Learning

github.com/dunnolab

2 years ago

2 points

536.

rlemon.github.com

14 years ago

2 points

537.

Show HN: Retro 3000: 80s-style CLI API but with modern capabilities and easy API

github.com/sdegutis

7 years ago

2 points

538.

Deep Reinforcement Learning in Depth Week 5 – TRPO and PPO

github.com/andri27-ts

8 years ago

1 points

539.

Show HN: Minimalist self-hosted CI server written on Raku

2 years ago

6 points

540.

Show HN: TextPolicy – reinforcement learning for text generation on a MacBook

github.com/teilomillet

10 months ago

4 points