HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
1.
▲
Show HN: ART – a new open-source RL framework for training agents
github.com/OpenPipe
12 comments
a year ago
kcorbitt
116 points
2.
▲
Show HN: RULER – Easily apply RL to any agent
openpipe.ai
11 comments
a year ago
kcorbitt
81 points
3.
▲
Show HN: Automatically convert your GPT-3.5 prompt to Llama 2
2 comments
3 years ago
kcorbitt
13 points
4.
▲
Is AI the next crypto? Insights from HN comments
openpipe.ai
367 comments
3 years ago
kcorbitt
237 points
5.
▲
Mistral 7B Fine-Tune Optimized
openpipe.ai
103 comments
3 years ago
tosh
234 points
6.
▲
Using reinforcement learning and $4.80 of GPU time to find the best HN post
openpipe.ai
95 comments
2 years ago
kcorbitt
217 points
7.
▲
Using GRPO to Beat o1, o3-mini and R1 at “Temporal Clue”
openpipe.ai
55 comments
a year ago
kcorbitt
199 points
8.
▲
OpenPipe Mixture of Agents: Outperform GPT-4 at 1/25th the Cost
openpipe.ai
2 comments
2 years ago
kcorbitt
13 points
9.
▲
Serverless RL: Faster, Cheaper and More Flexible RL Training
openpipe.ai
3 comments
8 months ago
slewis
9 points
10.
▲
PII-Redact – SOTA PII Redaction on Your Laptop
openpipe.ai
1 comment
a year ago
Arctic_fly
6 points
11.
▲
Analyzing OpenAI's Reinforcement Fine-Tuning: Less Data, Better Results
openpipe.ai
discuss
a year ago
kcorbitt
4 points
12.
▲
ART·E: how we built an email research agent that beats o3
openpipe.ai
2 comments
a year ago
kcorbitt
3 points
13.
▲
Everything I know about reward hacking
openpipe.ai
discuss
a year ago
kcorbitt
3 points
14.
▲
Fine-Tuning Best Practices Series Introduction and Chapter 1: Training Data
openpipe.ai
discuss
2 years ago
sebg
3 points
15.
▲
What we've learned in 3 days of Llama 3
openpipe.ai
discuss
2 years ago
kcorbitt
3 points
16.
▲
LLM Fine-Tuning Best Practices: Base Models Proprietary/Open Source, Large/Small
openpipe.ai
1 comment
2 years ago
billmalarky
2 points
17.
▲
Open Deep Research Tutorial – Train a deep research agent to exceed SOTA
art.openpipe.ai
discuss
10 months ago
rahimnathwani
2 points
18.
▲
Fine-Tuning Best Practices: Models
openpipe.ai
discuss
2 years ago
gk1
2 points
19.
▲
Fine-Tuning for Production Apps
openpipe.ai
discuss
2 years ago
ijidak
2 points
20.
▲
LLM Fine-Tuning Best Practices for Training Data Curation
openpipe.ai
2 comments
2 years ago
billmalarky
1 points
21.
▲
Summary-RL
openpipe.ai
discuss
a year ago
s16h
1 points
22.
▲
DPO fine-tuning outperforms SFT
openpipe.ai
discuss
2 years ago
kcorbitt
1 points
23.
▲
OpenPipe
openpipe.ai
discuss
2 years ago
handfuloflight
1 points
24.
▲
Mixtral Curious? Comparing Mistral 7B and Mixtral for fine-tuning
openpipe.ai
discuss
2 years ago
kcorbitt
1 points
25.
▲
S-LoRA: Serving Thousands of Models from One GPU for Fun and Profit
openpipe.ai
discuss
2 years ago
kcorbitt
1 points
26.
▲
Show HN: OpenPaper – Understand Papers Using AI (Open-Source)
openpaper.ai
discuss
a year ago
sabaimran
3 points