rafaepta

Born on January 22, 2021•419 Karma

2 days ago

•on: Ask HN: How do you separate intentional test boile...

Yeah, that is pretty much what it does already: it tries to recognize test files and skip them. Dupehound is available for 12 languages Today.

Some languages like RUst you mentioned, have a clear tag that says "this is a test," but others do not, so the tool has to guess from file names and ends up missing some and skipping too much.

Also as I mentioned on the answer below, sometimes you actually do want to see the repeats inside tests, or normal code repeats on purpose too. So I am leaning toward letting users wave off one specific case by hand instead of skipping everything blindly.

rafaepta•

2 days ago

•on: Ask HN: What tools are you using for AI-assisted c...

Using dupehound for identifying duplicated code.

What I use for: I use for identifying duplicated code. It is deterministic, doesn't use AI, offline, runs from CLI and is super fast (and free).

What I dislike: I won't say it I dislike, but it is not a tool that does all the jobs of a code review. For instance, it doesn't flag security issues. It is superfocused on code duplication (it performs better than Sonar for this use case) and is specifically useful for large codebases. Disclaimer: I am one of the collaborators, so take it with a grain of salt https://github.com/Rafaelpta/dupehound

rafaepta•

5 days ago

•on: Terminal UIs Are an Abomination. AI Needs Better U...

agreed, but imo terminal and any conversational interface is not a silver bullet. Example: signup up for a ai assistant that lives in the email. Can't cancelled it bcs the founder assumed that since you ask anything in natural language every problem the user might have is solved. This is just pushing problems to users.

rafaepta•

7 days ago

•on: Not everyone is using AI for everything

So true, just built a deterministic system to identify duplicated code. It's offline and doesn't use AI on purpose, since a gate that blocks your CI has to give the exact same answer every time, and finding dupes means comparing every function against every other (that's index work). It does NOT use AI. But ironically, I used AI to build it (https://github.com/Rafaelpta/dupehound )

rpdillon•

7 days ago

> But ironically, I used AI to build it

This is a pattern I encourage - the AI might not be reliable, but with coaching, it can produce reliable tools. `colordiff` was causing issues with `less` when I was looking at diffs (character encoding issues I think), and when I asked Kimi K2.6 what to do, it built me a rust command-line diff tool in one shot that I've been using ever since (it even downloaded rust, wrote the tool, and compiled it).

NathanaelRea•

7 days ago

Have you seen jscpd? What does your tool do differently?

rafaepta•

9 days ago

•on: Finding code duplicated by AI without AI

I think "deterministic" is going to become a feature label the way "offline" and "no tracking" did.

rafaepta•

3 months ago

•on: Files are the interface humans and agents interact...

Great read. Thanks for sharing

rafaepta•

5 months ago

•on: Ask HN: What's the oldest piece of code still runn...

watches age well

rafaepta•

11 months ago

•on: The AI Replaces Services Myth

This article misses the point. It is not about AI replacing workers, but about AI bringing more ROI. Can an AI convert twice as many customers as a $4k salesperson? It is reasonable to say that in a B2C setting, YES. I've seen that. Better SLA, fast responses during weekends, better adherence to existing playbooks, mapping out objections that are not in the playbook, and suggesting updates for the same prompt. In one week, the playbook evolved, and today we are converting more customers than the sales team. Does it capture the value of the $4k usd sales person ? If the ROI is superior, yes. Will I pay for it? That is a different story (we developed this ourselves).

rafaepta•

11 months ago

•on: Hiring for a job that doesn't exist yet

Thanks for flagging this. I do need a better way to explain what we do.

scarface_74•

11 months ago

It doesn’t matter. You are using LLMs to create AI slop and calling it “content”.

rafaepta•

11 months ago

Not really, that is not what we do.

scarface_74•

11 months ago

“Instead of relying on generic GPT blog posts tools, external agencies, or interns, we provide a proprietary content engine that learns from client’s data, builds a data-driven content strategy, and publishes high-quality content automatically bringing organic traffic from Google and AI search.”

What is an AI powered content engine then?

rafaepta•

11 months ago

•on: The Anatomy of Quick Wins

Systems always win. If you don't have anything in place, quick wins are just a distraction.