HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
601.
▲
Scraping and Extracting the Cablegate HTML in Python
github.com/typecode
discuss
16 years ago
amahon
8 points
602.
▲
Tadpole the Language for Scraping 0.2.0 – Complex Control Flow, Stealth and More
2 comments
4 months ago
zachperkitny
7 points
603.
▲
Domharvest: Semantic web scraping that survives DOM changes"
github.com/domharvest
2 comments
5 months ago
DomHarvest
7 points
604.
▲
Show HN: estela, a modern elastic web scraping cluster
github.com/bitmakerla
discuss
4 years ago
breno
7 points
605.
▲
Show HN: A Node.js script powered by Puppeteer for undetectable web scraping
github.com/darkotodoric
2 comments
2 years ago
darkotodoric
6 points
606.
▲
Crawl4AI: Open-Source Web Crawler for Seamless AI Data Scraping
github.com/unclecode
discuss
2 years ago
ProbeCraft
6 points
607.
▲
Show HN: Tool for scraping profile pictures from various social networks
github.com/tusharmakkar08
discuss
10 years ago
tusharmakkar08
6 points
608.
▲
H4ckf0r0day/obscura: The headless browser for AI agents and web scraping
github.com/h4ckf0r0day
4 comments
2 months ago
rezaprima
5 points
609.
▲
I wrote 280 articles about web scraping. Here's their index grouped by tag
github.com/TheWebScrapingClub
discuss
a year ago
PigiVinci83
5 points
610.
▲
Show HN: Scrapping Facebook with JavaScript
github.com/victorqribeiro
discuss
8 years ago
atum47
5 points
611.
▲
Automatic programming by scraping stack overflow answers
github.com/MilesCranmer
discuss
9 years ago
dcjaxx
5 points
612.
▲
Show HN: CLI for Email Scraping from Websites
github.com/lawzava
1 comment
6 years ago
hitchnsmile
4 points
613.
▲
Show HN: Skyscraper, a Clojure library for structural scraping of whole sites
github.com/nathell
1 comment
11 years ago
nathell
4 points
614.
▲
Opting out (of public chatroom scraping for archive.matrix.org)
github.com/matrix-org
discuss
3 years ago
rapnie
4 points
615.
▲
Tell HN: Flo Hiring scraping GitHub emails for recruiting spam
discuss
4 years ago
richbell
4 points
616.
▲
Geziyor, a fast web crawling and scraping framework for Go
github.com/geziyor
discuss
7 years ago
jinqueeny
4 points
617.
▲
Show HN: Scraping recipes to get live radio metadata
github.com/dest4
discuss
9 years ago
dest
4 points
618.
▲
Real-world web scraping exercises in Python 3 for data journalists
github.com/compjour
discuss
11 years ago
danso
4 points
619.
▲
Show HN: caniscrape – Analyze anti-bot protections before scraping (CLI and Web)
github.com/ZA1815
2 comments
8 months ago
Crroak
3 points
620.
▲
Ulixee Hero - The web browser built for scraping
github.com/ulixee
2 comments
4 years ago
icyfox
3 points
621.
▲
Raspberry Pi LTE proxy for scraping and Kubernetes
github.com/paulcjh
1 comment
2 years ago
paulcjh
3 points
622.
▲
GoScrapy: Harnessing Go's power for fast web scraping
github.com/tech-engine
1 comment
2 years ago
thunderbong
3 points
623.
▲
Show HN: Automated stripping of CSS/JS to improve performance audits
github.com/AlexChesser
1 comment
5 years ago
AlexC04
3 points
624.
▲
Rvest: Simple web scraping for R
github.com/hadley
discuss
11 years ago
claywm
3 points
625.
▲
Show HN: Multi-threaded JavaScript scrapping/API tool
github.com/valarauca
discuss
12 years ago
valarauca1
3 points
626.
▲
Show HN: Squache – A self-hosted HTTPS caching proxy for web scraping
github.com/devrupt-io
discuss
6 months ago
ddtaylor
3 points
627.
▲
Show HN: Univer Clipsheet – A Powerful Chrome Extension for Web Scraping
github.com/dream-num
discuss
a year ago
alexliu518
3 points
628.
▲
Scraping ETF historical holdings data
github.com/sap215
discuss
a year ago
sampass
3 points
629.
▲
Lightweight library for scraping web-sites with LLMs
github.com/raznem
discuss
2 years ago
thelastgallon
3 points
630.
▲
TWINT: Twitter scraping tool evading most API limitations
github.com/twintproject
discuss
8 years ago
wslh
3 points
More