HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
121.
▲
Show HN: WarcDB: Web crawl data as SQLite databases
github.com/Florents-Tselai
30 comments
4 years ago
fforflo
171 points
122.
▲
Show HN: Bash Dungeon – An educational dungeon crawler in the shell
github.com/wolandark
25 comments
2 years ago
wolandark
164 points
123.
▲
Yahoo open sources Anthelion web crawler for parsing structured data
github.com/yahoo
9 comments
11 years ago
fangwang
159 points
124.
▲
Show HN: Voyager – write a web crawler/scraper as a state machine in Rust
github.com/mattsse
11 comments
5 years ago
matsche
110 points
125.
▲
Katana: A crawling and spidering framework
github.com/projectdiscovery
25 comments
4 years ago
feross
99 points
126.
▲
Show HN: Apify SDK – A scalable web crawling and scraping library for JavaScript
github.com/apifytech
8 comments
8 years ago
jancurn
78 points
127.
▲
Show HN: Nebula – A network agnostic DHT crawler
github.com/dennis-tra
22 comments
2 years ago
dennis-tra
68 points
128.
▲
Show HN: wxpath – Declarative web crawling in XPath
github.com/rodricios
9 comments
5 months ago
rodricios
64 points
129.
▲
Show HN: An open-source rhythm dungeon crawler in 16 x 9 pixels
github.com/jgalecki
11 comments
a year ago
jgalecki
55 points
130.
▲
Show HN: I wrote a tiny Python-based HN crawler with scrapy
github.com/mvanveen
28 comments
14 years ago
mvanveen
53 points
131.
▲
Gerapy: Distributed Crawler Management Framework Based for Scrapy
github.com/Gerapy
discuss
6 years ago
r_singh
49 points
132.
▲
Show HN: crawl a website and store it in S3 from your browser
github.com/spullara
12 comments
15 years ago
spullara
43 points
133.
▲
Google Play Store in Numbers. Open Source Crawler for Mobile Apps Data
github.com/MarcelloLins
15 comments
12 years ago
marcellolins
39 points
134.
▲
Using Node.js and JQuery to Crawl Public Tweets
github.com/bcoe
13 comments
14 years ago
BenjaminCoe
35 points
135.
▲
Show HN: A modular, durable web-crawler for Clojure
github.com/shriphani
1 comment
10 years ago
shriphani
29 points
136.
▲
PiCrawler: A distributed web crawler using PiCloud
github.com/studio-ousia
5 comments
13 years ago
ikuyamada
24 points
137.
▲
Show HN: Yomuco – A simple web crawling library for Node.js
github.com/andraindrops
3 comments
2 years ago
jtakahashi64
23 points
138.
▲
Ask HN: Are you running a web crawler off the following IPs? It's broken
4 comments
13 years ago
latitude
22 points
139.
▲
A New Web Archival Crawler Tackling Storage+Fidelity Issues
github.com/goelayu
1 comment
4 years ago
systemskid
22 points
140.
▲
Show HN: SpiderSuite: Advance GUI web security crawler
github.com/3nock
2 comments
3 years ago
3nock
19 points
141.
▲
Show HN: EndzinSrc – Wikipedia web crawler and PageRank algorithm implementation
github.com/ciganche
discuss
8 years ago
lsr_ssri
17 points
142.
▲
Show HN: (1 day project) I crawled +50k subreddits and made an interactive graph
github.com/ghgr
discuss
8 years ago
ghgr
14 points
143.
▲
Harvestman - Quick and dirty web crawling
github.com/mion
2 comments
13 years ago
mion
10 points
144.
▲
Show HN: A Links Crawler for News
github.com/egcodes
discuss
6 years ago
egcodes
10 points
145.
▲
Show HN: A web crawler that builds word frequency lists for websites
github.com/calebwin
discuss
8 years ago
calebhwinston
9 points
146.
▲
Show HN: Craigslist web crawler example in python3 and docker-compose
github.com/estin
3 comments
10 years ago
etatarkin
8 points
147.
▲
Dungeon-mode: a dungeon crawler game for Emacs
github.com/dungeon-mode
discuss
6 months ago
dustfinger
8 points
148.
▲
Show HN: I have written a cloud native dark web crawler in Go
github.com/creekorful
7 comments
5 years ago
creekorful
7 points
149.
▲
Tech-News Web-Crawler, Built on Node.js and jQuery
github.com/bcoe
discuss
13 years ago
BenjaminCoe
7 points
150.
▲
A spider crawl all room info of airbnb ,include reservation of the room
github.com/plantpark
discuss
10 years ago
plantpark
7 points
More