HK
Heykuki News
Top
New
Best
Ask
Show
Jobs
Toggle theme
Top
New
Best
Ask
Show
Jobs
Request
61.
▲
Data-Intensive Text Processing with MapReduce (Free Book)
lintool.github.com
1 comment
14 years ago
wicknicks
4 points
62.
▲
Show HN: Alex – Catch insensitive, inconsiderate writing
github.com/wooorm
1 comment
11 years ago
wooorm
4 points
63.
▲
Show HN: Ragctl – document ingestion CLI for RAG (OCR, chunking, Qdrant)
github.com/datallmhub
discuss
6 months ago
ahsekka
4 points
64.
▲
QuestDB is an open source time-series database for fast ingest and SQL queries
github.com/questdb
discuss
2 years ago
tosh
4 points
65.
▲
Juvia: OSS commenting server in Rails 3 ala Disqus and IntenseDebate
github.com/phusion
discuss
14 years ago
FooBarWidget
4 points
66.
▲
Show HN: RAG-Ready Extractor – Structure-aware ingestion with semantic scoring
github.com/CarlosManuelDiaz
1 comment
4 months ago
cddIT
3 points
67.
▲
FishStore: A new storage layer for fast ingestion and indexing
github.com/microsoft
1 comment
7 years ago
skyprophet
3 points
68.
▲
Story Engine – High-Intensity Strategic Simulation Test Report
gist.github.com
discuss
16 days ago
field_reader
3 points
69.
▲
Data-Intensive Text Processing with MapReduce
lintool.github.com
discuss
14 years ago
acqq
3 points
70.
▲
Treating cancer with low-intensity ultrasound (2023) [pdf]
github.com/OpenwaterHealth
discuss
a year ago
mpweiher
3 points
71.
▲
Code Repo-Prep for LLM Ingestion
github.com/jimmc414
discuss
3 years ago
homarp
3 points
72.
▲
Rust for Data-Intensive Computation
github.com/frankmcsherry
discuss
6 years ago
mark4
3 points
73.
▲
Designing Data-Intensive Applications Book's References
github.com/ept
discuss
6 years ago
patternexon
3 points
74.
▲
Brim: Open-source Electron App, adds Linux, Zeek log ingest
github.com/brimsec
discuss
6 years ago
siskojr
3 points
75.
▲
FishStore: A fast ingestion and querying layer for flexible-schema data
github.com/microsoft
discuss
7 years ago
ngaut
3 points
76.
▲
Catch insensitive, inconsiderate writing
github.com/wooorm
discuss
9 years ago
febin
3 points
77.
▲
Show HN: Legion – Prepare Messy Data for RDBMS Ingestion with Hadoop MapReduce
github.com/republicwireless-open
discuss
10 years ago
dbatten
3 points
78.
▲
Tell HN: Twitch.tv now supports WebRTC ingestion (Broadcast from your browser)
5 comments
3 years ago
Sean-Der
2 points
79.
▲
Show HN: I'm a non-coder who turns 1-line intents into full-stack blueprints
3 comments
a year ago
TulioKBR
2 points
80.
▲
Show HN: DocuFlow – open-source event-driven AI invoice ingestion pipeline
github.com/Shashank0701-byte
2 comments
5 months ago
Shashank0701
2 points
81.
▲
Show HN: Witral: Self-hosted framework to ingest WhatsApp into Markdown/Obsidian
github.com/kirlts
1 comment
5 months ago
kirlts
2 points
82.
▲
MQTT Ingest Module for Weewx
github.com/kroy-the-rabbit
1 comment
a year ago
k_roy
2 points
83.
▲
GriSeis: Frequency data from GB electrical grid and carbon intensity
github.com/JamesTwallin
1 comment
a year ago
alibarber
2 points
84.
▲
Rust intends to force unsafe blocks in unsafe functions
github.com/rust-lang
1 comment
3 years ago
Subsentient
2 points
85.
▲
Stepping – New open source framework for Data intensive projects
github.com/imperva
1 comment
4 years ago
gabrielbeyo
2 points
86.
▲
Use Python-like indents in C
github.com/zhuzhuor
discuss
13 years ago
albertzeyer
2 points
87.
▲
Show HN: TKeeper – policy-governed, signed intents for autonomous systems
github.com/tkeeper-org
discuss
12 days ago
_qnt
2 points
88.
▲
Show HN: Incremental RAG ingestion, only changed chunks get re-embedded
github.com/shamikhan005
discuss
14 days ago
shamikhan005
2 points
89.
▲
Show HN: LogsGo - an experimental log ingestion/query project I built to learn
discuss
2 months ago
SaumyaCodes
2 points
90.
▲
References for "Designing Data-Intensive Applications, 2nd Edition"
github.com/ept
discuss
7 months ago
cyndunlop
2 points
More