Astro Hacker News - Readings in Database Systems (5th Edition) (2015)

vvern |next [-]

About time for the 6th Edition, eh? What would folks include in it?

- Vector databases and hybrid search?

- Object storage for all the things? Lake houses. Parquet and beyond.

- Continuously materialized views? I'm not sure this one has made the splash but I think about Naiad (Materialize) and Noria (Readyset)

- NewSQL went mostly mainstream (Spanner wasn't included in the last one, but there's been more here with things like CockroachDB, TiDB, etc)

teleforce |root |parent |next [-]

Definitely they should include D4M and GraphQL [1],[2].

Not only D4M can cater for structured relational data, it's also suitable for non-structured and sparse data in spreadsheet, matrices and graph. It's essentially a generalization of SQL but for all things data.

There's also integration of D4M with SciDB [3].

[1] D4M: Dynamic Distributed Dimensional Data Model:

https://d4m.mit.edu/

[2] GraphQL:

https://graphql.org/

[3] D4M: Bringing associative arrays to database engines:

https://arxiv.org/abs/1508.07371

kwillets |root |parent |next |previous [-]

The object storage stuff is new, but it's mostly confirmed that the older architecture works. MPP with shared (S3) storage and everything above that on local SSD and compute delivers the best performance. Even Snowflake finally came out with "interactive" warehouses with this architecture.

Parquet, Iceberg, and other open formats seem good, but they may hit a complexity wall. There's already some inconsistency between platforms, eg with delete vectors.

Incremental view maintenance interests me as well, and I would like to see it more available on different platforms. It's ironic that people use dbt etc. to test every little edit of their manually coded delta pipelines, but don't look at IVM.

B1FF_PSUVM |root |parent |previous [-]

LLMs as DBs (if you squint hard enough)

gnabgib |next |previous [-]

(2015) Popular in:

2020 (225 points, 30 comments) https://news.ycombinator.com/item?id=15436647

2017 (247 points, 44 comments) https://news.ycombinator.com/item?id=15436647

2015 (189 points, 37 comments) https://news.ycombinator.com/item?id=10694538

WalterGR |next |previous [-]

Before spidering the site for offline reading, be aware:

“Rather than secure rights to the recommended papers, we have simply provided links to Google Scholar searches that should help the reader locate the relevant papers.”

sam_lowry_ |root |parent |next [-]

Why not to Scihub?

xpe |root |parent |next [-]

Sci-Hub rules:

    1. You do not talk about Sci-Hub.
    2. You do NOT talk about Sci-Hub.
    3. If a download says "Stop," goes limp,
       or taps out, that download is over. 
    4. Only two tries per mirror. 
    5. One download at a time. 
    6. Shirt and shoes optional. 
    7. Downloads will continue until publicly funded
       research is widely distributed. 
    8. If this is your first time at Sci-Hub, you
       have to download something interesting,
       actually read at least part of it, learn
       something, and then fight ignorance and/or
       stupidity with it.

WalterGR |root |parent |previous [-]

Then you were successfully beworn.

nine_k |root |parent |previous [-]

A perfect task for an AI agent, BTW.

rodolphoarruda |next |previous [-]

Amazing: the website's index page has the book's index in it. While this makes perfect sense, it's a kind of a feature that is becoming rare in today's tech book websites which display all sorts of marketing fluff, social confirmations etc and not the structure of the book itself.

zingar |next |previous [-]

How does this stack up in 2025/6?

ctxc |next |previous [-]

Oh well...

https://ibb.co/BVrzQRWH

testdelacc1 |root |parent |next [-]

Wonder why. Did they confuse this with Maoist literature (Little Red Book)?

ctxc |root |parent [-]

Hmm maybe, a blooper on their part

I just switched networks (wifi/mobile) and it worked, only that provider seems to block it

nrhrjrjrjtntbt |root |parent |previous [-]

Peek:

Readings in Database Systems (commonly known as the "Red Book") has offered readers an opinionated take on both classic and cutting-edge research in the field of data management since 1988. Here, we present the Fifth Edition of the Red Book — the first in over ten years. CHAPTERS Preface [HTML] [PDF] Background introduced by Michael Stonebraker [HTML] [PDF] Traditional RDBMS Systems introduced by Michael Stonebraker [HTML] [PDF] Techniques Everyone Should Know introduced by Peter Bailis [HTML] [PDF] New DBMS Architectures introduced by Michael Stonebraker [HTML] [PDF] Large-Scale Dataflow Engines introduced by Peter Bailis [HTML] [PDF] Weak Isolation and Distribution introduced by Peter Bailis [HTML] [PDF] Query Optimization introduced by Joe Hellerstein [HTML] [PDF] Interactive Analytics introduced by Joe Hellerstein [HTML] [PDF] Languages introduced by Joe Hellerstein [HTML] [PDF] Web Data introduced by Peter Bailis [HTML] [PDF] A Biased Take on a Moving Target: Complex Analytics by Michael Stonebraker [HTML] [PDF] A Biased Take on a Moving Target: Data Integration by Michael Stonebraker [HTML] [PDF] Complete Book: [HTML] [PDF] Readings Only: [HTML] [PDF] Previous Editions: [HTML]

|root |parent [-]

herodoturtle |next |previous [-]

redbook.io huh?

Some might argue the Red Book to be “NSA Trusted Networks” a.k.a the ugly red book that won't fit on the shelf.

Crash & Burn <3

layer8 |root |parent |next [-]

There are a lot of red books: https://en.wikipedia.org/wiki/Red_Book

hashhar |root |parent |previous [-]

Redbook is also the Audio CD standard. Lots of redbooks exist.