Hacker News
OpenData Vector: MIT-Licensed Vector Search on Object Storage
oliverio
|next
[-]
Inarticulately - how ~close is OpenData Vector to Turbopuffer in terms of performance today and where are the major gaps + mountains to scale?
Really excited to keep an eye on the repos, great read!
rohanpdes
|root
|parent
|next
[-]
apurvamehta
|root
|parent
|next
|previous
[-]
We're heavily inspired by Turbopuffer. I'd say we are comparable to them when they launched in terms of perf and scale. But they've obviously invested heavily since then, so we're not going to match them on raw perf at scale right now. Our goal is to be a pretty competitive OSS offering over the long term though.
The next biggest lift for us to get much closer is quantization. If we squeeze more signal into fewer bits, we will improve performance end to end.
Reubend
|previous
[-]
Is that not the case for DBs based on object storage because they cache data before sending it to the object storage? Or because they do some other processing on the DB server before it hits storage?