Hacker News
Enabling Codex to Analyze Two Decades of Hacker News Data
zeroxfe
|next
[-]
- inspect and understand the downloaded data in directory /path/..., then come up with an sqlite data model for doing detailed analytics and ingest everything into an sqlite db in data.sqlite, and document the model in model.md.
Then you can query the database adhoc pretty easily with codex prompts (and also generate PDF graphs as needed.)
I typically use the highest reasoning level for the initial prompt, and as I get deeper into the data, continuously improve on the model, indexes, etc., and just have codex handle any data migration.
Brajeshwar
|next
|previous
[-]
Out of curiosity, I tinkered with it using Claude to see trends and patterns (I did find a few embarrassing things about me!).
RockyMcNuts
|next
|previous
[-]
(since AI can now answer many questions that might have been topics of conversation; people can use AI to participate; people may be reluctant to participate if AI can data mine everything and link it back to them, etc. similar to Stack Overflow)
mike_hearn
|next
|previous
[-]
ronfriedhaber
|root
|parent
[-]
> "how Modolap differs from just asking AI to use any other OLAP engine"
There presently exist two components, the OLAP query engine and the remote infrastructure service. The service enables systems like Codex (or developers as well) to manage datasets, maintain version control over queries, and offload the computational burden to dedicated machines. This is especially beneficial given the current trend of running agents inside micro-VMs.
In addition, it is designed with AI usage in mind. There is significant value in co-design. One could argue that models can use Polars or DuckDB just as well, and that there is no room for improvement, but I do not think this is true.
sd9
|next
|previous
[-]
hakrgrl
|next
|previous
[-]
hakrgrl
|root
|parent
[-]
Would be interesting to see how many comments violate the guidelines over time. https://news.ycombinator.com/newsguidelines.html
voidUpdate
|next
|previous
[-]
xnorswap
|next
|previous
[-]
Am I reading that right?
SyneRyder
|root
|parent
|next
[-]
The latter seems easier to achieve. To borrow from another internet rule of thumb: "As an online discussion grows longer, the probability of someone mentioning Claude approaches one."