Hacker News
Designing Predictable LLM-Verifier Systems for Formal Method Guarantee
brantmv
|next
[-]
Did I misread or miss something?
brantmv
|root
|parent
[-]
For those reasons, the grandiose name "LLM-Verifier Convergence Theorem" does not sit well with me.
werf456
|next
|previous
[-]
mapontosevenths
|next
|previous
[-]
"We prove that for any non-zero stage success probability, the system reaches the verified state almost surely"
What's the point if its still stochastic?
jaggederest
|root
|parent
|next
[-]
So like, imagine that you had some finite list of integers, and you were picking a random number from 0 to infinity - because the domain is infinite, any finite set has 0 probability, but that doesn't mean it doesn't exist.
mapontosevenths
|root
|parent
[-]
The ability to deterministcly identify that code eventually reaches a halting state, implies that we can use these stochastic tools to generate deterministic outcomes reliably in the future doesn't it?
jaggederest
|root
|parent
[-]
MiniMax42
|root
|parent
[-]
Well, technically, no chance of failure. The chance of failure is absolute zero. Not close to zero, absolute zero. There will be no failure if the assumptions of the model are correct.
The real catch here is in the assumptions.
How long do you have before you need to have a solution? An hour, a year, a century? Too bad, almost sure convergence only provides a guarantee if you wait an infinite amount of time.
And then there's the question of the probability space you assume. (The sigma algebra.) Which things do you assume to have probability zero from the start and is that realistic?
mapontosevenths
|root
|parent
[-]
Thanks for this. I was actually just thinking "this can't actually work, it would mean P vs NP is solved." Of course, this explains why it doesn't mean that.
edwardtay
|previous
[-]
XzAeRosho
|root
|parent
|next
[-]
From what I understood, this validates the output correctness, not that the output aligns with the user goals, so there's still room for the LLM to get the user goals wrong, and this is only to validate the mathematical consistency between the output code and the formal specification (in this paper, within the ESBMC framework for C++ code).
So it's kind of tight scoped, in this case, but I think it points in the right direction for coding assistants, which usually get some language primitives wrong.
westurner
|root
|parent
|previous
[-]
/? TLA LLM https://hn.algolia.com/?dateRange=all&page=0&prefix=false&qu... : 1 submission, ~20 comments
"AI will make formal verification go mainstream" (2025-12) https://news.ycombinator.com/item?id=46294574