Hacker News
Gemini 3 Deep Think drew me a good SVG of a pelican riding a bicycle
segmondy
|next
[-]
I just asked Gemini pro to generate an SVG of an octopus dunking a basketball and it did a great job. Not even Deep Think model. Then I did "generate an svg of raccoon at a beach drinking a beer" you can go try this out yourself. Ask it to generate anything you want in SVG. use your imagination.
Rant: This is why AI is going to take over, folks are not even trying the least.
JumpCrisscross
|root
|parent
|next
[-]
Kagi Assistant remains my main way of interacting with AI. One of its benefits is you're encouraged to try different models.
The heterogeneity in competence, particular per unit in time, is growing rapidly. If I'm extrapolating image-creation capabilities from Claude, I'm going to underestimate what Gemini can do without fuckery. Likewise, if I'm using Grok all day, Gemini and Claude will seem unbelievably competent when it comes to deep research.
raincole
|root
|parent
|next
|previous
[-]
irthomasthomas
|root
|parent
|next
|previous
[-]
WarmWash
|root
|parent
|next
|previous
[-]
bayindirh
|root
|parent
|next
|previous
[-]
I don't think they "rigged" it, but might be given a bit more push on that part since it's going for a very long time now.
Another benchmark is going on at [0]. It's pretty interesting. A perfect scoring model "borks" in the next iteration, for example.
> Rant: This is why AI is going to take over, folks are not even trying the least.
It might be drawing things alright, at least some cases. I seldom use it when my hours long researches doesn't take me to the place I want, and guess what? AI can't go there, either. It hallucinates things, makes up stuff, etc. For a couple of things I asked, it managed to find a single reference, and it was the thing I was looking for, so it works rarely in my cases.
Rant: This is why people are delusional. They test the happy path and claims it knows all the paths, and then some.
colecut
|root
|parent
|next
|previous
[-]
Some people try, most people don't.
AI makes doing almost anything easier for the people that do..
Despite the prophesied near-term obliteration of white collar work, I've never felt luckier to work in software.
vessenes
|next
|previous
[-]
He originally promised to generate a bunch more animals when we got a “good” pelican. This is not a good pelican. This is an OUTSTANDING pelican, a great bicycle, and it even has a little sun ray over the ocean marked out. I’d like to see more animals please Simon!
hnuser123456
|root
|parent
|next
[-]
romanhn
|root
|parent
|next
|previous
[-]
alterom
|root
|parent
|previous
[-]
It's not. Sorry.
Go look at some real bicycles for reference.
sdenton4
|root
|parent
|next
[-]
rustyhancock
|next
|previous
[-]
bonesss
|root
|parent
|next
[-]
Which is only to say: if we HN-front-page it, they will come (generate).
stared
|root
|parent
|next
|previous
[-]
A pelican on a bike is SFW, inclusive, yet cool.
It is not a full benchmark - rather a litmus test.
oplav
|root
|parent
[-]
There’s also the foreman for video: https://youtube.com/watch?v=0cdM-7_xUXM
bayindirh
|root
|parent
|next
|previous
[-]
JumpCrisscross
|root
|parent
|next
[-]
But it's still a fair target. Unless it's hard coded into Gemini 3 DT, for which we have no evidence and decent evidence against, I'd say it's still informative.
rcarmo
|next
|previous
[-]
staticassertion
|root
|parent
[-]
WarmWash
|next
|previous
[-]
From the blog:
>The strongest argument is that they would get caught. If a model finally comes out that produces an excellent SVG of a pelican riding a bicycle you can bet I’m going to test it on all manner of creatures riding all sorts of transportation devices. If those are notably worse it’s going to be pretty obvious what happened.
He mentioned in the Deep Think thread the other day that his secret test set also was impressive.
alestainer
|next
|previous
[-]
Springtime
|next
|previous
[-]
bfung
|next
|previous
[-]
aidos
|next
|previous
[-]
stephc_int13
|next
|previous
[-]
manojlds
|next
|previous
[-]
kittbuilds
|next
|previous
[-]
What I find interesting is that Deep Think's chain-of-thought approach helps here — you can actually watch it reason about where the pedals should be relative to the wheels, which is something that trips up models that try to emit the SVG in one shot. The deliberative process maps well to compositional visual tasks.