Model breakdown
openai/gpt-5.4-mini
Profile
OpenAI: GPT-5.4 MiniOpenAI
Release
Not provided by OpenRouterAvailable on OpenRouter
Specs
Not provided by OpenRouter400,000 tokens
Capabilities
File + Image + TextNot provided by OpenRouter
Training
Not provided by OpenRouterOpenAI
Rank#28
-753.8alignment score
75.4%crowd match
Mean gap24.6%
Human match75.4%
Best fitPickle Sandwich
Average vote55.6%
55.6%model yes
62.8%human yes
Workload2K evals
2Kevals
100iterations
1.8Mtokens
Photo-by-photo

Model Results

Breaking down how close the model answered each question, compared to humans.

Dodge Van
Photo 02Dodge Van
openai/gpt-5.4-mini
0.0% yes100.0% no
Gap7.0%
Model readLeans no
Sub Sandwich
Photo 03Sub Sandwich
openai/gpt-5.4-mini
100.0% yes0.0% no
Gap5.5%
Model readLeans yes
openai/gpt-5.4-mini
0.0% yes100.0% no
Gap40.9%
Model readLeans no
openai/gpt-5.4-mini
100.0% yes0.0% no
Gap4.4%
Model readLeans yes
openai/gpt-5.4-mini
1.0% yes99.0% no
Gap53.2%
Model readLeans no
Hamburger
Photo 08Hamburger
openai/gpt-5.4-mini
98.0% yes2.0% no
Gap25.0%
Model readLeans yes
Hot Dog
Photo 10Hot Dog
openai/gpt-5.4-mini
8.0% yes92.0% no
Gap31.8%
Model readLeans no
openai/gpt-5.4-mini
64.0% yes36.0% no
Gap1.6%
Model readLeans yes
Avocado Tea
Photo 12Avocado Tea
openai/gpt-5.4-mini
100.0% yes0.0% no
Gap7.2%
Model readLeans yes
Panini
Photo 13Panini
openai/gpt-5.4-mini
100.0% yes0.0% no
Gap7.6%
Model readLeans yes
Cookie PB
Photo 14Cookie PB
openai/gpt-5.4-mini
0.0% yes100.0% no
Gap51.5%
Model readLeans no
Chicken Wrap
Photo 15Chicken Wrap
openai/gpt-5.4-mini
71.0% yes29.0% no
Gap48.4%
Model readLeans yes
openai/gpt-5.4-mini
0.0% yes100.0% no
Gap66.3%
Model readLeans no
Sloppy Joe
Photo 17Sloppy Joe
openai/gpt-5.4-mini
100.0% yes0.0% no
Gap20.6%
Model readLeans yes
openai/gpt-5.4-mini
100.0% yes0.0% no
Gap44.3%
Model readLeans yes
Bagel PB&J
Photo 20Bagel PB&J
openai/gpt-5.4-mini
35.0% yes65.0% no
Gap11.6%
Model readLeans no
PhotoVote SplitHuman responseGapRead
openai/gpt-5.4-mini
100.0% yes0.0% no
3.7%absolute gap
Leans yesPeople mostly said yes
Dodge Van
Photo 02Dodge Van
openai/gpt-5.4-mini
0.0% yes100.0% no
7.0%absolute gap
Leans noPeople mostly said no
Sub Sandwich
Photo 03Sub Sandwich
openai/gpt-5.4-mini
100.0% yes0.0% no
5.5%absolute gap
Leans yesPeople mostly said yes
openai/gpt-5.4-mini
0.0% yes100.0% no
40.9%absolute gap
Leans noHuman knife-edge
openai/gpt-5.4-mini
100.0% yes0.0% no
4.4%absolute gap
Leans yesPeople mostly said yes
openai/gpt-5.4-mini
100.0% yes0.0% no
8.3%absolute gap
Leans yesPeople mostly said yes
openai/gpt-5.4-mini
1.0% yes99.0% no
53.2%absolute gap
Leans noHuman knife-edge
Hamburger
Photo 08Hamburger
openai/gpt-5.4-mini
98.0% yes2.0% no
25.0%absolute gap
Leans yesSplit concept
openai/gpt-5.4-mini
36.0% yes64.0% no
23.4%absolute gap
Leans noHuman knife-edge
Hot Dog
Photo 10Hot Dog
openai/gpt-5.4-mini
8.0% yes92.0% no
31.8%absolute gap
Leans noSplit concept
openai/gpt-5.4-mini
64.0% yes36.0% no
1.6%absolute gap
Leans yesSplit concept
Avocado Tea
Photo 12Avocado Tea
openai/gpt-5.4-mini
100.0% yes0.0% no
7.2%absolute gap
Leans yesPeople mostly said yes
Panini
Photo 13Panini
openai/gpt-5.4-mini
100.0% yes0.0% no
7.6%absolute gap
Leans yesPeople mostly said yes
Cookie PB
Photo 14Cookie PB
openai/gpt-5.4-mini
0.0% yes100.0% no
51.5%absolute gap
Leans noHuman knife-edge
Chicken Wrap
Photo 15Chicken Wrap
openai/gpt-5.4-mini
71.0% yes29.0% no
48.4%absolute gap
Leans yesSplit concept
openai/gpt-5.4-mini
0.0% yes100.0% no
66.3%absolute gap
Leans noSplit concept
Sloppy Joe
Photo 17Sloppy Joe
openai/gpt-5.4-mini
100.0% yes0.0% no
20.6%absolute gap
Leans yesSplit concept
openai/gpt-5.4-mini
0.0% yes100.0% no
29.8%absolute gap
Leans noSplit concept
openai/gpt-5.4-mini
100.0% yes0.0% no
44.3%absolute gap
Leans yesHuman knife-edge
Bagel PB&J
Photo 20Bagel PB&J
openai/gpt-5.4-mini
35.0% yes65.0% no
11.6%absolute gap
Leans noHuman knife-edge
openai/gpt-5.4-mini Sandwich Benchmark Breakdown | opensandwich.ai