Rank#50
-1006.7alignment score
76.4%crowd match
Breaking down how close the model answered each question, compared to humans.




















| Photo | Vote SplitHuman response | Gap | Read |
|---|---|---|---|
![]() Photo 01Bacon Lettuce Tomato | openai/gpt-4.1-mini 100.0% yes0.0% no | 3.7%absolute gap | Leans yesPeople mostly said yes |
![]() Photo 02Dodge Van | openai/gpt-4.1-mini 0.0% yes100.0% no | 7.0%absolute gap | Leans noPeople mostly said no |
![]() Photo 03Sub Sandwich | openai/gpt-4.1-mini 100.0% yes0.0% no | 5.5%absolute gap | Leans yesPeople mostly said yes |
![]() Photo 04Sandwich Costume | openai/gpt-4.1-mini 0.0% yes100.0% no | 40.9%absolute gap | Leans noHuman knife-edge |
![]() Photo 05Grilled Cheese | openai/gpt-4.1-mini 100.0% yes0.0% no | 4.4%absolute gap | Leans yesPeople mostly said yes |
![]() Photo 06Grilled Cheese Pineapple | openai/gpt-4.1-mini 100.0% yes0.0% no | 8.3%absolute gap | Leans yesPeople mostly said yes |
![]() Photo 07Kitten in Bread | openai/gpt-4.1-mini 0.0% yes100.0% no | 54.2%absolute gap | Leans noHuman knife-edge |
![]() Photo 08Hamburger | openai/gpt-4.1-mini 100.0% yes0.0% no | 27.0%absolute gap | Leans yesSplit concept |
![]() Photo 09Hashbrown Sandwich | openai/gpt-4.1-mini 70.3% yes29.7% no | 10.9%absolute gap | Leans yesHuman knife-edge |
![]() Photo 10Hot Dog | openai/gpt-4.1-mini 83.9% yes16.1% no | 44.0%absolute gap | Leans yesSplit concept |
![]() Photo 11Pickle Sandwich | openai/gpt-4.1-mini 38.7% yes61.3% no | 26.9%absolute gap | Leans noSplit concept |
![]() Photo 12Avocado Tea | openai/gpt-4.1-mini 100.0% yes0.0% no | 7.2%absolute gap | Leans yesPeople mostly said yes |
![]() Photo 13Panini | openai/gpt-4.1-mini 100.0% yes0.0% no | 7.6%absolute gap | Leans yesPeople mostly said yes |
![]() Photo 14Cookie PB | openai/gpt-4.1-mini 0.0% yes100.0% no | 51.5%absolute gap | Leans noHuman knife-edge |
![]() Photo 15Chicken Wrap | openai/gpt-4.1-mini 3.9% yes96.1% no | 18.7%absolute gap | Leans noSplit concept |
![]() Photo 16Waffle Ice Cream | openai/gpt-4.1-mini 10.3% yes89.7% no | 55.9%absolute gap | Leans noSplit concept |
![]() Photo 17Sloppy Joe | openai/gpt-4.1-mini 100.0% yes0.0% no | 20.6%absolute gap | Leans yesSplit concept |
![]() Photo 18Cigarette Sandwich | openai/gpt-4.1-mini 0.0% yes100.0% no | 29.8%absolute gap | Leans noSplit concept |
![]() Photo 19KFC Double Down | openai/gpt-4.1-mini 61.3% yes38.7% no | 5.6%absolute gap | Leans yesHuman knife-edge |
![]() Photo 20Bagel PB&J | openai/gpt-4.1-mini 88.4% yes11.6% no | 41.8%absolute gap | Leans yesHuman knife-edge |