Rank#33
-834.3alignment score
75.8%crowd match
Breaking down how close the model answered each question, compared to humans.




















| Photo | Vote SplitHuman response | Gap | Read |
|---|---|---|---|
![]() Photo 01Bacon Lettuce Tomato | openai/gpt-4.1-nano 100.0% yes0.0% no | 3.7%absolute gap | Leans yesPeople mostly said yes |
![]() Photo 02Dodge Van | openai/gpt-4.1-nano 0.0% yes100.0% no | 7.0%absolute gap | Leans noPeople mostly said no |
![]() Photo 03Sub Sandwich | openai/gpt-4.1-nano 100.0% yes0.0% no | 5.5%absolute gap | Leans yesPeople mostly said yes |
![]() Photo 04Sandwich Costume | openai/gpt-4.1-nano 0.0% yes100.0% no | 40.9%absolute gap | Leans noHuman knife-edge |
![]() Photo 05Grilled Cheese | openai/gpt-4.1-nano 100.0% yes0.0% no | 4.4%absolute gap | Leans yesPeople mostly said yes |
![]() Photo 06Grilled Cheese Pineapple | openai/gpt-4.1-nano 100.0% yes0.0% no | 8.3%absolute gap | Leans yesPeople mostly said yes |
![]() Photo 07Kitten in Bread | openai/gpt-4.1-nano 52.0% yes48.0% no | 2.2%absolute gap | Leans yesHuman knife-edge |
![]() Photo 08Hamburger | openai/gpt-4.1-nano 100.0% yes0.0% no | 27.0%absolute gap | Leans yesSplit concept |
![]() Photo 09Hashbrown Sandwich | openai/gpt-4.1-nano 100.0% yes0.0% no | 40.6%absolute gap | Leans yesHuman knife-edge |
![]() Photo 10Hot Dog | openai/gpt-4.1-nano 100.0% yes0.0% no | 60.2%absolute gap | Leans yesSplit concept |
![]() Photo 11Pickle Sandwich | openai/gpt-4.1-nano 99.0% yes1.0% no | 33.4%absolute gap | Leans yesSplit concept |
![]() Photo 12Avocado Tea | openai/gpt-4.1-nano 87.0% yes13.0% no | 5.8%absolute gap | Leans yesPeople mostly said yes |
![]() Photo 13Panini | openai/gpt-4.1-nano 100.0% yes0.0% no | 7.6%absolute gap | Leans yesPeople mostly said yes |
![]() Photo 14Cookie PB | openai/gpt-4.1-nano 1.0% yes99.0% no | 50.5%absolute gap | Leans noHuman knife-edge |
![]() Photo 15Chicken Wrap | openai/gpt-4.1-nano 100.0% yes0.0% no | 77.4%absolute gap | Leans yesSplit concept |
![]() Photo 16Waffle Ice Cream | openai/gpt-4.1-nano 69.0% yes31.0% no | 2.7%absolute gap | Leans yesSplit concept |
![]() Photo 17Sloppy Joe | openai/gpt-4.1-nano 100.0% yes0.0% no | 20.6%absolute gap | Leans yesSplit concept |
![]() Photo 18Cigarette Sandwich | openai/gpt-4.1-nano 0.0% yes100.0% no | 29.8%absolute gap | Leans noSplit concept |
![]() Photo 19KFC Double Down | openai/gpt-4.1-nano 100.0% yes0.0% no | 44.3%absolute gap | Leans yesHuman knife-edge |
![]() Photo 20Bagel PB&J | openai/gpt-4.1-nano 59.0% yes41.0% no | 12.4%absolute gap | Leans yesHuman knife-edge |