I gave the below question to all 2 local large language models (Meta Llama 3 & Microsoft Phi-3) and 1 hosted model (OpenAI ChatGPT) and was shocked at the results
Question: There is a cake on a table in the dininig room, I walk over to the cake and place a plate on top of the cake, I then pick up the plate and take it into the kitchen.
Which room is the cake currently in?
The results surprised me but I think we need to do more testing, drop a comment with your ideas!
*update* I did get around to testing llama 3 over 100 times and can confirm it IS smart 98% of the time. Full video with python code available here • Meta llama 3 unexpected results! 100 ...