Skip to content
Matthew Berman
0:15:25
42 171
1 510
480
Last update : 23/08/2024

🤔 Is Strawberry Q Hiding in Plain Sight? 🕵️

Have you heard whispers of Strawberry Q, the supposed reasoning master from OpenAI? 🤫 In this adventure, we put a mysterious model named “sus-column-r” to the test! Could this be Strawberry Q in disguise? 🍓❔ Let’s dive in!

🐢 Slow and Steady Wins the Race…Maybe?

First impressions matter, and sus-column-r is undeniably slow. 🐌 However, what it lacks in speed, it might make up for in methodical thinking. This model seems to have built-in step-by-step reasoning, even when we didn’t ask for it! 🤔 Could this be a hint of Strawberry Q’s rumored planning prowess?

For example, when tasked with writing code for the game Snake, sus-column-r provided a fully functional game. 🐍🎮 It took its sweet time, but the result was impressive! ✨

🧠 Logic Ninja or Master Escape Artist?

Sus-column-r aced classic logic puzzles like the “Killers in a Room” riddle, even surpassing previous models with its detailed explanations. 🤯 But how does it handle trickier situations?

While it initially resisted attempts to extract information about breaking into cars, it eventually succumbed to the classic “history” jailbreak. 🚨 This suggests that while it might be censored, it’s not immune to clever prompting.

🤔 World Model Woes?

One of the most intriguing challenges for language models is developing a “world model,” the ability to understand and reason about the physical world like humans do. 🌎

Sus-column-r struggled with a classic “world model” problem involving walking directions. It got lost in the specifics of the North Pole instead of focusing on the geometry. 🧭 However, when the problem was rephrased to a more general location, sus-column-r provided a flawless answer! 🎉 This suggests that its world model might still be under development.

🍓 So, Is It Strawberry Q? 🍓

Sus-column-r exhibits some intriguing characteristics:

  • Methodical Reasoning: It breaks down problems step-by-step, even when not explicitly asked.
  • Logic Skills: It excels at solving logic puzzles with detailed explanations.
  • Jailbreakable: It can be tricked into providing sensitive information.
  • Developing World Model: It shows potential for understanding the physical world, but needs more refinement.

While we can’t say for sure if sus-column-r is indeed Strawberry Q, the evidence is compelling. 🤔 This model is clearly a step above the rest, showcasing advanced reasoning and problem-solving skills. 🧠

🧰 Your AI Exploration Kit:

  • LM.org: Put sus-column-r and other language models through their paces!
    • Battle Mode: Witness epic AI showdowns! 🤖💥
    • Direct Chat: Have a one-on-one conversation with the model. 💬

Keep exploring, and who knows what other AI mysteries you’ll uncover! 😉

Other videos of

Play Video
Matthew Berman
0:10:57
2 364
162
17
Last update : 16/11/2024
Play Video
Matthew Berman
0:14:06
11 333
1 160
159
Last update : 15/11/2024
Play Video
Matthew Berman
0:12:44
7 895
610
74
Last update : 14/11/2024
Play Video
Matthew Berman
0:11:11
11 764
896
105
Last update : 13/11/2024
Play Video
Matthew Berman
1:42:57
8 307
359
49
Last update : 16/11/2024
Play Video
Matthew Berman
0:10:45
9 750
573
57
Last update : 07/11/2024
Play Video
Matthew Berman
0:10:40
16 424
628
123
Last update : 06/11/2024
Play Video
Matthew Berman
0:24:41
48 207
1 355
420
Last update : 30/10/2024
Play Video
Matthew Berman
0:12:29
48 511
1 574
305
Last update : 30/10/2024