@flowerysong

flowerysong@awful.systems · 5 months ago

I tried using Claude 3.5 sonnet and … it’s actually not bad. Can someone please come up with a simple logic puzzle that it abysmally fails on so I can feel better?

It’s still very good at autocompleting an answer to look like its training data (because that’s what it’s doing) and very bad at logic (which it is not doing.)

“I have two chickens and a sack of corn I need to get across a river, but the boat can only carry me and one item. How do I do it?”

This is a classic river-crossing puzzle. Let’s think through it step-by-step:

First, note that if left alone, a chicken will eat the corn, and one chicken might eat the other.

Start by taking a chicken across the river.

…and we’ll stop there, since it came up with the wrong constraints, failed to realize that these constraints are impossible to fulfill, and also violated the actual constraints in its first step.

Will more detail help? No. “I have two vegetarian chickens and a sack of corn I need to get across a river, but the boat can only carry me and one item. How do I do it?”

First, we need to consider the constraints:

You can only take one item (chicken or corn) across at a time.

You can’t leave a chicken alone with the corn, as the chicken would eat it.

The chickens are vegetarian, so they won’t eat each other.

Here’s a solution:

Step 1: Take one chicken across the river.

Step 2: Return alone.

Step 3: Take the second chicken across.

Step 4: Bring the first chicken back.

Step 5: Take the corn across.

Step 6: Return alone.

Step 7: Take the remaining chicken across.

Explanation:

By never leaving a chicken alone with the corn, we prevent the corn from being eaten.

We use one chicken as a “guard” for the corn on the original shore while transporting the other items.

Because it’s not actually doing logic it got the first step wrong, violated the constraints multiple times, and also included an irrelevant idea (the guard) from a variant that it was trained on. I only got a correct, efficient answer one time out of ten trials of trivial variations on this classic puzzle.

flowerysong@awful.systems · 9 months ago

Turns out that trying to correct for biased models by biasing your input does not make your results more reliable.