I'm not really sure, and you can pull lots of funny examples where various model...

jewelry · on Oct 12, 2024

This error is not nonsensical though as normal elementary kids would make similar error and with good episodic memory the agent will fix itself.

ben_w · on Oct 12, 2024

He did say "sometimes Einstein is on the other end, and sometimes it's a drunken child. You have no idea when you pick up the phone which way its going to go.", so I think that's still a valid thing for him to complain about.

LLMs totally violate our expectations for computers, by being a bit forgetful and bad at maths.

steveBK123 · on Oct 12, 2024

Yes, to put a point on it -

How many dollars per month would someone be willing to spend for a chatbot that has a 3rd graders ability at math? Personally, $0 for me.

But what if it's a PHD Math degrees ability at math? Tons, in some applications it could be worth $100s or $1000s in an enterprise license setting.

But what if it's unpredictably, imperceptibly question to question, 95% PHD and 5% 3rd grader? Again, for me - $0. (not 95% of $1000s, but truly, $0)