If you want to see just how few clothes the emperor really has, ask an LLM a nontrivial problem in algebra. I just gave a math olympiad question (on the very easy end of the spectrum) to chatGPT and I cringed at its response. If it was a human being I wouldn't even criticize what they wrote, I'd be too embarrassed. And this is not because the answer's bad (and wrong), but because of the very obvious effort that this hypothetical human is making to *look* like it understands what is so obviously does not understand at all.

It's as if LLMs perfectly encapsulate the superficial and vapid nature of the Silicon Valley-rooted modern culture. There's no there there.

Reply to this note

Please Login to reply.

Discussion

I only recently started playing around with it. Wrt math, I found it good for surfacing high level properties of algebraic structures, but falling way short of going much deeper with the conversation.

Am curious, can you share the question you posed?

Yep,

prove that x^4 + y^4 + z^2 >= sqrt(8)*x*y*z

Disappointed in myself for not seeing solution yet :/ Gonna give it a bit more thinkin’…

It's simple but not easy :) I didn't find it, but to be fair I didn't really try, just looked up the answer after 5 minutes 😄

tis the season, Bless the mess

Oh, wasn’t aware of the AM-GM inequality, not that I would have solved it had I known. Looks like it’s frequently used in Olympiad questions as it’s the first item listed in a study guide google found for olympiads. Met a guy who’d excelled in those, was very smart, but nothing extraordinary when it came to more general/abstract math (I’m prob jealous, still)

If you want a hint, try finding a way to apply AM-GM twice.

I read the solution already, but that’s a solid hint. Kinda glad to have learned of the AM-GM ineq, feels like something I could have used once or twice in the past

Yeah. Another one that's very useful is the Cauchy-Schwartz inequality, gets used a ton in e.g. mathematical physics

Ha, had to look it up to make sure I remembered that one.

Btw, your write up on CTs is great. Been making some reading stops along the way of going through 02bp’s paper (and related things).

have U tried others/*

A tool made in the image of its creator…

My own experience has been different. I mostly work with linear algebra, statistics, and differential equations, and it has been pretty good (but not perfect) as an assistant to help work out some some thorny problems in well-established areas.

I can believe it, seeing what it's done with coding problems, a few times. What's really shocking about what I just saw is how massively it hallucinated simplifications in a short problem (by the way, this one, to give you a sense: prove that x^4 + y^4 + z^2 >= sqrt(8)*x*y*z ). I think Olympiad problems (even easier ones) are designed to require some kind of "craft", creativity, rather than only handle turning. So unsurprisingly it immediately appealed to the AM-GM inequality (bread and butter for this kind of thing), but then made 2 or 3 dreadful mistakes to pretend that the structure was simpler than it was, before confidently asserting in great detail why so and so was true, when it was patently false.

I think it can be very good and giving you hints and strategies appealing to its vast knowledge base. Doing something new or actually *thinking*, it's just absolutely dreadful.

Sure. I think it is still at the point where you need to understand a subject well enough to "fact check" its replies, and it does best at being a savant-like research assistant than an original thinker. I find myself most productive using it for problems that are hard to solve but easy to verify 😆

it reminds me of that little book, On Bullshit

You're good with the obscure book references.

Nowadays I'm stuck with basic Spanish language novels. But I have graduated from Harry Potter type stuff to Agatha Christie so it's not so bad.

“Obscure books” comes with the “got a degree in liberal arts” 😂

my high school teacher had us reading parts of Don Quixote in the original ancient Spanish and i do not recommend it 🫠

wrt to LLMs i think this is the crucial point

this actually makes alot of sense.

t-y/* makn me rethink

im frequently left with the “why, you little weasel!” sentiment when using llms, and when thinking about SV.

Man, working for SV style tech companies has driven me mad with all their AI hype. The vapidness of it is crushing my soul. I’m so sick of it.

LLMs: a fascinating tool for accomplishing vapid tasks

AI bubble will pop. Even this gets better: LLMs only improve human flourishing if the saved time is used to produce something. Doing your work faster and “going home early” is not going to cut it.

Its an amazingly primitive & creative tool at this point.

I use it frequently for tasks that would take half my day In moments.

No question. Hugely useful. But it also regularly shocks me with what it can't do, but thinks it can :)

Its a bullshitter fosho