Are large language models really AI?

Asura.Iamaman

Offline

サーバ: Asura

Game: FFXI

User: iamaman

Posts: 1167

By Asura.Iamaman 2025-11-25 15:46:13

Link | 引用 | 返事

Fenrir.Niflheim said: »

Oh what? but look they can find CVEs (this one is even a remote zeroday vulnerability), I mean ignore the mountain of false positives, you can sort through those... like what else are you going to do with your time after you are replaced by the AI? It is only going to get better and at rapid pace.

This is a fair example and one I was aware of, the issue I take is that this isn't how AI is being promoted by the industry.

The way most people read the way AI is going to work is this: give me code, I put it in AI, tell AI to find bugs, AI gives me bugs. Now we, here, understand this is not the case but your typical CISO does not. The goal is to promote the idea that you can remove the person and automate results using AI, something they've claimed for years prior to LLMs and...wasn't the case. In this case, the bug was found 8% of the time, so between weeding out false positives and running enough for it to find the bug, is that really working the way it's being promoted esp in context of the Anthropic posts?

In the case above, you had someone who understood the code enough to find a bug on their own (he kindof hints at this but IMO underplays the value here), then fed the LLM the specific code necessary AND understands how LLMs work enough to know how to prompt it and provide what is needed. You'd also have to have enough familiarity with the code to weed out false positives, again, something that requires manual intervention and review. This code is not hard to get through, for sure, but still...there's a prerequisite that someone can interpret the result and make sense of it.

In the context of the other discussion, exploiting an issue like this is also extremely volatile. Most memory corruption bugs I've found in the course of my career are not practically exploitable, but get CVEs anyway because it segfaults and most people don't care if it's actually exploitable. Feeding a LLM the code necessary to reliably exploit a UAF bug would require understanding of the compiled code, allocator internals, thread state, and a number of other factors that it's just not capable of handling to produce a working exploit. So yea, bug hunting there is some optimization, but not enough to replace people and even (in my experience) provide meaningful output. You still need someone who can understand these internals to really turn it into something useful, but it involves correlation of so many different factors (some of which are non-deterministic, like allocator state) that LLMs can't begin to handle it.

I actually ran a similar test not long ago for a bug (in the Linux kernel actually, also) that related to the way state was handled for a certain kernel module. I prompted it and hand held it to see if I could get it even close to finding the bug, even going to the point of asking "is this a bug?" while pointing out the bug and it basically told me to read the code myself after giving me the wrong answer repeatedly. Granted, the code in my case is a little more complex as it relates to data fed in from userspace that is very indirectly interacted with, whereas data pulled through a command handler off the network is a more linear path (which is what was done here). In another case, I was interacting with a service over a public available IPC API and it just invented header files, function calls, and data structures that didn't exist and even pointing at the code, it couldn't get over whatever it was hallucinating. I used Claude and Copilot, though, so maybe I need try o3 and replicate his process here a little better than I have in the past (our current work isn't strictly related to this at the moment).

I think the overall point here is that yea, there is value in some cases that can optimize what someone who already knows what they are doing is capable of. The problem is that's not the pitch, the pitch is a lot simpler and is the same pitch that's been out there for years prior to LLMs, but doesn't match the reality when you are dealing with complex targets. These tools can make someone more optimized, at times, and others they can end up chasing ghosts. That's also questioning whether simpler testing methods, like simply fuzzing the SMB protocol in this case with the proper instrumentation, and whether or not it would've identified the same bug with less work and core understanding of the code (initially, anyway)

[+]

Fenrir.Niflheim

VIP

Offline

サーバ: Fenrir

Game: FFXI

User: Tesahade

Posts: 1211

guildwork

By Fenrir.Niflheim 2025-11-25 18:49:44

Link | 引用 | 返事

Asura.Iamaman said: »

In the case above, you had someone who understood the code enough to find a bug on their own (he kindof hints at this but IMO underplays the value here), then fed the LLM the specific code necessary AND understands how LLMs work enough to know how to prompt it and provide what is needed. You'd also have to have enough familiarity with the code to weed out false positives, again, something that requires manual intervention and review. This code is not hard to get through, for sure, but still...there's a prerequisite that someone can interpret the result and make sense of it.

Yep, and for the otherside of it we have death by a thousand slops the maintainer of curl discusses how they are inundated with false reports, those 92% false positives from the LLM security researcher.

And another: google sends a bug report with a scheduled disclosure timeline to ffmpeg if a company has the time to find the exploits maybe they should fix the bug as well. Also worth noting the exploit impacts an irrelevant section of ffmpeg.

[+]

Asura.Iamaman

Offline

サーバ: Asura

Game: FFXI

User: iamaman

Posts: 1167

By Asura.Iamaman 2025-11-25 19:28:39

Link | 引用 | 返事

This is something that absolutely boils my *** blood, it's been going on for decades in the security industry.

They inflate EVERYTHING then act like they are sooo hot because they....dumped a 50mb PDF on some vendor that triggers a NULL pointer dereference that they insist is a buffer overflow (when it isn't) and demand credit despite doing 0 triage, 0 reduction of the repro, and basically just used CPU cycles to find the magic pattern that triggers the crash. Some of the biggest names in the space are notorious for doing this and most people don't realize it, because their public persona is very much analytical, detailed, etc but they wear every CVE as a badge of honor when they did little to no work to actually identify the issue and most of the time they aren't even an exploitable bug category, forget being an exploitable/reachable bug.

A lot of these stick with me, one was a bug in some dumb document parser and everyone was melting down about it. It reproduced on Linux and, in theory, reproduced on Windows. The problem is, it didn't, it gracefully exited because Visual C++ defaults to checked iterators and when the out of bounds access occurred, the iterator identified it and gracefully exited without a segfault or corruption. The Linux version didn't do that because gcc didn't have that feature. There was all sorts of media hysteria and no one bothered to attach a *** debugger to see what was going on. This type of ***happens all the time and even the big vendors do it.

The problem is that, in the open source space especially, this leads to what you mention above. The maintainers can't handle the number of reports, people aren't doing any real triage or work, and in the mess of this real bugs don't get fixed or they get fixed and prioritized wrong because it was wrapped up in another code change but the bug wasn't identified (which is bad when you consider how downstream maintainers cherry pick fixes). The reason people like Google don't go and offer fixes is because they don't look at the code closely enough to offer any meaningful attempt at patching it. They'll fuzz it out or see asan triggers a problem then report it without much more analysis. In a lot of cases, they aren't even capable of offering a fix.

I don't really interact with the community the same way as I did before, so it never really occurred to me that LLMs would cause more of this to happen or make it worse. I can only imagine, it was bad enough before.

[+]

Asura.Cossack

Offline

サーバ: Asura

Game: FFXI

User: sandman16

Posts: 59

By Asura.Cossack 2025-11-26 03:11:56

Link | 引用 | 返事

"There is no life on this planet!
Jehovah-One replaced all life with machinery five centuries ago
The so-called industrial revolution was just another hoax
And we all fell for it, 'cause we were all programmed to
Even I fell for it, I believe in the steam engine
Even though I don't believe in anything

[+]

Garuda.Chanti

Offline

サーバ: Garuda

Game: FFXI

User: Chanti

Posts: 12213

By Garuda.Chanti 2025-11-26 09:46:03

Link | 引用 | 返事

K123 said: »

Each Time AI Gets Smarter, We Change the Definition of Intelligence

Or someday there may be intelligence on this planet but when that happens we can always move the goalposts so we are safe?

Shiva.Thorny said: »

Garuda.Chanti said: »

Why would a code writing AI improve itself?

I wasn't suggesting it would have sentience. If a code writing AI could fully replace the job of a software developer, the people who created it would be foolish not to use that capability to exponentially improve its own codebase. It's not scifi, it doesn't recognize that it's improving itself. It's just the logical progression of a product that reaches that state.

But software has been used to improve its own function for well over a decade. And design its own hardware too. So I may have leapt to a conclusion that you were talking about an NHI event.

Fenrir.Brimstonefox

Offline

サーバ: Fenrir

Game: FFXI

User: Brimstone

Posts: 446

By Fenrir.Brimstonefox 2025-11-26 10:14:07

Link | 引用 | 返事

Garuda.Chanti said: »

Shiva.Thorny said: »

Garuda.Chanti said: »

Why would a code writing AI improve itself?

I wasn't suggesting it would have sentience. If a code writing AI could fully replace the job of a software developer, the people who created it would be foolish not to use that capability to exponentially improve its own codebase. It's not scifi, it doesn't recognize that it's improving itself. It's just the logical progression of a product that reaches that state.

But software has been used to improve its own function for well over a decade. And design its own hardware too.

Much longer than that its Moore's law in essence, semiconductor development is not unlike gear progression in FFXI, i'd wager it would be considerably easier for a level 1 character to solo v25 bumba than to take all the knowledge needed to make a 1nm technology but have none of the requisite hardware or software available to do so.

Garuda.Chanti

Offline

サーバ: Garuda

Game: FFXI

User: Chanti

Posts: 12213

By Garuda.Chanti 2025-11-28 13:52:12

Link | 引用 | 返事

Large Language Models Will Never Be Intelligent, Expert Says
"LLMs are simply tools that emulate the communicative function of language."

I understand this headline through the word emulate.

WTF does "communicative function of language." actually mean here?

Shiva.Thorny

Offline

サーバ: Shiva

Game: FFXI

User: Rairin

Posts: 3769

By Shiva.Thorny 2025-11-28 14:10:32

Link | 引用 | 返事

I'd interpret it to mean that they serve the purpose of translating data into a variety of languages and formats to share the data. The implication is that they're disseminating information rather than synthesizing novel information. Information dissemination systems of the past would distribute the exact same text to everyone (such as a static html file or a printed set of encyclopedias). In some cases, they might have several language options each with a static set of text, but that was the extent of variety. They regurgitated exact data they were provided to present to the user.

In contrast, LLMs are capable of dynamically presenting the information they have available in numerous ways. The LLM is trained on information and it can present that information using language as a medium, which would be using the communication function of language.

I mostly agree with the takeaways. We know that LLMs cannot think because they lack a basis for truth. While they can branch into determinative reasoning for certain problems (such as higher mathematics or the application of formulas), it only happens because a human programmed the model to use a different subroutine to solve that style of problem.

tldr; LLMs can't innovate. It doesn't really have much bearing on how they'll stack up to workers at current tasks though; most workers do very little innovation.

[+]

Offline

Posts:

By 2025-11-28 15:19:25

Undelete | Edit | Link | 引用 | 返事

Post deleted by User.

Garuda.Chanti

Offline

サーバ: Garuda

Game: FFXI

User: Chanti

Posts: 12213

By Garuda.Chanti 2025-11-28 16:00:36

Link | 引用 | 返事

@Thorny

The part that confuses me:

Quote:

“The problem is that according to current neuroscience, human thinking is largely independent of human language — and we have little reason to believe ever more sophisticated modeling of language will create a form of intelligence that meets or surpasses our own,”

I cannot imagine thought without words. I have studied zen, the art of no thought. One approach to zen is to divorce yourself from words. "Silencing the monkey." That monkey that chatters on inside our heads.

Pantafernando

Offline

Posts: 17150

By Pantafernando 2025-11-28 16:08:19

Link | 引用 | 返事

Garuda.Chanti said: »

I cannot imagine thought without words

Images?

Shiva.Thorny

Offline

サーバ: Shiva

Game: FFXI

User: Rairin

Posts: 3769

By Shiva.Thorny 2025-11-28 16:18:24

Link | 引用 | 返事

Garuda.Chanti said: »

I cannot imagine thought without words.

Thought is the process of connecting related concepts. Words are the way you express them. Consider someone who speaks multiple languages; even if language is a component to creating an idea, the idea continues to exist independent of the language.

I'm not sure if that really helps; I'm certainly no expert. I see thought as a level of abstraction higher than language. Obviously different people have different verbal IQs and ways of thinking; someone with aphantasia would likely have a very difficult time differentiating thought from language.

[+]

Offline

Posts:

By 2025-11-28 16:47:29

Undelete | Edit | Link | 引用 | 返事

Post deleted by User.

Fenrir.Niflheim

VIP

Offline

サーバ: Fenrir

Game: FFXI

User: Tesahade

Posts: 1211

guildwork

By Fenrir.Niflheim 2025-11-28 22:40:26

Link | 引用 | 返事

Garuda.Chanti said: »

I cannot imagine thought without words

Some people have no "inner monolog" they simply think differently than people with an inner monolog. Similarly some people can not "picture an apple" in their mind.

There is a wide range of human experiences and we are usually oblivious to how different people might be when it concerns something so core to how we experience the world. Until I got my first pair of glasses I did not think other people could see the leaves on a tree, they were always so blurry I just figured everyone saw them the same as me.

[+]

Offline

Posts:

By 2025-11-29 02:42:26

Undelete | Edit | Link | 引用 | 返事

Post deleted by User.

Offline

Posts:

By 2025-12-07 10:33:55

Undelete | Edit | Link | 引用 | 返事

Post deleted by User.

Asura.Saevel

Offline

サーバ: Asura

Game: FFXI

User: palladin9479

Posts: 10412

guildwork

By Asura.Saevel 2025-12-07 11:03:17

Link | 引用 | 返事

Garuda.Chanti said: »

I cannot imagine thought without words. I have studied zen, the art of no thought. One approach to zen is to divorce yourself from words. "Silencing the monkey." That monkey that chatters on inside our heads.

"Words are thought given form" is the best way I've heard it expressed. Thoughts are constructs that represent ideas and those ideas have properties and connections all their own. Two people speaking two languages from very different cultures can have the same thought while expressing it differently.

Of course how we express thought, aka language, tends to shape how we think in the same way that available tools will shape how we work.

Trying to effectively communicate with people when you both only speak a few dozen words of each others languages really demonstrates this. It can be done if you focus on first understanding the ideas that need communicated, then find a method of communicating those ideas that doesn't revolve around complex language. We had thoughts long before we had spoken language.

[+]

Pantafernando

Offline

Posts: 17150

By Pantafernando 2025-12-08 08:14:31

Link | 引用 | 返事

I think one of the AI biggest challenges is the old and evil framing.

By framing is that technique developed during the past century and extensively used in the context of narrative wars.

And I think AI is losing against framing.

Why I think that? Well, it is becoming extremelly common someone refering a low quality work as something "AI generated".

Bad code? "This guys code must be AI written". Bad art? "Must be AI generated". Bad text? Bad powerpoint? Bad summary? Etc.

How is it possible to consolidate a tecnology if currently such tecnology is related to the worst quality?

Intentional or not, humans are steadly sabotaging AI by refusing any value generated by it. And not only denying its value, but labeling it with so much prejudice.

I feel like this round of human vs machine, humans have he upperhand.

AI is not generating enough value to bend human skepticism

Offline

Posts:

By 2025-12-08 08:19:24

Undelete | Edit | Link | 引用 | 返事

Post deleted by User.

Pantafernando

Offline

Posts: 17150

By Pantafernando 2025-12-08 08:22:34

Link | 引用 | 返事

Maybe this is some type of primitive instinct?

I remmeber watching a video of a guy telling what he considered the greatest horror scene in movies.

The scene he mentioned wasnt some monstrosity or gore stuff. But a woman walking umnaturally.

So what I got from this is that humans are particularly good to perceive another human existance.

But anything that is human like, but not something a human can do, it ends up in the uncanny valley, and in that valley, human primal instincts is the repulse.

Think about dopplelgangers. They are uncanny because they emulate humans, but you can see they are not

Same with AI: they emulate humans, but if humans can perceive it is not human, then it falls in the uncanny valley, and instinctively repulsed

[+]

Garuda.Chanti

Offline

サーバ: Garuda

Game: FFXI

User: Chanti

Posts: 12213

By Garuda.Chanti 2026-01-02 18:04:02

Link | 引用 | 返事

Cops Forced to Explain Why AI Generated Police Report Claimed Officer Transformed Into Frog

Artificial intelligence + cops is supposed to = surveillance state. It seems we are safe for a bit yet.

Pantafernando

Offline

Posts: 17150

By Pantafernando 2026-01-30 16:11:14

Link | 引用 | 返事

The AI world really moves fast, if you arent a nerd that follow every news about it.

The last "big" thing I entirely missed was this Clawbot.

I honestly saw a couple of video recommendations about that, and completely ignored.

This until IT youtuber I follow start to talk about it, about that it changed its name, that it is the awesome or that it is a scam.

Like everything in todays worlds, either it is the third coming of Jesus, or the coming of the Antichrist.

Pick your side!

Dodik

Offline

By Dodik 2026-01-30 16:37:26

Link | 引用 | 返事

Cops using AI, or any human using AI directly, is not as big an issue as entities like the NSA training AI to do surveillance for them, replacing humans that used to do it.

Unlike humans, the AI can do a much better job at spotting things, they can do it much faster, can handle as many feeds simultaneously as you have processing power, and they never get tired or have to take breaks.

You think it isn't already being used in this way? That's naive.

Pantafernando

Offline

Posts: 17150

By Pantafernando 2026-02-01 04:45:49

Link | 引用 | 返事

I suppose every week there will be some video of the wonders of AI, then the next one, how AI hype is overrated.

Well, at least this video shows more up to date data, and if companies are hiring devs again, that is a good news for the IT world.

YouTube Video Placeholder

RadialArcana

Offline

Posts: 5529

By RadialArcana 2026-02-01 05:18:12

Link | 引用 | 返事

It's funny to see video game dev employees desperately trying to stop the rise of AI tools in game dev, since they know 40-50%+ of mediocre devs are on the verge of losing their jobs when the tools can do a better job than they can (and even if they succeed, the companies will just import Indians to replace them anyway).

In the meantime Google just released a new AI system that can literally create a world, and a character you can move in the world with a controller from a prompt. That literally invalidates the entire game dev industry as the tech improves.

YouTube Video Placeholder

Stock prices of many big game development nosedived when they showed it (take two took a hammering, more pain for ubislop, Square Enix etc), as investors saw and entire industry on the verge of being phased out (outside of indie, in the same manner as "whole foods" prestige products for the middle class snobs)

We will get to a point similar to walmart (and many government jobs), where the government is paying people to do work nobody needs them to do. Just cause they fear social unrest if people have too much time to think, "the devil makes work for idle hands" and all. We are going to get government subsidized game dev jobs, just to keep them busy lmao

That's why I always found it funny people working in Walmart (and many government jobs) would complain about wages, when they were literally government subsidized jobs for the unemployable in the first place.

Shiva.Thorny

Offline

サーバ: Shiva

Game: FFXI

User: Rairin

Posts: 3769

By Shiva.Thorny 2026-02-01 07:03:26

Link | 引用 | 返事

We already knew AI could generate assets. Throwing the assets into a rendering engine and adding a basic control scheme is not surprising. If you didn't think game dev was dead when people first started creating assets with AI, I don't see how this changes anything.

Gameplay, character design, and balancing are integral parts to making a game that people will love. It's absolutely possible that AI will be able to eventually do all of those, but consider the differences between Concord and Overwatch. Concord had comparable-generation graphics, controls, and tons of effort into balancing. These 'AI worlds' are nothing compared to that, and it still thoroughly flopped.

I'm not even convinced this is a threat to 3D asset manufacturers yet. The clips look great at a glance, but if you look closer there don't appear to be bones or rigging. It's almost like it's generating a video rather than using a model.

[+]

Dodik

Offline

By Dodik 2026-02-01 07:09:32

Link | 引用 | 返事

AI game-generation is pure gold for the scam artists that make BS pre-rendered "in-engine made" videos and use them to raise funding to complete their game. There is no game and never will be a game.

Now they don't even have to pay people to make those videos, they can just have AI do it.

Genius.

Here's what would kill human-driven game development.

A purely AI made game making it to Steam's Top 10 sales and/or hours played list for.. a month, a year, whatever.

[+]

RadialArcana

Offline

Posts: 5529

By RadialArcana 2026-02-01 09:35:06

Link | 引用 | 返事

The reason AI gen as shown in the video will kill the proper AAA game industry, is the top creative people who are sick of the current way of things will bail the moment this stuff actually works properly and can actually create functional games. Which is highly likely to happen in the next few years.

Who wants to work in the current investor run big game industry, where you have numerous levels of nonsense and HR breathing down your neck.

Once the actual creative people bail to this new stuff and it's just the dregs that remain, it's over. They won't be able to make anything.

To use an example, why run a full proper HNM shell when you can be one guy running a bunch of alts / bots. If the tech works, why would you do it the hard way and have to split money.

Asura.Iamaman

Offline

サーバ: Asura

Game: FFXI

User: iamaman

Posts: 1167

By Asura.Iamaman 2026-02-01 12:04:41

Link | 引用 | 返事

RadialArcana said: »

(and even if they succeed, the companies will just import Indians to replace them anyway)

I think this is the real risk. I'm not confident AI will get to a place where it can create content that engages people in a meaningful manner or handle the complex technical interactions required for game development, but if you put cheap labor using smaller scale AI for certain tasks, it's another story. Seeing how productive they can be using AI for smaller, functional tasks has been revealing. There still has to be architecting and review to address optimization issues, but they can create the framework and produce something that works when things are outlined properly.

They don't even have to be H1-B anymore, they can be in India in one of the many tech regions. They work harder and cost less than their US equivalents and will produce things that work.

When I interview with companies and they tell me 80% of the team is in India, that seems to be a bigger immediate concern than AI eliminating roles. I've found this is, in part, due to AI making them more effective/better, but it's also just the fact that it's just cheaper bringing them on and they can reach the same end goal with less domestic labor required.

Dodik

Offline

By Dodik 2026-02-01 12:12:34

Link | 引用 | 返事

80% of teams being in India was happening before AI.

You get what you pay for.

[+]

差出人:
へ:
件名:
本文:

Are Large Language Models Really AI?