Square Enix says it wants generative AI to be doing 70% of its QA and debugging by the end of 2027 | VGC
www.videogameschronicle.com
external-link
The publisher is researching “Game QA Automation Technology” with the University of Tokyo…
@[email protected]
link
fedilink
English
31h

Not buying their next game it will be a nightmare

ssillyssadass
link
fedilink
English
31h

A year from now they’ll be wondering why their games are so buggy.

@[email protected]
link
fedilink
English
143h

Realistic goal considering they already do so little QA.

@[email protected]
link
fedilink
English
24h

Lol. Good luck!

@[email protected]
link
fedilink
English
209h

QA annnnd Debugging?

LLMs have a much better chance at succesfuly replacing whoever said that.

@[email protected]
link
fedilink
English
89h

And I thought I had no more disappointment left to allocate

@[email protected]
link
fedilink
English
23h

It’s just that you’ve reached your free quota, further disappointments will be charged 0.0937 emotional stability per hour

@[email protected]
link
fedilink
English
510h

So… no more SE games for me. Not a huge loss to be honest.

@[email protected]
link
fedilink
English
3014h

Well, good luck with that. Software development is a shit show already anyway. You can find me in my Gardening business in 2027.

@[email protected]
link
fedilink
English
9
edit-2
12h

Good Luck. When the economy finally bottoms out the first budget to go is always the gardening budget.

You can find me in my plumbing business in 2028.

I deal with shit daily so it’s what we in biz call a horizontal promotion.

@[email protected]
link
fedilink
English
410h

Market gardening isn’t so bad, people gotta eat. But yeah, if you’re cutting lawns you’re going to suffer when the economy shits the bed.

@[email protected]
link
fedilink
English
3815h

So Square Enix is demanding OpenAI stop using their content, but is 100% okay using AI built off stolen content to make more money themselves

As a developer, it bothers me that my code is being used to train AI that Square Enix is using while trying to deny anyone else the ability to use their work

I could go either way on whether or not AI should be able to train on available data, but no one should get to have it both ways

@[email protected]
link
fedilink
English
21
edit-2
15h

Square Enix actually has a pretty sick automated QA already. There’s a cool talk about how they did that for FFVII remake in GDC vault, and I highly recommend watching it, if you’re at all interested in QA.

It has nothing to do with AI, it’s just plain old automation, but they solve most of the issues you get with making automated tests in non-discrete 3D playspace and they do that in a pretty solid way. It’s definitely something I’d love to have implemented in the games I’m working on, as someone who worked in QA and now works in development. Being able to have mostly reliable way how to smoke-test levels for basic gameplay without having to torture QA to run the test-case again is good, and allows QA to focus on something else - but the tools also need oversight, so it’s not really a job lost. In summary - I think the talk is cool tech and worth the watch.

However, I don’t think AI will help in this regard, and something as unreliable and random as AI models are not a good fit for this job. You want to have deterministic testcases that you can quanitfy, and if something doesn’t match have an actual human to look at why. AI also probably won’t be able to find clever corner-cases and bugs that need human ingenuity.

Fuck AI, I kind of hope this is just a marketing talk and they are actually just improving the (deterministic) tools they already have (which actually are AI by definition, since they also do level exploration on top of recorded inputs), and they are calling it an “AI” to satisfy investors/management without actually slapping a glorified chat-bot into the tech for no reason.

LostWanderer
link
fedilink
15721h

Ew, sounds like a great reason to not buy any Square Enix games…

@[email protected]
link
fedilink
English
10820h

Not even from an ethically standpoint. Color me shocked if these games are like, playable

LostWanderer
link
fedilink
3720h

Exactly, as I don’t expect QA done by something that can’t think or feel to know what actually needs to be fixed. AI is a hallucination engine that just agrees rather than points out issues, in some cases it might call attention to non-issues and let critical bugs slip by. The ethical issues are still significant and play into the reason why I would refuse to buy any more Square Enix games going forward. I don’t trust them to walk this back, they are high on the AI lie. Human made games with humans handling the QA are the only games that I want.

@[email protected]
link
fedilink
English
319h

Exactly, as I don’t expect QA done by something that can’t think or feel to know what actually needs to be fixed

That is a very small part of QA’s responsibility. Mostly it is about testing and identifying bugs that get triaged by management. The person running the tests is NOT responsible for deciding what can and can’t ship.

And, in that regard… this is actually a REALLY good use of “AI” (not so much generative). Imagine something like the old “A star algorithm plays mario” where it is about finding different paths to accomplish the same goal (e.g. a quest) and immediately having a lot of exactly what steps led to the anomaly for the purposes of building a reproducer.

Which actually DOES feel like a really good use case… at the cost of massive computational costs (so… “AI”).

That said: it also has all of the usual labor implications. But from a purely technical “make the best games” standpoint? Managers overseeing a rack that is running through the games 24/7 for bugs that they can then review and prioritize seems like a REALLY good move.

@[email protected]
link
fedilink
English
1520h

I would initially tap the breaks on this, if for no other reason than “AI doing Q&A” reads more like corporate buzzwords than material policy. Big software developers should already have much of their Q&A automated, at least at the base layer. Further automating Q&A is generally a better business practice, as it helps catch more bugs in the Dev/Test cycle sooner.

Then consider that Q&A work by end users is historically a miserable and soul-sucking job. Converting those roles to debuggers and active devs does a lot for both the business and the workforce. When compared to “AI is doing the art” this is night-and-day, the very definition of the “Getting rid of the jobs people hate so they can do the work they love” that AI was supposed to deliver.

Finally, I’m forced to drag out the old “95% of AI implementations fail” statistic. Far more worried that they’re going to implement a model that costs a fortune and delivers mediocre results than that they’ll implement an AI driven round of end-user testing.

Turning Q&A over to the Roomba AI to find corners of the setting that snag the user would be Gud Aktuly.

Nate Cox
link
fedilink
English
1420h

Converting those roles to debuggers and active devs does a lot for both the business and the workforce.

Hahahahaha… on wait you’re serious. Let me laugh even harder.

They’re just gonna lay them off.

@[email protected]
link
fedilink
English
-118h

The thing about QA is the work is truly endless.

If they can do their work more efficiently, they don’t get laid off.

It just means a better % of edge cases can get covered, even if you made QAs operate at 100x efficiency, they’d still have edge cases not getting covered.

@[email protected]
link
fedilink
English
-219h

They’re just gonna lay them off.

And hire other people with the excess budget. Hell, depending on how badly these systems are implemented, you can end up with more staff supporting the testing system than you had doing the testing.

@[email protected]
link
fedilink
English
518h

I was going to say, this is one job that actually makes sense to automate. I don’t know any QA testers personally, but I’ve heard plenty of accounts of them absolutely hating their jobs and getting laid off after the time crunch anyway.

@[email protected]
link
fedilink
English
620h

The repetition of “Q&A” reads like this comment was also outsourced to AI.

@[email protected]
link
fedilink
English
115h

They already have a really cool solution for that, which they talked about in their GDC talk.. I don’t think there’s any need to slap a glorified chatbot into this, it already seems to work well and have just the right amount of human input to be reliable, while also leaving the “testcase replay gruntwork” to a script instead of a human.

@[email protected]
link
fedilink
English
419h

What does Q&A stand for?

@[email protected]
link
fedilink
English
219h

Quality and assurance

@[email protected]
link
fedilink
English
-219h

Ugh. QA. Quality Assurance. Reflexively jamming that & because I am a bad AI.

Regardless, digital simulated users are going to be able to test faster, more exhaustively, and with more detailed diagnostics, than manual end users.

@[email protected]
link
fedilink
English
3217h

@[email protected]
link
fedilink
English
6419h

That’s a stupid idea. You’re not supposed to QA or debug games. You just release it, customers report bugs, and then you promise to fix the bugs in the next patch (but don’t).

@[email protected]
link
fedilink
English
12h

Or do the Bethesda thing and let people playtest their slop and fix it for free.

Rhaedas
link
fedilink
717h

No better testing than in production.

/home/pineapplelover
link
fedilink
English
110h

Sure

@[email protected]
link
fedilink
English
4720h

Literally not how any of this works. You don’t let AI check your work, at best you use AI and check it’s work, and at worst you have to do everything by hand anyway.

@[email protected]
link
fedilink
English
20
edit-2
19h

You don’t let AI check your work

From a game dev perspective, user Q&A QA is often annoying and repetitive labor. Endlessly criss-crossing terran hitting different buttons to make sure you don’t snag a corner or click objects in a sequence that triggers a state freeze. Hooking a PS controller to Roomba logic and having a digital tool rapidly rerun routes and explore button combos over and over, looking for failed states, is significantly better for you than hoping an overworked team of dummy players can recreate the failed state by tripping into it manually.

subignition
link
fedilink
1419h

There’s plenty of room for sophisticated automation without any need to involve AI.

@[email protected]
link
fedilink
English
619h

I mean, as a branding exercise, every form of sophisticated automation is getting the “AI” label.

Past that, advanced pathing algorithms are what Q&A systems need to validate all possible actions within a space. That’s the bread-and-butter of AI. Its also generally how you’d describe simulated end-users on a test system.

subignition
link
fedilink
210h

I mean, as a branding exercise, every form of sophisticated automation is getting the “AI” label.

The article is specifically talking about generative AI. I think we need to find new terminology to describe the kind of automation that was colloquially referred to as AI before chatgpt et al. came into existence.

The important distinction, I think, is that these things are still purpose-built and (mostly) explainable. When you have a bunch of nails, you design a hammer. An “AI bot” QA tester the way Booty describes in the article isn’t going to be an advanced algorithm that carries out specific tests. That exists already and has for years. He’s asking for something that will figure out specific tests that are worth doing when given a vague or nonexistent test plan, most likely. You need a human, or an actual AGI, for something on that level, not generative AI.

And explicitly with generative AI, as pertains to Square Enix’s initiative in the article, there are the typical huge risks of verifiability and hallucination. However unpleasant you may think a QA worker’s job is now, I guarantee you it will be even more unpleasant when the job consists of fact-checking AI bug reports all day instead of actually doing the testing.

@[email protected]
link
fedilink
English
619h

Not all AI is generative.

@[email protected]
link
fedilink
English
-2
edit-2
15h

If it does the job better, who the fuck cares. No one actually cares about how you feel about the tech. Cry me a river.

@[email protected]
link
fedilink
English
413h

The problem is that if it doesn’t do a better job, no one left in charge will even know enough to give a shit, so quality will go down.

@[email protected]
link
fedilink
English
419h

its *

Ironically, that’s definitely something AI could check for.

@[email protected]
link
fedilink
English
219h

Spell check? Yeah fair enough. The misspelling has historical value now though so I have to keep it in :P

@[email protected]
link
fedilink
English
216h

Ask it for many R’s there are in strawberry

Create a post

Welcome to the largest gaming community on Lemmy! Discussion for all kinds of games. Video games, tabletop games, card games etc.

Rules

1. Submissions have to be related to games

Video games, tabletop, or otherwise. Posts not related to games will be deleted.

This community is focused on games, of all kinds. Any news item or discussion should be related to gaming in some way.

2. No bigotry or harassment, be civil

No bigotry, hardline stance. Try not to get too heated when entering into a discussion or debate.

We are here to talk and discuss about one of our passions, not fight or be exposed to hate. Posts or responses that are hateful will be deleted to keep the atmosphere good. If repeatedly violated, not only will the comment be deleted but a ban will be handed out as well. We judge each case individually.

3. No excessive self-promotion

Try to keep it to 10% self-promotion / 90% other stuff in your post history.

This is to prevent people from posting for the sole purpose of promoting their own website or social media account.

4. Stay on-topic; no memes, funny videos, giveaways, reposts, or low-effort posts

This community is mostly for discussion and news. Remember to search for the thing you’re submitting before posting to see if it’s already been posted.

We want to keep the quality of posts high. Therefore, memes, funny videos, low-effort posts and reposts are not allowed. We prohibit giveaways because we cannot be sure that the person holding the giveaway will actually do what they promise.

5. Mark Spoilers and NSFW

Make sure to mark your stuff or it may be removed.

No one wants to be spoiled. Therefore, always mark spoilers. Similarly mark NSFW, in case anyone is browsing in a public space or at work.

6. No linking to piracy

Don’t share it here, there are other places to find it. Discussion of piracy is fine.

We don’t want us moderators or the admins of lemmy.world to get in trouble for linking to piracy. Therefore, any link to piracy will be removed. Discussion of it is of course allowed.

Authorized Regular Threads

Related communities

PM a mod to add your own

Video games

Generic

Help and suggestions

By platform
By type
By games
Language specific
  • 1 user online
  • 287 users / day
  • 736 users / week
  • 1.87K users / month
  • 6.66K users / 6 months
  • 1 subscriber
  • 7.7K Posts
  • 158K Comments
  • Modlog