Scrubbles
link
fedilink
English
205M

Okay so I had a meltdown last year. I was staring down a startup that was circling the drain, I knew my time there was limited, and I was being bombarded daily with layoffs and friends not being able to find work, while hearing constantly that I was going to be left behind due to AI. (of course the layoffs were happening because tech CEOs heard AI and started frothing at the idea of getting rid of some of their most expensive staff)

So, I took it on myself to learn AI. I figured well, if it’s coming for my job I might as well learn how it works. And oh lorde, did I learn a lot. To the point where I’m running several LLMs now at home, I have them running in k3s, across multiple servers, and have built several apps to interact with them. I’ve trained finetuned LLMs, I’ve played with image generation, voices, I dove headfirst in. Eventually I did lose that job, and that gave me a couple months to focus even more before finding my current one.

My biggest learnings, which I’m sure many of you know:

  • AI is a very neat technology, and it has several real applications, but those applications are extremely limited by the limitations of AI
  • LLMs and AI are incredibly hard to control. You can’t just say if(nsfw) dont(). You have to spend a lot of time forcing the LLM to not give weight to the users course, and it’s to the point that it hardly seems worth it.
  • Companies love the idea of LLMs for their chatbots, but using the above it’s incredibly hard to prevent the chatbot from doing it’s own thing. You can say “We only have a return policy of 14 days” but if someone works at it hard enough and tells the LLM that it can still perform the return because they say so… is it really that useful? LLMs have no hard rules
  • LLMs have very real hardware restrictions. At home, it’s a single GPU. In the cloud they have some clever tricks to share memory, but overall it’s still mostly limited by the GPU. We’ll see as we’re moving forward what shenanigans NVidia comes up with, but LLMs and AI are essentially a brute force approach, and we can see that in how much power they soak up. You can see ChatGPT slows down once your conversation goes on long enough, it’s running low on memory
  • AI is not new. It’s still ML under the hood, it’s just coming up with unique ways to reuse it. Again with the brute force, I didn’t realize that for every token (word for simplicity), You’re entire conversation is passed in to the model, in which it will spit out one more word. Repeat for every word. That’s all it is. It’s just predicting the next word. Image generation is just predicting the next pixels, and then loops around again until it comes back. There is no consciousness, there’s no real nuance to it, that’s it. A predictive engine surrounded with a while loop.

There’s more but this is too long already. It’s neat, it’s useful, but the hype was just as intense as blockchain. We’re going to see some real great usages out of it, like integration with something like Word or a browser to summarize things is honestly a good idea. But there are so so so many pitfalls.

For coding? I think it’s a great place to get started, or to get an idea. I would never trust it in production. It will take a very long time for us to get to the point where you can say “Go build this feature” and I would blindly trust what it generated.

Thanks for this write up, appreciated because sometimes (like on LinkedIn, I know don’t ask) it feels like everyone is an AI guru talking hype hype hype.

It’s like the AI Scientist experiment, run jointly by Sakana AI (Japan), University of Oxford, and University of British Columbia.

It recently was tasked with running experiments, and strict time frames. So it tried to rewrite it’s own code to give itself more time to work on the experiment.

deleted by creator

slazer2au
link
fedilink
English
115M

Replace? Never. Lower the workload? Yes,maybe for boilerplate code.

In order to replace programmers product managers need to express clearly what they want the program to do.

@[email protected]
link
fedilink
English
25M

Yeah, you need a way to specify what you want with a high degree of both flexibility and specificity. We have a term for that in the industry, it’s called “writing code”.

comador
link
fedilink
135M

TL;DR: The answer is an astounding NO.

Create a post

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

  • 1 user online
  • 80 users / day
  • 165 users / week
  • 456 users / month
  • 2.28K users / 6 months
  • 1 subscriber
  • 3.07K Posts
  • 43.9K Comments
  • Modlog