You don’t need an LLM for this. You just need a FM that you fine tune, and you’d be surprised at how little computing power is actually required.
For our uses (which are similar to what OP wants), it takes longer for us to do an OCR scan on the documents our AI works with than for Sagemaker to do it’s thing on a rather small instance.
And, devs would just be implementing API calls, so it wouldn’t be a big deal to make the switch.
Buddy, I have actual training in AI/ML from some of the leading engineers in the field, and my job leverages AI/ML very successfully to do a task really similar to what OP is looking for.
Maybe the versions available to the public to play with aren’t up to the task, but using AWS Bedrock you can absolutely get results like OP wants.
It’s a foundation model. Basically it’s the base algorithm that you train with data. LLMs are FMs that have been trained with an enormous amount of data, but they aren’t necessary for every application, especially if you only need the AI/ML to perform a specific task.
Fine tuning an FM is just feeding it your own data.