I’ve tried coding and every one I’ve tried fails unless really, really basic small functions like what you learn as a newbie compared to say 4o mini that can spit out more sensible stuff that works.
I’ve tried explanations and they just regurgitate sentences that can be irrelevant, wrong, or get stuck in a loop.
So. what can I actually use a small LLM for? Which ones? I ask because I have an old laptop and the GPU can’t really handle anything above 4B in a timely manner. 8B is about 1 t/s!
I installed Llama. I’ve not found any use for it. I mean, I’ve asked it for a recipe because recipe websites suck, but that’s about it.
you can do a lot with it.
I heated my office with it this past winter.
I’ve used smollm2:135m for projects in DBeaver building larger queries. The box it runs on is Intel HD 530 graphics with an old i5-6500T processor. Doesn’t seem to really stress the CPU.
UPDATE: I apologize to the downvoter for not masochistically wanting to build a 1000 line bulk insert statement by hand.
How, exactly, do you have Intel HD graphics, found on Intel APUs, on a Ryzen AMD system?
Sorry, I was trying to find parts for my daughter’s machine while doing this (cheap Minecraft build). I corrected my comment.
Sorry, I am just gonne dump you some links from my bookmarks that were related and interesting to read, cause I am traveling and have to get up in a minute, but I’ve been interested in this topic for a while. All of the links discuss at least some usecases. For some reason microsoft is really into tiny models and made big breakthroughs there.
https://reddit.com/r/LocalLLaMA/comments/1cdrw7p/what_are_the_potential_uses_of_small_less_than_3b/
https://github.com/microsoft/BitNet
https://www.microsoft.com/en-us/research/blog/phi-2-the-surprising-power-of-small-language-models/
https://news.microsoft.com/source/features/ai/the-phi-3-small-language-models-with-big-potential/
I have it roleplay scenarios with me and sometimes I verbally abuse it for fun.
Weirdly I’m polite to all LLMs, but Gemini sets me off and I end up yelling at it.
it’s just so pushy and hard to remove. it’s asking for abuse.
It’ll work for quick bash scripts and one-off things like that. But there’s not usually enough context window unless you’re using a 24G GPU or such.