@fasterthanlime How thoroughly have you tested them? I've tried maybe half a dozen smaller (sub-9 GB) models, and my conclusion is that for general knowledge they're the worst of all worlds — still sound plausible, but their odds of getting anything correct is abysmal. I suppose for writing tools or code autocomplete they can be decent, but for “conversational assistant” my hopes hit rock bottom.
The next step up seems to be ~30 GB, but I don't have the resources to run that locally atm