Coding with LLMs (Claude Code, OpenAI Codex) is often presented as the ‘killer app’ for Generative AI. But looking at data, it seems the one piece of the puzzle missing is actual cost. …
They could, but what’s the plan here, exactly? That all these for profit companies who are currently publishing models for free, like Qwen, will continue to do so in the future?
Why not?
Why Microsoft develops it’s .NET ecosystem? Why Google develops Go/Dart? It costs them lots of money and they give it for free.
The answer is: they don’t earn money on it directly, but these tools are a way to tie programmers to their cloud services. If you use .NET you’ll probably end up on Azure. If Go - probably you’ll use GCP.
So I suspect the same will be with LLMs. At some point they will say: “hey, you can use this LLM however you want, but as you are already using it, then you may want to know our platform is optimized for it”
LLM providers are SaaS providers, meaning that even if they were to give you the source of all the tools they use, there’s a fundamental limit to how much you can self host.
A better comparison would be Google giving away their indexed search data: you might be able to run an infinitesimal portion of it on your hardware, and will never ever match the results Google offers on their website, and since it’s a monopoly, you would be at a permanent disadvantage.
Same goes for all these AI companies. They are an oligopoly that give away subpar free models, compared to their cloud offerings. Self hosted LLMs will never stand a chance.
They could, but what’s the plan here, exactly? That all these for profit companies who are currently publishing models for free, like Qwen, will continue to do so in the future?
Why not? Why Microsoft develops it’s .NET ecosystem? Why Google develops Go/Dart? It costs them lots of money and they give it for free.
The answer is: they don’t earn money on it directly, but these tools are a way to tie programmers to their cloud services. If you use .NET you’ll probably end up on Azure. If Go - probably you’ll use GCP.
So I suspect the same will be with LLMs. At some point they will say: “hey, you can use this LLM however you want, but as you are already using it, then you may want to know our platform is optimized for it”
That’s not an accurate analogy.
LLM providers are SaaS providers, meaning that even if they were to give you the source of all the tools they use, there’s a fundamental limit to how much you can self host.
A better comparison would be Google giving away their indexed search data: you might be able to run an infinitesimal portion of it on your hardware, and will never ever match the results Google offers on their website, and since it’s a monopoly, you would be at a permanent disadvantage.
Same goes for all these AI companies. They are an oligopoly that give away subpar free models, compared to their cloud offerings. Self hosted LLMs will never stand a chance.