To be even more pedantic, this is only true if the LLM is run locally on the same GPU with particular optimizations disabled.
To be even more pedantic, this is only true if the LLM is run locally on the same GPU with particular optimizations disabled.