They still output words through (except for multi-modal LLMs) so that does involve next word generation.
They still output words through (except for multi-modal LLMs) so that does involve next word generation.