KBLaM is a method for augmenting large language models with external knowledge, offering linear scalability with knowledge base size and eliminating the need for external retrieval modules.