Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows