MM-StoryAgent is a multi-agent framework that leverages large language models and expert tools across text, image, and audio modalities to generate immersive narrated storybook videos.