Unlimited-length talking video generation that supports image-to-video and video-to-video generation