KeySync is a two-stage framework designed for high-resolution lip synchronization in videos, addressing temporal consistency, leakage, and occlusions. It incorporates a masking strategy to achieve robust synchronization without audio leakage.