Maskgan: Towards Diverse And Interactive Facial Image Manipulation

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)(2020)

引用 1046|浏览456
暂无评分
摘要
Facial image manipulation has achieved great progress in recent years. However; previous methods either operate on a predefined set of face attributes or leave users little freedom to interactively manipulate images. To overcome these drawbacks, we propose a novel framework termed MaskGAN, enabling diverse and interactive face manipulation. Our key insight is that semantic masks serve as a suitable intermediate representation for flexible face manipulation with fidelity preservation. MaskGAN has two main components: 1) Dense Mapping Network (DMN) and 2) Editing Behavior Simulated Training (EBST). Specifically, DMN learns style mapping between a free-form user modified mask and a target image, enabling diverse generation results. EBST models the user editing behavior on the source mask, making the overall framework more robust to various manipulated inputs. Specifically, it introduces dual-editing consistency as the auxiliary supervision signal. To facilitate extensive studies, we construct a large-scale high-resolution face dataset with fine-grained mask annotations named CelebAMask-HQ. MaskGAN is comprehensively evaluated on two challenging tasks: attribute transfer and style copy, demonstrating superior performance over other state-of-the-art methods. The code, models, and dataset are available at https://github.com/switchablenorms/CeleAMask-HQ.
更多
查看译文
关键词
target image,diverse generation results,EBST models,user editing behavior,source mask,manipulated inputs,dual-editing consistency,high-resolution face dataset,fine-grained mask annotations,MaskGAN,attribute transfer,style copy,towards diverse,interactive facial image manipulation,predefined set,face attributes,users little freedom,diverse face manipulation,interactive face manipulation,semantic masks,suitable intermediate representation,flexible face manipulation,fidelity preservation,DMN,style mapping,free-form user modified mask
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要