1 |
Karras T, Laine S, Aila T. A Style-based Generator Architecture for Generative Adversarial Networks[C]//2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2019: 4396-4405.
|
2 |
Rombach Robin, Blattmann Andreas, Lorenz Dominik, et al. High-resolution Image Synthesis with Latent Diffusion Models[C]//2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2022: 10674-10685.
|
3 |
Jang Wonjong, Ju Gwangjin, Jung Yucheol, et al. StyleCariGAN: Caricature Generation via StyleGAN Feature Map Modulation[J]. ACM Transactions on Graphics, 2021, 40(4): 116.
|
4 |
Gatys Leon A, Ecker Alexander S, Bethge Matthias. Image Style Transfer Using Convolutional Neural Networks[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2016: 2414-2423.
|
5 |
Creswell A, White Tom, Dumoulin Vincent, et al. Generative Adversarial Networks: An Overview[J]. IEEE Signal Processing Magazine, 2018, 35(1): 53-65.
|
6 |
Kim Junho, Kim Minjae, Kang Hyeonwoo, et al. U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-instance Normalization for Image-to-image Translation[EB/OL]. (2020-04-08) [2024-07-01]. .
|
7 |
Cho Hansam, Lee Jonghyun, Chang Seunggyu, et al. One-shot Structure-aware Stylized Image Synthesis[C]//2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2024: 8302-8311.
|
8 |
Liu Songhua, Lin Tianwei, He Dongliang, et al. AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer[C]//2021 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2021: 6629-6638.
|
9 |
Chong M J, Forsyth D. GANs N' Roses: Stable, Controllable, Diverse Image to Image Translation (works for videos too!)[EB/OL]. (2021-06-11) [2024-06-17]. .
|
10 |
Yang Shuai, Jiang Liming, Liu Ziwei, et al. Pastiche Master: Exemplar-based High-resolution Portrait Style Transfer[C]//2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2022: 7683-7692.
|
11 |
Zeng Wei, Ren Xiaozhe, Su Teng, et al. PanGu-α: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation[EB/OL]. (2021-04-26) [2024-06-17]. .
|
12 |
Radford A, Kim J W, Hallacy C, et al. Learning Transferable Visual Models from Natural Language Supervision[C]//Proceedings of the 38th International Conference on Machine Learning. Chia Laguna Resort: PMLR, 2021: 8748-8763.
|
13 |
Ramesh A, Dhariwal P, Nichol A, et al. Hierarchical Text-conditional Image Generation with CLIP Latents[EB/OL]. (2022-04-13) [2024-06-18]. .
|
14 |
Yu Tao, Feng Runseng, Feng Ruoyu, et al. Inpaint Anything: Segment Anything Meets Image Inpainting[EB/OL]. (2023-04-13) [2024-07-05]. .
|
15 |
Kirillov A, Mintun E, Ravi N, et al. Segment Anything[C]//2023 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2023: 3992-4003.
|
16 |
Suvorov Roman, Logacheva Elizaveta, Mashikhin Anton, et al. Resolution-robust Large Mask Inpainting with Fourier Convolutions[C]//2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV). Piscataway: IEEE, 2022: 3172-3182.
|
17 |
Collins Edo, Bala R, Price B, et al. Editing in Style: Uncovering the Local Semantics of GANs[C]//2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2020: 5770-5779.
|
18 |
Shen Yujun, Zhou Bolei. Closed-form Factorization of Latent Semantics in GANs[C]//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2021: 1532-1540.
|
19 |
Kim Gwanghyun, Kwon Taesung, Chul Ye Jong. DiffusionCLIP: Text-guided Diffusion Models for Robust Image Manipulation[C]//2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2022: 2416-2425.
|
20 |
Zhang Lümin, Rao Anyi, Agrawala M. Adding Conditional Control to Text-to-image Diffusion Models[C]//2023 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2023: 3813-3824.
|
21 |
Ronneberger Olaf, Fischer Philipp, Brox Thomas. U-net: Convolutional Networks for Biomedical Image Segmentation[C]//Medical Image Computing and Computer-Assisted Intervention-MICCAI 2015. Cham: Springer International Publishing, 2015: 234-241.
|
22 |
Huang Ziqi, C K Chan Kelvin, Jiang Yuming, et al. Collaborative Diffusion for Multi-modal Face Generation and Editing[C]//2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2023: 6080-6090.
|
23 |
Pinkney J N M, Adler D. Resolution Dependent GAN Interpolation for Controllable Image Synthesis Between Domains[EB/OL]. (2020-11-21) [2024-06-18]. .
|
24 |
Johnson J, Alahi A, Li Feifei. Perceptual Losses for Real-time Style Transfer and Super-resolution[C]//Computer Vision-ECCV 2016. Cham: Springer International Publishing, 2016: 694-711.
|
25 |
Deng Jiankang, Guo Jia, Xue Niannan, et al. ArcFace: Additive Angular Margin Loss for Deep Face Recognition[C]//2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Piscataway: IEEE, 2019: 4685-4694.
|
26 |
Song Shuang, Liang Yuanbang, Wu Jing, et al. Feature Proliferation-the "Cancer" in StyleGAN and Its Treatments[C]//2023 IEEE/CVF International Conference on Computer Vision (ICCV). Piscataway: IEEE, 2023: 2360-2370.
|
27 |
Kingma Diederik P, Welling Max. Auto-encoding Variational Bayes[EB/OL]. (2022-12-10) [2024-07-05]. .
|
28 |
Liu Shilong, Zeng Zhaoyang, Ren Tianhe, et al. Grounding DINO: Marrying DINO with Grounded Pre-training for Open-set Object Detection[EB/OL]. (2024-07-19) [2024-07-06]. .
|
29 |
Hu E J, Shen Yelong, Wallis P, et al. LoRA: Low-rank Adaptation of Large Language Models[EB/OL]. (2021-10-16) [2024-07-06]. .
|
30 |
Branwen Gwern, Arfafax, Presser Shawn, et al. Anime Crop Datasets: Faces, Figures, & Hands[EB/OL]. (2020-08-05) [2024-07-10]. .
|
31 |
Talebi H, Milanfar P. NIMA: Neural Image Assessment[J]. IEEE Transactions on Image Processing, 2018, 27(8): 3998-4011.
|