Skip to main content
Yilin Wang - Research Scientist at Adobe

Yilin Wang

Research Scientist at Adobe, San Jose

I work on computer vision for Imaging products at Adobe. Primary contributor to Instruction-based Image Editing, Select Subject, Object Finder, and Select People Details. Ph.D. from Arizona State University, advised by Baoxin Li. 8 tech transfers to Photoshop, Lightroom, and Stardust.

Now recruiting summer research interns in image/video editing with MLLM. Contact me!

News

Highlighted Research

I strive for simple yet scalable methods in image understanding and editing. Representative works are highlighted below. Full list on Google Scholar.

X-Planner
Beyond Simple Edits: X-Planner for Complex Instruction-Based Image Editing
AAAI 2026

Chun-Hsiao Yeh, Yilin Wang, Nanxuan Zhao, Richard Zhang, Yuheng Li, Yi Ma, Krishna Kumar Singh

An MLLM-based planning system that bridges user intent with editing model capabilities.

OmniStyle
OmniStyle: Filtering High Quality Style Transfer Data at Scale
CVPR 2025

Ye Wang, Ruiqi Liu, Jiang Lin, Zili Yi, Yilin Wang, Rui Ma  co-advisor

First end-to-end style transfer framework on DiT, achieving 1K-resolution stylization with the OmniStyle-1M dataset. Supports both instruction- and image-guided stylization.

FineCaption
FINECAPTION: Compositional Image Captioning
CVPR 2025

Hang Hua, Qing Liu, Lingzhi Zhang, Jing Shi, Zhifei Zhang, Yilin Wang, Jianming Zhang, Jiebo Luo, Zhe Lin

A unified VLM for free-form mask grounding and compositional captioning.

UniReal
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
ICLR 2025 (Highlight)

Xi Chen, Zhifei Zhang, He Zhang, Yuqian Zhou, Soo Ye Kim, Qing Liu, Yijun Li, Jianming Zhang, Nanxuan Zhao, Yilin Wang, Hui Ding, Zhe Lin, Hengshuang Zhao

Foundational multi-modal generative model — A universal framework for image generation and editing by treating input/output images as video frames, learning real-world dynamics from video supervision.

SigStyle
SigStyle: Signature Style Transfer via Personalized Text-to-Image Models
AAAI 2025

Ye Wang, Tongyuan Bai, Xuping Xie, Zili Yi, Yilin Wang, Rui Ma  co-advisor

Style-preserved transfer via personalized subject editing diffusion model.

SwapAnything
SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing
ECCV 2024

Jing Gu, Nanxuan Zhao, Wei Xiong, Qing Liu, Zhifei Zhang, He Zhang, Jianming Zhang, HyunJoon Jung, Yilin Wang, Xin Eric Wang  Co-advisor

Personalized subject-driven image editing with arbitrary object swapping.

UniHuman
UniHuman: A Unified Model for Editing Human Images in the Wild
CVPR 2024

Nannan Li, Qing Liu, Krishna Kumar Singh, Yilin Wang, Jianming Zhang, Bryan A. Plummer, Zhe Lin

Human editing via diffusion.

Amodal Segmentation
Amodal Scene Analysis via Holistic Occlusion Relation Inference and Generative Mask Completion
AAAI 2024 (oral)

Bowen Zhang, Qing Liu, Jianming Zhang, Yilin Wang, Akide Liu, Zhe Lin, Yifan Liu

Amodal segmentation with mutual occlusion reasoning.

PhotoSwap
LightPainter
Interactive Portrait Harmonization
Lite Vision Transformer
Lite Vision Transformer with Enhanced Self-Attention
CVPR 2022

Chenglin Yang, Yilin Wang, Jianming Zhang, He Zhang, Zijun Wei, Zhe Lin, Alan Yuille

Light-weight vision transformer models for vision tasks.

SSH
SSH: A Self-Supervised Framework for Image Harmonization
ICCV 2021

Yifan Jiang, He Zhang, Jianming Zhang, Yilin Wang, Zhe Lin, Kalyan Sunkavalli, Simon Chen, Sohrab Amirghodsi, Sarah Kong, Zhangyang Wang

Image harmonization based on self-supervised learning.

Mask Guided Matting
Multimodal Contrastive Training
Multimodal Contrastive Training for Visual Representation Learning
CVPR 2021

Xin Yuan, Zhe Lin, Jason Kuen, Jianming Zhang, Yilin Wang, Michael Maire, Ajinkya Kale, Baldo Faieta

Intra- and inter-modal similarity preservation for multimodal representation learning.

RAL
Shape Adaptor
Multimodal Style Transfer

PhD Research

2018

  • Generalizing Graph Matching beyond Quadratic Assignment Model
    Tianshu Yu, Junchi Yan, Yilin Wang, Wei Liu, Baoxin Li
    NeurIPS 2018
  • Weakly Supervised Facial Attribute Manipulation via Deep Adversarial Network
    Yilin Wang, Suhang Wang, Guojun Qi, Jiliang Tang, Baoxin Li
    WACV 2018 [paper]
  • CrossFire: Cross Media Joint Friend and Item Recommendations
    Kai Shu, Suhang Wang, Jiliang Tang, Yilin Wang, Huan Liu
    WSDM 2018 spotlight [paper]
  • Understanding and Predicting Delay in Reciprocal Relations
    Jundong Li, Jiliang Tang, Yilin Wang, Yali Wan, Yi Chang, Huan Liu
    WWW 2018 Research Track [arXiv]
  • Exploring Hierarchical Structures for Recommender Systems
    Suhang Wang, Jiliang Tang, Yilin Wang, Huan Liu
    IEEE TKDE

2017

  • CLARE: A Joint Approach to Label Classification and Tag Recommendation
    Yilin Wang, Suhang Wang, Jiliang Tang, Guojun Qi, Huan Liu, Baoxin Li
    AAAI 2017 oral [paper] [code]
  • Understanding and Discovering Deliberate Self-harm Content in Social Media
    Yilin Wang, Jiliang Tang, Jundong Li, Baoxin Li, Yali Wan, Clayton Mellina, Neil O'Hare, Yi Chang
    WWW 2017 Research Track [paper] [slides]
  • Exploiting Hierarchical Structures for Unsupervised Feature Selection
    Suhang Wang, Yilin Wang, Jiliang Tang, Charu Aggarwal, Suhas Ranganath, Huan Liu
    SDM 2017 [paper]
  • What Your Images Reveal: Exploiting Visual Contents for Point-of-Interest Recommendation
    Suhang Wang, Yilin Wang, Jiliang Tang, Kai Shu, Suhas Ranganath, Huan Liu
    WWW 2017 Research Track [paper]

2016

  • PPP: Joint Pointwise and Pairwise Image Label Prediction
    Yilin Wang, Suhang Wang, Jiliang Tang, Huan Liu, Baoxin Li
    CVPR 2016 [paper]
  • Efficient Unsupervised Abnormal Crowd Activity Detection Based on a Spatiotemporal Saliency Detector
    Yilin Wang, Qiang Zhang, Baoxin Li
    WACV 2016 [paper] [code]
  • Scale Adaptive Eigen Eye for Fast Eye Detection in Wild Web Images
    Xu Zhou, Yilin Wang, Peng Zhang, Baoxin Li
    ICIP 2016

2015

  • Sentiment Analysis for Social Media Images
    Yilin Wang, Baoxin Li
    ICDM PhD Forum 2015
  • Real Time Vehicle Back-up Warning System with Single Camera
    Yilin Wang, Jun Cao, Baoxin Li
    ICIP 2015 [paper]
  • Unsupervised Sentiment Analysis for Social Media Images
    Yilin Wang, Suhang Wang, Jiliang Tang, Huan Liu, Baoxin Li
    IJCAI 2015 [paper] [project]
  • Inferring Sentiment from Web Images with Joint Inference on Visual and Social Cues: A Regulated Matrix Factorization Approach
    Yilin Wang, Yuheng Hu, Subbarao Kambhampati, Baoxin Li
    ICWSM 2015 oral [paper]
  • Structure Preserving Image Quality Assessment
    Yilin Wang, Qiang Zhang, Baoxin Li
    ICME 2015 oral [paper]
  • Exploring Implicit Hierarchical Structure for Recommender Systems
    Suhang Wang, Jiliang Tang, Yilin Wang, Huan Liu
    IJCAI 2015 [paper]
  • Improving Vision-based Self-positioning in Intelligent Transportation Systems via Integrated Lane and Vehicle Detection
    Parag S. Chandakkar, Yilin Wang, Baoxin Li
    WACV 2015 [paper]

2014

  • Image Co-segmentation via Multi-task Learning
    Qiang Zhang, Jiayu Zhou, Yilin Wang, Jieping Ye, Baoxin Li
    BMVC 2014 [paper]

Service & Interns