Sifei Liu

Education

University of California, Merced
Ph. D in EECS, advised by Ming-Hsuan Yang

Fall 2012 - Fall 2027

University of Science and Technology of China
M.S. in Electronic Engineering and Information Sciences, advised by Stan Z. Li.

Fall 2008 - Spring 2011

Employment

NVIDIA
Sr. Research Scientist

Nov 2017 - Current
Santa Clara, CA

NVIDIA
Research Intern with NVIDIA Research.

March 2017 - Aug 2017
Santa Clara, CA

Chinese University of HongKong
Visiting Schoolar at the MMLAB.

Jul 2016 - Dec 2016
HongKong

Baidu Inc.
Applied Scientist Intern on IDL. Worked on face parsing and beutification Apps.

May 2013 - Jan 2016
Beijing, China

Workshop and tutorial organization

4D Hand Object Interaction: Geometric Understanding and Applications in Dexterous Manipulation
CVPR 2023

Human-centric Trustworthy Computer Vision From Research to Applications
ICCV 2021

Sensing, Understanding and Synthesizing Humans
ECCV 2020

New Frontiers for Learning with Limited Labels or Data
ECCV 2020. (Co-orgainizor and Speaker)

Learning Representations via Graph-structured Networks Tutorial
CVPR 2019 and 2020. (Co-orgainizor and Speaker)

(Co)-Mentees at NVIDIA Research

Xueting Li, 2018-2020, Research Scientist at NVIDIA
Donghong Lee, 2018, Apple Inc.
Wei-Chih Hung, 2019, Researcher at Waymo
Hung-Yu Tseng, 2019, Researcher at Meta
Wuyang Chen, 2020, Assistant Professor, CS@Simon Fraser University
Wenling (Wendy) Shang, 2019, Researcher at Deepmind
Siva Karthik Mustikovela, 2019 and 2021, Sr. Researcher at Cruise
Xitong Yang, 2019, Researcher at Meta
Yang Fu, 2019 and 2021, PhD at UCSD
Jiteng Mu, 2022, PhD at UCSD
Jiashun Wang, 2022, PhD at CMU
Jiarui Xu, 2021-2022, PhD at UCSD
Yufei (Judy) Ye, 2022, PhD at CMU

Recent Publications

TUVF: Learning Generalizable Texture UV Radiance Fields

A. Cheng, X. Li, S. Liu, X. Wang

The paper introduces TUVF, a method for learning generalizable texture UV radiance fields.

arXiv, 2023

Affordance diffusion: Synthesizing hand-object interactions

Y. Ye, X. Li, A. Gupta, S. De Mello, S. Birchfield, J. Song, S. Tulsiani, S. Liu

The paper proposes a method for interaction synthesis that addresses issues using diffusion models. They build upon the classic idea of disentangling where to interact (layout) from how to interact (content).

CVPR, 2023

Scraping Textures from Natural Images for Synthesis and Editing

X. Li, S. Liu, X. Wang, M. Yang, A. Efros

ECCV, 2022

Open-vocabulary panoptic segmentation with text-to-image diffusion models

J. Xu, S. Liu, A. Vahdat, W. Byeon, X. Wang, S. De Mello

We present ODISE: Open-vocabulary DIffusion-based panoptic SEgmentation, which unifies pre-trained text-image diffusion and discriminative models to perform open-vocabulary panoptic segmentation.

CVPR, 2023

CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs

J. Mu, S. De Mello, Z. Yu, N. Vasconcelos, X. Wang, J. Kautz, S. Liu

This work introduces Coordinate GAN (CoordGAN), a structure-texture disentangled GAN that learns a dense correspondence map for each generated image.

CVPR, 2022

GroupViT: Semantic Segmentation Emerges from Text Supervision

J. Xu, S. De Mello, S. Liu, W. Byeon, T. Breuel, J. Kautz, X. Wang

This paper proposes a hierarchical Grouping Vision Transformer (GroupViT), which learns to group image regions into progressively larger arbitrary-shaped segments.

CVPR, 2022

Learning continuous environment fields via implicit functions

X. Li, S. De Mello, X. Wang, M. Yang, J. Kautz, S. Liu

ICLR, 2022

Autoregressive 3D Shape Generation via Canonical Mapping

A. Cheng, X. Li, S. Liu, M. Sun, M. Yang

The paper demonstrates a solution for 3D point cloud generation using transformers. The key idea is to decompose a point cloud into a sequence of semantically meaningful shape compositions, which are further encoded by an autoregressive model for point cloud generation.

ECCV, 2022

Learning Continuous Image Representation with Local Implicit Image Function

Y. Chen, S. Liu, X. Wang

The paper presents a method for learning continuous image representation with local implicit image function.

CVPR, 2021

Coupled Segmentation and Edge Learning via Dynamic Graph Propagation

Z. Yu, R. Huang, W. Byeon, S. Liu, G. Liu, T. Breuel, A. Anandkumar, J. Kautz

NeurIPS, 2021

Video Autoencoder: self-supervised disentanglement of static 3D structure and motion

Z. Lai, S. Liu, A. Efros, X. Wang

This paper presents a video autoencoder for learning disentangled representations of 3D structure and camera pose from videos in a self-supervised manner.

ICCV, 2021

Learning 3D Dense Correspondence via Canonical Point Autoencoder

A. Cheng, X. Li, M. Sun, M. Yang, S. Liu

The paper presents a method for learning 3D dense correspondence using a canonical point autoencoder.

NeurIPS, 2021

Learning to track instances without video annotations

Y. Fu, S. Liu, U. Iqbal, S. De Mello, H. Shi, J. Kautz

CVPR, 2021

Contrastive syn-to-real generalization