发表论文

All Publications

2026

OctoNav: Towards Generalist Embodied Navigation

Chen Gao, Liankai Jin, Xingyu Peng, Jiazhao Zhang, Yue Deng, Annan Li, He Wang, Si Liu

Embodied AI Navigation

CVPR 2026

Parse, Search, and Confirmation: Training-Free Aerial Vision-and-Dialog Navigation with Chain-of-Thought Reasoning and Structured Spatial Memory

Yu Qi, Hongyu Li, Shaofei Huang, Tianrui Hui, Yaxiong Wang, Lechao Cheng, Zhun Zhong, Si Liu, Meng Wang

UAV Embodied AI Vision-Language

CVPR 2026

LookasideVLN: Direction-Aware Aerial Vision-and-Language Navigation

Yuwei Ning, Ganlong Zhao, Yipeng Qin, Si Liu, Yang Liu, Liang Lin, Guanbin Li

UAV Embodied AI Vision-Language

CVPR 2026

VGGT-Segmentor: Geometry-Enhanced Cross-View Segmentation

Yulu Gao, Bohao Zhang, Zongheng Tang, Jitong Liao, Wenjun Wu, Si Liu

3D Vision Segmentation Cross-View

CVPR 2026

ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models

Linqing Zhong, Yi Liu, Yifei Wei, Ziyu Xiong, Maoqing Yao, Si Liu, Guanghui Ren

Multimodal Learning Embodied AI Robotics

CVPR 2026

Geometry-Guided 3D Visual Token Pruning for Video-Language Models

Han Li, Zehao Huang, Jiahui Fu, Naiyan Wang, Si Liu

Multimodal Learning Video Understanding Model Compression

CVPR 2026

2025

RoboCerebra: A Large-scale Benchmark for Long-horizon Robotic Manipulation Evaluation

Songhao Han#, Boxiang Qiu#, Yue Liao#, Siyuan Huang, Chen Gao, Shuicheng Yan*, Si Liu*

Embodied AI Robotics Dataset

NeurIPS 2025

GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance

Jingqiu Zhou#, Lue Fan#, Xuesong Chen, Linjiang Huang*, Si Liu, Hongsheng Li

3D Vision Neural Rendering

AAAI 2025

Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation

Shaofei Huang#, Rui Ling#, Hongyu Li#, Tianrui Hui, Zongheng Tang, Xiaoming Wei, Jizhong Han, Si Liu*

Video Understanding Multimodal Learning

AAAI 2025

Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology

Xiangyu Wang#, Donglin Yang#, Zigin Wang#, Hohin Kwan, Jinyu Chen, Wenjun Wu, Hongsheng Li, Yue Liao*, Si Liu*

UAV Embodied AI Vision-Language

ICLR 2025

LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation

Fangxun Shu#, Yue Liao#, Le Zhuo, Chenning Xu, Guanghao Zhang, Haonan Shi, LongChen, Tao Zhong, Wanggui He, Siming Fu, Haoyuan Li, Bolin Li, Zhelun Yu, Si Liu*, Hongsheng Li*, Hao Jiang*

Multimodal Learning Model Compression

ICLR 2025

Point Cluster: A Compact Message Unit for Communication-Efficient Collaborative Perception

Zihan Ding, Jiahui Fu, Si Liu*, Hongyu Li, Siheng Chen, Hongsheng Li, Shifeng Zhang, Xu Zhou

Autonomous Driving Collaborative Perception

ICLR 2025

MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More

Wei Huang#, Yue Liao#, Jianhui Liu, Ruifei He, Haoru Tan, Shiming Zhang*, Hongsheng Li, Si Liu*, Xiaojuan Qi*

Large Language Model Model Compression

ICLR 2025

Generative Map Priors for Collaborative BEV Semantic Segmentation

Jiahui Fu, Yue Gong, Luting Wang, Shifeng Zhang, Xu Zhou, Si Liu*

Autonomous Driving 3D Vision

CVPR 2025

FlexDrive: Toward Trajectory Flexibility in Driving Scene Reconstruction and Rendering

Jingqiu Zhou, Lue Fan, Linjiang Huang*, Zhaoxiang Zhang, Xiaoyu Shi, Si Liu, Hongsheng Li*

Autonomous Driving Neural Rendering

CVPR 2025

VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

Songhao Han, Wei Huang, Hairong Shi, Le Zhuo, Xiu Su, Shifeng Zhang, Xu Zhou, Xiaojuan Qi, Yue Liao*, Si Liu*

Video Understanding Dataset

CVPR 2025

LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding

Hongyu Li#, Jinyu Chen#, Ziyu Wei#, Shaofei Huang, Tianrui Hui, Jialin Gao*, Xiaoming Wei, Si Liu*

Multimodal Learning Video Understanding

CVPR 2025

Revisiting Audio-Visual Segmentation with Vision-Centric Transformer

Shaofei Huang#, Rui Ling, Tianrui Hui*, Hongyu Li, Xu Zhou, Shifeng Zhang, Si Liu*, Richang Hong, Meng Wang

Multimodal Learning Segmentation

CVPR 2025

Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMS

Zitian Wang, Yue Liao*, Kang Rong, Fengyun Rao, Yibo Yang*, Si Liu

Multimodal Learning Alignment

ICCV 2025

Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization

Hao Ju#, Shaofei Huang#, Si Liu, Zhedong Zheng*

UAV 3D Vision

ICCV 2025

CoST: Efficient Collaborative Perception From Unified Spatiotemporal Perspective

Zongheng Tang, Yi Liu, Yifan Sun, Yulu Gao, Jinyu Chen, Runsheng Xu, Si Liu*

Autonomous Driving Collaborative Perception

ICCV 2025

CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation

Yi Liu, Shengqian Li, Zuzeng Lin, Feng Wang, Si Liu*

Image Generation Generative Model

ICCV 2025

RATopo: Improving Lane Topology Reasoning via Redundancy Assignment

Han Li, Shaofei Huang, Longfei Xu, Yulu Gao, Beipeng Mu, Si Liu*

Autonomous Driving Scene Understanding

ACM MM 2025

DOMR: Establishing Cross-View Segmentation via Dense Object Matching

Jitong Liao#, Yulu Gao#, Shaofei Huang, Jialin Gao, Jie Lei, Ronghua Liang, Si Liu*

3D Vision Segmentation

ACM MM 2025

AeroDuo: Aerial Duo for UAV-based Vision and Language Navigation

Ruipu Wu#, Yige Zhang#, Jinyu Chen, Linjiang Huang*, Shifeng Zhang, Xu Zhou, Liang Wang, Si Liu*

UAV Embodied AI Vision-Language

ACM MM 2025

"Hi AirStar, Guide Me to the Badminton Court."

Zigin Wang#, Jinyu Chen#, Xiangyi Zheng, Qinan Liao, Linjiang Huang*, Si Liu*

UAV Embodied AI Vision-Language

ACM MM demo 2025

UAV-Flow Colosseo: A Real-World Benchmark for Flying-on-a-Word UAV Imitation Learning

Xiangyu Wang#, Donglin Yang#, Yue Liao#, Wenhao Zheng, Wenjun Wu, Bin Dai, Hongsheng Li, Si Liu*

UAV Embodied AI Dataset

NeurIPS 2025

RoboCerebra: A Large-scale Benchmark for Long-horizon Robotic Manipulation Evaluation

Songhao Han#, Boxiang Qiu#, Yue Liao#, Siyuan Huang, Chen Gao, Shuicheng Yan*, Si Liu*

Embodied AI Robotics Dataset

NeurIPS 2025

Towards Realistic Earth-Observation Constellation Scheduling: Benchmark and Methodology

Luting Wang#, Yinghao Xiang#, Hongliang Huang, Dongjun Li, Chen Gao*, Si Liu*

Satellite Systems Dataset

NeurIPS 2025

FACT: Mitigating Inconsistent Hallucinations in LLMs via Fact-Driven Alternating Code-Text Training

Xinxin You, Qixin Sun, Xien Liu, Chenwei Yan, Xiao Zhang, Chen Ning, Xiangling Fu, Si Liu, Shijin Wang, Guoping Hu, Ji Wu*

Large Language Model

NeurIPS 2025

M2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection

Zitian Wang, Zehao Huang, Yulu Gao, Naiyan Wang, Si Liu*

Autonomous Driving 3D Vision

TPAMI 2025

2024

Multi-Person Pose Regression with Distribution-Aware Single-Stage Models

Leyan Zhu#, Zitian Wang#, Si Liu*, Xuecheng Nie, Luoqi Liu, Bo Li

Pose Estimation

TPAMI 2024

Data Augmentation in Human-Centric Vision

Wentao Jiang, Yige Zhang, Shaozhong Zheng, Si Liu*, Shuicheng Yan

Data Augmentation Human-Centric Vision

Vicinagearth (Springer Nature) 2024

FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature Augmentation

Rongyao Fang, Peng Gao, Aojun Zhou, Yingjie Cai, Si Liu, Jifeng Dai, Hongsheng Li*

Detection Data Augmentation

TPAMI 2024

PPDM++: Parallel Point Detection and Matching for Fast and Accurate HOI Detection

Yue Liao, Si Liu*, Yulu Gao, Aixi Zhang, Zhimin Li, Fei Wang, Bo Li

Detection HOI

TPAMI 2024

MAC: Masked Contrastive Pre-Training for Efficient Video-Text Retrieval

Fangxun Shu, Biaolong Chen, Yue Liao, Jinqiao Wang, Si Liu

Multimodal Learning Video Understanding Retrieval

TMM 2024

RGB-T Tracking with Template-Bridged Search Interaction and Target-Preserved Template Updating

Bo Li, Fengguang Peng, Tianrui Hui, Xiaoming Wei, Xiaolin Wei, Lijun Zhang, Hang Shi, Si Liu*

Multimodal Learning Video Understanding Tracking

TPAMI 2024

Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor Regression

Shaofei Huang, Zhenwei Shen, Zehao Huang, Yue Liao, Jizhong Han, Naiyan Wang, Si Liu*

Autonomous Driving 3D Vision Detection

TPAMI 2024

ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation

Bo Zhang#, Xinyu Cai#*, Jiakang Yuan, Donglin Yang, Jianfei Guo, Xiangchao Yan, Renqiu Xia, Botian Shi, Min Dou, Tao Chen, Si Liu, Junchi Yan*, Yu Qiao

Autonomous Driving 3D Vision Domain Adaptation

ICLR 2024

Octavius: Mitigating Task Interference in MLLMs via MoE

Ziqin Wang#, Zeren Chen#, Zhen Wang#, Huayang Liu, Zhenfei Yin, Si Liu, Lu Sheng*, Wanli Ouyang, Yu Qiao, Jing Shao*

Large Language Model

ICLR 2024

Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection

Jiahui Fu, Chen Gao, Zitian Wang, Lirong Yang, Xiaofei Wang, Beipeng Mu, Si Liu

Autonomous Driving 3D Vision Multimodal Learning

ICRA 2024

Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training

Runze He#, Shaofei Huang#, Xuecheng Nie, Tianrui Hui, Luoqi Liu, Jiao Dai, Jizhong Han, Guanbin Li*, Si Liu*

3D Vision Image Generation Scene Understanding

CVPR 2024

EASE-DETR: Easing the Competition among Object Queries

Yulu Gao, Yifan Sun, Xudong Ding, Chuyang Zhao, Si Liu

Detection

CVPR 2024

Reference Prompted Model Adaptation for Referring Camouflaged Object Detection

Xuewei Liu#, Shaofei Huang#, Ruipu Wu, Hengyuan Zhao, Duo Xu, Xiaoming Wei, Jizhong Han*, Si Liu

Detection Prompting

ICME 2024

SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection

Gang Zhang, Junnan Chen, Guohuan Gao, Jianmin Li, Si Liu, Xiaolin Hu*

3D Vision Detection

CVPR 2024

Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection

Jiaming Li, Jiacheng Zhang, Jichang Li, Ge Li, Si Liu, Liang Lin, Guanbin Li

Detection Prompting

CVPR 2024

Communication-Efficient Collaborative Perception via Information Filling with Codebook

Yue Hu, Juntong Peng, Sifei Liu, Junhao Ge, Si Liu, Siheng Chen

Autonomous Driving Collaborative Perception

CVPR 2024

Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation

Hairong Shi#, Songhao Han#, Shaofei Huang*, Yue Liao, Guanbin Li*, Xiangxing Kong, Hua Zhu, Xiaomu Wang, Si Liu

Medical Imaging Segmentation

MICCAI 2024

Realistic Rainy Weather Simulation for LiDARs in CARLA Simulator

Donglin Yang, Xinyu Cai∗, Zhenfeng Liu, Wentao Jiang, Bo Zhang, Guohang Yan, Xing Gao, Si Liu, Botian Shi

Autonomous Driving 3D Vision

IROS 2024

Asynchronous Large Language Model Enhanced Planner for Autonomous Driving

Yuan Chen#, Zi-han Ding#, Ziqin Wang#, Yan Wang*, Lijun Zhang, Si Liu*

Autonomous Driving Large Language Model Navigation

ECCV 2024

LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction

Penghui Du#, Yu Wang#, Yifan Sun, Luting Wang, Yue Liao, Gang Zhang, Errui Ding, Yan Wang*, Jingdong Wang, Si Liu*

Large Language Model Detection

ECCV 2024

Controllable Navigation Instruction Generation with Chain of Thought Prompting

Xianghao Kong#, Jinyu Chen#, Wenguan Wang*, Hang Su, Xiaolin Hu, Yi Yang, Si Liu*

Image Generation Navigation Prompting

ECCV 2024

FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis

Linjiang Huang, Rongyao Fang, Aiping Zhang, Guanglu Song, Si Liu, Yu Liu, Hongsheng Li

Image Generation

ECCV 2024

GPD-VVTO: Preserving Garment Details in Video Virtual Try-On

Yuanbin Wang, Weilun Dai, Long Chan, Huanyu Zhou, Aixi Zhang, Si Liu

Video Understanding

ACM MM 2024

Collaborative Training of Tiny-Large Vision Language Models

Shichen Lu, Longteng Guo, Wenxuan Wang, Zijia Zhao, Tongtian Yue, Si Liu, Jing Liu

Multimodal Learning Large Language Model

ACM MM 2024

Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding

Hongyu Li, Tianrui Hui, Zihan Ding, Jing Zhang, Bin Ma, Xiaoming Wei, Jizhong Han, Si Liu

Multimodal Learning Segmentation Grounding

ACM MM 2024

Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT

Le Zhuo, Ruoyi Du, Han Xiao, Yangguang Li, Dongyang Liu, Rongjie Huang, Wenze Liu, Lirui Zhao, Fu-Yun Wang, Zhanyu Ma, Xu Luo, Zehan Wang, Kaipeng Zhang, Xiangyang Zhu, Si Liu, Xiangyu Yue, Dingning Liu, Wanli Ouyang, Ziwei Liu, Yu Qiao, Hongsheng Li, Peng

Multimodal Learning Image Generation Diffusion

NeurIPS 2024

Image Understanding Makes for A Good Tokenizer for Image Generation

Luting Wang, Yang Zhao, Zijian Zhang, Jiashi Feng, Si Liu, Bingyi Kang

Image Generation

NeurIPS 2024

CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics

Ziqin Wang*, Jiawei Gao*, Zeqi Xiao, Jingbo Wang, Tai Wang, Jinkun Cao, Xiaolin Hu, Si Liu, Jifeng Dai, Jiangmiao Pang

Video Understanding HOI Human-Centric Vision

NeurIPS 2024 (Spotlight)

2023

Language-Aware Spatial-Temporal Collaboration for Referring Video Segmentation

Tianrui Hui, Si Liu*, Zihan Ding, Shaofei Huang, Guanbin Li, Wenguan Wang, Luoqi Liu, Jizhong Han

Video Understanding Segmentation

TPAMI 2023

Room-Object Entity Prompting and Reasoning for Embodied Referring Expression

Chen Gao, Si Liu*, Jinyu Chen, Luting Wang, Qi Wu, Bo Li, Qi Tian

Embodied AI Prompting Scene Understanding

TPAMI 2023

Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe

Hongyang Li*, Chonghao Sima, Jifeng Dai, Wenhai Wang, Lewei Lu*, Huijie Wang, Jia Zeng, Zhiqi Li, Jiazhi Yang, Hanming Deng, Hao Tian, Enze Xie, Jiangwei Xie, Li Chen, Tianyu Li, Yang Li, Yulu Gao, Xiaosong Jia, Si Liu, Jianping Shi, Dahua Lin, Yu Qiao

Autonomous Driving Benchmark

TPAMI 2023

Teach-DETR: Better Training DETR with Teachers

Linjiang Huang, Kaixin Lu, Guanglu Song, Liang Wang, Si Liu, Yu Liu, Hongsheng Li*

Detection

TPAMI 2023

Region-Adaptive and Context-Complementary Cross Modulation for RGB-T Semantic Segmentation

Fengguang Peng, Zihan Ding, Ziming Chen, Gang Wang*, Tianrui Hui, Si Liu, Hang Shi

Multimodal Learning Segmentation

Pattern Recognition 2023

MI3C: Mining Intra- and Inter-Image Context for Person Search

Zongheng Tang, Yulu Gao, Tianrui Hui*, Fengguang Peng, Si Liu

Re-ID

Pattern Recognition 2023

Linker: Learning Long Short-term Associations for Robust Visual Tracking

Zizheng Xun, Shangzhe Di, Yulu Gao, Zongheng Tang, Gang Wang∗, Si Liu, Bo Li

Tracking

TMM 2023

Anchor3DLane: Learning to Regress 3D Anchors for Monocular 3D Lane Detection

Shaofei Huang, Zhenwei Shen, Zehao Huang, Zihan Ding, Jiao Dai, Jizhong Han, Naiyan Wang, Si Liu

Autonomous Driving 3D Vision Detection

CVPR 2023

Bridging Search Region Interaction with Template for RGB-T Tracking

Tianrui Hui, Zizheng Xun, Fengguang Peng, Junshi Huang, Xiaoming Wei, Xiaolin Wei, Jiao Dai, Jizhong Han, Si Liu

Multimodal Learning Video Understanding Tracking

CVPR 2023

DETR with Additional Global Aggregation for Cross-domain Weakly Supervised Object Detection

Zongheng Tang, Yifan Sun, Si Liu, Yi Yang

Detection Diffusion

CVPR 2023

Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection

Luting Wang, Yi Liu, Penghui Du, Zihan Ding, Yue Liao, Qiaosong Qi, Biaolong Chen, Si Liu

Detection

CVPR 2023

Adaptive Zone-aware Hierarchical Planner for Vision-Language Navigation

Chen Gao, Xingyu Peng, Bo Yan, He Wang, Lirong Yang, Haibing Ren, Hongsheng Li, Si Liu

Multimodal Learning Navigation

CVPR 2023

Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels

Jingqiu Zhou, Liang Wang, Si Liu, Hongsheng Li, Linjiang Huang

Video Understanding Localization

CVPR 2023

Analyzing Infrastructure LiDAR Placement with Realistic LiDAR Simulation Library

Xinyu Cai, Wentao Jiang, Runsheng Xu, Wenquan Zhao, Jiaqi Ma, Si Liu, Yikang Li*

Autonomous Driving 3D Vision

ICRA 2023

LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT

Le Zhuo, Ruibin Yuan, Jiahao Pan, Yinghao Ma, Yizhi Li, Ge Zhang, Si Liu, Roger Dannenberg, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenhu Chen, Wei Xue, Yike Guo

Large Language Model Audio

ISMIR 2023

Sparse Dense Fusion for 3D Object Detection

Yulu Gao, Chonghao Sima, Shaoshuai Shi, Shangzhe Di, Si Liu, Hongyang Li

3D Vision Detection

IROS 2023

Discovering Sounding Objects by Audio Queries for Audio Visual Segmentation

Shaofei Huang, Han Li, Yuqing Wang, Hongji Zhu, Jiao Dai, Jizhong Han, Wenge Rong, Si Liu

Audio Segmentation

IJCAI 2023

Enriching Phrases with Coupled Pixel and Object Contexts for Panoptic Narrative Grounding

Tianrui Hui, Zihan Ding, Junshi Huang, Xiaoming Wei, Xiaolin Wei, Jiao Dai, Jizhong Han, Si Liu

Segmentation Grounding

IJCAI 2023

Video Background Music Generation: Dataset, Method and Evaluation

Le Zhuo, Zhaokai Wang, Baisen Wang, Yue Liao, Stanley Peng, Chenxi Bao, Miao Lu, Xiaobo Li, Si Liu

Video Understanding Audio Image Generation

ICCV 2023

Optimizing the Placement of Roadside LiDARs for Autonomous Driving

Wentao Jiang, Hao Xiang, Xinyu Cai, Runsheng Xu, Jiaqi Ma, Yikang Li, Gim Hee Lee, Si Liu

Autonomous Driving 3D Vision

ICCV 2023

Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation

Jinyu Chen, Wenguan Wang, Si Liu, Hongsheng Li, Yi Yang

Multimodal Learning Audio Navigation

ICCV 2023

Object as Query: Lifting any 2D Object Detector to 3D Detection

Zitian Wang, Zehao Huang, Jiahui Fu, Naiyan Wang, Si Liu

3D Vision Detection

ICCV 2023

DiffDance: Cascaded Human Motion Diffusion Model for Dance Generation

Qiaosong Qi*, Le Zhuo*, Aixi Zhang, Yue Liao, Fei Fang, Si Liu, Shuicheng Yan

Image Generation Diffusion Human-Centric Vision

ACM MM 2023

Transferring CLIP’s Knowledge into Zero-Shot Point Cloud Semantic Segmentation

Yuanbin Wang, Shaofei Huang, Yulu Gao, Zhen Wang, Rui Wang, Kehua Sheng, Bo Zhang, Si Liu

3D Vision Segmentation Domain Adaptation

ACM MM 2023

DUSA: Decoupled Unsupervised Sim2Real Adaptation for Vehicle-to-Everything Collaborative Perception

Xianghao Kong, Wentao Jiang, Jinrang Jia, Yifeng Shi, Runsheng Xu, Si Liu

Autonomous Driving Collaborative Perception Domain Adaptation

ACM MM 2023

MARBLE: Music Audio Representation Benchmark for Universal Evaluation

Ruibin Yuan, Yinghao Ma, Yizhi Li, Ge Zhang, Xingran Chen, Hanzhi Yin, Le Zhuo, Yiqi Liu, Jiawen Huang, Zeyue Tian, Binyue Deng, Ningzhi Wang, Wenhu Chen, Gus Xia, Wei Xue, Si Liu, Shi Wang, Ruibo Liu, Yike Guo, Jie Fu

Audio Dataset Benchmark

NeurIPS 2023

2022

3D-SPS: Single-Stage 3D Visual Grounding via Referred Point Progressive Selection

Junyu Luo#, Jiahui Fu#, Xianghao Kong, Chen Gao*, Haibing Ren, Hao Shen, Huaxia Xia, Si Liu

3D Vision Grounding

CVPR 2022 (Oral)

GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection

Yue Liao, Aixi Zhang, Miao Lu, Yongliang Wang, Xiaobo Li, Si Liu*

Video Understanding Detection HOI

CVPR 2022

PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding

Zihan Ding#, Zi-han Ding#, Tianrui Hui*, Junshi Huang, Xiaoming Wei, Xiaolin Wei, Si Liu

Segmentation Grounding

ACM MM 2022

Target-Driven Structured Transformer Planner for Vision-Language Navigation

Yusheng Zhao#, Jinyu Chen#, Chen Gao, Wenguan Wang*, Lirong Yang, Haibing Ren, Huaxia Xia, Si Liu

Multimodal Learning Navigation

ACM MM 2022 (Oral)

Cross-Modality Domain Adaptation for Freespace Detection: A Simple yet Effective Baseline

Yuanbin Wang, Leyan Zhu, Shaofei Huang*, Tianrui Hui, Xiaojie Li, Fei Wang, Si Liu

Multimodal Learning Detection Domain Adaptation

ACM MM 2022

PoseTrans: A Simple Yet Effective Pose Transformation Augmentation for Human Pose Estimation

Wentao Jiang, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Si Liu*

Pose Estimation Data Augmentation Human-Centric Vision

ECCV 2022

HEAD: HEtero-Assists Distillation for Heterogeneous Object Detectors

Luting Wang, Xiaojie Li, Yue Liao*, Zeren Jiang, Jianlong Wu, Fei Wang, Chen Qian, Si Liu

Detection

ECCV 2022

Fine-grained Face Editing via Personalized Spatial-aware Affine Modulation

Si Liu, Renda Bao, Defa Zhu, Shaofei Huang, Qiong Yan, Liang Lin, Chao Dong

Image Generation Face Analysis Diffusion

TMM 2022

Progressive Language-customized Visual Feature Learning for One-stage Visual Grounding

Yue Liao, Aixi Zhang, Zhiyuan Chen, Tianrui Hui, Si Liu*

Grounding

TIP 2022

Simultaneously Training and Compressing Vision-and-Language Pre-training Model

Qiaosong Qi, Aixi Zhang, Yue Liao*, Wenyu Sun, Yongliang Wang, Xiaobo Li, Si Liu

Multimodal Learning

TMM 2022

Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation

Zihan Ding, Tianrui Hui, Junshi Huang, Xiaoming Wei, Jizhong Han, Si Liu*

Video Understanding Segmentation

CVPR 2022

Distribution-Aware Single-Stage Models for Multi-Person 3D Pose Estimation

Zitian Wang, Xuecheng Nie, Xiaochao Qu, Yunpeng Chen, Si Liu*

3D Vision Pose Estimation

CVPR 2022

Reinforced Structured State-Evolution for Vision-Language Navigation

Jinyu Chen, Chen Gao, Meng Erli, Qiong Zhang, Si Liu*

Multimodal Learning Navigation

CVPR 2022

2021

Video Background Music Generation with Controllable Music Transformer

Shangzhe Di#, Zeren Jiang#, Si Liu*, Zhaokai Wang, Leyan Zhu, Zexin He, Hongming Liu, Shuicheng Yan

Video Understanding Audio Image Generation

ACM MM 2021 Best Paper

Language-Guided Global Image Editing via Cross-Modal Cyclic Mechanism

Wentao Jiang, Ning Xu, Jiayun Wang, Chen Gao, Jing Shi, Zhe Lin, Si Liu*

Multimodal Learning Image Generation Diffusion

ICCV 2021

Mining the Benefits of Two-stage and One-stage HOI Detection

Aixi Zhang#, Yue Liao#, Si Liu*, Miao Lu, Yongliang Wang, Chen Gao, Xiaobo Li

Detection HOI

NeurIPS 2021

Confidence-aware Non-repetitive Multimodal Transformers for TextCaps

Zhaokai Wang, Renda Bao, Qi Wu, Si Liu*

Multimodal Learning

AAAI 2021

Reformulating HOI Detection as Adaptive Set Prediction

Mingfei Chen#, Yue Liao#, Si Liu*, Zhiyuan Chen, Fei Wang, Chen Qian

Detection HOI

CVPR 2021

Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring Expression

Chen Gao#, Jinyu Chen#, Si Liu*, Luting Wang, Qiong Zhang, Qi Wu

Embodied AI Scene Understanding

CVPR 2021 Oral

Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation

Tianrui Hui#, Shaofei Huang#, Si Liu*, Zihan Ding, Guanbin Li, Wenguan Wang, Jizhong Han, Fei Wang

Video Understanding Segmentation

CVPR 2021

General Instance Distillation for Object Detection

Xing Dai#, Zeren Jiang#, Zhao Wu, Yiping Bao, Zhicheng Wang, Si Liu, Erjin Zhou

Detection

CVPR 2021

Differentiable Multi-Granularity Human Representation Learning for Instance-Aware Human Semantic Parsing

Tianfei Zhou, Wenguan Wang*, Si Liu, Yi Yang, Luc Van Gool

Segmentation Human-Centric Vision

CVPR 2021 Oral

TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding

Dailan He#, Yusheng Zhao#, Junyu Luo#, Tianrui Hui, Shaofei Huang, Anxi Zhang, Si Liu*

3D Vision Grounding

ACM MM 2021

Attentive Excitation and Aggregation for Bilingual Referring Image Segmentation

Qianli Zhou#, Tianrui Hui#, Rong Wang*, Haimiao Hu, Si Liu* (#Equal contribution)

Segmentation

TIST 2021

Discriminative Triad Matching and Reconstruction for Weakly Referring Expression Grounding

Mingjie Sun, Jimin Xiao, Enggee Lim, Si Liu, John Yannis Goulermas

Grounding

TPAMI 2021

Human-centric Relation Segmentation: Dataset and Solution

Si Liu, Zitian Wang, Yulu Gao, Lejian Ren, Yue Liao, Guanghui Ren, Bo Li, Shuicheng Yan

Segmentation Dataset Human-Centric Vision

TPAMI 2021

Cross-Modal Progressive Comprehension for Referring Segmentation

Si Liu, Tianrui Hui, Shaofei Huang, Yunchao Wei, Bo Li, Guanbin Li*

Multimodal Learning Segmentation

TPAMI 2021

PSGAN++: Robust Detail-Preserving Makeup Transfer and Removal

Si Liu, Wentao Jiang, Chen Gao, Ran He*, Jiashi Feng, Bo Li, Shuicheng Yan

Image Generation Face Analysis Domain Adaptation

TPAMI 2021

Human-centric Spatio-Temporal Video Grounding With Visual Transformers

Zongheng Tang, Yue Liao, Si Liu*, Guanbin Li, Xiaojie Jin, Hongxu Jiang, Qian Yu, Dong Xu

Video Understanding Grounding Human-Centric Vision

TCSVT 2021

2020

Scene Graph Generation with Hierarchical Context

Guanghui Ren, Lejian Ren, Yue Liao, Si Liu*, Bo Li, Jizhong Han, Shuicheng Yan

Image Generation Graph Learning Scene Understanding

TNNLS 2020

ORDNet: Capturing Omni-Range Dependencies for Scene Parsing

Shaofei Huang, Si Liu*, Tianrui Hui, Jizhong Han, Bo Li, Jiashi Feng, Shuicheng Yan

Segmentation Scene Understanding

TIP 2020

PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer

Wentao Jiang, Si Liu*, Chen Gao, Jie Cao, Ran He, Jiashi Feng, Shuicheng Yan

Pose Estimation Image Generation Face Analysis

CVPR 2020 Oral

AdversarialNAS: Adversarial Neural Architecture Search for GANs

Chen Gao, Yunpeng Chen, Si Liu*, Zhenxiong Tan, Shuicheng Yan

Image Generation

CVPR 2020

A Real-Time Cross-modality Correlation Filtering Method for Referring Expression Comprehension

Yue Liao, Si Liu*, Guanbin Li, Fei Wang, Yanjie Chen, Chen Qian, Bo Li

Multimodal Learning

CVPR 2020

PPDM: Parallel Point Detection and Matching for Real-time Human-Object Interaction Detection

Yue Liao, Si Liu*, Fei Wang, Yanjie Chen, Chen Qian, Jiashi Feng

Video Understanding Detection Human-Centric Vision

CVPR 2020

Referring Image Segmentation via Cross-Modal Progressive Comprehension

Shaofei Huang#, Tianrui Hui#, Si Liu*, Guanbin Li, Yunchao Wei, Jizhong Han, Luoqi Liu, Bo Li

Multimodal Learning Segmentation

CVPR 2020

Linguistic Structure Guided Context Modeling for Referring Image Segmentation

Tianrui Hui, Si Liu*, Shaofei Huang, Guanbin Li, Sansi Yu, Faxi Zhang, Jizhong Han

Segmentation

ECCV 2020

Rule-Guided Compositional Representation Learning on Knowledge Graphs

Guanglin Niu, Yongfei Zhang, Bo Li, Peng Cui, Si Liu, Jingyang Li, Xiaowei Zhang

Graph Learning

AAAI 2020

Tree-Structured Policy based Progressive Reinforcement Learning for Temporally Language Grounding in Video

Jie Wu, Guanbin Li, Si Liu, Liang Lin

Video Understanding Grounding

AAAI 2020

InteractGAN: Learning to Generate Human-Object Interaction

Chen Gao, Si Liu*, Defa Zhu, Quan Liu, Jie Cao, Haoqian He, Ran He, Shuicheng Yan

Video Understanding Image Generation Human-Centric Vision

ACM MM 2020 Oral

Cross-Modal Omni Interaction Modeling for Phrase Grounding

Tianyu Yu, Tianrui Hui, Zhihao Yu, Yue Liao, Sansi Yu, Faxi Zhang, Si Liu*

Multimodal Learning Video Understanding Grounding

ACM MM 2020

2019

Magic-wall: Visualizing Room Decoration by Enhanced Wall Segmentation

Ting Liu, Yunchao Wei, Yao Zhao, Si Liu, Shikui Wei

Segmentation Scene Understanding

TIP 2019

Accurate Facial Image Parsing at Real-Time Speed

Zhen Wei, Si Liu, Yao Sun, Hefei Ling

Segmentation

TIP 2019

RotateView: A Video Composition System for Interactive Product Display

Shan An, Si Liu*, Zhibiao Huang, Guangfu Che, Qian Bao, Zhaoqi Zhu, Yu Chen, Dennis Z. Weng

Video Understanding

TMM 2019

Fine-grained Human-centric Tracklet Segmentation with Single Frame Supervision

Si Liu, Guanghui Ren, Yao Sun, Jinqiao Wang, Changhu Wang, Bo Li, Shuicheng Yan

Segmentation Human-Centric Vision

TPAMI 2019

Building Detail-Sensitive Semantic Segmentation Networks with Polynomial Pooling

Zhen Wei, Jingyi Zhang, Fumin Shen, Li Liu, Fan Zhu, Yi Zhou, Si Liu, Yao Sun, Ling Shao

Segmentation

CVPR 2019

GPS: Group People Segmentation with Detailed Part Inference [oral]

Yue Liao, Si Liu*, Tianrui Hui, Chen Gao, Yao Sun, Hefei Ling, Bo Li

Segmentation

ICME 2019

Enhanced Memory Network for Video Segmentation [1st Place in Youtube-VOS 2019]

Zhishan Zhou, Lejian Ren, Pengfei Xiong, Yifei Ji, Peisen Wang, Haoqiang Fan, Si Liu

Video Understanding Segmentation

ICCV Workshop 2019

RGB-Infrared Cross-Modality Person Re-Identification via Joint Pixel and Feature Alignment

Guan’an Wang, Tianzhu Zhang, Jian Cheng, Si Liu, Yang Yang, Zengguang Hou

Multimodal Learning Re-ID

ICCV 2019

Finding Images by Dialoguing with Image

Lejian Ren, Si Liu, Han Huang, Jizhong Han, Shuicheng Yan, Bo Li

Interactive Vision

ACM MM 2019

2018

Cross-domain Human Parsing via Adversarial Feature and Label Adaptation

Si Liu, Yao Sun, Defa Zhu, Guanghui Ren, Yu Chen, Jiashi Feng, Jizhong Han

Segmentation Human-Centric Vision

AAAI 2018

Learning Adaptive Receptive Fields for Deep Image Parsing Network

Zhen Wei, Yao Sun, Junyu Lin, Si Liu

Segmentation

Computational Visual Media 2018

Ensemble Soft-Margin Softmax Loss for Image Classification

Xiaobo Wang, Shifeng Zhang, Zhen Lei, Si Liu, Xiaojie Guo, Stan Z. Li

Recognition

IJCAI 2018

BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial Network

Tingting Li, Ruihe Qian, Chao Dong, Si Liu, Qiong Yan, Wenwu Zhu, Liang Lin

Image Generation Face Analysis Domain Adaptation

ACM MM 2018 Oral

Composing Semantic Collage for Image Retargeting

Si Liu, Zhen Wei, Yao Sun, Xinyu Ou, Junyu Lin, Bin Liu, Ming-Hsuan Yang

Image Generation

TIP 2018

Correlation Particle Filter for Visual Tracking

Tianzhu Zhang, Si Liu, Changsheng Xu, Bin Liu, Ming-Hsuan Yang

Tracking

TIP 2018

Robust Target Tracking by Online Random Forests and Superpixels

Wei Wang, Chunping Wang, Si Liu, Xiaochun Cao

Tracking

TCSVT 2018

Improved Search in Hamming Space using Deep Multi-Index Hashing

Hanjiang Lai, Yan Pan, Si Liu, Zhenbin Weng, Jian Yin

Retrieval

TCSVT 2018

2017

A weakly supervised method for makeup-invariant face verification

Yao Sun, Lejian Ren, Zhen Wei, Bin Liu, Yanlong Zhai, Si Liu

Face Analysis

Pattern Recognition 2017

Adult Images and Videos Recognition by Deep Multi-Context Network and Fine-to-Coarse Strategy

Xinyu Ou, Hefei Ling, Han Yu, Si Liu

Video Understanding Recognition

TIST 2017

Objectness Region Enhancement Networks for Scene Parsing

Xinyu Ou, Ping Li, Hefei Ling, Si Liu, Tianjiang Wang, Dan Li

Segmentation Image Generation Scene Understanding

JCST 2017

Time Traveler: a real-time face aging system

Lejian Ren, Si Liu, Yao Sun, JianDong Luoqi Liu, Shuicheng Yan

Face Analysis

ACM MM 2017

Learning Adaptive Receptive Fields for Deep Image Parsing Network

Zhen Wei, Yao Sun, Jinqiao Wang, Hanjiang Lai, Si Liu

Segmentation

CVPR 2017

Surveillance Video Parsing with Single Frame Supervision

Si Liu, Changhu Wang, Ruihe Qian, Han Yu, Renda Bao, Yao Sun

Video Understanding Segmentation

CVPR 2017

Face Aging with Contextual Generative Adversarial Nets

Si Liu, Yao Sun, Wei Wang, Renda Bao, Defa Zhu, Xiangbo Zhu, and Shuicheng Yan

Face Analysis

ACM MM 2017

Fast Deep Matting for Portrait Animation on Mobile Phone

Bingke Zhu, Yingying Chen, Si Liu, Bo Zhang, Jinqiao Wang, Ming Tang

Segmentation

ACM MM 2017

Magic-wall: Visualizing Room Decoration

Ting Liu, Yunchao Wei, Yao Zhao, Si Liu, Shikui Wei

Scene Understanding

ACM MM 2017

RSVP: A Real-Time Surveillance Video Parsing System with Single Frame Supervision

Han Yu, Guanghui Ren, Ruihe Qian, Yao Sun, Changhu Wang, Hanqing Lu, Si Liu

Video Understanding Segmentation

ACM MM 2017

2016

SketchNet: Sketch Classification with Web Images

Hua Zhang, Si Liu, Changqing Zhang, Wenqi Ren, Xiaochun Cao

Recognition Sketch Understanding

CVPR 2016

Structural Correlation Filter for Robust Visual Tracking

Si Liu, Tianzhu Zhang, Changsheng Xu, Xiaochun Cao

Tracking

CVPR 2016

Single Image Dehazing via Multi-Scale Convolutional Neural Networks

Wenqi Ren, Si Liu, Hua Zhang, Jianshan Pan, Xiaochun Cao, Ming-Hsuan Yang

Image Enhancement

ECCV 2016

Deep Multi-Context Network for Fine-Grained Visual Recognition

Xinyu Ou, Zhen Wei, Si Liu, Xiaochun Cao, Hefei Ling

Recognition

ICME 2016

Makeup like a superstar: Deep Localized Makeup Transfer Network

Si Liu, Xinyu Ou, Ruihe Qian, Wei Wang, Xiaochun Cao

Face Analysis Domain Adaptation

IJCAI 2016

Visual Attributes for Fashion Analytics

Si Liu, Lisa M. Brown, Qiang Chen, Junshi Huang, Luoqi Liu, Shuicheng Yan

Fashion Vision

Book Chapter 2016

Robust Visual Tracking via Exclusive Context Modeling

Tianzhu Zhang, Bernard Ghanem, Si Liu, Changsheng Xu, Narendra Ahuja

Tracking

IEEE Cybernetics 2016

2015

Matching-CNN Meets KNN: Quasi-Parametric Human Parsing

Si Liu, Xiaodan Liang, Luoqi Liu, Xiaohui Shen, Jianchao Yang, Changsheng Xu, Xiaochun Cao, Shuicheng Yan

Segmentation Human-Centric Vision

CVPR 2015

Towards Computational Baby Learning: A Weakly-Supervised Approach for Object Detection

Xiaodan Liang, Si Liu, Yunchao Wei, Luoqi Liu, Liang Lin, Shuicheng Yan

Detection

ICCV 2015

Deep People Counting in Extremely Dense Crowds

Chuan Wang, Hua Zhang, Yang Liang, Si Liu, Xiaochun Cao

Counting

ACM MM 2015

Diversity-induced Multiview Subspace Clustering

Xiaochun Cao, Changqing Zhang, Huazhu Fu, Si Liu

Clustering

CVPR 2015

Structural Sparse Tracking

Tianzhu Zhang, Si Liu, Changsheng Xu, Shuicheng Yan, Narendra Ahuja, Bernard Ghanem, Ming-Hsuan Yang

Tracking

CVPR 2015

Low-Rank Tensor Constrained Multiview Subspace Clustering

Changqing Zhang, Huazhu Fu, Si Liu, Guangcan Liu, Xiaochun Cao

Clustering

ICCV 2015

Human Parsing With Contextualized Convolutional Neural Network

Xiaodan Liang, Chunyan Xu, Xiaohui Shen, Jianchao Yang, Si Liu, Jinhui Tang, Liang Lin, Shuicheng Yan

Segmentation Human-Centric Vision

ICCV 2015

Deep Human Parsing with Active Template Regression

Xiaodan Liang, Si Liu, Xiaohui Shen, Jianchao Yang, Luoqi Liu, Liang Lin, Shuicheng Yan

Segmentation Human-Centric Vision

TPAMI 2015

Fashion Parsing With Video Context

Si Liu, Xiaodan Liang, Luoqi Liu, Liang Lin, Ke Lv, Xiaochun Cao, Shuicheng Yan

Video Understanding Segmentation Fashion Vision

TMM 2015

SLED: Semantic Label Embedding Dictionary Representation for Multi-label Image Annotation

Xiaochun Cao, Hua Zhang, Xiaojie Guo, Si Liu, Dan Meng

Image Annotation

TIP 2015

2014

Fashion Parsing with Video Context

Si Liu, Xiaodan Liang, Luoqi Liu, Liang Lin, Ke Lv, Shuicheng Yan

Video Understanding Segmentation Fashion Vision

ACM MM 2014

Puzzle Search: Image Retrieval and Ranking with Consistent Reconstruction of Multi-Attribute Queries

Xiaochun Cao, Hua Zhang, Xiaojie Guo, Si Liu, Xiaowu Chen

Retrieval

ECCV 2014

Clothing Attributes Assisted Person Re-identification

Annan Li, Luoqi Liu, Kang Wang, Si Liu, Shuicheng Yan

Re-ID

TCSVT 2014

Fashion Parsing with Weak Color-Category Labels

Si Liu, Jiashi Feng, Csaba Domokos, Junshi Huang, Zhenzhen Hu, Shuicheng Yan

Segmentation Fashion Vision

TMM 2014

Fashion Analysis: Current Techniques and Future Directions

Si Liu, Luoqi Liu, Shuicheng Yan

Fashion Vision

IEEE MultiMedia 2014

Snap & Play: Auto-Generate Personalized Find-the-Difference Game

Si Liu, Qiang Chen, Shuicheng Yan, Changsheng Xu, Hanqing Lu

Interactive Vision

TIST 2014

Wow! You Are So Beautiful Today!

Luoqi Liu, Jun-liang Xing, Si Liu, Hui Xu, Xi Zhou, Shuicheng Yan

Face Analysis

TOMCCAP 2014

PicWords: Render a Picture by Packing Keywords

Zhenzhen Hu, Si Liu, Jianguo Jiang, Richang Hong, Meng Wang, Shuicheng Yan

Image Generation

TMM 2014

Circle & Search: Attribute-aware Shoe Retrieval

Junshi Huang, Si Liu, Junliang Xing, Tao Mei, Shuicheng Yan

Retrieval

TOMCCAP 2014

Robust Visual Tracking via Consistent Low-Rank Sparse Learning

Tianzhu Zhang, Si Liu, Narendra Ahuja, Ming-Hsuan Yang, Bernard Ghanem

Tracking

IJCV 2014

2013

Towards Decrypting Attractiveness via Multi-Modality Cues

Tam V. Nguyen, Si Liu, Bingbing Ni, Jun Tan, Yong Rui, Shuicheng Yan

Face Analysis

TOMCCAP 2013

Wow! you are so beautiful today!

Luoqi Liu, Hui Xu, Junliang Xing, Si Liu, Xi Zhou, Shuicheng Yan

Face Analysis

ACM MM 2013 Best Paper

eHeritage of shadow puppetry: creation and manipulation

Min Lin, Zhenzhen Hu, Si Liu, Meng Wang, Richang Hong, Shuicheng Yan

Robotics

ACM MM 2013

Wow! you are so beautiful today!

Luoqi Liu, Hui Xu, Si Liu, Junliang Xing, Xi Zhou, Shuicheng Yan

Face Analysis

ACM MM demo 2013

Low-Rank Sparse Coding for Image Classification

Tianzhu Zhang, Bernard Ghanem, Si Liu, Changsheng Xu, Narendra Ahuja

Recognition

ICCV 2013

SYM-FISH: A Symmetry-aware Flip Invariant Sketch Histogram Shape Descriptor

Xiaochun Cao, Hua Zhang, Si Liu, Xiaojie Guo

Sketch Understanding

ICCV 2013

2012

Hi, magic closet, tell me what to wear!

Si Liu, Tam V. Nguyen, Jiashi Feng, Meng Wang, Shuicheng Yan

Fashion Vision

ACM MM 2012 Best Demo

Hi, magic closet, tell me what to wear!

Si Liu, Jiashi Feng, Zheng Song, Tianzhu Zhang, Hanqing Lu, Changsheng Xu, Shuicheng Yan

Fashion Vision

ACM MM 2012

Street-to-Shop: Cross-Scenario Clothing Retrieval via Human Part Alignment and Auxiliary Set

Si Liu, Zheng Song, Guangcan Liu, Shuicheng Yan, Changsheng Xu, Hanqing Lu

Retrieval Human-Centric Vision

CVPR 2012 Oral

Street-to-shop: cross-scenario clothing retrieval via parts alignment and auxiliary set

Si Liu, Zheng Song, Meng Wang, Changsheng Xu, Hanqing Lu, Shuicheng Yan

Retrieval

ACM MM demo 2012

Robust Visual Tracking via Structured Multi-Task Sparse Learning

Tianzhu Zhang, Bernard Ghanem, Si Liu, Narendra Ahuja

Tracking

IJCV 2012

A Generic Framework for Video Annotation via Semi-supervised Learning

Tianzhu Zhang, Changsheng Xu, Guangyu Zhu, Si Liu, Hanqing Lu

Video Understanding Image Annotation

TMM 2012

Weakly-Supervised Graph Propagation Towards Collective Image Parsing

Si Liu, Shuicheng Yan, Tianzhu Zhang, Changsheng Xu, Jing Liu, Hanqing Lu

Segmentation Graph Learning

TMM 2012

Sense beauty via face, dressing, and/or voice

Tam V. Nguyen, Si Liu, Bingbing Ni, Jun Tan, Yong Rui, Shuicheng Yan

Face Analysis

ACM MM 2012

Low-Rank Sparse Learning for Robust Visual Tracking

Tianzhu Zhang, Bernard Ghanem, Si Liu, Narendra Ahuja

Tracking

ECCV 2012

Robust Visual Tracking via Multi-Task Sparse Learning

Tianzhu Zhang, Bernard Ghanem, Si Liu, Narendra Ahuja

Tracking

CVPR 2012 Oral

2011

Boosted Exemplar Learning for Action Recognition and Annotation

Tianzhu Zhang, Jing Liu, Si Liu, Changsheng Xu, Hanqing Lu

Video Understanding Recognition Image Annotation

TCSVT 2011

Size Adaptive Selection of Most Informative Features

Si Liu, Hairong Liu, Shuicheng Yan, Longin Latecki, Changsheng Xu, Hanqing Lu

Feature Selection

AAAI 2011 Oral

Snap & Play: Auto-generate Personalized Find-the-Difference Mobile Game

Si Liu, Qiang Chen, Shuicheng Yan, Changsheng Xu, Hanqing Lu

Interactive Vision

ACM MM 2011

2010

A Generic Framework for Event Detection in Various Video Domain

Tianzhu Zhang, Changsheng Xu, Guangyu Zhu, Si Liu, Hanqing Lu

Video Understanding Detection

ACM MM 2010