发表论文
All Publications
OctoNav: Towards Generalist Embodied Navigation
Chen Gao, Liankai Jin, Xingyu Peng, Jiazhao Zhang, Yue Deng, Annan Li, He Wang, Si Liu
Embodied AI
Navigation
CVPR 2026
Parse, Search, and Confirmation: Training-Free Aerial Vision-and-Dialog Navigation with Chain-of-Thought Reasoning and Structured Spatial Memory
Yu Qi, Hongyu Li, Shaofei Huang, Tianrui Hui, Yaxiong Wang, Lechao Cheng, Zhun Zhong, Si Liu, Meng Wang
UAV
Embodied AI
Vision-Language
CVPR 2026
LookasideVLN: Direction-Aware Aerial Vision-and-Language Navigation
Yuwei Ning, Ganlong Zhao, Yipeng Qin, Si Liu, Yang Liu, Liang Lin, Guanbin Li
UAV
Embodied AI
Vision-Language
CVPR 2026
VGGT-Segmentor: Geometry-Enhanced Cross-View Segmentation
Yulu Gao, Bohao Zhang, Zongheng Tang, Jitong Liao, Wenjun Wu, Si Liu
3D Vision
Segmentation
Cross-View
CVPR 2026
ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models
Linqing Zhong, Yi Liu, Yifei Wei, Ziyu Xiong, Maoqing Yao, Si Liu, Guanghui Ren
Multimodal Learning
Embodied AI
Robotics
CVPR 2026
Geometry-Guided 3D Visual Token Pruning for Video-Language Models
Han Li, Zehao Huang, Jiahui Fu, Naiyan Wang, Si Liu
Multimodal Learning
Video Understanding
Model Compression
CVPR 2026
RoboCerebra: A Large-scale Benchmark for Long-horizon Robotic Manipulation Evaluation
Songhao Han#, Boxiang Qiu#, Yue Liao#, Siyuan Huang, Chen Gao, Shuicheng Yan*, Si Liu*
Embodied AI
Robotics
Dataset
NeurIPS 2025
GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance
Jingqiu Zhou#, Lue Fan#, Xuesong Chen, Linjiang Huang*, Si Liu, Hongsheng Li
3D Vision
Neural Rendering
AAAI 2025
Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
Shaofei Huang#, Rui Ling#, Hongyu Li#, Tianrui Hui, Zongheng Tang, Xiaoming Wei, Jizhong Han, Si Liu*
Video Understanding
Multimodal Learning
AAAI 2025
Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology
Xiangyu Wang#, Donglin Yang#, Zigin Wang#, Hohin Kwan, Jinyu Chen, Wenjun Wu, Hongsheng Li, Yue Liao*, Si Liu*
UAV
Embodied AI
Vision-Language
ICLR 2025
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation
Fangxun Shu#, Yue Liao#, Le Zhuo, Chenning Xu, Guanghao Zhang, Haonan Shi, LongChen, Tao Zhong, Wanggui He, Siming Fu, Haoyuan Li, Bolin Li, Zhelun Yu, Si Liu*, Hongsheng Li*, Hao Jiang*
Multimodal Learning
Model Compression
ICLR 2025
Point Cluster: A Compact Message Unit for Communication-Efficient Collaborative Perception
Zihan Ding, Jiahui Fu, Si Liu*, Hongyu Li, Siheng Chen, Hongsheng Li, Shifeng Zhang, Xu Zhou
Autonomous Driving
Collaborative Perception
ICLR 2025
MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More
Wei Huang#, Yue Liao#, Jianhui Liu, Ruifei He, Haoru Tan, Shiming Zhang*, Hongsheng Li, Si Liu*, Xiaojuan Qi*
Large Language Model
Model Compression
ICLR 2025
Generative Map Priors for Collaborative BEV Semantic Segmentation
Jiahui Fu, Yue Gong, Luting Wang, Shifeng Zhang, Xu Zhou, Si Liu*
Autonomous Driving
3D Vision
CVPR 2025
FlexDrive: Toward Trajectory Flexibility in Driving Scene Reconstruction and Rendering
Jingqiu Zhou, Lue Fan, Linjiang Huang*, Zhaoxiang Zhang, Xiaoyu Shi, Si Liu, Hongsheng Li*
Autonomous Driving
Neural Rendering
CVPR 2025
VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection
Songhao Han, Wei Huang, Hairong Shi, Le Zhuo, Xiu Su, Shifeng Zhang, Xu Zhou, Xiaojuan Qi, Yue Liao*, Si Liu*
Video Understanding
Dataset
CVPR 2025
LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding
Hongyu Li#, Jinyu Chen#, Ziyu Wei#, Shaofei Huang, Tianrui Hui, Jialin Gao*, Xiaoming Wei, Si Liu*
Multimodal Learning
Video Understanding
CVPR 2025
Revisiting Audio-Visual Segmentation with Vision-Centric Transformer
Shaofei Huang#, Rui Ling, Tianrui Hui*, Hongyu Li, Xu Zhou, Shifeng Zhang, Si Liu*, Richang Hong, Meng Wang
Multimodal Learning
Segmentation
CVPR 2025
Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMS
Zitian Wang, Yue Liao*, Kang Rong, Fengyun Rao, Yibo Yang*, Si Liu
Multimodal Learning
Alignment
ICCV 2025
Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization
Hao Ju#, Shaofei Huang#, Si Liu, Zhedong Zheng*
UAV
3D Vision
ICCV 2025
CoST: Efficient Collaborative Perception From Unified Spatiotemporal Perspective
Zongheng Tang, Yi Liu, Yifan Sun, Yulu Gao, Jinyu Chen, Runsheng Xu, Si Liu*
Autonomous Driving
Collaborative Perception
ICCV 2025
CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation
Yi Liu, Shengqian Li, Zuzeng Lin, Feng Wang, Si Liu*
Image Generation
Generative Model
ICCV 2025
RATopo: Improving Lane Topology Reasoning via Redundancy Assignment
Han Li, Shaofei Huang, Longfei Xu, Yulu Gao, Beipeng Mu, Si Liu*
Autonomous Driving
Scene Understanding
ACM MM 2025
DOMR: Establishing Cross-View Segmentation via Dense Object Matching
Jitong Liao#, Yulu Gao#, Shaofei Huang, Jialin Gao, Jie Lei, Ronghua Liang, Si Liu*
3D Vision
Segmentation
ACM MM 2025
AeroDuo: Aerial Duo for UAV-based Vision and Language Navigation
Ruipu Wu#, Yige Zhang#, Jinyu Chen, Linjiang Huang*, Shifeng Zhang, Xu Zhou, Liang Wang, Si Liu*
UAV
Embodied AI
Vision-Language
ACM MM 2025
"Hi AirStar, Guide Me to the Badminton Court."
Zigin Wang#, Jinyu Chen#, Xiangyi Zheng, Qinan Liao, Linjiang Huang*, Si Liu*
UAV
Embodied AI
Vision-Language
ACM MM demo 2025
UAV-Flow Colosseo: A Real-World Benchmark for Flying-on-a-Word UAV Imitation Learning
Xiangyu Wang#, Donglin Yang#, Yue Liao#, Wenhao Zheng, Wenjun Wu, Bin Dai, Hongsheng Li, Si Liu*
UAV
Embodied AI
Dataset
NeurIPS 2025
RoboCerebra: A Large-scale Benchmark for Long-horizon Robotic Manipulation Evaluation
Songhao Han#, Boxiang Qiu#, Yue Liao#, Siyuan Huang, Chen Gao, Shuicheng Yan*, Si Liu*
Embodied AI
Robotics
Dataset
NeurIPS 2025
Towards Realistic Earth-Observation Constellation Scheduling: Benchmark and Methodology
Luting Wang#, Yinghao Xiang#, Hongliang Huang, Dongjun Li, Chen Gao*, Si Liu*
Satellite Systems
Dataset
NeurIPS 2025
FACT: Mitigating Inconsistent Hallucinations in LLMs via Fact-Driven Alternating Code-Text Training
Xinxin You, Qixin Sun, Xien Liu, Chenwei Yan, Xiao Zhang, Chen Ning, Xiangling Fu, Si Liu, Shijin Wang, Guoping Hu, Ji Wu*
Large Language Model
NeurIPS 2025
M2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection
Zitian Wang, Zehao Huang, Yulu Gao, Naiyan Wang, Si Liu*
Autonomous Driving
3D Vision
TPAMI 2025
Multi-Person Pose Regression with Distribution-Aware Single-Stage Models
Leyan Zhu#, Zitian Wang#, Si Liu*, Xuecheng Nie, Luoqi Liu, Bo Li
Pose Estimation
TPAMI 2024
Data Augmentation in Human-Centric Vision
Wentao Jiang, Yige Zhang, Shaozhong Zheng, Si Liu*, Shuicheng Yan
Data Augmentation
Human-Centric Vision
Vicinagearth (Springer Nature) 2024
FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature Augmentation
Rongyao Fang, Peng Gao, Aojun Zhou, Yingjie Cai, Si Liu, Jifeng Dai, Hongsheng Li*
Detection
Data Augmentation
TPAMI 2024
PPDM++: Parallel Point Detection and Matching for Fast and Accurate HOI Detection
Yue Liao, Si Liu*, Yulu Gao, Aixi Zhang, Zhimin Li, Fei Wang, Bo Li
Detection
HOI
TPAMI 2024
MAC: Masked Contrastive Pre-Training for Efficient Video-Text Retrieval
Fangxun Shu, Biaolong Chen, Yue Liao, Jinqiao Wang, Si Liu
Multimodal Learning
Video Understanding
Retrieval
TMM 2024
RGB-T Tracking with Template-Bridged Search Interaction and Target-Preserved Template Updating
Bo Li, Fengguang Peng, Tianrui Hui, Xiaoming Wei, Xiaolin Wei, Lijun Zhang, Hang Shi, Si Liu*
Multimodal Learning
Video Understanding
Tracking
TPAMI 2024
Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor Regression
Shaofei Huang, Zhenwei Shen, Zehao Huang, Yue Liao, Jizhong Han, Naiyan Wang, Si Liu*
Autonomous Driving
3D Vision
Detection
TPAMI 2024
ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation
Bo Zhang#, Xinyu Cai#*, Jiakang Yuan, Donglin Yang, Jianfei Guo, Xiangchao Yan, Renqiu Xia, Botian Shi, Min Dou, Tao Chen, Si Liu, Junchi Yan*, Yu Qiao
Autonomous Driving
3D Vision
Domain Adaptation
ICLR 2024
Octavius: Mitigating Task Interference in MLLMs via MoE
Ziqin Wang#, Zeren Chen#, Zhen Wang#, Huayang Liu, Zhenfei Yin, Si Liu, Lu Sheng*, Wanli Ouyang, Yu Qiao, Jing Shao*
Large Language Model
ICLR 2024
Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection
Jiahui Fu, Chen Gao, Zitian Wang, Lirong Yang, Xiaofei Wang, Beipeng Mu, Si Liu
Autonomous Driving
3D Vision
Multimodal Learning
ICRA 2024
Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training
Runze He#, Shaofei Huang#, Xuecheng Nie, Tianrui Hui, Luoqi Liu, Jiao Dai, Jizhong Han, Guanbin Li*, Si Liu*
3D Vision
Image Generation
Scene Understanding
CVPR 2024
EASE-DETR: Easing the Competition among Object Queries
Yulu Gao, Yifan Sun, Xudong Ding, Chuyang Zhao, Si Liu
Detection
CVPR 2024
Reference Prompted Model Adaptation for Referring Camouflaged Object Detection
Xuewei Liu#, Shaofei Huang#, Ruipu Wu, Hengyuan Zhao, Duo Xu, Xiaoming Wei, Jizhong Han*, Si Liu
Detection
Prompting
ICME 2024
SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection
Gang Zhang, Junnan Chen, Guohuan Gao, Jianmin Li, Si Liu, Xiaolin Hu*
3D Vision
Detection
CVPR 2024
Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection
Jiaming Li, Jiacheng Zhang, Jichang Li, Ge Li, Si Liu, Liang Lin, Guanbin Li
Detection
Prompting
CVPR 2024
Communication-Efficient Collaborative Perception via Information Filling with Codebook
Yue Hu, Juntong Peng, Sifei Liu, Junhao Ge, Si Liu, Siheng Chen
Autonomous Driving
Collaborative Perception
CVPR 2024
Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation
Hairong Shi#, Songhao Han#, Shaofei Huang*, Yue Liao, Guanbin Li*, Xiangxing Kong, Hua Zhu, Xiaomu Wang, Si Liu
Medical Imaging
Segmentation
MICCAI 2024
Realistic Rainy Weather Simulation for LiDARs in CARLA Simulator
Donglin Yang, Xinyu Cai∗, Zhenfeng Liu, Wentao Jiang, Bo Zhang, Guohang Yan, Xing Gao, Si Liu, Botian Shi
Autonomous Driving
3D Vision
IROS 2024
Asynchronous Large Language Model Enhanced Planner for Autonomous Driving
Yuan Chen#, Zi-han Ding#, Ziqin Wang#, Yan Wang*, Lijun Zhang, Si Liu*
Autonomous Driving
Large Language Model
Navigation
ECCV 2024
LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction
Penghui Du#, Yu Wang#, Yifan Sun, Luting Wang, Yue Liao, Gang Zhang, Errui Ding, Yan Wang*, Jingdong Wang, Si Liu*
Large Language Model
Detection
ECCV 2024
Controllable Navigation Instruction Generation with Chain of Thought Prompting
Xianghao Kong#, Jinyu Chen#, Wenguan Wang*, Hang Su, Xiaolin Hu, Yi Yang, Si Liu*
Image Generation
Navigation
Prompting
ECCV 2024
FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis
Linjiang Huang, Rongyao Fang, Aiping Zhang, Guanglu Song, Si Liu, Yu Liu, Hongsheng Li
Image Generation
ECCV 2024
GPD-VVTO: Preserving Garment Details in Video Virtual Try-On
Yuanbin Wang, Weilun Dai, Long Chan, Huanyu Zhou, Aixi Zhang, Si Liu
Video Understanding
ACM MM 2024
Collaborative Training of Tiny-Large Vision Language Models
Shichen Lu, Longteng Guo, Wenxuan Wang, Zijia Zhao, Tongtian Yue, Si Liu, Jing Liu
Multimodal Learning
Large Language Model
ACM MM 2024
Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Hongyu Li, Tianrui Hui, Zihan Ding, Jing Zhang, Bin Ma, Xiaoming Wei, Jizhong Han, Si Liu
Multimodal Learning
Segmentation
Grounding
ACM MM 2024
Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT
Le Zhuo, Ruoyi Du, Han Xiao, Yangguang Li, Dongyang Liu, Rongjie Huang, Wenze Liu, Lirui Zhao, Fu-Yun Wang, Zhanyu Ma, Xu Luo, Zehan Wang, Kaipeng Zhang, Xiangyang Zhu, Si Liu, Xiangyu Yue, Dingning Liu, Wanli Ouyang, Ziwei Liu, Yu Qiao, Hongsheng Li, Peng
Multimodal Learning
Image Generation
Diffusion
NeurIPS 2024
Image Understanding Makes for A Good Tokenizer for Image Generation
Luting Wang, Yang Zhao, Zijian Zhang, Jiashi Feng, Si Liu, Bingyi Kang
Image Generation
NeurIPS 2024
CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics
Ziqin Wang*, Jiawei Gao*, Zeqi Xiao, Jingbo Wang, Tai Wang, Jinkun Cao, Xiaolin Hu, Si Liu, Jifeng Dai, Jiangmiao Pang
Video Understanding
HOI
Human-Centric Vision
NeurIPS 2024 (Spotlight)
Language-Aware Spatial-Temporal Collaboration for Referring Video Segmentation
Tianrui Hui, Si Liu*, Zihan Ding, Shaofei Huang, Guanbin Li, Wenguan Wang, Luoqi Liu, Jizhong Han
Video Understanding
Segmentation
TPAMI 2023
Room-Object Entity Prompting and Reasoning for Embodied Referring Expression
Chen Gao, Si Liu*, Jinyu Chen, Luting Wang, Qi Wu, Bo Li, Qi Tian
Embodied AI
Prompting
Scene Understanding
TPAMI 2023
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Hongyang Li*, Chonghao Sima, Jifeng Dai, Wenhai Wang, Lewei Lu*, Huijie Wang, Jia Zeng, Zhiqi Li, Jiazhi Yang, Hanming Deng, Hao Tian, Enze Xie, Jiangwei Xie, Li Chen, Tianyu Li, Yang Li, Yulu Gao, Xiaosong Jia, Si Liu, Jianping Shi, Dahua Lin, Yu Qiao
Autonomous Driving
Benchmark
TPAMI 2023
Teach-DETR: Better Training DETR with Teachers
Linjiang Huang, Kaixin Lu, Guanglu Song, Liang Wang, Si Liu, Yu Liu, Hongsheng Li*
Detection
TPAMI 2023
Region-Adaptive and Context-Complementary Cross Modulation for RGB-T Semantic Segmentation
Fengguang Peng, Zihan Ding, Ziming Chen, Gang Wang*, Tianrui Hui, Si Liu, Hang Shi
Multimodal Learning
Segmentation
Pattern Recognition 2023
MI3C: Mining Intra- and Inter-Image Context for Person Search
Zongheng Tang, Yulu Gao, Tianrui Hui*, Fengguang Peng, Si Liu
Re-ID
Pattern Recognition 2023
Linker: Learning Long Short-term Associations for Robust Visual Tracking
Zizheng Xun, Shangzhe Di, Yulu Gao, Zongheng Tang, Gang Wang∗, Si Liu, Bo Li
Tracking
TMM 2023
Anchor3DLane: Learning to Regress 3D Anchors for Monocular 3D Lane Detection
Shaofei Huang, Zhenwei Shen, Zehao Huang, Zihan Ding, Jiao Dai, Jizhong Han, Naiyan Wang, Si Liu
Autonomous Driving
3D Vision
Detection
CVPR 2023
Bridging Search Region Interaction with Template for RGB-T Tracking
Tianrui Hui, Zizheng Xun, Fengguang Peng, Junshi Huang, Xiaoming Wei, Xiaolin Wei, Jiao Dai, Jizhong Han, Si Liu
Multimodal Learning
Video Understanding
Tracking
CVPR 2023
DETR with Additional Global Aggregation for Cross-domain Weakly Supervised Object Detection
Zongheng Tang, Yifan Sun, Si Liu, Yi Yang
Detection
Diffusion
CVPR 2023
Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection
Luting Wang, Yi Liu, Penghui Du, Zihan Ding, Yue Liao, Qiaosong Qi, Biaolong Chen, Si Liu
Detection
CVPR 2023
Adaptive Zone-aware Hierarchical Planner for Vision-Language Navigation
Chen Gao, Xingyu Peng, Bo Yan, He Wang, Lirong Yang, Haibing Ren, Hongsheng Li, Si Liu
Multimodal Learning
Navigation
CVPR 2023
Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels
Jingqiu Zhou, Liang Wang, Si Liu, Hongsheng Li, Linjiang Huang
Video Understanding
Localization
CVPR 2023
Analyzing Infrastructure LiDAR Placement with Realistic LiDAR Simulation Library
Xinyu Cai, Wentao Jiang, Runsheng Xu, Wenquan Zhao, Jiaqi Ma, Si Liu, Yikang Li*
Autonomous Driving
3D Vision
ICRA 2023
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
Le Zhuo, Ruibin Yuan, Jiahao Pan, Yinghao Ma, Yizhi Li, Ge Zhang, Si Liu, Roger Dannenberg, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenhu Chen, Wei Xue, Yike Guo
Large Language Model
Audio
ISMIR 2023
Sparse Dense Fusion for 3D Object Detection
Yulu Gao, Chonghao Sima, Shaoshuai Shi, Shangzhe Di, Si Liu, Hongyang Li
3D Vision
Detection
IROS 2023
Discovering Sounding Objects by Audio Queries for Audio Visual Segmentation
Shaofei Huang, Han Li, Yuqing Wang, Hongji Zhu, Jiao Dai, Jizhong Han, Wenge Rong, Si Liu
Audio
Segmentation
IJCAI 2023
Enriching Phrases with Coupled Pixel and Object Contexts for Panoptic Narrative Grounding
Tianrui Hui, Zihan Ding, Junshi Huang, Xiaoming Wei, Xiaolin Wei, Jiao Dai, Jizhong Han, Si Liu
Segmentation
Grounding
IJCAI 2023
Video Background Music Generation: Dataset, Method and Evaluation
Le Zhuo, Zhaokai Wang, Baisen Wang, Yue Liao, Stanley Peng, Chenxi Bao, Miao Lu, Xiaobo Li, Si Liu
Video Understanding
Audio
Image Generation
ICCV 2023
Optimizing the Placement of Roadside LiDARs for Autonomous Driving
Wentao Jiang, Hao Xiang, Xinyu Cai, Runsheng Xu, Jiaqi Ma, Yikang Li, Gim Hee Lee, Si Liu
Autonomous Driving
3D Vision
ICCV 2023
Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation
Jinyu Chen, Wenguan Wang, Si Liu, Hongsheng Li, Yi Yang
Multimodal Learning
Audio
Navigation
ICCV 2023
Object as Query: Lifting any 2D Object Detector to 3D Detection
Zitian Wang, Zehao Huang, Jiahui Fu, Naiyan Wang, Si Liu
3D Vision
Detection
ICCV 2023
DiffDance: Cascaded Human Motion Diffusion Model for Dance Generation
Qiaosong Qi*, Le Zhuo*, Aixi Zhang, Yue Liao, Fei Fang, Si Liu, Shuicheng Yan
Image Generation
Diffusion
Human-Centric Vision
ACM MM 2023
Transferring CLIP’s Knowledge into Zero-Shot Point Cloud Semantic Segmentation
Yuanbin Wang, Shaofei Huang, Yulu Gao, Zhen Wang, Rui Wang, Kehua Sheng, Bo Zhang, Si Liu
3D Vision
Segmentation
Domain Adaptation
ACM MM 2023
DUSA: Decoupled Unsupervised Sim2Real Adaptation for Vehicle-to-Everything Collaborative Perception
Xianghao Kong, Wentao Jiang, Jinrang Jia, Yifeng Shi, Runsheng Xu, Si Liu
Autonomous Driving
Collaborative Perception
Domain Adaptation
ACM MM 2023
MARBLE: Music Audio Representation Benchmark for Universal Evaluation
Ruibin Yuan, Yinghao Ma, Yizhi Li, Ge Zhang, Xingran Chen, Hanzhi Yin, Le Zhuo, Yiqi Liu, Jiawen Huang, Zeyue Tian, Binyue Deng, Ningzhi Wang, Wenhu Chen, Gus Xia, Wei Xue, Si Liu, Shi Wang, Ruibo Liu, Yike Guo, Jie Fu
Audio
Dataset
Benchmark
NeurIPS 2023
3D-SPS: Single-Stage 3D Visual Grounding via Referred Point Progressive Selection
Junyu Luo#, Jiahui Fu#, Xianghao Kong, Chen Gao*, Haibing Ren, Hao Shen, Huaxia Xia, Si Liu
3D Vision
Grounding
CVPR 2022 (Oral)
GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection
Yue Liao, Aixi Zhang, Miao Lu, Yongliang Wang, Xiaobo Li, Si Liu*
Video Understanding
Detection
HOI
CVPR 2022
PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding
Zihan Ding#, Zi-han Ding#, Tianrui Hui*, Junshi Huang, Xiaoming Wei, Xiaolin Wei, Si Liu
Segmentation
Grounding
ACM MM 2022
Target-Driven Structured Transformer Planner for Vision-Language Navigation
Yusheng Zhao#, Jinyu Chen#, Chen Gao, Wenguan Wang*, Lirong Yang, Haibing Ren, Huaxia Xia, Si Liu
Multimodal Learning
Navigation
ACM MM 2022 (Oral)
Cross-Modality Domain Adaptation for Freespace Detection: A Simple yet Effective Baseline
Yuanbin Wang, Leyan Zhu, Shaofei Huang*, Tianrui Hui, Xiaojie Li, Fei Wang, Si Liu
Multimodal Learning
Detection
Domain Adaptation
ACM MM 2022
PoseTrans: A Simple Yet Effective Pose Transformation Augmentation for Human Pose Estimation
Wentao Jiang, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Si Liu*
Pose Estimation
Data Augmentation
Human-Centric Vision
ECCV 2022
HEAD: HEtero-Assists Distillation for Heterogeneous Object Detectors
Luting Wang, Xiaojie Li, Yue Liao*, Zeren Jiang, Jianlong Wu, Fei Wang, Chen Qian, Si Liu
Detection
ECCV 2022
Fine-grained Face Editing via Personalized Spatial-aware Affine Modulation
Si Liu, Renda Bao, Defa Zhu, Shaofei Huang, Qiong Yan, Liang Lin, Chao Dong
Image Generation
Face Analysis
Diffusion
TMM 2022
Progressive Language-customized Visual Feature Learning for One-stage Visual Grounding
Yue Liao, Aixi Zhang, Zhiyuan Chen, Tianrui Hui, Si Liu*
Grounding
TIP 2022
Simultaneously Training and Compressing Vision-and-Language Pre-training Model
Qiaosong Qi, Aixi Zhang, Yue Liao*, Wenyu Sun, Yongliang Wang, Xiaobo Li, Si Liu
Multimodal Learning
TMM 2022
Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
Zihan Ding, Tianrui Hui, Junshi Huang, Xiaoming Wei, Jizhong Han, Si Liu*
Video Understanding
Segmentation
CVPR 2022
Distribution-Aware Single-Stage Models for Multi-Person 3D Pose Estimation
Zitian Wang, Xuecheng Nie, Xiaochao Qu, Yunpeng Chen, Si Liu*
3D Vision
Pose Estimation
CVPR 2022
Reinforced Structured State-Evolution for Vision-Language Navigation
Jinyu Chen, Chen Gao, Meng Erli, Qiong Zhang, Si Liu*
Multimodal Learning
Navigation
CVPR 2022
Video Background Music Generation with Controllable Music Transformer
Shangzhe Di#, Zeren Jiang#, Si Liu*, Zhaokai Wang, Leyan Zhu, Zexin He, Hongming Liu, Shuicheng Yan
Video Understanding
Audio
Image Generation
ACM MM 2021 Best Paper
Language-Guided Global Image Editing via Cross-Modal Cyclic Mechanism
Wentao Jiang, Ning Xu, Jiayun Wang, Chen Gao, Jing Shi, Zhe Lin, Si Liu*
Multimodal Learning
Image Generation
Diffusion
ICCV 2021
Mining the Benefits of Two-stage and One-stage HOI Detection
Aixi Zhang#, Yue Liao#, Si Liu*, Miao Lu, Yongliang Wang, Chen Gao, Xiaobo Li
Detection
HOI
NeurIPS 2021
Confidence-aware Non-repetitive Multimodal Transformers for TextCaps
Zhaokai Wang, Renda Bao, Qi Wu, Si Liu*
Multimodal Learning
AAAI 2021
Reformulating HOI Detection as Adaptive Set Prediction
Mingfei Chen#, Yue Liao#, Si Liu*, Zhiyuan Chen, Fei Wang, Chen Qian
Detection
HOI
CVPR 2021
Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring Expression
Chen Gao#, Jinyu Chen#, Si Liu*, Luting Wang, Qiong Zhang, Qi Wu
Embodied AI
Scene Understanding
CVPR 2021 Oral
Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation
Tianrui Hui#, Shaofei Huang#, Si Liu*, Zihan Ding, Guanbin Li, Wenguan Wang, Jizhong Han, Fei Wang
Video Understanding
Segmentation
CVPR 2021
General Instance Distillation for Object Detection
Xing Dai#, Zeren Jiang#, Zhao Wu, Yiping Bao, Zhicheng Wang, Si Liu, Erjin Zhou
Detection
CVPR 2021
Differentiable Multi-Granularity Human Representation Learning for Instance-Aware Human Semantic Parsing
Tianfei Zhou, Wenguan Wang*, Si Liu, Yi Yang, Luc Van Gool
Segmentation
Human-Centric Vision
CVPR 2021 Oral
TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding
Dailan He#, Yusheng Zhao#, Junyu Luo#, Tianrui Hui, Shaofei Huang, Anxi Zhang, Si Liu*
3D Vision
Grounding
ACM MM 2021
Attentive Excitation and Aggregation for Bilingual Referring Image Segmentation
Qianli Zhou#, Tianrui Hui#, Rong Wang*, Haimiao Hu, Si Liu* (#Equal contribution)
Segmentation
TIST 2021
Discriminative Triad Matching and Reconstruction for Weakly Referring Expression Grounding
Mingjie Sun, Jimin Xiao, Enggee Lim, Si Liu, John Yannis Goulermas
Grounding
TPAMI 2021
Human-centric Relation Segmentation: Dataset and Solution
Si Liu, Zitian Wang, Yulu Gao, Lejian Ren, Yue Liao, Guanghui Ren, Bo Li, Shuicheng Yan
Segmentation
Dataset
Human-Centric Vision
TPAMI 2021
Cross-Modal Progressive Comprehension for Referring Segmentation
Si Liu, Tianrui Hui, Shaofei Huang, Yunchao Wei, Bo Li, Guanbin Li*
Multimodal Learning
Segmentation
TPAMI 2021
PSGAN++: Robust Detail-Preserving Makeup Transfer and Removal
Si Liu, Wentao Jiang, Chen Gao, Ran He*, Jiashi Feng, Bo Li, Shuicheng Yan
Image Generation
Face Analysis
Domain Adaptation
TPAMI 2021
Human-centric Spatio-Temporal Video Grounding With Visual Transformers
Zongheng Tang, Yue Liao, Si Liu*, Guanbin Li, Xiaojie Jin, Hongxu Jiang, Qian Yu, Dong Xu
Video Understanding
Grounding
Human-Centric Vision
TCSVT 2021
Scene Graph Generation with Hierarchical Context
Guanghui Ren, Lejian Ren, Yue Liao, Si Liu*, Bo Li, Jizhong Han, Shuicheng Yan
Image Generation
Graph Learning
Scene Understanding
TNNLS 2020
ORDNet: Capturing Omni-Range Dependencies for Scene Parsing
Shaofei Huang, Si Liu*, Tianrui Hui, Jizhong Han, Bo Li, Jiashi Feng, Shuicheng Yan
Segmentation
Scene Understanding
TIP 2020
PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer
Wentao Jiang, Si Liu*, Chen Gao, Jie Cao, Ran He, Jiashi Feng, Shuicheng Yan
Pose Estimation
Image Generation
Face Analysis
CVPR 2020 Oral
AdversarialNAS: Adversarial Neural Architecture Search for GANs
Chen Gao, Yunpeng Chen, Si Liu*, Zhenxiong Tan, Shuicheng Yan
Image Generation
CVPR 2020
A Real-Time Cross-modality Correlation Filtering Method for Referring Expression Comprehension
Yue Liao, Si Liu*, Guanbin Li, Fei Wang, Yanjie Chen, Chen Qian, Bo Li
Multimodal Learning
CVPR 2020
PPDM: Parallel Point Detection and Matching for Real-time Human-Object Interaction Detection
Yue Liao, Si Liu*, Fei Wang, Yanjie Chen, Chen Qian, Jiashi Feng
Video Understanding
Detection
Human-Centric Vision
CVPR 2020
Referring Image Segmentation via Cross-Modal Progressive Comprehension
Shaofei Huang#, Tianrui Hui#, Si Liu*, Guanbin Li, Yunchao Wei, Jizhong Han, Luoqi Liu, Bo Li
Multimodal Learning
Segmentation
CVPR 2020
Linguistic Structure Guided Context Modeling for Referring Image Segmentation
Tianrui Hui, Si Liu*, Shaofei Huang, Guanbin Li, Sansi Yu, Faxi Zhang, Jizhong Han
Segmentation
ECCV 2020
Rule-Guided Compositional Representation Learning on Knowledge Graphs
Guanglin Niu, Yongfei Zhang, Bo Li, Peng Cui, Si Liu, Jingyang Li, Xiaowei Zhang
Graph Learning
AAAI 2020
Tree-Structured Policy based Progressive Reinforcement Learning for Temporally Language Grounding in Video
Jie Wu, Guanbin Li, Si Liu, Liang Lin
Video Understanding
Grounding
AAAI 2020
InteractGAN: Learning to Generate Human-Object Interaction
Chen Gao, Si Liu*, Defa Zhu, Quan Liu, Jie Cao, Haoqian He, Ran He, Shuicheng Yan
Video Understanding
Image Generation
Human-Centric Vision
ACM MM 2020 Oral
Cross-Modal Omni Interaction Modeling for Phrase Grounding
Tianyu Yu, Tianrui Hui, Zhihao Yu, Yue Liao, Sansi Yu, Faxi Zhang, Si Liu*
Multimodal Learning
Video Understanding
Grounding
ACM MM 2020
Magic-wall: Visualizing Room Decoration by Enhanced Wall Segmentation
Ting Liu, Yunchao Wei, Yao Zhao, Si Liu, Shikui Wei
Segmentation
Scene Understanding
TIP 2019
Accurate Facial Image Parsing at Real-Time Speed
Zhen Wei, Si Liu, Yao Sun, Hefei Ling
Segmentation
TIP 2019
RotateView: A Video Composition System for Interactive Product Display
Shan An, Si Liu*, Zhibiao Huang, Guangfu Che, Qian Bao, Zhaoqi Zhu, Yu Chen, Dennis Z. Weng
Video Understanding
TMM 2019
Fine-grained Human-centric Tracklet Segmentation with Single Frame Supervision
Si Liu, Guanghui Ren, Yao Sun, Jinqiao Wang, Changhu Wang, Bo Li, Shuicheng Yan
Segmentation
Human-Centric Vision
TPAMI 2019
Building Detail-Sensitive Semantic Segmentation Networks with Polynomial Pooling
Zhen Wei, Jingyi Zhang, Fumin Shen, Li Liu, Fan Zhu, Yi Zhou, Si Liu, Yao Sun, Ling Shao
Segmentation
CVPR 2019
GPS: Group People Segmentation with Detailed Part Inference [oral]
Yue Liao, Si Liu*, Tianrui Hui, Chen Gao, Yao Sun, Hefei Ling, Bo Li
Segmentation
ICME 2019
Enhanced Memory Network for Video Segmentation [1st Place in Youtube-VOS 2019]
Zhishan Zhou, Lejian Ren, Pengfei Xiong, Yifei Ji, Peisen Wang, Haoqiang Fan, Si Liu
Video Understanding
Segmentation
ICCV Workshop 2019
RGB-Infrared Cross-Modality Person Re-Identification via Joint Pixel and Feature Alignment
Guan’an Wang, Tianzhu Zhang, Jian Cheng, Si Liu, Yang Yang, Zengguang Hou
Multimodal Learning
Re-ID
ICCV 2019
Finding Images by Dialoguing with Image
Lejian Ren, Si Liu, Han Huang, Jizhong Han, Shuicheng Yan, Bo Li
Interactive Vision
ACM MM 2019
Cross-domain Human Parsing via Adversarial Feature and Label Adaptation
Si Liu, Yao Sun, Defa Zhu, Guanghui Ren, Yu Chen, Jiashi Feng, Jizhong Han
Segmentation
Human-Centric Vision
AAAI 2018
Learning Adaptive Receptive Fields for Deep Image Parsing Network
Zhen Wei, Yao Sun, Junyu Lin, Si Liu
Segmentation
Computational Visual Media 2018
Ensemble Soft-Margin Softmax Loss for Image Classification
Xiaobo Wang, Shifeng Zhang, Zhen Lei, Si Liu, Xiaojie Guo, Stan Z. Li
Recognition
IJCAI 2018
BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial Network
Tingting Li, Ruihe Qian, Chao Dong, Si Liu, Qiong Yan, Wenwu Zhu, Liang Lin
Image Generation
Face Analysis
Domain Adaptation
ACM MM 2018 Oral
Composing Semantic Collage for Image Retargeting
Si Liu, Zhen Wei, Yao Sun, Xinyu Ou, Junyu Lin, Bin Liu, Ming-Hsuan Yang
Image Generation
TIP 2018
Correlation Particle Filter for Visual Tracking
Tianzhu Zhang, Si Liu, Changsheng Xu, Bin Liu, Ming-Hsuan Yang
Tracking
TIP 2018
Robust Target Tracking by Online Random Forests and Superpixels
Wei Wang, Chunping Wang, Si Liu, Xiaochun Cao
Tracking
TCSVT 2018
Improved Search in Hamming Space using Deep Multi-Index Hashing
Hanjiang Lai, Yan Pan, Si Liu, Zhenbin Weng, Jian Yin
Retrieval
TCSVT 2018
A weakly supervised method for makeup-invariant face verification
Yao Sun, Lejian Ren, Zhen Wei, Bin Liu, Yanlong Zhai, Si Liu
Face Analysis
Pattern Recognition 2017
Adult Images and Videos Recognition by Deep Multi-Context Network and Fine-to-Coarse Strategy
Xinyu Ou, Hefei Ling, Han Yu, Si Liu
Video Understanding
Recognition
TIST 2017
Objectness Region Enhancement Networks for Scene Parsing
Xinyu Ou, Ping Li, Hefei Ling, Si Liu, Tianjiang Wang, Dan Li
Segmentation
Image Generation
Scene Understanding
JCST 2017
Time Traveler: a real-time face aging system
Lejian Ren, Si Liu, Yao Sun, JianDong Luoqi Liu, Shuicheng Yan
Face Analysis
ACM MM 2017
Learning Adaptive Receptive Fields for Deep Image Parsing Network
Zhen Wei, Yao Sun, Jinqiao Wang, Hanjiang Lai, Si Liu
Segmentation
CVPR 2017
Surveillance Video Parsing with Single Frame Supervision
Si Liu, Changhu Wang, Ruihe Qian, Han Yu, Renda Bao, Yao Sun
Video Understanding
Segmentation
CVPR 2017
Face Aging with Contextual Generative Adversarial Nets
Si Liu, Yao Sun, Wei Wang, Renda Bao, Defa Zhu, Xiangbo Zhu, and Shuicheng Yan
Face Analysis
ACM MM 2017
Fast Deep Matting for Portrait Animation on Mobile Phone
Bingke Zhu, Yingying Chen, Si Liu, Bo Zhang, Jinqiao Wang, Ming Tang
Segmentation
ACM MM 2017
Magic-wall: Visualizing Room Decoration
Ting Liu, Yunchao Wei, Yao Zhao, Si Liu, Shikui Wei
Scene Understanding
ACM MM 2017
RSVP: A Real-Time Surveillance Video Parsing System with Single Frame Supervision
Han Yu, Guanghui Ren, Ruihe Qian, Yao Sun, Changhu Wang, Hanqing Lu, Si Liu
Video Understanding
Segmentation
ACM MM 2017
SketchNet: Sketch Classification with Web Images
Hua Zhang, Si Liu, Changqing Zhang, Wenqi Ren, Xiaochun Cao
Recognition
Sketch Understanding
CVPR 2016
Structural Correlation Filter for Robust Visual Tracking
Si Liu, Tianzhu Zhang, Changsheng Xu, Xiaochun Cao
Tracking
CVPR 2016
Single Image Dehazing via Multi-Scale Convolutional Neural Networks
Wenqi Ren, Si Liu, Hua Zhang, Jianshan Pan, Xiaochun Cao, Ming-Hsuan Yang
Image Enhancement
ECCV 2016
Deep Multi-Context Network for Fine-Grained Visual Recognition
Xinyu Ou, Zhen Wei, Si Liu, Xiaochun Cao, Hefei Ling
Recognition
ICME 2016
Makeup like a superstar: Deep Localized Makeup Transfer Network
Si Liu, Xinyu Ou, Ruihe Qian, Wei Wang, Xiaochun Cao
Face Analysis
Domain Adaptation
IJCAI 2016
Visual Attributes for Fashion Analytics
Si Liu, Lisa M. Brown, Qiang Chen, Junshi Huang, Luoqi Liu, Shuicheng Yan
Fashion Vision
Book Chapter 2016
Robust Visual Tracking via Exclusive Context Modeling
Tianzhu Zhang, Bernard Ghanem, Si Liu, Changsheng Xu, Narendra Ahuja
Tracking
IEEE Cybernetics 2016
Matching-CNN Meets KNN: Quasi-Parametric Human Parsing
Si Liu, Xiaodan Liang, Luoqi Liu, Xiaohui Shen, Jianchao Yang, Changsheng Xu, Xiaochun Cao, Shuicheng Yan
Segmentation
Human-Centric Vision
CVPR 2015
Towards Computational Baby Learning: A Weakly-Supervised Approach for Object Detection
Xiaodan Liang, Si Liu, Yunchao Wei, Luoqi Liu, Liang Lin, Shuicheng Yan
Detection
ICCV 2015
Deep People Counting in Extremely Dense Crowds
Chuan Wang, Hua Zhang, Yang Liang, Si Liu, Xiaochun Cao
Counting
ACM MM 2015
Diversity-induced Multiview Subspace Clustering
Xiaochun Cao, Changqing Zhang, Huazhu Fu, Si Liu
Clustering
CVPR 2015
Structural Sparse Tracking
Tianzhu Zhang, Si Liu, Changsheng Xu, Shuicheng Yan, Narendra Ahuja, Bernard Ghanem, Ming-Hsuan Yang
Tracking
CVPR 2015
Low-Rank Tensor Constrained Multiview Subspace Clustering
Changqing Zhang, Huazhu Fu, Si Liu, Guangcan Liu, Xiaochun Cao
Clustering
ICCV 2015
Human Parsing With Contextualized Convolutional Neural Network
Xiaodan Liang, Chunyan Xu, Xiaohui Shen, Jianchao Yang, Si Liu, Jinhui Tang, Liang Lin, Shuicheng Yan
Segmentation
Human-Centric Vision
ICCV 2015
Deep Human Parsing with Active Template Regression
Xiaodan Liang, Si Liu, Xiaohui Shen, Jianchao Yang, Luoqi Liu, Liang Lin, Shuicheng Yan
Segmentation
Human-Centric Vision
TPAMI 2015
Fashion Parsing With Video Context
Si Liu, Xiaodan Liang, Luoqi Liu, Liang Lin, Ke Lv, Xiaochun Cao, Shuicheng Yan
Video Understanding
Segmentation
Fashion Vision
TMM 2015
SLED: Semantic Label Embedding Dictionary Representation for Multi-label Image Annotation
Xiaochun Cao, Hua Zhang, Xiaojie Guo, Si Liu, Dan Meng
Image Annotation
TIP 2015
Fashion Parsing with Video Context
Si Liu, Xiaodan Liang, Luoqi Liu, Liang Lin, Ke Lv, Shuicheng Yan
Video Understanding
Segmentation
Fashion Vision
ACM MM 2014
Puzzle Search: Image Retrieval and Ranking with Consistent Reconstruction of Multi-Attribute Queries
Xiaochun Cao, Hua Zhang, Xiaojie Guo, Si Liu, Xiaowu Chen
Retrieval
ECCV 2014
Clothing Attributes Assisted Person Re-identification
Annan Li, Luoqi Liu, Kang Wang, Si Liu, Shuicheng Yan
Re-ID
TCSVT 2014
Fashion Parsing with Weak Color-Category Labels
Si Liu, Jiashi Feng, Csaba Domokos, Junshi Huang, Zhenzhen Hu, Shuicheng Yan
Segmentation
Fashion Vision
TMM 2014
Fashion Analysis: Current Techniques and Future Directions
Si Liu, Luoqi Liu, Shuicheng Yan
Fashion Vision
IEEE MultiMedia 2014
Snap & Play: Auto-Generate Personalized Find-the-Difference Game
Si Liu, Qiang Chen, Shuicheng Yan, Changsheng Xu, Hanqing Lu
Interactive Vision
TIST 2014
Wow! You Are So Beautiful Today!
Luoqi Liu, Jun-liang Xing, Si Liu, Hui Xu, Xi Zhou, Shuicheng Yan
Face Analysis
TOMCCAP 2014
PicWords: Render a Picture by Packing Keywords
Zhenzhen Hu, Si Liu, Jianguo Jiang, Richang Hong, Meng Wang, Shuicheng Yan
Image Generation
TMM 2014
Circle & Search: Attribute-aware Shoe Retrieval
Junshi Huang, Si Liu, Junliang Xing, Tao Mei, Shuicheng Yan
Retrieval
TOMCCAP 2014
Robust Visual Tracking via Consistent Low-Rank Sparse Learning
Tianzhu Zhang, Si Liu, Narendra Ahuja, Ming-Hsuan Yang, Bernard Ghanem
Tracking
IJCV 2014
Towards Decrypting Attractiveness via Multi-Modality Cues
Tam V. Nguyen, Si Liu, Bingbing Ni, Jun Tan, Yong Rui, Shuicheng Yan
Face Analysis
TOMCCAP 2013
Wow! you are so beautiful today!
Luoqi Liu, Hui Xu, Junliang Xing, Si Liu, Xi Zhou, Shuicheng Yan
Face Analysis
ACM MM 2013 Best Paper
eHeritage of shadow puppetry: creation and manipulation
Min Lin, Zhenzhen Hu, Si Liu, Meng Wang, Richang Hong, Shuicheng Yan
Robotics
ACM MM 2013
Wow! you are so beautiful today!
Luoqi Liu, Hui Xu, Si Liu, Junliang Xing, Xi Zhou, Shuicheng Yan
Face Analysis
ACM MM demo 2013
Low-Rank Sparse Coding for Image Classification
Tianzhu Zhang, Bernard Ghanem, Si Liu, Changsheng Xu, Narendra Ahuja
Recognition
ICCV 2013
SYM-FISH: A Symmetry-aware Flip Invariant Sketch Histogram Shape Descriptor
Xiaochun Cao, Hua Zhang, Si Liu, Xiaojie Guo
Sketch Understanding
ICCV 2013
Hi, magic closet, tell me what to wear!
Si Liu, Tam V. Nguyen, Jiashi Feng, Meng Wang, Shuicheng Yan
Fashion Vision
ACM MM 2012 Best Demo
Hi, magic closet, tell me what to wear!
Si Liu, Jiashi Feng, Zheng Song, Tianzhu Zhang, Hanqing Lu, Changsheng Xu, Shuicheng Yan
Fashion Vision
ACM MM 2012
Street-to-Shop: Cross-Scenario Clothing Retrieval via Human Part Alignment and Auxiliary Set
Si Liu, Zheng Song, Guangcan Liu, Shuicheng Yan, Changsheng Xu, Hanqing Lu
Retrieval
Human-Centric Vision
CVPR 2012 Oral
Street-to-shop: cross-scenario clothing retrieval via parts alignment and auxiliary set
Si Liu, Zheng Song, Meng Wang, Changsheng Xu, Hanqing Lu, Shuicheng Yan
Retrieval
ACM MM demo 2012
Robust Visual Tracking via Structured Multi-Task Sparse Learning
Tianzhu Zhang, Bernard Ghanem, Si Liu, Narendra Ahuja
Tracking
IJCV 2012
A Generic Framework for Video Annotation via Semi-supervised Learning
Tianzhu Zhang, Changsheng Xu, Guangyu Zhu, Si Liu, Hanqing Lu
Video Understanding
Image Annotation
TMM 2012
Weakly-Supervised Graph Propagation Towards Collective Image Parsing
Si Liu, Shuicheng Yan, Tianzhu Zhang, Changsheng Xu, Jing Liu, Hanqing Lu
Segmentation
Graph Learning
TMM 2012
Sense beauty via face, dressing, and/or voice
Tam V. Nguyen, Si Liu, Bingbing Ni, Jun Tan, Yong Rui, Shuicheng Yan
Face Analysis
ACM MM 2012
Low-Rank Sparse Learning for Robust Visual Tracking
Tianzhu Zhang, Bernard Ghanem, Si Liu, Narendra Ahuja
Tracking
ECCV 2012
Robust Visual Tracking via Multi-Task Sparse Learning
Tianzhu Zhang, Bernard Ghanem, Si Liu, Narendra Ahuja
Tracking
CVPR 2012 Oral
Boosted Exemplar Learning for Action Recognition and Annotation
Tianzhu Zhang, Jing Liu, Si Liu, Changsheng Xu, Hanqing Lu
Video Understanding
Recognition
Image Annotation
TCSVT 2011
Size Adaptive Selection of Most Informative Features
Si Liu, Hairong Liu, Shuicheng Yan, Longin Latecki, Changsheng Xu, Hanqing Lu
Feature Selection
AAAI 2011 Oral
Snap & Play: Auto-generate Personalized Find-the-Difference Mobile Game
Si Liu, Qiang Chen, Shuicheng Yan, Changsheng Xu, Hanqing Lu
Interactive Vision
ACM MM 2011
A Generic Framework for Event Detection in Various Video Domain
Tianzhu Zhang, Changsheng Xu, Guangyu Zhu, Si Liu, Hanqing Lu
Video Understanding
Detection
ACM MM 2010