2024   • 2023   • 2022   • 2021   • 2020   • 2019   • 2018   • 2017 and before   •
Date & Time Speakers Topic & Slides

December 2020-12-20

Yulu Gao MOT

December 2020-12-20

Zitian Wang Towards Real-Time Multi-Object Tracking

December 2020-12-20

Jiahui Fu Pre-Trained Image Processing Transformer

December 2020-12-20

Yushen Zhao Person Search

December 2020-12-20

Zhaokai Wang Transformer, BERT, GPT

December 2020-12-12

Mingfei Chen DETR Related Work in Detection

November 2020-11-22

Yanglin Pu Domain Adaptation

September 2020-09-26

Luting Wang Knowledge Graph

September 2020-09-21

Zihan Ding video classification with Channel-Separated Convolutional Networks

September 2020-09-21

Jiahui Fu Multimodal Dialog System: Generating Responses via Adaptive Decoders

September 2020-09-21

Luting Wang Grouped Spatial-Temporal Aggregation for Efficient Action Recognition

September 2020-09-13

Luting Wang Embodied Ai

September 2020-09-13

Jingyu Chen VLN-CE

September 2020-09-06

Zihan Ding Give Me Something to Eat Referring Expression Comprehension with Commonsense Knowledge

August 2020-08-30

Luting Wang Long short-term memory networks in memristor crossbar arrays

August 2020-08-30

Jiahui Fu Jiahui Fu-talk papers

August 2020-08-30

Zihan Ding CVPR2020-Learning Visual Commonsense for Robust Scene Graph Generation

August 2020-08-30

Zihan Ding news datasets

August 2020-08-23

Luting Wang 多模态表示学习

August 2020-08-01

Chen Gao Are we pretraining it right Digging deeper into visio-linguistic pretraining

August 2020-08-01

Jinyu Chen constractive bert

August 2020-08-01

Tianyu YU Tianyu MM2020 Work Report

July 2020-07-25

Zongheng Tang Spatial-temporal video grounding

July 2020-07-25

Yulu Gao CBR-Net: Cascade Boundary Refinement Network for Action Detection:

July 2020-07-25

Jinyu Chen A Cordial Sync: Going Beyond Marginal Policies For Multi-Agent Embodied Tasks

July 2020-07-25

Tianrui Hui Graph-Structured Referring Expression Reasoning in The Wild

July 2020-07-21

Luting Wang 遥感图像分割

July 2020-07-18

Tianrui Hui Auto Caption on GIF

July 2020-07-18

Tianrui Hui Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation

July 2020-07-18

Zihan Ding ERNIE-ViL: KNOWLEDGE ENHANCED VISION-LANGUAGE REPRESENTATIONS THROUGH SCENE GRAPH"

July 2020-07-18

Zhaokai Wang Structured Multimodal Attentions for TextVQA

July 2020-07-11

Zitian Wang End-to-End Object Detection with Transformers

July 2020-07-04

Renda Bao Neural-Symbolic VQA Disentangling Reasoning from Vision and Language Understanding

July 2020-07-04

Shaofei Huang Explainable Neural Computation via Stack Neural Module Networks

July 2020-07-04

Zitian Wang meta module networks

June 2020-06-06

Yixuan Qiao emotion reinforced visualstory telling

June 2020-06-06

Haolin Wang Multi-source Multi-level Attention Network for Visual Question and Answering

June 2020-06-06

Chen Gao Neural Storyboard Artist

June 2020-06-06

Tianyu YU Show, Reward, and Tell Adversarial Visual Story Generation

June 2020-06-06

Shaofei Huang dense caption

June 2020-06-06

Renda Bao Work Report

May 2020-05-29

Yulu Gao AI Coach

May 2020-05-29

Guanghui Ren NAS-Det

May 2020-05-29

Zitian Wang Learning Rich Image Region Representation

May 2020-05-29

Zongheng Tang Learning 2D Temporal Adjacent Network for Moment Localization with Natural Language

May 2020-05-29

Jinyu Chen From Words to Sentences: A Progressive Learning Approach for Zero-resource Machine Translation with Visual Pivots

May 2020-05-29

Wentao Jiang GAN Report

May 2020-05-29

Renda Bao Exploiting hierarchical visual features for visual question answering

May 2020-05-06

Jinyu Chen visual commonsense R-CNN

May 2020-05-06

Wentao Xie AI CITY CHALLENGE

May 2020-05-06

Chen Gao Iterative Context-Aware Graph Inference for Visual Dialog

May 2020-05-06

Zongheng Tang Unbiased_Scene_Graph_Generation_by_Biased_Learning

May 2020-05-06

Renda Bao Deconfounded Image Captioning

May 2020-05-06

Zitian Wang Two Causal Principles for Improving Visual Dialog

April 2020-04-28

Zhaokai Wang Pointer Networks

April 2020-04-19

Jinyu Chen interbert

April 2020-04-19

Wentao Jiang ActBERT

April 2020-04-19

Tianrui Hui PixelBERT

April 2020-04-19

Zongheng Tang VLBERT

March 2020-03-29

Zongheng Tang Recursive Visual Attention in Visual Dialog

March 2020-03-29

Zitian Wang unbiased scene graph generation from biased training

March 2020-03-25

Lejian Ren CS224n--lecture15

March 2020-03-25

Tianyu YU Dense-Caption

March 2020-03-25

Yier Shu Siamese Box Adaptive Network for Visual Tracking

March 2020-03-25

Yulu Gao SOLOv2

March 2020-03-25

Renda Bao In Defense of Grid Features for Visual Question Answering

March 2020-03-25

Yixuan Qiao TVQA+

March 2020-03-25

Wentao Xie VIDVRD

March 2020-03-25

Zongheng Tang Visual Grounding in Video for UnsupervisedWord Translation

March 2020-03-22

Guanghui Ren ABCNet

March 2020-03-15

Zongheng Tang Sentence Specified Dynamic Video Thumbnail Gener

March 2020-03-15

Wentao Jiang StackGAN-StackGAN++

March 2020-03-15

Renda Bao Survey of VQA

March 2020-03-15

Jinyu Chen REVIE

March 2020-03-15

Chen Gao Virtually Trying on New Clothing with Arbitrary Poses

March 2020-03-15

Guanghui Ren automl_zero

March 2020-03-15

Shaofei Huang temporal segment network

March 2020-03-15

Zongheng Tang Exploiting Temporal Relationships in Video Moment Localization with Natural Language

March 2020-03-08

Wentao Xie A Multigrid Method for Efficiently TrainA Multigrid Method for Efficiently Training Video Modelsing Video Models

March 2020-03-08

Yue Liao End-to-End Learning of Visual Representations from Uncurated Instructional Videos

March 2020-03-01

Jinyu Chen Vision-and-Language Navigation V2

February 2020-02-25

Zhaokai Wang Report on TextVQA Challenge

February 2020-02-25

Wentao Xie TSM: Temporal Shift Module for Efficient Video Understanding

February 2020-02-25

Zongheng Tang STEP: Spatio-Temporal Progressive Learning for Video Action Detection

January 2020-01-12

Wentao Jiang Embodied Question Answering

January 2020-01-12

Zitian Wang Audio-Visual Embodied Navigation

January 2020-01-12

Chen Gao Gibson Env Real-World Perception for Embodied Agents

January 2020-01-12

Jinyu Chen Learning to Navigate Using Mid-Level Visual Priors

January 2020-01-12

Defa Zhu Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation

January 2020-01-12

Tianyu Yu Phrase Grouding

January 2020-01-10

Renda Bao Licheng Yu Phd thesis

January 2020-01-10

Lejian Ren Bayesian Relational Memory for Semantic Visual Navigation

January 2020-01-10

Renda Bao From Two Graphs to N Questions A VQA Dataset for Compositional Reasoning on Vision and Commonsense

January 2020-01-05

Guanghui Ren cs224_seq2seq_att

January 2020-01-05

Yixuan Qiao STEP:spatial-temporal learning for video action detection

January 2020-01-05

Renda Bao From Two Graphs to N Questions A VQA Dataset for Compositional Reasoning on Vision and Commonsense

January 2020-01-05

Shaofei Huang self-supervised lvn

January 2020-01-05

Jinyu Chen Towards Learning a Generic Agent for.pptx

January 2020-01-05

Guanghui Ren Situational Fusion of Visual Representation for Visual Navigation

January 2020-01-05

Zongheng Tang Talk2Nav

January 2020-01-05

Tianrui Hui VLN

January 2020-01-03

Renda Bao up-down VQA

January 2020-01-03

Renda Bao Visual Question Answering as Reading Comprehension

January 2020-01-02

Jinyu Chen YOLACT++

January 2020-01-02

Zongheng Tang anchor free recent work report