ICCV2019 全部论文分类汇总,含目标检测 / 图像分割等,1008 更新中

作为计算机视觉领域三大顶会之一,ICCV2019目前已公布了所有接收论文ID(会议接收论文1077篇,总提交4303篇,25%的接收率),相关报道:1077篇!ICCV2019接收结果公布,你中了吗?

此前我们也对CVPR2019的论文做了分类汇总:CVPR2019 | 论文分类汇总,而本帖是对ICCV2019顶会论文的实时跟进和分类,欢迎点击文末关注按钮,即可获取本帖最新更新消息。

目录

目标检测

语义分割

图像分割

目标跟踪

人脸

ReID

OCR

视频

超分辨率

自动驾驶

3D、点云

GCN

GAN

其他

(以计算机视觉具体研究方向划分,如有错误欢迎指出~)

目标检测

1、ThunderNet: Towards Real-time Generic Object Detection

ThunderNet:走向实时通用目标检测

作者:Zheng Qin, Zeming Li, Zhaoning Zhang, Yiping Bao, Gang Yu, Yuxing Peng, Jian Sun

论文链接:https://arxiv.org/abs/1903.11752

论文解读:http://bbs.cvmart.net/articles/361

2、MemorizingNormality to Detect Anomaly: Memory-augmented Deep Autoencoder (MemAE) forUnsupervised Anomaly Detection

MemorizingNormality检测异常:内存增强深度自动编码器(MemAE)用于非监督异常检测

作者:Dong Gong, Lingqiao Liu, Vuong Le, Budhaditya Saha, Moussa Reda Mansour, Svetha Venkatesh, Anton van den Hengel

项目链接:https://donggong1.github.io/anomdec-memae.html

论文链接:https://arxiv.org/abs/1904.02639

GitHub:https://github.com/donggong1/memae-anomaly-detection

3、Deep Hough Voting for 3D Object Detection in Point Clouds(Oral)

深入投票进行点云中的三维目标检测

作者:Charles R. Qi, Or Litany, Kaiming He, Leonidas J. Guibas

论文链接:https://arxiv.org/abs/1904.09664

4、Multi-adversarial Faster-RCNN for Unrestricted Object Detection

用于无限制目标检测的多对抗性更快RCNN

作者:Zhenwei He, Lei Zhang

论文链接:https://arxiv.org/abs/1907.10343

5、FCOS: Fully Convolutional One-Stage Object Detection

FCOS:完全卷积一级目标检测

作者:Zhi Tian, Chunhua Shen, Hao Chen, Tong He

论文链接:https://arxiv.org/abs/1904.01355

Github链接:https://github.com/tianzhi0549/FCOS/

论文解读: https://mp.weixin.qq.com/s/N93TrVnUuvAgfcoHXevTHw

6、Simultaneous multi-view instance detection with learned geometric soft-constraints

使用学习的几何软约束同时进行多视图实例检测

作者:Ahmed Samy Nassar, Sebastien Lefevre, Jan D. Wegner

论文链接:https://arxiv.org/abs/1907.10892

7、Cap2Det: Learning to Amplify Weak Caption Supervision for Object Detection

Cap2Det:学习放大目标检测的弱字幕监控

作者:Keren Ye, Mingda Zhang, Adriana Kovashka, Wei Li, Danfeng Qin, Jesse Berent

论文链接:https://arxiv.org/abs/1907.10164

8、Towards Adversarially Robust Object Detection

对抗强大的目标检测

作者:Haichao Zhang, Jianyu Wang

论文链接:https://arxiv.org/abs/1907.10310

9、Few-shot Object Detection via Feature Reweighting

通过特征重新加权的快速物体检测

作者:Bingyi Kang, Zhuang Liu, Xin Wang, Fisher Yu, Jiashi Feng, Trevor Darrell

论文链接:https://arxiv.org/pdf/1812.01866.pdf

10、Optimizing the F-measure for Threshold-free Salient Object Detection

优化无阈值显着物体检测的F-测量

作者:Kai Zhao, Shanghua Gao, Wenguan Wang, Ming-ming Cheng

论文链接:http://data.kaizhao.net/publications/iccv2019fmeasure.pdf

Github链接:https://github.com/zeakey/iccv2019-fmeasure

项目链接:http://kaizhao.net/fmeasure

11、Depth-induced Multi-scale Recurrent Attention Network for Saliency Detection

深度诱导多尺度重复注意网络用于显著性检测

作者:Yongri Piao, Wei Ji, Jingjing Li, Miao Zhang, Huchuan Lu

Github链接:https://github.com/jiwei0921/DMRA_RGBD-SOD

10、Learning Lightweight Lane Detection CNNs by Self Attention Distillation

通过自注意蒸馏学习轻量级车道检测神经网络

作者:Yuenan Hou, Zheng Ma, Chunxiao Liu, Chen Change Loy

论文链接:https://arxiv.org/abs/1908.00821

Github链接:https://github.com/cardwing/Codes-for-Lane-Detection

11、Towards High-Resolution Salient Object Detection

实现高分辨率突出目标检测

作者:Yi Zeng, Pingping Zhang, Jianming Zhang, Zhe Lin, Huchuan Lu

论文链接:https://arxiv.org/abs/1908.07274

Github链接:https://github.com/yi94code/HRSOD

12、Teacher Supervises Students How to Learn From Partially Labeled Images for Facial Landmark Detection

教师指导学生如何从部分标记的图像中学习识别面部地标

作者:Xuanyi Dong, Yi Yang

论文链接:https://arxiv.org/abs/1908.02116

Github链接:https://github.com/D-X-Y/landmark-detection

13、Temporally-Aggregating Spatial Encoder-Decoder for Video Saliency Detection

用于视频显著性检测的时间聚合空间编解码器

Github链接:https://github.com/kylemin/TASED-Net

14、SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects

SCRDet:对小的,杂乱的和旋转的物体进行更加稳健的检测

作者:Xue Yang, Jirui Yang, Junchi Yan, Yue Zhang, Tengfei Zhang, Zhi Guo, Sun Xian, Kun Fu

论文链接:https://arxiv.org/abs/1811.07126

Github链接:https://github.com/DetectionTeamUCAS

15、Clustered Object detection in aerial images

航拍图像中的聚类物体检测

作者:Fan Yang, Heng Fan, Peng Chu, Erik Blasch, Haibin Ling

论文链接:https://arxiv.org/pdf/1904.08008

16、Relation Distillation Networks for Video Object Detection

用于视频对象检测的关联蒸馏网络

作者: Jiajun Deng, Yingwei Pan, Ting Yao, Wengang Zhou, Houqiang Li, Tao Mei

论文链接:https://arxiv.org/abs/1908.09511

17、Scaling Object Detection by Transferring Classification Weights

通过转移分类权值来缩放目标检测

作者:Jason Kuen, Federico Perazzi, Zhe Lin, Jianming Zhang, Yap-Peng Tan

论文链接:https://arxiv.org/abs/1909.06804

Github链接:https://github.com/xternalz/AE-WTN

18、WSOD^2: Learning Bottom-up and Top-down Objectness Distillation for Weakly-supervised Object Detection

WSOD^2:学习自底向上和自顶向下的对象精馏,用于弱监督对象检测

作者:Zhaoyang Zeng, Bei Liu, Jianlong Fu, Hongyang Chao, Lei Zhang

论文链接:https://arxiv.org/abs/1909.04972

图像分割

1、Incremental Class Discovery for Semantic Segmentation with RGBD Sensing

用RGBD传感进行语义分割的增量类发现

作者:Yoshikatsu Nakajima, Byeongkeun Kang, Hideo Saito, Kris Kitani

论文链接:https://arxiv.org/abs/1907.10008

2、TensorMask: A Foundation for Dense Object Segmentation

TensorMask:密集对象分割的基础

作者:Xinlei Chen, Ross Girshick, Kaiming He, and Piotr Dollár

论文链接:https://arxiv.org/abs/1903.12174

3、Orientation-aware Semantic Segmentation on Icosahedron Spheres

二十面体球面上的方向感知语义分割

作者:Chao Zhang, Stephan Liwicki, William Smith, Roberto Cipolla

论文链接:https://arxiv.org/abs/1907.12849

4、Expectation-Maximization Attention Networks for Semantic Segmentation (Oral)

语义分割的期望最大化注意网络

作者: Xia Li, Zhisheng Zhong, Jianlong Wu, Yibo Yang, Zhouchen Lin, Hong Liu

论文链接:https://arxiv.org/abs/1907.13426

5、ACE: Adapting to Changing Environments for Semantic Segmentation

ACE:适应不断变化的环境进行语义分割

作者:Zuxuan Wu, Xin Wang, Joseph E. Gonzalez, Tom Goldstein, Larry S. Davis

论文链接:https://arxiv.org/pdf/1904.06268.pdf

6、CCNet: Criss-Cross Attention for Semantic Segmentation

CCNet:交叉关注语义分割

作者:Zilong Huang, Xinggang Wang, Lichao Huang, Chang Huang, Yunchao Wei, Wenyu Liu

论文链接:https://arxiv.org/abs/1811.11721

Github链接:https://github.com/speedinghzl/CCNet

7、SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences

SemanticKITTI:一个用于激光雷达序列语义场景理解的数据集

作者:J. Behley, M. Garbade, A. Milioto, J. Quenzel, S. Behnke, C. Stachniss, and J. Gall

论文链接:https://arxiv.org/abs/1904.01416

8、DADA: Depth-Aware Domain Adaptation in Semantic Segmentation

作者:Tuan-Hung Vu, Himalaya Jain, Maxime Bucher, Matthieu Cord, Patrick Pérez

论文链接:https://arxiv.org/abs/1904.01886

9、Weakly Supervised Energy-Based Learning for Action Segmentation(Oral)

弱监督能源行动学习分割

Github链接:https://github.com/JunLi-Galios/CDFL

10、Explicit Shape Encoding for Real-Time Instance Segmentation

实时实例分割的显式形状编码

作者:Wenqiang Xu, Haiyang Wang, Fubo Qi, Cewu Lu

论文链接:https://arxiv.org/abs/1908.04067

11、ACFNet: Attentional Class Feature Network for Semantic Segmentation

用于语义分割的注意类特征网络

作者:Fan Zhang, Yanqin Chen, Zhihang Li, Zhibin Hong, Jingtuo Liu, Feifei Ma, Junyu Han, Errui Ding

论文链接:https://arxiv.org/abs/1909.09408

12、Hierarchical Point-Edge Interaction Network for Point Cloud Semantic Segmentation

层次式点-边交互网络用于点云语义分割

作者:Li Jiang, Hengshuang Zhao, Shu Liu, Xiaoyong Shen, Chi-Wing Fu, Jiaya Jia

论文链接:https://arxiv.org/abs/1909.10469

13、SSAP: Single-Shot Instance Segmentation With Affinity Pyramid

SSAP:带有亲缘金字塔的单点实例分割

作者:Naiyu Gao, Yanhu Shan, Yupei Wang, Xin Zhao, Yinan Yu, Ming Yang, Kaiqi Huang

论文链接:https://arxiv.org/abs/1909.01616

姿态估计

1、Ego-Pose Estimation and Forecasting as Real-Time PD Control

Ego-Pose估计和预测作为实时PD控制

作者:Ye Yuan, Kris Kitani

论文链接:https://arxiv.org/abs/1906.03173

2、xR-EgoPose: Egocentric 3D Human Pose from an HMD Camera

xR-EgoPose:HMD相机的以自我为中心的3D人体姿态

作者:Denis Tome, Patrick Peluse, Lourdes Agapito, Hernan Badino

论文链接:https://arxiv.org/abs/1907.10045

3、Selectivity or Invariance: Boundary-aware Salient Object Detection

选择性或不变性:边界感知的突出物体检测

作者:Jinming Su, Jia Li1, Yu Zhang, Changqun Xia and Yonghong Tian

论文链接:https://arxiv.org/pdf/1812.10066.pdf

4、Learnable Triangulation of Human Pose(Oral)

人体姿态的可验证三角测量

作者:Karim Iskakov, Egor Burkov, Victor Lempitsky, Yury Malkov

论文链接:https://arxiv.org/abs/1905.05754

Github链接:https://github.com/karfly/learnable-triangulation-pytorch

项目链接:https://saic-violet.github.io/learnable-triangulation/

5、Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image

来自单个RGB图像的3D多人姿态估计的摄像距离感知自上而下方法

作者:Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee

论文链接:https://arxiv.org/abs/1907.11346

Github链接:https://github.com/mks0601/3DMPPE_ROOTNET_RELEASE

6、Pose-aware Dynamic Attention for Human Object Interaction Detection

姿态感知动态注意用于人体目标交互检测

Github链接:https://github.com/bobwan1995/Pose-aware-Dynamic-Attention-for-Human-Object-Interaction-Detection

7、SO-HandNet: Self-Organizing Network for 3D Hand Pose Estimation with Semi-supervised Learning

基于半监督学习的三维手部姿态估计自组织网络

Github链接:https://github.com/TerenceCYJ/SO-HandNet

8、Dynamic Kernel Distillation for Efficient Pose Estimation in Videos

动态核蒸馏在视频中的有效姿态估计

作者:Xuecheng Nie, Yuncheng Li, Linjie Luo, Ning Zhang, Jiashi Feng

论文链接:https://arxiv.org/abs/1908.09216

9、Single-Stage Multi-Person Pose Machines

单级多人姿势机器

作者:Xuecheng Nie, Jianfeng Zhang, Shuicheng Yan, Jiashi Feng

论文链接:https://arxiv.org/abs/1908.09220

10、Shape-Aware Human Pose and Shape Reconstruction Using Multi-View Images

利用多视图图像进行人体姿态和形状重建

作者:Junbang Liang, Ming C. Lin

论文链接:https://arxiv.org/abs/1908.09464

11、Holistic++ Scene Understanding: Single-view 3D Holistic Scene Parsing and Human Pose Estimation with Human-Object Interaction and Physical Commonsense

整体++场景理解:单视图三维整体场景解析和人-物交互和物理常识下的人体姿态估计

作者:Yixin Chen, Siyuan Huang, Tao Yuan, Siyuan Qi, Yixin Zhu, Song-Chun Zhu

论文链接:https://arxiv.org/abs/1909.01507

12、Imitation Learning for Human Pose Prediction

模仿学习用于人体姿态预测

作者:Borui Wang, Ehsan Adeli, Hsu-kuang Chiu, De-An Huang, Juan Carlos Niebles

论文链接:https://arxiv.org/abs/1909.03449

目标跟踪

1、Joint Monocular 3D Detection and Tracking

联合单目3D检测和跟踪

作者:Hou-Ning Hu, Qizhi Cai, Dequan Wang, Ji Lin, Min Sun, Philipp Krähenbühl, Trevor Darrell, Fisher Yu

论文链接:https://arxiv.org/abs/1811.10742

项目链接:https://eborboihuc.github.io/Mono-3DT/?fbclid=IwAR1maTNHE5z-vEwAJKIcNEpbMWwBcjWJQ0gEHOwHB-u51w5dfeiZNCh0y-U

GitHub:https://github.com/ucbdrive/3d-vehicle-tracking

2、Deep Meta Learning for Real-Time Target-Aware Visual Tracking

用于实时目标感知视觉跟踪的深度元学习

作者:Janghoon Choi, Junseok Kwon, and Kyoung Mu Lee

论文链接:https://arxiv.org/pdf/1712.09153.pdf

3、Learning Aberrance Repressed Correlation Filters for Real-Time UAV Tracking

学习畸变抑制相关滤波器用于无人机实时跟踪

作者:Ziyuan Huang, Changhong Fu, Yiming Li, Fuling Lin, Peng Lu

论文链接:https://arxiv.org/abs/1908.02231

4、Robust Multi-Modality Multi-Object Tracking

鲁棒多模态多目标跟踪

作者:Wenwei Zhang, Hui Zhou, Shuyang Sun, Zhe Wang, Jianping Shi, Chen Change Loy

论文链接:https://arxiv.org/abs/1909.03850

Github链接:https://github.com/ZwwWayne/mmMOT

人脸

1、Video Face Clustering with Unknown Number of Clusters

视频人脸聚类,聚类个数未知

作者:M. Tapaswi, M. T. Law, and S. Fidler

Github链接:https://github.com/makarandtapaswi/BallClustering_ICCV2019

2、Probabilistic Face Embeddings

作者:Yichun Shi, Anil K. Jain

论文链接:https://arxiv.org/abs/1904.09658

Github链接:https://github.com/seasonSH/Probabilistic-Face-Embeddings

3、Photo-Realistic Facial Details Synthesis from Single Image(Oral)

从单张图像合成的真实面部细节

作者:Anpei Chen, Zhang Chen, Guli Zhang, Ziheng Zhang, Kenny Mitchell, Jingyi Yu

论文链接:https://arxiv.org/abs/1903.10873

Github链接: https://github.com/apchenstu/Facial_Details_Synthesis

ReID

1、One Shot Domain Adaptation for Person Re-Identification(Oral )

作者:Yang Fu, Yunchao Wei, Guanshuo Wang, Jiwei Li, Xi Zhou, Honghui Shi, Thomas Huang

论文链接:https://arxiv.org/abs/1811.10144

Github链接:https://github.com/OasisYang/SSG

2、ABD-Net: Attentive but Diverse Person Re-Identification

ABD-Net:专注而多元的人重新识别

作者:Tianlong Chen, Shaojin Ding, Jingyi Xie, Ye Yuan, Wuyang Chen, Yang Yang, Zhou Ren, Zhangyang Wang

论文链接:https://arxiv.org/abs/1908.01114

Github链接:https://github.com/TAMU-VITA/ABD-Net

3、A Novel Unsupervised Camera-aware Domain Adaptation Framework for Person Re-identification

一种新的无监督摄像机感知域适应框架,用于人员重新识别

作者:Lei Qi, Lei Wang, Jing Huo, Luping Zhou, Yinghuan Shi, Yang Gao

论文链接:https://arxiv.org/abs/1904.03425

4、advPattern: Physical-World Attacks on Deep Person Re-Identification via Adversarially Transformable Patterns

advPattern:物理世界通过可逆变换模式对人重新识别的攻击

作者:Zhibo Wangy, Siyan Zhengy, Mengkai Songy, Qian Wangy, Alireza Rahimpourz, Hairong Qi

论文链接:https://arxiv.org/abs/1908.09327

5、Re-ID Driven Localization Refinement for Person Search

reid驱动的人员搜索本地化细化

作者:Chuchu Han, Jiacheng Ye, Yunshan Zhong, Xin Tan, Chi Zhang, Changxin Gao, Nong Sang

论文链接:https://arxiv.org/abs/1909.08580

6、Cross-Dataset Person Re-Identification via Unsupervised Pose Disentanglement and Adaptation

跨数据集人员重新识别通过无监督的姿态解缠和适应

作者:Yu-Jhe Li, Ci-Siang Lin, Yan-Bo Lin, Yu-Chiang Frank Wang

论文链接:https://arxiv.org/abs/1909.09675

OCR

1、GA-DAN: Geometry-Aware Domain Adaptation Network for Scene Text Detection and Recognition

GA-DAN:用于场景文本检测和识别的几何感知域适应网络

作者:Fangneng Zhan, Chuhui Xue, Shijian Lu

论文链接:https://arxiv.org/abs/1907.09653

2、Cascaded Context Pyramid for Full-Resolution 3D Semantic Scene Completion(Oral )

用于全分辨率3D语义场景完成的嵌入式上下文金字塔

作者:Pingping Zhang, Wei Liu, Yinjie Lei, Huchuan Lu, Xiaoyun Yang

论文链接:https://arxiv.org/abs/1908.00382

3、Symmetry-constrained Rectification Network for Scene Text Recognition

用于场景文本识别的对称约束校正网络

作者:MingKun Yang, Yushuo Guan, Minghui Liao, Xin He, Kaigui Bian, Song Bai, Cong Yao, Xiang Bai

论文链接:https://arxiv.org/abs/1908.01957

4、Towards Unconstrained End-to-End Text Spotting(Oral )

Towards无约束的端到端文本定位

作者:Siyang Qin, Alessandro Bissacco, Michalis Raptis, Yasuhisa Fujii, Ying Xiao

论文链接:https://arxiv.org/abs/1908.09231

5、SPGNet: Semantic Prediction Guidance for Scene Parsing

SPGNet:场景分析的语义预测指南

作者:Bowen Cheng, Liang-Chieh Chen, Yunchao Wei, Yukun Zhu, Zilong Huang, Jinjun Xiong, Thomas Huang, Wen-Mei Hwu, Honghui Shi

论文链接:https://arxiv.org/abs/1908.09798

6、CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval

CAMP:用于文本-图像检索的跨模式自适应消息传递

作者:Zihao Wang, Xihui Liu, Hongsheng Li, Lu Sheng, Junjie Yan, Xiaogang Wang, Jing Shao

论文链接:https://arxiv.org/abs/1909.05506

7、Chinese Street View Text: Large-scale Chinese Text Reading with Partially Supervised Learning

中文街景文本:部分监督学习的大型中文文本阅读

作者: Yipeng Sun, Jiaming Liu, Wei Liu, Junyu Han, Errui Ding, Jingtuo Liu

论文链接:https://arxiv.org/abs/1909.07808

8、Large-scale Tag-based Font Retrieval with Generative Feature Learning

基于标签的大规模字体检索与生成特征学习

作者:Tianlang Chen, Zhaowen Wang, Ning Xu, Hailin Jin, Jiebo Luo

论文链接:https://arxiv.org/abs/1909.02072

9、Visual Semantic Reasoning for Image-Text Matching(Oral)

图像-文本匹配的视觉语义推理

作者:Kunpeng Li, Yulun Zhang, Kai Li, Yuanyuan Li, Yun Fu

论文链接:https://arxiv.org/abs/1909.02701

Github链接:https://github.com/KunpengLi1994/VSRN

10、Dynamic Context Correspondence Network for Semantic Alignment

用于语义对齐的动态上下文通信网络

作者:Shuaiyi Huang, Qiuyue Wang, Songyang Zhang, Shipeng Yan, Xuming He

论文链接:https://arxiv.org/abs/1909.03444

视频

1、HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips

HowTo100M:通过观看数以亿计的视频剪辑来学习文本视频嵌入

作者:Antoine Miech, Dimitri Zhukov, Jean-Baptiste Alayrac, Makarand Tapaswi, Ivan Laptev, Josef Sivic

论文链接:https://arxiv.org/abs/1906.03327

2、VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research

用于视频和语言研究的大规模,高质量多语言数据集

作者:Xin Wang, Jiawei Wu, Junkun Chen, Lei Li, Yuan-Fang Wang, William Yang Wang

论文链接:https://arxiv.org/abs/1904.03493

项目链接:http://vatex.org/main/index.html

论文解读:https://mp.weixin.qq.com/s/bOpKXshitpQ1YKE53WUPEw

3、BMN: Boundary-Matching Network for Temporal Action Proposal Generation

BMN:用于生成时间行动提案的边界匹配网络

作者:Tianwei Lin, Xiao Liu, Xin Li, Errui Ding, Shilei Wen

论文链接:https://arxiv.org/abs/1907.09702

4、Free-form Video Inpainting with 3D Gated Convolution and Temporal PatchGAN

使用3D门控卷积和时间PatchGAN进行自由视频修复

作者:Ya-Liang Chang, Zhe Yu Liu, Kuan-Ying Lee, Winston Hsu

论文链接:https://arxiv.org/abs/1904.10247

Github链接:https://github.com/amjltc295/Free-Form-Video-Inpainting

5、SlowFast Networks for Video Recognition(Oral)

用于视频识别的SlowFast网络

作者:Christoph Feichtenhofer, Haoqi Fan, Jitendra Malik, and Kaiming He

论文链接:https://arxiv.org/abs/1812.03982

6、Point-to-Point Video Generation

点对点视频生成

作者:Tsun-Hsuan Wang, Yen-Chi Cheng, Chieh Hubert Lin, Hwann-Tzong Chen, Min Sun

论文链接:https://arxiv.org/abs/1904.02912

项目链接:https://zswang666.github.io/P2PVG-Project-Page/?fbclid=IwAR1Cr-T54keo5zzaWLQuYNQMcPoKzXGr6-YrTDoauW6Hb5bOSwgluZQ3fIE

7、Disentangling Propagation and Generation for Video Prediction

作者:Hang Gao, Huazhe Xu, Qi-Zhi Cai, Ruth Wang, Fisher Yu, Trevor Darrell

论文链接:https://arxiv.org/pdf/1812.00452.pdf

8、Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition (Oral)

作者:Wenhao Wu, Dongliang He, Xiao Tan, Shifeng Chen, Shilei Wen

论文链接:https://arxiv.org/abs/1907.13369

9、VideoBERT: A Joint Model for Video and Language Representation Learning ( Oral )

VideoBERT:视频和语言表征学习的联合模型

作者:Chen Sun, Austin Myers, Carl Vondrick, Kevin Murphy, Cordelia Schmid

论文链接:https://arxiv.org/abs/1904.01766

10、TSM: Temporal Shift Module for Efficient Video Understanding

时间转移模块,用于高效的视频理解

作者:Ji Lin, Chuang Gan, Song Han

论文链接:https://arxiv.org/abs/1811.08383

Github链接:https://github.com/mit-han-lab/temporal-shift-module

11、Exploiting temporal consistency for real-time video depth estimation

利用时间一致性进行实时视频深度估计

Github链接:https://github.com/hkzhang91/ST-CLSTM

12、EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition

EPIC-Fusion:以自我为中心的动作识别的视听时间绑定

作者:Evangelos Kazakos, Arsha Nagrani, Andrew Zisserman, Dima Damen

Github链接:https://github.com/ekazakos/temporal-binding-network

13、Remote Heart Rate Measurement from Highly Compressed Facial Videos: an End-to-end Deep Learning Solution with Video Enhancement

远程压缩面部视频的心率测量:具有视频增强功能的端到端深度学习解决方案

作者:Zitong Yu, Wei Peng, Xiaobai Li, Xiaopeng Hong, Guoying Zhao

论文链接:https://arxiv.org/abs/1907.11921

14、Onion-Peel Networks for Deep Video Completion

用于深度视频完成的Onion-Peel网络

作者:Seoung Wug Oh, Sungho Lee, Joon-Young Lee, Seon Joo Kim

论文链接:https://arxiv.org/abs/1908.08718

15、An Internal Learning Approach to Video Inpainting

一种内部学习方法的视频Inpainting

作者:Haotian Zhang, Long Mai, Ning Xu, Zhaowen Wang, John Collomosse, Hailin Jin

论文链接:https://arxiv.org/abs/1909.07957

16、Graph Convolutional Networks for Temporal Action Localization

时间动作定位的图卷积网络

作者:Runhao Zeng, Wenbing Huang, Mingkui Tan, Yu Rong, Peilin Zhao, Junzhou Huang, Chuang Gan

论文链接:https://arxiv.org/abs/1909.03252

超分辨率

1、Deep SR-ITM: Joint Learning of Super-resolution and Inverse Tone-Mapping for 4K UHD HDR Applications(Oral)

Deep SR-ITM:4K UHD HDR应用的超分辨率和反色调映射联合学习

作者:Soo Ye Kim, Jihyong Oh, Munchurl Kim

论文链接:https://arxiv.org/abs/1904.11176

2、Toward Real-World Single Image Super-Resolution: A New Benchmark and A New Model

面向现实世界的单幅图像超分辨率:一个新的基准和一个新的模型

论文链接:https://csjcai.github.io/papers/RealSR.pdf

Github链接:https://github.com/csjcai/RealSR

自动驾驶

1、Exploring the Limitations of Behavior Cloning for Autonomous Driving

探讨自动驾驶行为克隆的局限性

作者:Felipe Codevilla, Eder Santana, Antonio M. López, Adrien Gaidon

论文链接:https://arxiv.org/abs/1904.08980

Github链接:https://github.com/felipecode/coiltraine/blob/master/docs/exploring_limitations.md

2、Scalable Place Recognition Under Appearance Change for Autonomous Driving(Oral )

可伸缩位置识别下外观变化自主驾驶

作者:Anh-Dzung Doan, Yasir Latif, Tat-Jun Chin, Yu Liu, Thanh-Toan Do, Ian Reid

论文链接:https://arxiv.org/abs/1908.00178

3D、点云

1、Deep Hough Voting for 3D Object Detection in Point Clouds(Oral)

DHV:点云中的三维物体检测

作者:Charles R. Qi, Or Litany, Kaiming He, Leonidas J. Guibas

论文链接:https://arxiv.org/abs/1904.09664

2、3D Point Cloud Learning for Large-scale Environment Analysis and Place Recognition

3D点云学习用于大规模环境分析和场所识别

作者:Zhe Liu, Shunbo Zhou, Chuanzhe Suo, Yingtian Liu, Hesheng Wang, Yun-Hui Liu

论文链接:https://arxiv.org/abs/1812.07050

3、Learning to Reconstruct 3D Manhattan Wireframes from a Single Image ( Oral )

学习从单个图像重建3D曼哈顿线框

作者:Yichao Zhou, Haozhi Qi, Yuexiang Zhai, Qi Sun, Zhili Chen, Li-Yi Wei, Yi Ma

论文链接:https://arxiv.org/abs/1905.07482

4、GarNet: A Two-stream Network for Fast and Accurate 3D Cloth Draping

GarNet:一个快速准确的3D布料覆盖双流网络

作者:Erhan Gundogdu, Victor Constantin, Amrollah Seifoddini, Minh Dang, Mathieu Salzmann, Pascal Fua

论文链接:https://arxiv.org/abs/1811.10983v2

项目链接:https://cvlab.epfl.ch/research/garment-simulation/garnet/

5、3D-RelNet: Joint Object and Relational Network for 3D Prediction

3D-RelNet:用于3D预测的联合对象和关系网络

作者:Nilesh Kulkarni, Ishan Misra, Shubham Tulsiani, Abhinav Gupta

论文链接:https://arxiv.org/pdf/1906.02729.pdf

6、PointFlow : 3D Point Cloud Generation with Continuous Normalizing Flows(Oral)

PointFlow:使用连续正常化流程生成3D点云

作者:Guandao Yang, Xun Huang, Zekun Hao, Ming-Yu Liu, Serge Belongie, Bharath Hariharan

论文链接:https://arxiv.org/abs/1906.12320

Github链接:https://github.com/stevenygd/PointFlow

项目链接:https://www.guandaoyang.com/PointFlow/

7、Multi-Angle Point Cloud-VAE: Unsupervised Feature Learning for 3D Point Clouds from Multiple Angles by Joint Self-Reconstruction and Half-to-Half Prediction

多角度点云-VAE:通过联合自我重建和半对半预测从多角度的三维点云进行无监督特征学习

作者:Zhizhong Han, Xiyang Wang, Yu-Shen Liu, Matthias Zwicker

论文链接:https://arxiv.org/abs/1907.12704

8、SceneGraphNet: Neural Message Passing for 3D Indoor Scene Augmentation

SceneGraphNet:神经消息传递3D室内场景增强

作者:Yang Zhou, Zachary While, Evangelos Kalogerakis

论文链接:https://arxiv.org/abs/1907.11308

9、HoloGAN: Unsupervised learning of 3D representations from natural images

HoloGAN:从自然图像中无监督地学习3D表示

作者:Thu Nguyen-Phuoc, Chuan Li, Lucas Theis, Christian Richardt, Yong-Liang Yang

论文链接:https://arxiv.org/abs/1904.01326

项目链接:https://www.monkeyoverflow.com/#/hologan-unsupervised-learning-of-3d-representations-from-natural-images/

10、FrameNet: Learning Local Canonical Frames of 3D Surfaces from a Single RGB Image

FrameNet:从单个RGB图像学习3D表面的局部规范框架

作者:Jingwei Huang, Yichao Zhou, Thomas Funkhouser, Leonidas Guibas

论文链接:https://arxiv.org/pdf/1903.12305.pdf

11、Face De-occlusion using 3D Morphable Model and Generative Adversarial

使用3D可变模型和生成对抗性进行面部去遮挡

作者:Xaiowei Yuan and In Kyu Park

论文链接:http://image.inha.ac.kr/paper/ICCV2019_Xaiowei.pdf

12、Multi-layer Depth and Epipolar Feature Transformers for 3D Scene Reconstruction

用于三维场景重建的多层深度和极线特征变换器

作者:Daeyun Shin, Zhile Ren, Erik B. Sudderth, Charless C. Fowlkes

论文链接:https://arxiv.org/abs/1902.06729

13、Pix2Vox: Context-aware 3D Reconstruction from Single and Multi-view Images

Pix2Vox:上下文感知的三维重建从单一和多视图图像

作者:Haozhe Xie, Hongxun Yao, Xiaoshuai Sun, Shangchen Zhou, Shengping Zhang

论文链接:https://arxiv.org/abs/1901.11153

Github链接:https://github.com/hzxie/Pix2Vox

14、MonoLoco: Monocular 3D Pedestrian Localization and Uncertainty Estimation

单目三维行人定位与不确定性估计

作者:Lorenzo Bertoni, Sven Kreiss, Alexandre Alahi

论文链接:https://arxiv.org/abs/1906.06059

Github链接:https://github.com/vita-epfl/monoloco

15、Moulding Humans: Non-parametric 3D Human Shape Estimation from Single Images

塑造人类:从单个图像中估计非参数三维人体形状

作者:Valentin Gabeur, Jean-Sebastien Franco, Xavier Martin, Cordelia Schmid, Gregory Rogez

论文链接:https://arxiv.org/abs/1908.00439

16、Pixel2Mesh++: Multi-View 3D Mesh Generation via Deformation

Pixel2Mesh++:通过变形生成多视图3D网格

作者:Chao Wen, Yinda Zhang, Zhuwen Li, Yanwei Fu

论文链接:https://arxiv.org/abs/1908.01491

17、View N-gram Network for 3D Object Retrieval

查看N-gram网络三维对象检索

作者:Xinwei He, Tengteng Huang, Song Bai, Xiang Bai

论文链接:https://arxiv.org/abs/1908.01958

18、GP2C: Geometric Projection Parameter Consensus for Joint 3D Pose and Focal Length Estimation in the Wild

GP2C:野外关节三维位姿和焦距估计的几何投影参数一致性

作者:Alexander Grabner, Peter M. Roth, Vincent Lepetit

论文链接:https://arxiv.org/abs/1908.02809

19、Neural 3D Morphable Models: Spiral Convolutional Networks for 3D Shape Representation Learning and Generation

用于视频显著性检测的时间聚合空间编解码器

作者:Giorgos Bouritsas, Sergiy Bokhnyak, Stylianos Ploumpis, Michael Bronstein, Stefanos Zafeiriou

论文链接:https://arxiv.org/abs/1905.02876

Github链接:https://github.com/gbouritsas/Neural3DMM

20、DUP-Net: Denoiser and Upsampler Network for 3D Adversarial Point Clouds Defense

DUP-Net:用于3D对抗点云防御的Denoiser和Upsampler网络

作者: Hang Zhou, Kejiang Chen, Weiming Zhang, Han Fang, Wenbo Zhou, Nenghai Yu

论文链接:https://arxiv.org/abs/1812.11017

21、Interpolated Convolutional Networks for 3D Point Cloud Understanding(Oral )

用于3D点云理解的插值卷积网络

作者: Jiageng Mao, Xiaogang Wang, Hongsheng Li

论文链接:https://arxiv.org/abs/1908.04512

22、Efficient Learning on Point Clouds with Basis Point Sets

基于点集的点云有效学习

作者:Sergey Prokudin, Christoph Lassner, Javier Romero

论文链接:https://arxiv.org/abs/1908.09186

23、Few-Shot Generalization for Single-Image 3D Reconstruction via Priors

基于先验的单幅图像三维重建的小镜头综合

作者:Bram Wallace, Bharath Hariharan

论文链接:https://arxiv.org/abs/1909.01205

24、DensePoint: Learning Densely Contextual Representation for Efficient Point Cloud Processing

DensePoint:学习密集的上下文表示,以实现高效的点云处理

作者:Yongcheng Liu, Bin Fan, Gaofeng Meng, Jiwen Lu, Shiming Xiang, Chunhong Pan

论文链接:https://arxiv.org/abs/1909.03669

GCN

1、Can GCNs Go as Deep as CNNs?

作者:Guohao Li, Matthias Müller, Ali Thabet, Bernard Ghanem

论文链接:https://arxiv.org/abs/1904.03751

GitHub:https://github.com/lightaime/deep_gcns

GAN

1、Controllable Artistic Text Style Transfer via Shape-Matching GAN(Oral )

通过形匹配GAN实现可控的艺术文本风格转换

作者:Shuai Yang, Zhangyang Wang, Zhaowen Wang, Ning Xu, Jiaying Liu, Zongming Guo

论文链接:https://arxiv.org/abs/1905.01354

Github链接:https://github.com/TAMU-VITA/ShapeMatchingGAN

项目链接:https://williamyang1991.github.io/projects/ICCV2019/SMGAN.html

2、Photo-Realistic Monocular Gaze Redirection Using Generative Adversarial Networks

利用生成对抗性网络,实现逼真的单目凝视重定向

作者:Zhe He, Adrian Spurr, Xucong Zhang, Otmar Hilliges

论文链接:https://arxiv.org/abs/1903.12530

Github链接:https://github.com/HzDmS/gaze_redirection

3、AutoGAN: Neural Architecture Search for Generative Adversarial Networks

神经结构搜索生成对抗性网络

Github链接:https://github.com/TAMU-VITA/AutoGAN

4、ARGAN: Attentive Recurrent Generative Adversarial Network for Shadow Detection and Removal

ARGAN:用于阴影检测和去除的细心的反复生成的对抗性网络

作者:Bin Ding, Chengjiang Long, Ling Zhang, Chunxia Xiao

论文链接:https://arxiv.org/abs/1908.01323

其他

1、Meta-Sim Learning to Generate Synthetic Datasets (Oral)

Meta-Sim学习生成合成数据集

作者:Amlan Kar, Aayush Prakash, Ming-Yu Liu, Eric Cameracci, Justin Yuan, Matt Rusiniak, David Acuna, Antonio Torralba, Sanja Fidler

项目链接:HTTPS://nv-tlabs.github.io/meta-sim/

论文链接:HTTPS://arxiv.org/abs/1904.11621

2、nocaps: novel object captioning at scale

nocaps:大规模的新颖物体字幕

作者:Harsh Agrawal, Karan Desai, Yufei Wang, Xinlei Chen, Rishabh Jain, Mark Johnson, Dhruv Batra, Devi Parikh, Stefan Lee, Peter

项目链接:https://nocaps.org

论文链接:https://arxiv.org/abs/1812.08658

3、Scene GraphPrediction with Limited Labels

作者:Vincent S. Chen, Paroma Varma, Ranjay Krishna, Michael Bernstein, Christopher Re, Li Fei-Fei

论文链接:https://arxiv.org/abs/1904.11622

4、Variational Adversarial Active Learning(Oral)

变异对抗主动学习

作者:Samarth Sinha, Sayna Ebrahimi, Trevor Darrell

论文链接:https://arxiv.org/abs/1904.00370

5、The Trajectron: Probabilistic Multi-Agent Trajectory Modeling with Dynamic Spatiotemporal Graphs

Trajectron:使用动态时空图的概率多智能体轨迹建模

作者:Boris Ivanovic, Marco Pavone

论文链接:https://arxiv.org/abs/1810.05993

6、End-to-End Learning of Representations for Asynchronous Event-BasedData

异步事件数据表示的端到端学习

作者:Daniel Gehrig, Antonio Loquercio, Konstantinos G. Derpanis, Davide Scaramuzza

论文链接:https://arxiv.org/abs/1904.08245

7、End-to-End Wireframe Parsing

端到端线框解析

作者:Yichao Zhou, Haozhi Qi, Yi Ma

论文链接:https://arxiv.org/abs/1905.03246

Github链接:https://github.com/zhou13/lcnn

8、Correlation Congruence for Knowledge Distillation

知识蒸馏的相关同余

作者:Baoyun Peng, Xiao Jin, Jiaheng Liu, Shunfeng Zhou, Yichao Wu, Yu Liu, Dongsheng Li, Zhaoning Zhang

论文链接:https://arxiv.org/abs/1904.018029

9、Equivariant Multi-View Networks (Oral)

Equivariant多视图网络

作者:Carlos Esteves, Yinshuang Xu, Christine Allen-Blanchette, Kostas Daniilidis

论文链接:https://arxiv.org/abs/1904.00993

Github链接:https://github.com/daniilidis-group/emvn

10、Episodic Training for Domain Generalization

领域泛化的模式训练

作者:Da Li, Jianshu Zhang, Yongxin Yang, Cong Liu, Yi-Zhe Song, Timothy M. Hospedales

论文链接:https://arxiv.org/abs/1902.00113

11、Few-shot Unsupervised Image-to-Image Translation

作者:Ming-Yu Liu, Xun Huang, Arun Mallya, Tero Karras, Timo Aila, Jaakko Lehtinen, Jan Kautz

论文链接:https://arxiv.org/abs/1905.01723

Github链接:https://github.com/nvlabs/FUNIT/

项目链接:http://www.cs.cornell.edu/~xhuang/publication/funit/

12、Tex2Shape: Detailed Full Human Body Geometry from a Single Image

Tex2Shape:单个图像的人体详细几何特征

作者:Thiemo Alldieck, Gerard Pons-Moll, Christian Theobalt, Marcus Magnor

论文链接:https://arxiv.org/abs/1904.08645

Github链接:https://github.com/thmoa/tex2shape

13、Semi-supervised Domain Adaptation via Minimax Entropy

通过Minimax熵进行半监督的域适应

作者:Kuniaki Saito, Donghyun Kim, Stan Sclaroff, Trevor Darrell, Kate Saenko

论文链接:https://arxiv.org/abs/1904.06487

14、Canonical Surface Mapping via Geometric Cycle Consistency

通过几何周期一致性的经典表面映射

作者:Nilesh Kulkarni, Abhinav Gupta, Shubham Tulsiani

论文链接:https://arxiv.org/abs/1907.10043

项目链接:https://nileshkulkarni.github.io/csm/

15、U4D: Unsupervised 4D Dynamic Scene Understanding

U4D:无监督的4D动态场景理解

作者:Armin Mustafa, Chris Russell, Adrian Hilton

论文链接:https://arxiv.org/abs/1907.09905

16、Scoot: A Perceptual Metric for Facial Sketches

Scoot:面部草图的感知度量

作者:Deng-Ping Fan, ShengChuan Zhang, Yu-Huan Wu, Yun Liu, Ming-Ming Cheng, Bo Ren, Paul L Rosin, Rongrong Ji

论文链接:http://dpfan.net/wp-content/uploads/FaceSketch.pdf

Github链接:http://dpfan.net/wp-content/uploads/Scoot.zip

项目链接:http://dpfan.net/Scoot/

17、Similarity-Preserving Knowledge Distillation

相似性 - 保持知识蒸馏

作者:Frederick Tung, Greg Mori

论文链接:https://arxiv.org/abs/1907.09682

18、Tell, Draw, and Repeat: Generating and modifying images based on continual linguistic instruction

Tell,Draw和Repeat:基于连续的语言指令生成和修改图像

作者:Alaaeldin El-Nouby, Shikhar Sharma, Hannes Schulz, Devon Hjelm, Layla El Asri, Samira Ebrahimi Kahou, Yoshua Bengio, Graham W. Taylor

论文链接:https://arxiv.org/pdf/1811.09845.pdf

19、Semantic Adversarial Attacks: Parametric Transformations That Fool Deep Classifiers

语义对抗性攻击:欺骗深度分类器的参数化变换

作者:Ameya Joshi, Amitangshu Mukherjee, Soumik Sarkar, Chinmay Hegde

论文链接:https://arxiv.org/abs/1904.08489

20、What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention

用滚动展开LSTM和模态注意预测自我中心行为

作者:Antonino Furnari, Giovanni Maria Farinella

项目链接:https://iplab.dmi.unict.it/rulstm/

论文链接:https://arxiv.org/pdf/1905.09035.pdf

GitHub:https://github.com/antoninofurnari/rulstm

21、Improving Adversarial Robustness via Guided Complement Entropy

通过引导补语熵提高对抗鲁棒性

作者:Hao-Yun Chen, Jhao-Hong Liang, Shih-Chieh Chang, Jia-Yu Pan, Yu-Ting Chen, Wei Wei, Da-Cheng Juan.

论文链接:https://arxiv.org/abs/1903.09799

Github链接:https://github.com/henry8527/GCE

22、6-DOF GraspNet: Variational Grasp Generation for Object Manipulation

6-DOF GraspNet:对象操作的变分抓取生成

作者:Arsalan Mousavian, Clemens Eppner, Dieter Fox

论文链接:https://arxiv.org/abs/1905.10520

23、Analyzing the Variety Loss in the Context of Probabilistic Trajectory Prediction

概率弹道预测语境中的变种损失分析

作者:Luca Anthony Thiede, Pratik Prabhanjan Brahma

论文链接:https://arxiv.org/abs/1907.10178

24、DAFL: Data-Free Learning of Student Networks

DAFL:学生网络的无数据学习

作者:Hanting Chen, Yunhe Wang, Chang Xu, Zhaohui Yang, Chuanjian Liu, Boxin Shi, Chunjing Xu, Chao Xu, Qi Tian

论文链接:https://arxiv.org/abs/1904.01186

25、Boosting Few-Shot Visual Learning with Self-Supervision

自我监督推动的少样本视觉学习

作者:Spyros Gidaris, Andrei Bursuc, Nikos Komodakis, Patrick Pérez, Matthieu Cord

论文链接:https://arxiv.org/abs/1906.05186

26、A Quaternion-based Certifiably Optimal Solution to the Wahba Problem with Outliers

Wahba问题异常值的基于四元数的可证明最优解

作者:Heng Yang, Luca Carlone

论文链接:https://arxiv.org/abs/1905.12536

27、Embodied Visual Recognition

人体视觉识别

作者:Jianwei Yang,Zhile Ren,Mingze Xu,Xinlei Chen,David Crandall,Devi Parikh,Dhruv Batra

项目链接:https://www.cc.gatech.edu/~jyang375/evr.html

28、Learning Implicit Generative Models by Matching Perceptual Features(Oral)

作者:Cicero Nogueira dos Santos, Youssef Mroueh, Inkit Padhi, Pierre Dognin

论文链接:https://arxiv.org/abs/1904.02762v1

29、Rethinking ImageNet Pre-training

作者:Kaiming He, Ross Girshick, and Piotr Dollár

论文链接:https://arxiv.org/abs/1811.08883

30、COCO-GAN: Generation by Parts via Conditional Coordinating(Oral)

COCO-GAN:通过条件协调按部件生成

作者:Chieh Hubert Lin, Chia-Che Chang, Yu-Sheng Chen, Da-Cheng Juan, Wei Wei, Hwann-Tzong Chen

论文链接:https://arxiv.org/abs/1904.00284

Github链接:https://github.com/hubert0527/COCO-GAN

项目链接:https://hubert0527.github.io/COCO-GAN/

31、Model Vulnerability to Distributional Shifts over Image Transformation Sets

作者:Riccardo Volpi, Vittorio Murino

论文链接:https://arxiv.org/abs/1903.11900

Github链接:https://github.com/ricvolpi/domain-shift-robustness

32、Exploring Randomly Wired Neural Networks for Image Recognition(Oral)

探索随机有线神经网络进行图像识别

作者:Saining Xie, Alexander Kirillov, Ross Girshick, and Kaiming He

论文链接:https://arxiv.org/abs/1904.01569

33、Temporal Attentive Alignment for Large-Scale Video Domain Adaptation

作者:Min-Hung Chen, Zsolt Kira, Ghassan AlRegib, Jaekwon Woo, Ruxin Chen, Jian Zheng

论文链接:https://arxiv.org/abs/1907.12743

Github链接:http://github.com/cmhungsteve/TA3N

34、Creativity Inspired Zero-Shot Learning

启发式的零样本学习

作者:Mohamed Elhoseiny, Mohamed Elfeki

论文链接:https://arxiv.org/abs/1904.01109

35、Model Vulnerability to Distributional Shifts over Image Transformation Sets

作者:Riccardo Volpi, Vittorio Murino

论文链接:https://arxiv.org/abs/1903.11900

Github链接:https://github.com/ricvolpi/domain-shift-robustness

36、Coherent Semantic Attention for Image Inpainting

图像修复的连贯语义注意力机制

作者:Hongyu Liu, Bin Jiang, Yi Xiao, Chao Yang

论文链接:https://arxiv.org/abs/1905.12384

37、Learning to Paint with Model-based Deep Reinforcement Learning

基于模型的深层强化学习学习绘画

作者:Zhewei Huang, Wen Heng, Shuchang Zhou

论文链接:https://arxiv.org/abs/1903.04411

Github链接:https://github.com/hzwer/ICCV2019-LearningToPaint

38、LayoutVAE: Stochastic Scene Layout Generation from a Label Set

LayoutVAE:从标签集生成随机场景布局

作者:Akash Abdu Jyothi, Thibaut Durand, Jiawei He, Leonid Sigal, Greg Mori

论文链接:https://arxiv.org/abs/1907.10719

39、Co-Evolutionary Compression for Unpaired Image Translation

非成对图像翻译的共同进化压缩

作者:Han Shu, Yunhe Wang, Xu Jia, Kai Han, Hanting Chen, Chunjing Xu, Qi Tian, Chang Xu

论文链接:https://arxiv.org/abs/1907.10804

40、Enhancing Adversarial Example Transferability with an Intermediate Level Attack

通过中级攻击增强对抗性示例可转移性

作者:Qian Huang, Isay Katsman, Horace He, Zeqi Gu, Serge Belongie, Ser-Nam Lim

论文链接:https://arxiv.org/abs/1907.10823

41、Gated2Depth: Real-time Dense Lidar from Gated Images

Gated2Depth:来自门控图像的实时密集激光雷达

作者:Tobias Gruber, Frank Julca-Aguilar,Mario Bijelic,Werner Ritter,Klaus Dietmayer,Felix Heide

论文链接:https://www.cs.princeton.edu/~fheide/papers/Gated2Depth_preprint.pdf

42、Counting with Focus for Free

作者:Zenglin Shi, Pascal Mettes, Cees G. M. Snoek

论文链接:https://arxiv.org/abs/1903.12206

Github链接:https://github.com/shizenglin/Counting-with-Focus-for-Free

43、PU-GAN: a Point Cloud Upsampling Adversarial Network

作者:Ruihui Li, Xianzhi Li, Chi-Wing Fu, Daniel Cohen-Or, Pheng-Ann Heng

论文链接:https://arxiv.org/abs/1907.10844

44、Moment Matching for Multi-Source Domain Adaptation (Oral)

多源域适应的配套匹配

作者:Xingchao Peng, Qinxun Bai, Xide Xia, Zijun Huang, Kate Saenko, Bo Wang

论文链接:https://arxiv.org/abs/1812.01754

45、EMPNet: Neural Localisation and Mapping using Embedded Memory Points

EMPNet:使用嵌入式存储点的神经定位和映射

作者:Gil Avraham, Yan Zuo, Thanuja Dharmasiri, Tom Drummond

论文链接:https://arxiv.org/abs/1907.13268

46、Learning Compositional Representations for Few-Shot Recognition

少样本识别的构图表示学习

作者:Pavel Tokmakov, Yuxiong Wang, Martial Hebert

论文链接:https://sites.google.com/view/comprepr/home

47、Digging Into Self-Supervised Monocular Depth Estimation

自我监督单眼深度估计的研究

作者:Clement Godard, Oisin Mac Aodha, Michael Firman, Gabriel Brostow

论文链接:https://arxiv.org/pdf/1806.01260.pdf

48、Deep Interpretable Non-Rigid Structure from Motion

运动的深层可解释的非刚性结构

作者:Chen Kong, Simon Lucey

论文链接:https://arxiv.org/pdf/1902.10840.pdf

49、PRECOG: PREdiction Conditioned On Goals in Visual Multi-Agent Settings

PRECOG:视觉多代理设置中的目标条件

作者:Nicholas Rhinehart, Rowan McAllister, Kris Kitani, Sergey Levine

论文链接:https://arxiv.org/pdf/1905.01296.pdf

项目链接:https://sites.google.com/view/precog

50、Lifelong GAN: Continual Learning for Conditional Image Generation

终身GAN:条件图像生成的持续学习

作者:Mengyao Zhai, Lei Chen, Fred Tung, Jiawei He, Megha Nawhal, Greg Mori

论文链接:https://arxiv.org/abs/1907.10107

52、An Empirical Study of Spatial Attention Mechanisms in Deep Networks

深度网络空间注意机制的实证研究

作者:Xizhou Zhu, Dazhi Cheng, Zheng Zhang, Stephen Lin, Jifeng Dai

论文链接:https://arxiv.org/pdf/1904.05873.pdf

53、Fashion++: Minimal Edits for Outfit Improvement

Fashion ++:改进装备的最小编辑

作者:Wei-Lin Hsiao, Isay Katsman, Chao-Yuan Wu, Devi Parikh, Kristen Grauman

论文链接:https://arxiv.org/pdf/1904.09261.pdf

54、Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment

Align2Ground:由图像标题对齐引导的弱监督短语接地

作者:Samyak Datta, Karan Sikka, Anirban Roy, Karuna Ahuja, Devi Parikh, Ajay Divakaran

论文链接:https://arxiv.org/pdf/1903.11649.pdf

55、Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded

做一个提示:利用解释使视觉和语言模型更加扎实

作者:Ramprasaath R. Selvaraju, Stefan Lee, Yilin Shen, Hongxia Jin, Dhruv Batra, Devi Parikh

论文链接:https://arxiv.org/pdf/1902.03751.pdf

56、SplitNet: Sim2Sim and Task2Task Transfer for Embodied Visual Navigation

SplitNet:用于体验视觉导航的Sim2Sim和Task2Task转移

作者:Daniel Gordon, Abhishek Kadian, Devi Parikh, Judy Hoffman, Dhruv Batra

论文链接:https://arxiv.org/pdf/1905.07512.pdf

57、Habitat: A Platform for Embodied AI Research ( Oral )

Habitat:体验人工智能研究的平台

作者:Manolis Savva, Abhishek Kadian, Oleksandr Maksymets, Yili Zhao, Erik Wijmans, Bhavana Jain, Julian Straub, Jia Liu, Vladlen Koltun, Jitendra Malik, Devi Parikh, Dhruv Batra

论文链接:https://arxiv.org/abs/1904.01201

58、EM-Fusion: Dynamic Object-Level SLAM with Probabilistic Data Association

EM-Fusion:具有概率数据关联的动态对象级SLAM

作者:Michael Strecke, Jörg Stückler

论文链接:https://arxiv.org/abs/1904.11781

59、Texture Fields: Learning Texture Representations in Function Space

纹理字段:在函数空间中学习纹理表示

作者:Michael Oechsle, Lars Mescheder, Michael Niemeyer, Thilo Strauss, Andreas Geiger

论文链接:https://arxiv.org/abs/1905.07259

60、AMASS: Archive of Motion Capture as Surface Shapes

AMASS:将运动捕捉存档为表面形状

作者:Naureen Mahmood, Nima Ghorbani, Nikolaus F. Troje, Gerard Pons-Moll, Michael J. Black

论文链接:https://arxiv.org/abs/1904.03278

61、End-to-end Learning for Graph Decomposition

图形分解的端到端学习

作者:Jie Song, Bjoern Andres, Michael Black, Otmar Hilliges, Siyu Tang

论文链接:https://arxiv.org/pdf/1812.09737.pdf

62、Towards Multi-pose Guided Virtual Try-on Network

Towards多姿态引导虚拟试穿网络

作者:Haoye Dong, Xiaodan Liang, Bochao Wang, Hanjiang Lai, Jia Zhu, Jian Yin

论文链接:https://arxiv.org/abs/1902.11026

63、On the Design of Black-box Adversarial Examples by Leveraging Gradient-free Optimization and Operator Splitting Method

利用无梯度优化和算子分裂方法设计黑盒对抗实例

作者:Pu Zhao, Sijia Liu, Pin-Yu Chen, Nghia Hoang, Kaidi Xu, Bhavya Kailkhura, Xue Lin

论文链接:https://arxiv.org/abs/1907.11684

64、Goal-Driven Sequential Data Abstraction

目标驱动的顺序数据抽象

作者:Umar Riaz Muhammad, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song

论文链接:https://arxiv.org/abs/1907.12336

65、Recursive Cascaded Networks for Unsupervised Medical Image Registration

用于无监督医学图像配准的递归级联网络

作者: Shengyu Zhao, Yue Dong, Eric I-Chao Chang, Yan Xu

论文链接:https://arxiv.org/abs/1907.12353

66、Learn to Scale: Generating Multipolar Normalized Density Map for Crowd Counting

学习规模:为人群计数生成多极归一化密度图

作者:Chenfeng Xu, Kai Qiu, Jianlong Fu, Song Bai, Yongchao Xu, Xiang Bai

论文链接:https://arxiv.org/abs/1907.12428

67、MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

MetaPruning:自动神经网络通道修剪的元学习

作者:Zechun Liu, Haoyuan Mu, Xiangyu Zhang, Zichao Guo, Xin Yang, Tim Kwang-Ting Cheng, Jian Sun

论文链接:https://arxiv.org/abs/1903.10258

68、Switchable Whitening for Deep Representation Learning

作者:Xingang Pan, Xiaohang Zhan, Jianping Shi, Xiaoou Tang, Ping Luo

论文链接:https://arxiv.org/abs/1904.09739

69、Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution

作者:Yunpeng Chen, Haoqi Fan, Bing Xu, Zhicheng Yan, Yannis Kalantidis, Marcus Rohrbach, Shuicheng Yan, Jiashi Feng

论文链接:https://arxiv.org/abs/1904.05049

70、Task2Vec: Task Embedding for Meta-Learning

Task2Vec:元学习的任务嵌入

作者:Alessandro Achille, Michael Lam, Rahul Tewari, Avinash Ravichandran, Subhransu Maji, Charless Fowlkes, Stefano Soatto, Pietro Perona

论文链接:https://arxiv.org/abs/1902.03545

71、CARAFE: Content-Aware ReAssembly of FEatures ( Oral )

CARAFE:内容意识重新组装特征

作者:Jiaqi Wang, Kai Chen, Rui Xu, Ziwei Liu, Chen Change Loy, Dahua Lin

论文链接:https://arxiv.org/pdf/1905.02188.pdf

72、Domain Intersection and Domain Difference

域交和域差

Github链接:https://github.com/sagiebenaim/DomainIntersectionDifference

73、A Closed-form Solution to Universal Style Transfer

一种通用样式转换的封闭式解决方案

作者:Ming Lu, Hao Zhao, Anbang Yao, Yurong Chen, Feng Xu, Li Zhang

论文链接:https://arxiv.org/abs/1906.00668

Github链接:https://github.com/lu-m13/OptimalStyleTransfer

74、Sampling-free Epistemic Uncertainty Estimation Using Approximated Variance Propagation

基于近似方差传播的无样本认知不确定性估计

Github链接:https://github.com/janisgp/Sampling-free-Epistemic-Uncertainty

75、On the Over-Smoothing Problem of CNN Based Disparity Estimation

基于CNN的视差估计的过平滑问题

Github链接:https://github.com/chenchr/otosp

76、Metric Learning with HORDE: High-Order Regularizer for Deep Embeddings

HORDE度量学习:用于深度嵌入的高阶正则化器

论文链接:https://arxiv.org/abs/1908.02735

Github链接:https://github.com/pierre-jacob/ICCV2019-Horde

77、Mask-ShadowGAN: Learning to Remove Shadows from Unpaired Data

面具阴影甘:学习从未配对的数据中去除阴影

作者:Xiaowei Hu, Yitong Jiang, Chi-Wing Fu, and Pheng-Ann Heng

Github链接:https://github.com/xw-hu/Mask-ShadowGAN

78、Universally Slimmable Networks and Improved Training Techniques

普遍精简的网络和改进的培训技术

作者:Jiahui Yu, Thomas Huang

论文链接:https://arxiv.org/abs/1903.05134

Github链接:https://github.com/JiahuiYu/slimmable_networks

79、Domain Adaptation for Structured Output via Discriminative Patch Representations (Oral)

通过有区别的Patch表示对结构化输出进行域适应

作者:Yi-Hsuan Tsai, Kihyuk Sohn, Samuel Schulter, Manmohan Chandraker

论文链接:https://arxiv.org/abs/1901.05427

80、Deep Non-Rigid Structure from Motion(Oral)

.深非刚性的结构与运动

作者:Chen Kong, Simon Lucey

论文链接:https://arxiv.org/abs/1908.00052

81、Learning the Model Update for Siamese Trackers

学习暹罗语追踪器的模型更新

作者:Lichao Zhang, Abel Gonzalez-Garcia, Joost van de Weijer, Martin Danelljan, Fahad Shahbaz Khan

论文链接:https://arxiv.org/abs/1908.00855

82、Distilling Knowledge From a Deep Pose Regressor Network

从深层位姿回归网络中提取知识

作者:Muhamad Risqi U. Saputra, Pedro P. B. de Gusmao, Yasin Almalioglu, Andrew Markham, Niki Trigoni

论文链接:https://arxiv.org/abs/1908.00858

83、Permutation-invariant Feature Restructuring for Correlation-aware Image Set-based Recognition

基于相关感知的图像集识别的置换不变特征重构

作者:Xiaofeng Liu, Zhenhua Guo, Site Li, Lingsheng Kong, Ping Jia, Jane You, B. V. K. Kumar

论文链接:https://arxiv.org/abs/1908.01174

84、Restoration of Non-rigidly Distorted Underwater Images using a Combination of Compressive Sensing and Local Polynomial Image Representations(Oral )

恢复非刚性的扭曲的水下图像使用压缩传感和图像局部多项式表示的组合

作者: Jerin Geo James, Pranay Agrawal, Ajit Rajwade

论文链接:https://arxiv.org/abs/1908.01940

85、Semi-supervised Skin Detection by Network with Mutual Guidance

基于相互指导的网络半监督皮肤检测

作者:Yi He, Jiayuan Shi, Chuan Wang, Haibin Huang, Jiaming Liu, Guanbin Li, Risheng Liu, Jue Wang

论文链接:https://arxiv.org/abs/1908.01977

86、Consensus Maximization Tree Search Revisited(Oral)

共识最大化树搜索重新审视(口语)

作者:Zhipeng Cai, Tat-Jun Chin, Vladlen Koltun

论文链接:https://arxiv.org/abs/1908.02021

87、Deep Self-Learning From Noisy Labels

从嘈杂的标签中进行深度的自我学习

作者:Jiangfan Han, Ping Luo, Xiaogang Wang

论文链接:https://arxiv.org/abs/1908.02160

88、Symmetric Graph Convolutional Autoencoder for Unsupervised Graph Representation Learning

用于无监督图表示学习的对称图卷积自编码器

作者:Jiwoong Park, Minsik Lee, Hyung Jin Chang, Kyuewang Lee, Jin Young Choi

论文链接:https://arxiv.org/abs/1908.02441

89、Expert Sample Consensus Applied to Camera Re-Localization

将专家样本一致性应用于相机再定位

作者:Eric Brachmann, Carsten Rother

论文链接:https://arxiv.org/abs/1908.02484

90、SpatialSense: An Adversarially Crowdsourced Benchmark for Spatial Relation Recognition

空间感:一个反向众包的空间关系识别基准

作者:Kaiyu Yang, Olga Russakovsky, Jia Deng

论文链接:https://arxiv.org/abs/1908.02660

91、Bidirectional One-Shot Unsupervised Domain Mapping

双向一次无监督域映射

Github链接:https://github.com/tomercohen11/BiOST

92、CompenNet++: End-to-end Full Projector Compensation

CompenNet++:端到端全投影仪补偿

Github链接:https://github.com/BingyaoHuang/CompenNet-plusplus

93、Perspective-Guided Convolution Networks for Crowd Counting

用于人群计数的透视引导卷积网络

Github链接:https://github.com/Zhaoyi-Yan/PGCNet

94、Larger Norm More Transferable: An Adaptive Feature Norm Approach for Unsupervised Domain Adaptation

Larger范数更可转移:无监督域自适应的自适应特征范数方法

作者:Ruijia Xu, Guanbin Li, Jihan Yang, Liang Lin

论文链接:https://arxiv.org/abs/1811.07456

95、Closed-Form Optimal Two-View Triangulation Based on Angular Errors

基于角度误差的闭式最优双视三角剖分

作者:Seong Hun Lee, Javier Civera

论文链接:https://arxiv.org/abs/1903.09115

96、Overcoming Catastrophic Forgetting with Unlabeled Data in the Wild

在荒野中克服了未标记数据的灾难性遗忘

作者:Kibok Lee, Kimin Lee, Jinwoo Shin, Honglak Lee

论文链接:https://arxiv.org/abs/1903.12648

Github链接:https://github.com/kibok90/iccv2019-inc

97、Learning Combinatorial Embedding Networks for Deep Graph Matching

用于深度图匹配的学习组合嵌入网络

作者:Runzhong Wang, Junchi Yan, Xiaokang Yang

论文链接:https://arxiv.org/abs/1904.00597

98、PR Product: A Substitute for Inner Product in Neural Networks(Oral )

PR产品:神经网络内部产品替代品(口服)

作者:Zhennan Wang, Wenbin Zou, Chen Xu

论文链接:https://arxiv.org/abs/1904.13148

Github链接:https://github.com/wzn0828/PR_Product

99、STM: SpatioTemporal and Motion Encoding for Action Recognition

STM:行动识别的SpatioTmporal和运动编码

作者:Boyuan Jiang, Mengmeng Wang, Weihao Gan, Wei Wu, Junjie Yan

论文链接:https://arxiv.org/abs/1908.02486

100、Memory-Based Neighbourhood Embedding for Visual Recognition(Oral )

基于记忆的邻域嵌入视觉识别

作者:Suichan Li, Dapeng Chen, Bin Liu, Nenghai Yu, Rui Zhao

论文链接:https://arxiv.org/abs/1908.04992

101、Few-Shot Learning with Global Class Representations

全球班级代表的快速学习

作者:Tiange Luo, Aoxue Li, Tao Xiang, Weiran Huang, Liwei Wang

论文链接:https://arxiv.org/abs/1908.05257

102、Learning Trajectory Dependencies for Human Motion PredictionOral

学习人体运动预测的轨迹依赖性

作者:Wei Mao, Miaomiao Liu, Mathieu Salzmann, Hongdong Li

论文链接:https://arxiv.org/abs/1908.05436

Github链接:https://github.com/wei-mao-2019/LearnTrajDep

103、Symmetric Cross Entropy for Robust Learning with Noisy Labels

具有噪声标签的鲁棒学习的对称交叉熵

作者:Yisen Wang, Xingjun Ma, Zaiyi Chen, Yuan Luo, Jinfeng Yi, James Bailey

论文链接:https://arxiv.org/abs/1908.06112

104、From Open Set to Closed Set: Counting Objects by Spatial Divide-and-Conquer

从打开设置到封闭设置:按空间划分和计数计算对象

作者:Haipeng Xiong, Hao Lu, Chengxin Liu, Liang Liu, Zhiguo Cao, Chunhua Shen

论文链接:https://arxiv.org/abs/1908.06473

Github链接:https://github. com/xhp-hust-2018-2011/S-DCNet

105、Human Mesh Recovery from Monocular Images via a Skeleton-disentangled Representation

通过骷髅解剖表示从人工图像中恢复人体网格

作者:Sun Yu, Ye Yun, Liu Wu, Gao Wenpeng, Fu YiLi, Mei Tao

论文链接:https://arxiv.org/abs/1908.07172

106、ViCo: Word Embeddings from Visual Co-occurrences

ViCo:来自视觉共现的词嵌入

作者:Tanmay Gupta, Alexander Schwing, Derek Hoiem

论文链接:https://arxiv.org/abs/1908.08527

项目链接:http://tanmaygupta.info/vico/

107、Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning

在多样性图像标题期间建模意图的顺序潜在空间

作者:Jyoti Aneja, Harsh Agrawal, Dhruv Batra, Alexander Schwing

论文链接:https://arxiv.org/abs/1908.08529

108、Learning Similarity Conditions Without Explicit Supervision

在没有明确监督的情况下学习相似性条件

作者:Reuben Tan, Mariya I. Vasileva, Kate Saenko, Bryan A. Plummer

论文链接:https://arxiv.org/abs/1908.08589

109、Shadow Removal via Shadow Image Decomposition

通过阴影图像分解去除阴影

作者:Hieu Le, Dimitris Samaras

论文链接:https://arxiv.org/abs/1908.08628

110、Crowd Counting with Deep Structured Scale Integration Network

深度结构化规模集成网络的计算

作者:Lingbo Liu, Zhilin Qiu, Guanbin Li, Shufan Liu, Wanli Ouyang, Liang Lin

论文链接:https://arxiv.org/abs/1908.08692

111、Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry

自我监督的深度视觉测距的顺序对抗性学习

作者:Shunkai Li, Fei Xue, Xin Wang, Zike Yan, Hongbin Zha

论文链接:https://arxiv.org/abs/1908.08704

112、Learning Filter Basis for Convolutional Neural Network Compression

卷积神经网络压缩的学习滤波器基础

作者:Yawei Li, Shuhang Gu, Luc Van Gool, Radu Timofte

论文链接:https://arxiv.org/abs/1908.08932

Github链接:https://github.com/ofsoundof/learning_filter_basis

113、Where Is My Mirror?

作者:Xin Yang, Haiyang Mei, Ke Xu, Xiaopeng Wei, Baocai Yin, Rynson W. H. Lau

论文链接:https://arxiv.org/abs/1908.09101

项目链接:https://mhaiyang.github.io/ICCV2019_MirrorNet/index.html

114、Towards Unsupervised Image Captioning with Shared Multimodal Embeddings

使用共享多模式嵌入来保护无监督的图像标题

作者:Iro Laina, Christian Rupprecht, Nassir Navab

论文链接:https://arxiv.org/abs/1908.09317

115、Object-Driven Multi-Layer Scene Decomposition From a Single Image

来自单个图像的对象驱动的多层场景分解

作者:Helisa Dhamo, Nassir Navab, Federico Tombari

论文链接:https://arxiv.org/abs/1908.09521

116、Non-local Recurrent Neural Memory for Supervised Sequence Modeling(Oral)

用于监督序列建模的非局部递归神经记忆

作者:Canmiao Fu, Wenjie Pei, Qiong Cao, Chaopeng Zhang, Yong Zhao, Xiaoyong Shen, Yu-Wing Tai

论文链接:https://arxiv.org/abs/1908.09535

117、Embarrassingly Simple Binary Representation Learning

简单的二进制表示学习

作者:Yuming Shen, Jie Qin, Jiaxin Chen, Li Liu, Fan Zhu

论文链接:https://arxiv.org/abs/1908.09573

118、Stochastic Filter Groups for Multi-Task CNNs: Learning Specialist and Generalist Convolution Kernels(Oral )

简单的二进制表示学习

作者:Felix J. S. Bragman, Ryutaro Tanno, Sebastien Ourselin, Daniel C. Alexander, M. Jorge Cardoso

论文链接:https://arxiv.org/abs/1908.09597

119、Confidence Regularized Self-Training

肯定的自我训练

作者:Yang Zou, Zhiding Yu, Xiaofeng Liu, B. V. K. Vijaya Kumar, Jinsong Wang

论文链接:https://arxiv.org/abs/1908.09822

Github链接:https://github.com/yzou2/CRST

120、SoftTriple Loss: Deep Metric Learning Without Triplet Sampling

软三重损失:没有三重抽样的深度度量学习

作者:Qi Qian, Lei Shang, Baigui Sun, Juhua Hu, Hao Li, Rong Jin

论文链接:https://arxiv.org/abs/1909.05235

121、A Camera That CNNs: Towards Embedded Neural Networks on Pixel Processor Arrays

一种CNNs相机:面向像素处理器阵列上的嵌入式神经网络

作者:Laurie Bose, Jianing Chen, Stephen J. Carey, Piotr Dudek, Walterio Mayol-Cuevas

论文链接:https://arxiv.org/abs/1909.05647

122、DeepPruner: Learning Efficient Stereo Matching via Differentiable PatchMatch

深度修剪:通过可微PatchMatch学习有效的立体匹配

作者:Shivam Duggal, Shenlong Wang, Wei-Chiu Ma, Rui Hu, Raquel Urtasun

论文链接:https://arxiv.org/abs/1909.05845

123、Rethinking Zero-Shot Learning: A Conditional Visual Classification Perspective

反思零镜头学习:一个有条件的视觉分类视角

作者:Kai Li, Martin Renqiang Min, Yun Fu

论文链接:https://arxiv.org/abs/1909.05995

124、Learning Spatial Awareness to Improve Crowd Counting(Oral)

作者:Zhi-Qi Cheng, Jun-Xiu Li, Qi Dai, Xiao Wu, Alexander Hauptmann

论文链接:https://arxiv.org/abs/1909.07057

125、AdaptIS: Adaptive Instance Selection Network

AdaptIS:自适应实例选择网络

作者:Konstantin Sofiiuk, Olga Barinova, Anton Konushin

论文链接:https://arxiv.org/abs/1909.07829

Github链接:https://github.com/saic-vul/adaptis

126、Self-Supervised Monocular Depth Hints

自我监督单眼深度提示

作者:Jamie Watson, Michael Firman, Gabriel J. Brostow, Daniyar Turmukhambetov

论文链接:https://arxiv.org/abs/1909.09051

127、Making the Invisible Visible: Action Recognition Through Walls and Occlusions

使不可见变为可见:通过墙壁和遮挡的动作识别

作者:Tianhong Li, Lijie Fan, Mingmin Zhao, Yingcheng Liu, Dina Katabi

论文链接:https://arxiv.org/abs/1909.09300

128、Adversarial Learning with Margin-based Triplet Embedding Regularization

基于边值的三重嵌入正则化的对抗性学习

作者: Yaoyao Zhong, Weihong Deng

论文链接:https://arxiv.org/abs/1909.09481

129、Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation

交互式草图和填充:多级草图到图像的翻译

作者: Arnab Ghosh, Richard Zhang, Puneet K. Dokania, Oliver Wang, Alexei A. Efros, Philip H. S. Torr, Eli Shechtman

论文链接:https://arxiv.org/abs/1909.11081

项目链接:https://arnabgho.github.io/iSketchNFill/

130、Anchor Loss: Modulating Loss Scale based on Prediction Difficulty(Oral )

锚点损失:基于预测难度的调整损失量表

作者:Serim Ryou, Seong-Gyun Jeong, Pietro Perona

论文链接:https://arxiv.org/abs/1909.11155

131、Learning Propagation for Arbitrarily-structured Data

任意结构数据的学习传播

作者:Sifei Liu, Xueting Li, Varun Jampani, Shalini De Mello, Jan Kautz

论文链接:https://arxiv.org/abs/1909.11237

132、MIC: Mining Interclass Characteristics for Improved Metric Learning

MIC:挖掘类间特征以改进度量学习

作者:Karsten Roth, Biagio Brattoli, Björn Ommer

论文链接:https://arxiv.org/abs/1909.11574

133、Compact Trilinear Interaction for Visual Question Answering

紧凑的三线性互动视觉问题回答

作者:Tuong Do, Thanh-Toan Do, Huy Tran, Erman Tjiputra, Quang D. Tran

论文链接:https://arxiv.org/abs/1909.11874

134、Convex Relaxations for Consensus and Non-Minimal Problems in 3D Vision

三维视觉中一致和非最小问题的凸松弛

作者:Thomas Probst, Danda Pani Paudel, Ajad Chhatkuli, Luc Van Gool

论文链接:https://arxiv.org/abs/1909.12034

135、Differentiable Learning-to-Group Channels via Groupable Convolutional Neural Networks

可区分的学习到组的通道通过可分组的卷积神经网络

作者:Zhaoyang Zhang, Jingyu Li, Wenqi Shao, Zhanglin Peng, Ruimao Zhang, Xiaogang Wang, Ping Luo

论文链接:https://arxiv.org/abs/1908.05867

Github链接:https://github.com/d-li14/dgconv.pytorch

136、HBONet: Harmonious Bottleneck on Two Orthogonal Dimensions

HBONet:两个正交维度上的和谐瓶颈

作者:Duo Li, Aojun Zhou, Anbang Yao

论文链接:https://arxiv.org/abs/1908.03888

Github链接:https://github.com/d-li14/HBONet

137、Dual Student: Breaking the Limits of the Teacher in Semi-supervised Learning

双元学生:打破教师在半监督学习中的限制

作者:Zhanghan Ke, Daoye Wang, Qiong Yan, Jimmy Ren, Rynson W. H. Lau

论文链接:https://arxiv.org/abs/1909.01804

138、

139、Program-Guided Image Manipulators

Program-Guided形象操纵者

作者:Jiayuan Mao, Xiuming Zhang, Yikai Li, William T. Freeman, Joshua B. Tenenbaum, Jiajun Wu

论文链接:https://arxiv.org/abs/1909.02116

项目链接:http://pgim.csail.mit.edu/

140、Understanding Human Gaze Communication by Spatio-Temporal Graph Reasoning

通过时空图推理来理解人类的目光交流

作者:Lifeng Fan, Wenguan Wang, Siyuan Huang, Xinyu Tang, Song-Chun Zhu

论文链接:https://arxiv.org/abs/1909.02144

141、Gravity as a Reference for Estimating a Person\'s Height from Video

重力作为一个参考,从视频估计一个人的高度

作者:Didier Bieler, Semih Günel, Pascal Fua, Helge Rhodin

论文链接:https://arxiv.org/abs/1909.02211

142、Bayes-Factor-VAE: Hierarchical Bayesian Deep Auto-Encoder Models for Factor Disentanglement

贝叶斯-因素- vae:用于因素分解的层次贝叶斯深度自动编码器模型

作者:Minyoung Kim, Yuting Wang, Pritish Sahu, Vladimir Pavlovic

论文链接:https://arxiv.org/abs/1909.02820

143、Hierarchy Parsing for Image Captioning

用于图像字幕的层次结构解析

作者: Ting Yao, Yingwei Pan, Yehao Li, Tao Mei

论文链接:https://arxiv.org/abs/1909.03918

144、Learning Object-specific Distance from a Monocular Image

.从单眼图像学习物体特定的距离

作者:Jing Zhu, Yi Fang, Husam Abu-Haimed, Kuo-Chin Lien, Dongdong Fu, Junli Gu

论文链接:https://arxiv.org/abs/1909.04182

145、Bayesian Relational Memory for Semantic Visual Navigation

用于语义视觉导航的贝叶斯关系记忆

作者:Yi Wu, Yuxin Wu, Aviv Tamar, Stuart Russell, Georgia Gkioxari, Yuandong Tian

论文链接:https://arxiv.org/abs/1909.04306

146、FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape from Single RGB Images

FreiHAND:用于从单个RGB图像捕获手部姿势和形状的无标记数据集

作者:Christian Zimmermann, Duygu Ceylan, Jimei Yang, Bryan Russell, Max Argus, Thomas Brox

论文链接:https://arxiv.org/abs/1909.04349

项目链接:https://lmb.informatik.uni-freiburg.de/projects/freihand/

147、Structured Modeling of Joint Deep Feature and Prediction Refinement for Salient Object Detection

联合深度特征的结构化建模和突出目标检测的预测精化

作者: Yingyue Xu, Dan Xu, Xiaopeng Hong, Wanli Ouyang, Rongrong Ji, Min Xu, Guoying Zhao

论文链接:https://arxiv.org/abs/1909.04366

148、FDA: Feature Disruptive Attack

特性破坏性攻击

作者: Aditya Ganeshan, B. S. Vivek, R. Venkatesh Babu

论文链接:https://arxiv.org/abs/1909.04385

Github链接:https://github.com/BardOfCodes/fda

149、Cross-X Learning for Fine-Grained Visual Categorization

用于细粒度视觉分类的交叉x学习

作者:Wei Luo, Xitong Yang, Xianjie Mo, Yuheng Lu, Larry S. Davis, Jun Li, Jian Yang, Ser-Nam Lim

论文链接:https://arxiv.org/abs/1909.04412

Github链接:https://github.com/cswluo/CrossX

150、Reasoning About Human-Object Interactions Through Dual Attention Networks

通过双重注意力网络对人-物交互进行推理

作者:Tete Xiao, Quanfu Fan, Dan Gutfreund, Mathew Monfort, Aude Oliva, Bolei Zhou

论文链接:https://arxiv.org/abs/1909.04743

151、Variable Rate Deep Image Compression With a Conditional Autoencoder

可变速率深图像压缩与条件自动编码器

作者: Yoojin Choi, Mostafa El-Khamy, Jungwon Lee

论文链接:https://arxiv.org/abs/1909.04802

152、Deep Elastic Networks with Model Selection for Multi-Task Learning

具有多任务学习模型选择的深度弹性网络

作者:Chanho Ahn, Eunwoo Kim, Songhwai Oh

论文链接:https://arxiv.org/abs/1909.04860

153、Sparse and Imperceivable Adversarial Attacks

稀疏的和不可感知的对抗攻击

作者:Francesco Croce, Matthias Hein

论文链接:https://arxiv.org/abs/1909.05040

本文章首发在 极市计算机视觉技术社区