IT数码 购物 网址 头条 软件 日历 阅读 图书馆
TxT小说阅读器
↓语音阅读,小说下载,古典文学↓
图片批量下载器
↓批量下载图片,美女图库↓
图片自动播放器
↓图片自动播放器↓
一键清除垃圾
↓轻轻一点,清除系统垃圾↓
开发: C++知识库 Java知识库 JavaScript Python PHP知识库 人工智能 区块链 大数据 移动开发 嵌入式 开发工具 数据结构与算法 开发测试 游戏开发 网络协议 系统运维
教程: HTML教程 CSS教程 JavaScript教程 Go语言教程 JQuery教程 VUE教程 VUE3教程 Bootstrap教程 SQL数据库教程 C语言教程 C++教程 Java教程 Python教程 Python3教程 C#教程
数码: 电脑 笔记本 显卡 显示器 固态硬盘 硬盘 耳机 手机 iphone vivo oppo 小米 华为 单反 装机 图拉丁
 
   -> 人工智能 -> 计算机视觉论文-2021-08-02 -> 正文阅读

[人工智能]计算机视觉论文-2021-08-02

本专栏是计算机视觉方向论文收集积累,时间:2021年8月2日,来源:paper digest

欢迎关注原创公众号?【计算机视觉联盟】,回复?【西瓜书手推笔记】?可获取我的机器学习纯手推笔记!

直达笔记地址:机器学习手推笔记(GitHub地址)

1, TITLE:?DarkLighter: Light Up The Darkness for UAV Tracking
AUTHORS: Junjie Ye ; Changhong Fu ; Guangze Zheng ; Ziang Cao ; Bowen Li
CATEGORY: cs.CV [cs.CV, cs.RO]
HIGHLIGHT: To facilitate aerial tracking in the dark through a general fashion, this work proposes a low-light image enhancer namely DarkLighter, which dedicates to alleviate the impact of poor illumination and noise iteratively.

2, TITLE:?Real-time Streaming Perception System for Autonomous Driving
AUTHORS: Yongxiang Gu ; Qianlei Wang ; Xiaolin Qin
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: To remedy this, we present the real-time steaming perception system in this paper, which is also the 2nd Place solution of Streaming Perception Challenge (Workshop on Autonomous Driving at CVPR 2021) for the detection-only track.

3, TITLE:?From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection
AUTHORS: Jiajun Deng ; Wengang Zhou ; Yanyong Zhang ; Houqiang Li
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: To this end, in this work, we regard point clouds as hollow-3D data and propose a new architecture, namely Hallucinated Hollow-3D R-CNN ($\text{H}^2$3D R-CNN), to address the problem of 3D object detection.

4, TITLE:?Exploring Low-light Object Detection Techniques
AUTHORS: Winston Chen ; Tejas Shah
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: 2)In the second phase, we explore different object detection models that can be applied to the enhanced image.

5, TITLE:?Self-Supervised Regional and Temporal Auxiliary Tasks for Facial Action Unit Recognition
AUTHORS: Jingwei Yan ; Jingjing Wang ; Qiang Li ; Chunmao Wang ; Shiliang Pu
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: Motivated by this, we take the AU properties into consideration and propose two auxiliary AU related tasks to bridge the gap between limited annotations and the model performance in a self-supervised manner via the unlabeled data.

6, TITLE:?Iterative, Deep, and Unsupervised Synthetic Aperture Sonar Image Segmentation
AUTHORS: Yung-Chen Sun ; Isaac D. Gerg ; Vishal Monga
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this work, we present a new iterative unsupervised algorithm for learning deep features for SAS image segmentation.

7, TITLE:?Product1M: Towards Weakly Supervised Instance-Level Product Retrieval Via Cross-modal Pretraining
AUTHORS: XUNLIN ZHAN et. al.
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this paper, we investigate a more realistic setting that aims to perform weakly-supervised multi-modal instance-level product retrieval among fine-grained product categories.

8, TITLE:?Relightable Neural Video Portrait
AUTHORS: YOUJIA WANG et. al.
CATEGORY: cs.CV [cs.CV, cs.GR]
HIGHLIGHT: In this paper, we present a relightable neural video portrait, a simultaneous relighting and reenactment scheme that transfers the head pose and facial expressions from a source actor to a portrait video of a target actor with arbitrary new backgrounds and lighting conditions.

9, TITLE:?Recognizing Emotions Evoked By Movies Using Multitask Learning
AUTHORS: Hassan Hayat ; Carles Ventura ; Agata Lapedriza
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this paper, we model the emotions evoked by videos in a different manner: instead of modeling the aggregated value we jointly model the emotions experienced by each viewer and the aggregated value using a multi-task learning approach.

10, TITLE:?The Minimum Edit Arborescence Problem and Its Use in Compressing Graph Collections [Extended Version]
AUTHORS: Lucas Gnecco ; Nicolas Boria ; S�bastien Bougleux ; Florian Yger ; David B. Blumenthal
CATEGORY: cs.CV [cs.CV, cs.DS]
HIGHLIGHT: We introduce a unified and generic structure called edit arborescence that relies on edit paths between data in a collection, as well as the Min Edit Arborescence Problem, which asks for an edit arborescence that minimizes the sum of costs of its inner edit paths.

11, TITLE:?OpenForensics: Large-Scale Challenging Dataset For Multi-Face Forgery Detection And Segmentation In-The-Wild
AUTHORS: Trung-Nghia Le ; Huy H. Nguyen ; Junichi Yamagishi ; Isao Echizen
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: This paper presents a comprehensive study on two new countermeasure tasks: multi-face forgery detection and segmentation in-the-wild. To promote these new tasks, we have created the first large-scale dataset posing a high level of challenges that is designed with face-wise rich annotations explicitly for face forgery detection and segmentation, namely OpenForensics.

12, TITLE:?Fourier Series Expansion Based Filter Parametrization for Equivariant Convolutions
AUTHORS: Qi Xie ; Qian Zhao ; Zongben Xu ; Deyu Meng
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: Against this issue, in this paper we modify the classical Fourier series expansion for 2D filters, and propose a new set of atomic basis functions for filter parametrization.

13, TITLE:?Towards Robust Vision By Multi-task Learning on Monkey Visual Cortex
AUTHORS: SHAHD SAFARANI et. al.
CATEGORY: cs.CV [cs.CV, cs.AI, cs.LG]
HIGHLIGHT: Here, we successfully leveraged these inductive biases with a multi-task learning approach: we jointly trained a deep network to perform image classification and to predict neural activity in macaque primary visual cortex (V1).

14, TITLE:?T-SVDNet: Exploring High-Order Prototypical Correlations for Multi-Source Domain Adaptation
AUTHORS: Ruihuang Li ; Xu Jia ; Jianzhong He ; Shuaijun Chen ; Qinghua Hu
CATEGORY: cs.CV [cs.CV, cs.LG]
HIGHLIGHT: We propose a novel approach named T-SVDNet to address the task of Multi-source Domain Adaptation (MDA), which is featured by incorporating Tensor Singular Value Decomposition (T-SVD) into a neural network's training pipeline.

15, TITLE:?Manipulating Identical Filter Redundancy for Efficient Pruning on Deep and Complicated CNN
AUTHORS: Xiaohan Ding ; Tianxiang Hao ; Jungong Han ; Yuchen Guo ; Guiguang Ding
CATEGORY: cs.CV [cs.CV, cs.LG]
HIGHLIGHT: In this paper, we propose to manipulate the redundancy during training to facilitate network pruning.

16, TITLE:?Pix2Point: Learning Outdoor 3D Using Sparse Point Clouds and Optimal Transport
AUTHORS: R�my Leroy ; Pauline Trouv�-Peloux ; Fr�d�ric Champagnat ; Bertrand Le Saux ; Marcela Carvalho
CATEGORY: cs.CV [cs.CV, I.4.5; I.2.10]
HIGHLIGHT: In this paper, we address the problem of learning outdoor 3D point cloud from monocular data using a sparse ground-truth dataset.

17, TITLE:?Real-Time Anchor-Free Single-Stage 3D Detection with IoU-Awareness
AUTHORS: RUNZHOU GE et. al.
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this report, we introduce our winning solution to the Real-time 3D Detection and also the "Most Efficient Model" in the Waymo Open Dataset Challenges at CVPR 2021.

18, TITLE:?Temporal Feature Warping for Video Shadow Detection
AUTHORS: Shilin Hu ; Hieu Le ; Dimitris Samaras
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this paper, we propose a simple but powerful method to better aggregate information temporally.

19, TITLE:?ADeLA: Automatic Dense Labeling with Attention for Viewpoint Adaptation in Semantic Segmentation
AUTHORS: YANCHAO YANG et. al.
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: We describe an unsupervised domain adaptation method for image content shift caused by viewpoint changes for a semantic segmentation task.

20, TITLE:?Automatic Vocabulary and Graph Verification for Accurate Loop Closure Detection
AUTHORS: HAOSONG YUE et. al.
CATEGORY: cs.CV [cs.CV, cs.RO]
HIGHLIGHT: To overcome these disadvantages, we propose a natural convergence criterion based on the comparison between the radii of nodes and the drifts of feature descriptors, which is then utilized to build the optimal vocabulary automatically.

21, TITLE:?SNE-RoadSeg+: Rethinking Depth-Normal Translation and Deep Supervision for Freespace Detection
AUTHORS: Hengli Wang ; Rui Fan ; Peide Cai ; Ming Liu
CATEGORY: cs.CV [cs.CV, cs.RO]
HIGHLIGHT: To address this problem, we introduce SNE-RoadSeg+, an upgraded version of SNE-RoadSeg.

22, TITLE:?Can Non-specialists Provide High Quality Gold Standard Labels in Challenging Modalities?
AUTHORS: SAMUEL BUDD et. al.
CATEGORY: cs.CV [cs.CV, cs.HC, cs.LG]
HIGHLIGHT: In this work we challenge this assumption and examine the implications of using a minimally trained novice labelling workforce to acquire annotations for a complex medical image dataset.

23, TITLE:?Seeing Poverty from Space, How Much Can It Be Tuned?
AUTHORS: Tomas Sako ; Arturo Jr M. Martinez
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this paper we demonstrate that individuals with no organizational affiliation and equipped only with common hardware, publicly available datasets and cloud-based computing services can participate in the improvement of predicting machine-learning-based approaches to predicting local poverty levels in a given agro-ecological environment.

24, TITLE:?DPT: Deformable Patch-based Transformer for Visual Recognition
AUTHORS: ZHIYANG CHEN et. al.
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: To address this problem, we propose a new Deformable Patch (DePatch) module which learns to adaptively split the images into patches with different positions and scales in a data-driven way rather than using predefined fixed patches.

25, TITLE:?Out-of-Core Surface Reconstruction Via Global $TGV$ Minimization
AUTHORS: Nikolai Poliarnyi
CATEGORY: cs.CV [cs.CV, cs.DC, cs.GR, I.4.8; I.3.5; C.2.4]
HIGHLIGHT: We present an out-of-core variational approach for surface reconstruction from a set of aligned depth maps.

26, TITLE:?Shadow Art Revisited: A Differentiable Rendering Based Approach
AUTHORS: Kaustubh Sadekar ; Ashish Tiwari ; Shanmuganathan Raman
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this work, we revisit shadow art using differentiable rendering based optimization frameworks to obtain the 3D sculpture from a set of shadow (binary) images and their corresponding projection information.

27, TITLE:?Sparse-to-dense Feature Matching: Intra and Inter Domain Cross-modal Learning in Domain Adaptation for 3D Semantic Segmentation
AUTHORS: Duo Peng ; Yinjie Lei ; Wen Li ; Pingping Zhang ; Yulan Guo
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In light of this, we propose to further leverage 2D data for 3D domain adaptation by intra and inter domain cross modal learning.

28, TITLE:?Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation
AUTHORS: Bowen Zhang ; Yifan Liu ; Zhi Tian ; Chunhua Shen
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: As each location on the encoder's output corresponds to a local patch of the semantic labels, in this work, we represent these local patches of labels with compact neural networks.

29, TITLE:?Enhancing Social Relation Inference with Concise Interaction Graph and Discriminative Scene Representation
AUTHORS: XIAOTIAN YU et. al.
CATEGORY: cs.CV [cs.CV, cs.AI]
HIGHLIGHT: In this paper, by mimicking human understanding on images, we propose an approach of \textbf{PR}actical \textbf{I}nference in \textbf{S}ocial r\textbf{E}lation (PRISE), which concisely learns interactive features of persons and discriminative features of holistic scenes. To further boost the performance in social relation inference, we collect and distribute a new large-scale dataset, which consists of about 240 thousand unlabeled images.

30, TITLE:?Instant Visual Odometry Initialization for Mobile AR
AUTHORS: Alejo Concha ; Michael Burri ; Jes�s Briales ; Christian Forster ; Luc Oth
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this paper, we present a 6-DoF monocular visual odometry that initializes instantly and without motion parallax. We release a dataset for the relative pose problem using real data to facilitate the comparison with future solutions for the relative pose problem.

31, TITLE:?Medical Instrument Segmentation in 3D US By Hybrid Constrained Semi-Supervised Learning
AUTHORS: HONGXU YANG et. al.
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this article, we propose a semi-supervised learning (SSL) framework for instrument segmentation in 3D US, which requires much less annotation effort than the existing methods.

32, TITLE:?Synth-by-Reg (SbR): Contrastive Learning for Synthesis-based Registration of Paired Images
AUTHORS: Adri� Casamitjana ; Matteo Mancini ; Juan Eugenio Iglesias
CATEGORY: cs.CV [cs.CV, cs.LG, eess.IV]
HIGHLIGHT: Here we propose a synthesis-by-registration method to convert this problem into an easier intra-modality task.

33, TITLE:?High-Resolution Depth Maps Based on TOF-Stereo Fusion
AUTHORS: Vineet Gandhi ; Jan Cech ; Radu Horaud
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: We propose a novel TOF-stereo fusion method based on an efficient seed-growing algorithm which uses the TOF data projected onto the stereo image pair as an initial set of correspondences.

34, TITLE:?PiBase: An IoT-based Security System Using Raspberry Pi and Google Firebase
AUTHORS: Venkat Margapuri ; Niketa Penumajji ; Mitchell Neilsen
CATEGORY: cs.CR [cs.CR, cs.CV, cs.LO]
HIGHLIGHT: Machine learning algorithms, namely Haar-feature based cascade classifiers and Linear Binary Pattern Histograms (LBPH), are used for face detection and face recognition, respectively.

35, TITLE:?ManiSkill: Learning-from-Demonstrations Benchmark for Generalizable Manipulation Skills
AUTHORS: TONGZHOU MU et. al.
CATEGORY: cs.LG [cs.LG, cs.AI, cs.CV, cs.RO]
HIGHLIGHT: In this work, we focus on object-level generalization and propose SAPIEN Manipulation Skill Benchmark (abbreviated as ManiSkill), a large-scale learning-from-demonstrations benchmark for articulated object manipulation with visual input (point cloud and image).

36, TITLE:?Perceiver IO: A General Architecture for Structured Inputs & Outputs
AUTHORS: ANDREW JAEGLE et. al.
CATEGORY: cs.LG [cs.LG, cs.CL, cs.CV, cs.SD, eess.AS]
HIGHLIGHT: The recently-proposed Perceiver model obtains good results on several domains (images, audio, multimodal, point clouds) while scaling linearly in compute and memory with the input size.

37, TITLE:?On The Efficacy of Small Self-Supervised Contrastive Models Without Distillation Signals
AUTHORS: HAIZHOU SHI et. al.
CATEGORY: cs.LG [cs.LG, cs.CV]
HIGHLIGHT: In this paper, we study the issue of training self-supervised small models without distillation signals.

38, TITLE:?When Deep Learners Change Their Mind: Learning Dynamics for Active Learning
AUTHORS: Javad Zolfaghari Bengar ; Bogdan Raducanu ; Joost van de Weijer
CATEGORY: cs.LG [cs.LG, cs.CV]
HIGHLIGHT: In this paper, we propose a new informativeness-based active learning method.

39, TITLE:?Topological Similarity Index and Loss Function for Blood Vessel Segmentation
AUTHORS: R. J. Ara�jo ; J. S. Cardoso ; H. P. Oliveira
CATEGORY: eess.IV [eess.IV, cs.CV, I.2.10; I.4.6]
HIGHLIGHT: In this paper, we propose a similarity index which captures the topological consistency of the predicted segmentations having as reference the ground truth.

40, TITLE:?Single Image Deep Defocus Estimation and Its Applications
AUTHORS: Fernando J. Galetto ; Guang Deng
CATEGORY: eess.IV [eess.IV, cs.CV]
HIGHLIGHT: Based on this principle and the widely used assumption that Gaussian blur is a good model for defocus blur, we formulate the problem of estimating the spatially varying defocus blurriness as a Gaussian blur classification problem. We have created a dataset of more than 500000 image patches of size 32x32 which are used to train and test several well-known network models.

41, TITLE:?Automatic Multi-Stain Registration of Whole Slide Images in Histopathology
AUTHORS: ABUBAKR SHAFIQUE et. al.
CATEGORY: eess.IV [eess.IV, cs.CV]
HIGHLIGHT: In this paper, we propose a two-step automatic feature-based cross-staining WSI alignment to assist localization of even tiny metastatic foci in the assessment of lymph node.

  人工智能 最新文章
2022吴恩达机器学习课程——第二课(神经网
第十五章 规则学习
FixMatch: Simplifying Semi-Supervised Le
数据挖掘Java——Kmeans算法的实现
大脑皮层的分割方法
【翻译】GPT-3是如何工作的
论文笔记:TEACHTEXT: CrossModal Generaliz
python从零学(六)
详解Python 3.x 导入(import)
【答读者问27】backtrader不支持最新版本的
上一篇文章      下一篇文章      查看所有文章
加:2021-08-03 11:10:51  更:2021-08-03 11:14:48 
 
开发: C++知识库 Java知识库 JavaScript Python PHP知识库 人工智能 区块链 大数据 移动开发 嵌入式 开发工具 数据结构与算法 开发测试 游戏开发 网络协议 系统运维
教程: HTML教程 CSS教程 JavaScript教程 Go语言教程 JQuery教程 VUE教程 VUE3教程 Bootstrap教程 SQL数据库教程 C语言教程 C++教程 Java教程 Python教程 Python3教程 C#教程
数码: 电脑 笔记本 显卡 显示器 固态硬盘 硬盘 耳机 手机 iphone vivo oppo 小米 华为 单反 装机 图拉丁

360图书馆 购物 三丰科技 阅读网 日历 万年历 2024年12日历 -2024/12/22 15:21:12-

图片自动播放器
↓图片自动播放器↓
TxT小说阅读器
↓语音阅读,小说下载,古典文学↓
一键清除垃圾
↓轻轻一点,清除系统垃圾↓
图片批量下载器
↓批量下载图片,美女图库↓
  网站联系: qq:121756557 email:121756557@qq.com  IT数码