[人工智能] NIPS2018 - Reducing Network Agnostophobia

开发: C++知识库 Java知识库 JavaScript Python PHP知识库人工智能区块链大数据移动开发嵌入式开发工具数据结构与算法开发测试游戏开发网络协议系统运维
教程: HTML教程 CSS教程 JavaScript教程 Go语言教程 JQuery教程 VUE教程 VUE3教程 Bootstrap教程 SQL数据库教程 C语言教程 C++教程 Java教程 Python教程 Python3教程 C#教程
数码: 电脑笔记本显卡显示器固态硬盘硬盘耳机手机 iphone vivo oppo 小米华为单反装机图拉丁

-> 人工智能 -> NIPS2018 - Reducing Network Agnostophobia -> 正文阅读

[人工智能]NIPS2018 - Reducing Network Agnostophobia

ABSTRACT
Agnostophobia, the fear of the unknown, can be experienced by deep learning engineers while applying their networks to real-world applications. Unfortunately, network behavior is not well defined for inputs far from a networks training set. In an uncontrolled environment, networks face many instances that are not of interest to them and have to be rejected in order to avoid a false positive. This problem has previously been tackled by researchers by either a) thresholding softmax, which by construction cannot return none of the known classes, or b) using an additional background or garbage class. In this paper, we show that both of these approaches help, but are generally insufficient when previously unseen classes are encountered. We also introduce a new evaluation metric that focuses on comparing the performance of multiple approaches in scenarios where such unseen classes or unknowns are encountered. Our major contributions are simple yet effective Entropic Open-Set and Objectosphere losses that train networks using negative samples from some classes. These novel losses are designed to maximize entropy for unknown inputs while increasing separation in deep feature space by modifying magnitudes of known and unknown samples. Experiments on networks trained to classify classes from MNIST and CIFAR-10 show that our novel loss functions are significantly better at dealing with unknown inputs from datasets such as Devanagari, NotMNIST, CIFAR-100, and SVHN.

图像分类任务，作为计算机视觉中最基础的任务，看似简单，却在实际应用中常面临一些不足。鲁棒性(Robustness)、开集问题(Open-set)、类别不均衡(Class imbalance) 这些都是在学术数据集上很少考虑，而实际中常见且直接影响算法效果的问题。这篇文章讨论的是开集问题。简言之，开集问题是在测试时如何针对训练集不包含的类别数据进行预测/分类。（题外话，人脸识别就是典型开集识别问题，训练集中 ID 和应用场景中 ID 往往有较大差异，采用度量学习方法）不考虑开集问题，在实际中会造成大量 false positive，影响使用体感。

文章中以手写数字识别为例(0-9，10分类问题)，将 Devanagari 数据集作为开集数据。采用 LeNet++ 作为backbone，将图片映射到 2 维特征空间，进而预测 0-9 类别。

符号表

符号	含义
Y	所有类别空间
C	已知类别，known classes (1…C)
U	未知类别，unknown classes
B	未知类别子集1，background, garbage, or known unknown classes.
A	未知类别子集2，unknown unknown classes
D	测试数据集
D’	训练数据集