Deep anomaly detection with outlier exposure
arXiv preprint arXiv:1812.04606, 2018•arxiv.org
It is important to detect anomalous inputs when deploying machine learning systems. The
use of larger and more complex inputs in deep learning magnifies the difficulty of
distinguishing between anomalous and in-distribution examples. At the same time, diverse
image and text data are available in enormous quantities. We propose leveraging these
data to improve deep anomaly detection by training anomaly detectors against an auxiliary
dataset of outliers, an approach we call Outlier Exposure (OE). This enables anomaly …
use of larger and more complex inputs in deep learning magnifies the difficulty of
distinguishing between anomalous and in-distribution examples. At the same time, diverse
image and text data are available in enormous quantities. We propose leveraging these
data to improve deep anomaly detection by training anomaly detectors against an auxiliary
dataset of outliers, an approach we call Outlier Exposure (OE). This enables anomaly …
It is important to detect anomalous inputs when deploying machine learning systems. The use of larger and more complex inputs in deep learning magnifies the difficulty of distinguishing between anomalous and in-distribution examples. At the same time, diverse image and text data are available in enormous quantities. We propose leveraging these data to improve deep anomaly detection by training anomaly detectors against an auxiliary dataset of outliers, an approach we call Outlier Exposure (OE). This enables anomaly detectors to generalize and detect unseen anomalies. In extensive experiments on natural language processing and small- and large-scale vision tasks, we find that Outlier Exposure significantly improves detection performance. We also observe that cutting-edge generative models trained on CIFAR-10 may assign higher likelihoods to SVHN images than to CIFAR-10 images; we use OE to mitigate this issue. We also analyze the flexibility and robustness of Outlier Exposure, and identify characteristics of the auxiliary dataset that improve performance.
arxiv.org
顯示最佳搜尋結果。 查看所有結果