View-Invariant Pixelwise Anomaly Detection in Multi-object Scenes with Adaptive View Synthesis

发布于 2024-06-27  7 次阅读


AI 摘要

该文章介绍了一种新颖的网络技术OmniAD,用于多物体场景中视角不变的像素级异常检测。传统的无监督像素级异常检测方法主要用于已知和恒定的相机位置的工业场景,但在图像未完全对齐的情况下往往难以推广。作者提出的OmniAD网络通过改进逆向蒸馏异常检测方法,在像素级别异常检测性能上提高了40%。此外,作者还提出了两种新的数据增强策略,利用新颖的视角合成和相机定位来改善泛化性能。作者在新的数据集ToyCity上以及已建立的单一物体中心的数据集MAD上进行了定性和定量结果的验证。

[PDF] [Site] [Kimi]

The inspection and monitoring of infrastructure assets typically requires identifying visual anomalies in scenes periodically photographed over time. Images collected manually or with robots such as unmanned aerial vehicles from the same scene at different instances in time are typically not perfectly aligned. Supervised segmentation methods can be applied to identify known problems, but unsupervised anomaly detection approaches are required when unknown anomalies occur. Current unsupervised pixel-level anomaly detection methods have mainly been developed for industrial settings where the camera position is known and constant. However, we find that these methods fail to generalize to the case when images are not perfectly aligned. We term the problem of unsupervised anomaly detection between two such imperfectly aligned sets of images as Scene Anomaly Detection (Scene AD). We present a novel network termed OmniAD to address the Scene AD problem posed. Specifically, we refine the anomaly detection method reverse distillation to achieve a 40% increase in pixel-level anomaly detection performance. The network's performance is further demonstrated to improve with two new data augmentation strategies proposed that leverage novel view synthesis and camera localization to improve generalization. We validate our approach with qualitative and quantitative results on a new dataset, ToyCity, the first Scene AD dataset with multiple objects, as well as on the established single object-centric dataset, MAD. https://drags99.github.io/OmniAD/

Hello
最后更新于 2024-08-02