Papers
arxiv:2104.10036

VT-ADL: A Vision Transformer Network for Image Anomaly Detection and Localization

Published on Apr 20, 2021
Authors:
,
,
,
,

Abstract

A transformer-based image anomaly detection and localization network uses patch embedding and Gaussian mixture density network to identify and locate anomalies, outperforming other methods on datasets like MNIST and MVTec.

We present a transformer-based image anomaly detection and localization network. Our proposed model is a combination of a reconstruction-based approach and patch embedding. The use of transformer networks helps to preserve the spatial information of the embedded patches, which are later processed by a Gaussian mixture density network to localize the anomalous areas. In addition, we also publish BTAD, a real-world industrial anomaly dataset. Our results are compared with other state-of-the-art algorithms using publicly available datasets like MNIST and MVTec.

Community

Sign up or log in to comment

Get this paper in your agent:

hf papers read 2104.10036
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2104.10036 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2104.10036 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2104.10036 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.