Over the years, researchers have been shedding light on the domain of anomaly detection in sound. The anomalous sounds indicate various symptoms of mistakes or malicious activities in a system and detection of it can possibly prevent several problems. Anomaly detection sound can be used for various purposes such as audio surveillance, product inspection, predictive maintenance and much more.
Anomalous sound data is difficult to collect, as there were no available large-scale datasets for anomaly detection in machine operating sounds until now. Recently, researchers from NTT Media Intelligence Laboratories and Ritsumeikan University, Tokyo, introduced a new dataset known as ToyADMOS which is mainly designed for training and testing ADMOS systems.
About The Dataset
ToyADMOS dataset is a machine operating sounds dataset of approximately 540 hours of normal machine operating sounds and over 12,000 samples of anomalous sounds collected with four microphones at a 48kHz sampling rate.
The dataset consists of three sub-datasets for three types of ADMOS tasks such as machine-condition inspection, fault diagnosis of machines with geometrically fixed tasks, and fault diagnosis of machines with moving tasks. The overview of the three sub-datasets are mentioned below
- Toy Car: Designed for product-inspection task, toy car runs on an inspection device where the sound data are collected with four microphones arranged close to the inspection device.
- Toy Conveyor: Designed for fault diagnosis of a fixed machine, toy conveyor is fixed on a desk, and sound data are collected with four microphones in such a way that one is fixed on the body of the conveyor, and the other three are placed on the desk.
- Toy Train: Designed for fault diagnosis of a moving machine, toy train runs on a railway track where sound data are collected with the help of four microphones surrounding the track.
Here, each sub-dataset consists of three types of sound data, they are normal, anomalous, and environmental. For collecting the sounds, four omnidirectional microphones (SHURE SM11-CN) were used.
The ToyADMOS dataset has the following characteristics:
- It is designed for three ADMOS tasks: product inspection (toy car), fault diagnosis for a fixed machine (toy conveyor), and fault diagnosis for a moving machine (toy train).
- Machine-operating sounds and environmental noise are individually recorded for simulating various noise levels.
- All sounds are recorded with four microphones for testing noise reduction and/or data-augmentation techniques such as mix-up
- In each task, multiple machines of the same class are used where each machine belongs to the same class of toys but has a different detailed structure. Since the collected operating sounds have variations depending on individual differences, the dataset can be used for testing domain-adaptation techniques to absorb individual differences and/or changes in noise level.
- Each anomalous sound was recorded several times for testing a few-shot learning-based ADMOS for obtaining the characteristics of anomalous sounds from only a few samples.
- The released dataset consists of over 180 hours of normal machine-operating sounds and over 4,000 samples of anomalous sounds collected with four microphones at a 48-kHz sampling rate for each task.
Advantages & Limitations
The main advantage of the ToyADMOS dataset over other datasets is that it was built under controlled conditions. Unlike other datasets, this dataset collects all normal sounds under the same condition in order to analyse the system performance or the cause of misdetection. This dataset will assist in advancing research into anomaly detection in sounds.
The limitation of the dataset is that the toy sounds and the real machine sounds do not match exactly. The details of the spectral shape of a toy and a real machine often times sounds different even though the time-frequency structure is similar.