Robust Train Component Detection with Cascade Convolutional Neural Networks based on Structure Knowledge

Page view(s)

Checked on Aug 04, 2025

Please use this identifier to cite or link to this item: https://oar.a-star.edu.sg/communities-collections/articles/17106

Title:

Robust Train Component Detection with Cascade Convolutional Neural Networks based on Structure Knowledge

Journal Title:

IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC) 2020

DOI:

10.1109/ITSC45102.2020.9294755

Publication URL:

https://doi.org/10.1109/ITSC45102.2020.9294755

Authors:

Fan Wu, Zhongyao Cheng, Juelin Zhu, Cen Chen, Xiaoxi Yu, Yue Li, Zeng Zeng

Keywords:

object detection, Feature extraction, deep learning, Fasteners, Prediction algorithms, Training, Search methods

Publication Date:

24 December 2020

Citation:

C. Zhongyao et al., "Robust Train Component Detection with Cascade Convolutional Neural Networks based on Structure Knowledge," 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC), Rhodes, Greece, 2020, pp. 1-6, doi: 10.1109/ITSC45102.2020.9294755.

Abstract:

Recently, convolutional neural network (CNN) based methods have achieved superior results in generic object detection and have become the de-facto standard in the domain. However, potential adaptations to industrial areas are not well studied yet. A case worth exploring is the train component detection, in which the components may have strong relationships and some components (e.g., screws and nuts) are very small. Nevertheless, the detection performance of small train components significantly affects the efficiency of overall train component detection. In this work, we propose a novel robust train component detection(RTCD) framework, built on cascading CNNs and utilizing prior structure knowledge of the relationships between train components. The core idea of RTCD is to detect the big and easily detectable component first, and then find the areas that may contain small and challenging to detect components for following fine-grained exploitation. Our proposed attention region mechanism can find regions deserving of further analysis based on the region-of-interest (ROI) detected by the previous CNNs with the known structure knowledge. Then, these areas are cropped, zoomed in and fed into the following deep learning models for further detection. In order to verify the effectiveness of RTCD, 1, 130 high-resolution images of moving trains are captured and collected, from which 17, 334 critical train components are manually annotated. Extensive experiments therein have demonstrated that RTCD outperforms the existing state-of-the-art baselines significantly. The dataset and corresponding source code will be released to facilitate more future work.

License type:

PublisherCopyrights

Funding Info:

The work was supported by Singapore-China NRF-NSFC Grant (Grant No. NRF2016NRF-NSFC001-111)

Description:

© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

URI:

https://oar.a-star.edu.sg/communities-collections/articles/17106

ISBN:

978-1-7281-4150-3
978-1-7281-4149-7

Collections:

Institute for Infocomm Research

Files uploaded:

Manuscripts in This Item:

File	Size	Format	Action
There are no attached files.