Loading…

Pyramid Transformer: A Multi-size Object Detection Model with Limited Device Requirements for the Nursing Robot

Multi-size object detection is a technical difficulty which impeding the development of the intelligent nursing robot. To cope with the problem, this paper proposes a Pyramid Transformer model to detect the objects with different sizes in nursing scenario. Pyramid Transformer consists of three parts...

Full description

Saved in:
Bibliographic Details
Main Authors: Li, Jiazheng, Xie, Jiexin, Wang, Jiaxin, Wen, Yujian, Guo, Shijie
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Multi-size object detection is a technical difficulty which impeding the development of the intelligent nursing robot. To cope with the problem, this paper proposes a Pyramid Transformer model to detect the objects with different sizes in nursing scenario. Pyramid Transformer consists of three parts including Transformer Module, Pyramid Structure and Convolution Module. Transformer Module can improve the performance of large object detection with Multi-head Attention mechanism, and Pyramid Structure enables the model to make prediction with feature maps of different sizes which benefits the detection of small objects. Convolution Module is employed to reduce hardware requirements, and it makes Pyramid Transformer could run and implement on a single graphics card. The experiments show that the mean average precision reaches 72.7% which makes improvement over other models. This shows that the proposed Pyramid Transformer model is practical and effective for object detection of the nursing robot. The dataset can be got at https://github.com/NotFar1997/NSI-dataset.
ISSN:2375-0197
DOI:10.1109/ICTAI56018.2022.00163