Inverse synthetic aperture radar (ISAR) ship target detection is of great significance and has broad application prospects in scenarios such
Inverse synthetic aperture radar (ISAR) ship target detection is of great significance and has broad application prospects in scenarios such as protecting marine resources and maintaining maritime order. Existing ship target detection techniques, especially target detection methods and detection models in complex settings, have problems such as long inference time and unstable robustness, meaning that they can easily miss the best time for detecting ship targets and cause intelligence loss. To solve these problems, this study proposes a new ISAR target detection model for ships based on deep learning—Complex ISAR Detection Net (CIDNet). The model is based on the Boundary Box Efficient Transformer (BETR) architecture, which combines super-resolution preprocessing, a deep feature extraction network, a feature fusion technique, and a coordinate maintenance mechanism to improve the detection accuracy and real-time performance of ship targets in complex settings. The CIDNet improves the resolution of the input image via the super-resolution preprocessing technique, which enhances the rendering of details of ship targets in the image. The feature extraction part of the model combines the efficient feature extraction capability of YOLOv10 with the global attention mechanism of BETR. It efficiently combines information from different scales and levels through a feature fusion strategy. In addition, the model integrates a coordinated attention mechanism to enhance the focus on the target region and optimize the detection accuracy. The experimental results show that CIDNet exhibits stable performance on the test dataset. Compared with existing models such as YOLOv10 and Faster R-CNN, CIDNet improves precision, recall, and the F1 score, especially when dealing with smaller targets and complex background conditions. In addition, CIDNet achieves a detection frame rate of 63, demonstrating its fine real-time processing capabilities.