GDText-VM: an arbitrary-shaped scene text detector based on globally deformable VMamba
Abstract Detecting arbitrary-shaped text in natural scenes remains a significant challenge in deep learning research. Contemporary text detectors based on Convolutional Neural Networks face challenges in effectively modeling long-range dependencies. While Vision Transformers theoretically enable glo...
| Published in: | Complex & Intelligent Systems |
|---|---|
| Main Authors: | , , , , |
| Format: | Article |
| Language: | English |
| Published: |
Springer
2025-06-01
|
| Subjects: | |
| Online Access: | https://doi.org/10.1007/s40747-025-01987-6 |
