Artificial intelligence (AI)-assisted computer vision is an evolving field in medical imaging. However, accuracy and precision suffer when using the existing AI models for small, easy-to-miss objects such as bone fractures, which affects the models’ applicability and effectiveness in a clinical setting. The proposed integration of the Hybrid-Attention (HA) mechanism into the YOLOv8 architecture offers a robust solution to improve accuracy, reliability, and speed in medical imaging applications. Experimental results demonstrate that our HA-modified YOLOv8 models achieve a 20% higher Mean Average Precision (mAP 50) and improved processing speed in arm fracture detection.