Scopus Indexed Publications

Paper Details


Title
Direct video-based spatiotemporal deep learning for cattle lameness detection

Author
Md Fahimuzzman Sohan,

Email

Abstract

Cattle lameness is a prevalent health problem in livestock farming, often resulting from hoof injuries or infections, and severely impacts animal welfare and productivity. Early and accurate detection is critical for minimizing economic losses and ensuring proper treatment. This study proposes a spatiotemporal deep learning framework for automated cattle lameness detection using publicly available video data. We curate and publicly release a balanced set of 50 online video clips featuring 42 individual cattle, recorded from multiple viewpoints in both indoor and outdoor environments. The videos were categorized into lame and non-lame classes based on visual gait characteristics and metadata descriptions. After applying data augmentation techniques to enhance generalization, two deep learning architectures were trained and evaluated: 3D Convolutional Neural Networks (3D CNN) and Convolutional Long-Short-Term Memory (ConvLSTM2D). The 3D CNN achieved a video-level classification accuracy of 90%, with a precision, recall, and F1 score of 92%, 90%, and 90% respectively, outperforming the ConvLSTM2D model, which achieved 85% accuracy. Unlike conventional approaches that rely on multistage pipelines involving object detection and pose estimation, this study demonstrates the effectiveness of a direct end-to-end video classification approach. Compared with previously reported end-to-end methods (e.g., C3D-ConvLSTM, 90.3%) evaluated on different datasets, our model achieves comparable accuracy while maintaining a simpler, single-stage design. By bypassing computationally expensive pose estimation and object detection steps, the model substantially reduces latency and complexity, making it suitable for real-time farm deployment.


Keywords

Journal or Conference Name
Scientific Reports

Publication Year
2026

Indexing
scopus