Clear Sky Science – Articles (en)

OBJECT DETECTION ARTICLES

Object detection research focuses on automatically identifying and localizing objects within images or video, assigning each object both a category label and a bounding box. Early systems relied on hand-crafted features such as Haar-like features and Histogram of Oriented Gradients, combined with sliding windows and classical classifiers. These methods were computationally expensive and struggled with complex scenes.

Deep learning transformed the field. Region based convolutional neural networks introduced a two stage approach: first generating region proposals, then classifying and refining bounding boxes. Successive variants improved speed and accuracy by sharing convolutional features, optimizing end to end, and reducing redundant computation. Single stage detectors such as the You Only Look Once family and Single Shot MultiBox Detector removed the separate proposal step, directly predicting object classes and box coordinates from dense grids of features, enabling real time performance.

Further refinements include anchor based and anchor free designs, multi scale feature pyramids for handling objects of different sizes, and specialized loss functions to improve bounding box regression. Transformers and attention mechanisms have enabled end to end architectures that can model global relationships without hand designed proposals. Recent work also targets small, occluded, or overlapping objects, leverages unlabeled data through self supervision, and adapts models efficiently to new domains.

Applications span autonomous driving, medical imaging, robotics, video surveillance, environmental monitoring, and augmented reality. Ongoing challenges include robustness to adverse conditions, bias and fairness issues, efficient deployment on edge devices, and interpretability of model decisions. Research continues to balance accuracy, speed, and resource usage while extending object detection to ever more complex real world scenarios.

2026-05-12

POLAR-DETR: Polarized occlusion-aware local-global attention real-time detection transformer for total laboratory automation

2026-03-03

Design and implementation of a 6-DoF robot arm control with object detection based on machine learning using mini microcontroller

2026-01-31

Agriculture surrounding monitoring and object identification based on optimized you only look once and single shot multibox detector setups using combined vision and thermal images

2026-01-12

OBJECT DETECTION ARTICLES

A real-world framework for automated product recognition and catalog generation: dataset, model, and analysis

Intelligent recognition of embroidered purse patterns: comparing YOLO series and RT-DETR

Research on lung nodule detection in X-ray plain films based on improved YOLOv12 model

A novel approach for disease and pests detection in potato production system based on deep learning

Coral morphology detection in underwater imagery using YOLOv12 with CNN and transformer encoder fusion

Real-world road damage dataset with potholes, cracks, and maintenance holes

Diamond-DETR: lightweight real-time quality evaluation algorithm for synthetic diamonds

ScaleMamba-YOLO: a multi-scale MambaYOLO for medical object detection

Prototype-oriented contrastive mean-teacher for unsupervised domain adaptive object detection

Review of large YOLOv8 and RT-DETR energy efficiency on edge devices for real-time detection

Pseudo-depth-based deep neural network model for object detection

SODNet: a scale-oriented detection network for efficient UAV-based sewage outfall detection

Bidirectional state space modeling for lightweight and robust wheat head detection in complex agricultural environments

A lightweight and cross-scale attention network for geological hazard detection in rescue robotics

Leveraging haze-aware features for improved image clarity and detection accuracy with an optimized DCNN-YOLOv8 network

Detection and classification of chromosomes with sister chromatid cohesion defects using object detection models

Sentinel for confidence-aware multi-object tracking

A Benchmark X-ray Dataset for Pediatric Supracondylar Humerus Fractures with Improved YOLOv11-Based Detection

ResNet based backbone integrated YOLO framework for bone fracture detection

Research on target detection algorithm for forest fire images based on multi-scale feature extraction

Multi-dimensional attention transformer for vehicle and pedestrian detection in adverse weather

POLAR-DETR: Polarized occlusion-aware local-global attention real-time detection transformer for total laboratory automation

Foreign object detection in power transmission lines using SESYOLO

A cascaded group attention mechanism-based object detection algorithm for construction and demolition waste

Blasting ore size detection based on efficient dehazing network and multi-dimensional feature fusion

A lightweight multi-scale detection framework for X-ray images with supervised contrastive learning

Infrared ship target detection algorithm PEW_YOLOv8 in complex environments

A foreign object detection dataset and network for electrified railway catenary systems

Lightweight scalable deep learning framework for real time detection of potato leaf diseases

Lightweight target detection and multi target tracking for UAV inspection in open pit mines

Working face status detection in coal mine based on YOLOv8-EST

DeCon-Net: decoupled hierarchical contrast for soccer object detection

The evolution of object detection from CNNs to transformers and multi-modal fusion

Enhanced YOLO12 with spatial pyramid pooling for real-time cotton insect detection

An improved YOLOv11 network for marine debris detection in underwater environment

Design and implementation of a 6-DoF robot arm control with object detection based on machine learning using mini microcontroller

Real-time object detection for unmanned aerial vehicles based on vision transformer and edge computing

Object-guided contrastive language-image pre-training for zero-shot target recognition

Deep learning for construction waste detection using ConvNeXt V2 EMA attention and WIoU v3 loss

Object detection on low-compute edge SoCs: a reproducible benchmark and deployment guidelines

Infrared and visible image fusion via visual enhancement and semantic coupling

High-accuracy brain tumor detection method based on deep learning

Agriculture surrounding monitoring and object identification based on optimized you only look once and single shot multibox detector setups using combined vision and thermal images

A real-time mobile aquatic plant recognition algorithm based on deep learning for intelligent ecological monitoring

POP-YOLOv8: an object detection framework for partially occluded pedestrians in nighttime traffic environments

Domain-adaptive faster R-CNN for non-PPE identification on construction sites from body-worn and general images