Clear Sky Science – Articles (en)

COMPUTER VISION ARTICLES

Computer vision is a field of artificial intelligence focused on enabling computers to interpret and understand visual information from the world. Modern research combines mathematical modeling, physics, machine learning and signal processing to extract structure, motion and meaning from images and video.

One core area is 3D reconstruction, where algorithms recover the geometry of a scene from multiple images or video sequences. By tracking points across frames and modeling camera motion, systems estimate depth and create detailed 3D models. This is vital for robotics, augmented reality and autonomous navigation.

Another key topic is motion analysis. Optical flow methods estimate how each pixel moves between frames, revealing object trajectories, deformations and scene dynamics. Researchers build physically grounded models that relate brightness and motion, and develop numerical algorithms that are both accurate and efficient for large data sets.

Stereo vision is studied to infer depth from two or more cameras. By matching corresponding features and solving constrained optimization problems, algorithms compute disparity maps that approximate human binocular perception. Work in this area addresses challenges such as textureless regions, occlusions and noisy measurements.

Researchers also explore variational and partial differential equation approaches, where vision tasks are formulated as energy minimization problems. This provides a unifying mathematical framework for denoising, segmentation, inpainting and edge detection, often leading to robust, interpretable solutions.

Overall, the research emphasizes mathematically principled methods that scale to real-world data, bridging theory and applications in engineering, medicine, environmental monitoring and autonomous systems, while continually seeking more reliable and physically consistent interpretations of visual scenes.

2026-05-23

PickAMoo: LIDAR-enhanced mask R-CNN segmentation for precision weight estimation in dairy cattle using smartphone imaging

2026-05-15

MFR-YOLO: advancing UAV object detection with multi-scale feature refinement via deformable convolution and global attention

2026-03-31

Bridging mathematical modeling and AI for 3D coordinate recognition of moving objects without external reference and attitude measurement

2026-03-19

Adaptive lightweight mask R-CNN model for underwater debris instance segmentation and classification towards sustainable marine waste management

2026-03-17

Interpretable and granular video-based quantification of motor characteristics from the finger-tapping test in Parkinson’s disease

2026-03-08

POLAR-DETR: Polarized occlusion-aware local-global attention real-time detection transformer for total laboratory automation

2026-03-03

Image processing-based warning system for preventing the fuel selector valve from remaining closed in small trainer aircraft

2026-02-25

TumorSageNet CNN hybrid architecture enables accurate detection of mango leaf pathologies

2026-02-25

Deep residual and hybrid CNN models for confidence-aware real-world waste classification for sustainable waste management

2026-02-24

A framework for enhancing pharmaceutical integrity and patient safety: novel mobile health solution integrating smart packaging and computer vision

2026-02-18

MFDH-Net: defect detection network for multi-level feature fusion and cross-sensing decoupling head

2026-02-17

Enhancing long-range depth estimation via heterogeneous CNN-transformer encoding and cross-dimensional semantic fusion

2026-02-17

Research on super-resolution reconstruction of construction images based on attention mechanism and generative adversarial networks

2026-02-15

Utilizing deep learning models for early detection and classification of fruit diseases: towards sustainable agriculture and enhanced food quality

2026-02-10

A dual-stream deep learning framework for continuous sign language recognition to enhance communication accessibility in the Ha’il region

2026-02-02

Deep atrous context convolution generative adversarial network with corner key point extracted feature for nuts classification

2026-01-27

COMPUTER VISION ARTICLES

PickAMoo: LIDAR-enhanced mask R-CNN segmentation for precision weight estimation in dairy cattle using smartphone imaging

Smart mobility infrastructure: improving campus parking efficiency in real time

A hybrid ConvNeXt–BiLSTM framework for robust scene text recognition

Blind image quality assessment based on statistical features

A multi-modal dataset for insect biodiversity with imagery and DNA at the trap and individual level

StimVision: smartphone video kinematics to optimize DBS programming in Parkinson’s disease

Generating archaeological line drawings from limited reference images

Spatial-frequency complementary fusion network for dehazing with multi-scale and attention modules

Prism-OBI: a novel framework for oracle bone inscription recognition via visual perception and feature decoupling

Fourier transform-based single domain generalization for crowd counting

Toward autonomous weed management systems in sugarcane crops and an assessment of technological readiness

MoSA-Det: motion state adaptive object detection for sports videos

Infrared-visible image fusion with double-attention mechanism and adaptive interaction loss

YOLO-DCF: dual distillation and context-aware fusion for defect detection

Research on identification method and application of unsafe behavior of coal mine personnel

Enhancing single shot unsupervised domain adaptation for inter-camera person re-identification

Optimized K-means algorithm for image segmentation based on improved dung beetle algorithm

An efficient method for monitoring small bird targets in wetland environments based on object detection

Meta-learned dynamic hierarchical fusion for robust multi-scale object classification

Real-world road damage dataset with potholes, cracks, and maintenance holes

MFR-YOLO: advancing UAV object detection with multi-scale feature refinement via deformable convolution and global attention

Deep learning-based visual algorithms for identity and action recognition in engineering practical courses

YOLO-LSBA: A high-precision model for detecting stems of small-sized cherry tomatoes

Enhanced visual-inertial SLAM Using SuperPoint and semantic geometric dynamic feature detection

ClarityTrack for multi object tracking via hierarchical association and environment specific cost matching

Decoding garden design language via semantic segmentation for social aesthetic interaction

Explainable hybrid AI CAD framework for advanced prediction of steel surface defects

RGB-conditioned frequency domain refinement for sparse-to-dense depth completion

Automated landscape element recognition and layout optimization based on image segmentation and object detection

PCB-YOLOV8X: a network for detecting micro-sized defects on PCB surfaces based on enhanced feature information

Efficient monocular 3D lane detection via Mamba-enhanced CM-3DLane framework

A fatigue driving detection method based on driver posture and facial state analysis

Pseudo-depth-based deep neural network model for object detection

YOLO-MFD: a multi-scale feature and dynamic head framework for prefabricated shoreline underwater object detection

MAGNet: enhancing action recognition with multimodal fusion and adaptive graph convolution

LogoXpertNet: a novel lightweight logo classification using deep learning

Video-based cattle behaviour detection for digital twin development in precision dairy systems

Lightweight multiscale behavior recognition for caged laying hens using an enhanced YOLOv8 framework

Sensor fusion of touch & vision in soft manipulators for fruit picking

A smart monocular vision metrology system based on computer for standing long jump

A CNN–Bi-LSTM pipeline and open FSW dataset for freestyle wrestling action recognition

Design of an in-pipe inspection robotic system (IPIRS) with YOLOv8–LSTM integration for real-time in-pipe navigation

An SfM system for mural digitization with attention-guided feature matching and robust sparse reconstruction

Bridging mathematical modeling and AI for 3D coordinate recognition of moving objects without external reference and attitude measurement

A CNN-transformer dual-branch network with structure-aware loss for high-resolution edge detection

A wild fish image dataset for individual re-identification and phenotyping

Camouflaged object detection via context and texture-aware hierarchical interaction

Comparing the performance of deep learning video-based models and trained veterinarians in cattle pain assessment

YOLO-Starfish: fish object detection learning complex underwater features

RT-FogNet: real-time ship detection under low-visibility conditions in inland waterways

Adaptive lightweight mask R-CNN model for underwater debris instance segmentation and classification towards sustainable marine waste management

A lightweight deep learning architecture for automatic shrimp disease classification

Predicting congregational and crowd spread-out flow using YOLOv4 and DeepSORT

Hybrid attention optimized hierarchical multiscale transformer architecture for image super-resolution

Sentinel for confidence-aware multi-object tracking

Cattle lameness detection using depth image and deep learning

SCB-YOLO: a lightweight adaptive attention-enhanced network for student behavior detection in complex classroom settings

Research on batik image pattern detection based on improved YOLOv11

A hybrid convolution and attention-based framework with visual explanation for fruit disease identification

Interpretable and granular video-based quantification of motor characteristics from the finger-tapping test in Parkinson’s disease

Multi-dimensional attention transformer for vehicle and pedestrian detection in adverse weather

An intelligent approach for early smoke/fire detection using vision sensors in smart cities

Real time fire and smoke detection using vision transformers and spatiotemporal learning

Object tracking algorithm based on deformable attention mechanism

POLAR-DETR: Polarized occlusion-aware local-global attention real-time detection transformer for total laboratory automation

3D Magic Mirror: clothing reconstruction from a single image via a causal perspective

A dynamic element-activated non-semantic sparse attention method for remote sensing small object detection

Image-based detection of bolts and bolt-missing defects in multi-angle and complex background scenarios

Blasting ore size detection based on efficient dehazing network and multi-dimensional feature fusion

Optimized wheat seed classification using YOLO with morphological image feature enhancement

AI-powered BlindSpot VisionGuide system on raspberry Pi for enhancing independence of visually impaired users

An underwater image dataset for occlusion-aware fish instance segmentation

MSSA: memory-driven and simplified scaled attention for enhanced image captioning

A lightweight multi-scale detection framework for X-ray images with supervised contrastive learning

Image processing-based warning system for preventing the fuel selector valve from remaining closed in small trainer aircraft

TumorSageNet CNN hybrid architecture enables accurate detection of mango leaf pathologies

Deep residual and hybrid CNN models for confidence-aware real-world waste classification for sustainable waste management

CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx

Edge-guided multi-scale instance segmentation for railway track