Clear Sky Science – Articles (en)

REINFORCEMENT LEARNING ARTICLES

Reinforcement learning is a branch of machine learning in which an agent learns to make sequences of decisions by interacting with an environment and receiving rewards or penalties. The core idea is trial and error guided by feedback. Unlike supervised learning, where correct answers are given, the agent must discover good actions by exploring and exploiting what it has already learned.

Formally, the problem is often described as a Markov decision process with states, actions, transition probabilities, and a reward function. The objective is to learn a policy that maximizes cumulative reward over time. This is typically done through value functions that estimate how good a state or state action pair is, or by directly optimizing the policy.

Classical methods include dynamic programming, which assumes a known model of the environment, and model free approaches such as Monte Carlo methods and temporal difference learning. Temporal difference methods like Q learning and SARSA update value estimates from experience without needing a model of the dynamics.

More recently, deep reinforcement learning combines these ideas with deep neural networks to handle high dimensional inputs such as images. Techniques like deep Q networks approximate value functions, while policy gradient methods and actor critic architectures directly learn parameterized policies, often with improved stability and performance.

Research also addresses exploration strategies, credit assignment over long time horizons, sample efficiency, and safety. Applications range from game playing and robotics to resource management and autonomous systems, where the ability to learn from interaction makes reinforcement learning a powerful framework for sequential decision making under uncertainty.

REINFORCEMENT LEARNING ARTICLES

A novel intelligent hybrid reinforcement learning framework for autonomous decision making in complex health cognitive systems

Multi-objective inventory optimization using reinforcement learning: a comparative study on profitability and carbon emissions

Outplaying elite table tennis players with an autonomous robot

DMARS_WGO: a deep reinforcement-driven hybrid metaheuristic for intelligent adaptive optimization

Optimization of infectious disease intervention measures using reinforcement learning with UK COVID-19 epidemic data

Scalable conflict-free bandit algorithm using a quantum optical setup

DeepSentRec: a deep learning-based sentiment-aware product recommendation system

DRLO-VANET: a deep reinforcement learning-based offloading framework for low-latency and energy-efficient task execution in VANETs

Value-added assessment of career planning for vocational competence based on deep learning

Enhancing IELTS writing automated scoring with M-LoRA fine-tuned LLAMA-3 and human feedback-driven PPO reinforcement learning

Domain knowledge-integrated reinforcement learning control of nonlinear tunable vibration absorber under nonstationary excitation

Acetylcholine demixes heterogeneous dopamine signals for learning and moving

Distinct roles of cortical layer 5 subtypes in associative learning

SVDHLA: symmetric variable depth hybrid learning automaton and its application

Optimizing charge discharge cycles using QPPONet-enabled hybrid learning framework for energy management and safety in electric vehicles

Liver transplant donor-recipient matching with offline reinforcement learning

Sensory-motor control with large language models via iterative policy refinement

Adaptive reinforcement learning for lithography optimization: a scalable AI-driven solution for next-generation semiconductor manufacturing

End-to-end example-based sim-to-real RL policy transfer based on neural stylisation with application to robotic cutting

Energy optimized scheduling in wireless sensor networks (WSNs) using hybrid bio-inspired reinforcement learning approach

Intelligent resource management in UAV-enabled networks using cell-free communications and intelligent reflective surfaces

Accelerating the learning process of deep reinforcement learning algorithms in distribution network reconfiguration using an innovative replay method

Deep recurrent neural networks for water hammer transient prediction and dynamic protection optimization in long distance pipelines

A fuzzy-TD3 hybrid reinforcement learning framework for robust trajectory tracking of the Mitsubishi RV-2AJ robotic arm

Reinforcement learning-based optimal control for stochastic opinion dynamics

Intrinsic gradient oxygen-driven second-order memristors for continual reinforcement learning

Learning-aided observer design for improving autonomous vehicle safety

Evolutionary reinforcement learning framework for energy-efficient fault resilience and topological stability in WSNs

Visualising backward information propagation in deep reinforcement learning from a variational data assimilation perspective

Decoupled safety supervision empowering efficient and safe energy management for fuel cell vehicles

A deep reinforcement learning framework for influence maximization problem on large-scale social networks

Auto-arrange buildings in urban planning with DQN

Fuzzy adaptive nonlinear MIMO control for rigid coupled multibody robots using reinforcement learning model

IntelliScheduler: an edge-cloud computing environment hybrid deep learning framework for task scheduling based on learning

New-generation AI-driven intelligent decision-making and inventory optimization in the full lifecycle of complex product manufacturing integrating LSTM and Q-learning

Breaking through safety performance stagnation in autonomous vehicles with dense learning

Personalized multi-agent reinforcement learning framework for adaptive chronic disease therapy management

A hybrid actor–critic and BERT framework for intelligent course recommendation in IoT-aware e-learning systems

Brain-inspired synaptic transistors for in-situ spiking reinforcement learning with eligibility trace

QLSA-MOEAD integration for precision task scheduling in heterogeneous computing environments

Temporal influence maximization via continuous-time graph neural networks and deep reinforcement learning

Deep reinforcement learning-driven multi-objective optimization and its applications on lighting infrastructure operation and maintenance strategy

Dynamic adaptation of non standard service tasks through reinforcement learning driven task technology fit and service interaction

Duration between rewards controls the rate of behavioral and dopaminergic learning

An Adaptive Blockchain Framework for Federated IoMT with Reinforcement Learning-Based Consensus and Resource Forecasting

Autonomous path planning for intercostal robotic ultrasound imaging using reinforcement learning

Persistent representation of a prior schema in the orbitofrontal cortex facilitates learning of a conflicting schema

DQN-empowered energy optimization for wireless powered communication networks

A cognitive internet of things resource allocation method based on multi-agent reinforcement learning algorithm

Multi-modal and multi-agent reinforcement learning framework for urban traffic flow prediction and signal control optimization

Reinforcement learning framework for computerized adaptive testing using multi armed bandit approach

Adaptive reinforcement learning framework for sustainable microgrid optimization in arid urban environments

NeuroAction: a neuroevolutionary approach to reinforcement learning for autonomous vehicles

Uncertainty and reward histories have distinct effects on decisions after wins and losses

A novel reinforcement learning-based approach for short-term load and price forecasting in energy markets

Risk sensitive twin distributional critics with a lambda lower confidence bound for continuous control reinforcement learning

A hybrid CNN and reinforcement learning framework for speaker identification using Mel-Spectrogram and continuous wavelet transform features

A Q-learning approach to waste rock reduction in open-pit mine design based on cleaner production principles

Behaviorally informed deep reinforcement learning for portfolio optimization with loss aversion and overconfidence

A novel augmented reality and reinforcement learning empowered communication framework for underwater unmanned autonomous vehicle

Bayesian reinforcement learning for adaptive control of energy recuperation in hydraulic excavator arms

A deep reinforcement learning approach to dance movement analysis

Evaluating gait system vulnerabilities through PPO and GAN-generated adversarial attacks

Adaptive hierarchical learning for uncertainty-aware distributed energy resource planning

A hierarchical fusion framework for vehicle to grid energy management using predictive intelligence and learning based pricing

Autonomous navigation in unstructured outdoor environments using semantic segmentation guided reinforcement learning

Active guidance in ultrasound bladder scanning using reinforcement learning

Dynamic causal weighting-based risk propagation modeling for airport movement areas

Reinforcement learning-driven dynamic optimization strategy for parametric design of 3D models