DEEP REINFORCEMENT LEARNING BASED OPTIMAL OPERATION OF X2026