cs.AI 편의 논문 | Gist.Science

Offline Dynamic Inventory and Pricing Strategy: Addressing Censored and Dependent Demand

이 논문은 과거 수요에 의존하고 재고 부족으로 인해 판매 데이터가 검열 (censored) 되는 동적 재고 및 가격 결정 문제를 해결하기 위해, 검열된 의존적 수요 환경에서 최적 정책을 학습하는 새로운 데이터 기반 알고리즘을 제안하고 그 성능을 이론적 및 실험적으로 검증합니다.

Korel Gundem, Zhengling Qi2026-03-12📊 stat

Scalable Multi-Task Learning through Spiking Neural Networks with Adaptive Task-Switching Policy for Intelligent Autonomous Agents

이 논문은 고정된 작업 전환 간격의 한계를 극복하고 간섭을 줄이며 확장 가능한 다중 작업 학습을 가능하게 하기 위해, 활성 수상돌기와 듀얼 구조를 갖춘 심층 스파이킹 Q-네트워크와 보상 및 내부 동역학을 기반으로 한 적응형 작업 전환 정책을 결합한 'SwitchMT' 방법론을 제안합니다.

Rachmad Vidya Wicaksana Putra, Avaneesh Devkota, Muhammad Shafique2026-03-12🤖 cs.AI

← 이전 다음 →

cs.AI

Offline Dynamic Inventory and Pricing Strategy: Addressing Censored and Dependent Demand

Scalable Multi-Task Learning through Spiking Neural Networks with Adaptive Task-Switching Policy for Intelligent Autonomous Agents

Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement

REI-Bench: Can Embodied Agents Understand Vague Human Instructions in Task Planning?

Training with Pseudo-Code for Instruction Following

LLLMs: A Data-Driven Survey of Evolving Research on Limitations of Large Language Models

Consistency-based Abductive Reasoning over Perceptual Errors of Multiple Pre-trained Models in Novel Environments

Comparative Analysis of Modern Machine Learning Models for Retail Sales Forecasting

Self-Improving Loops for Visual Robotic Planning

Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions

Differential Privacy in Machine Learning: A Survey from Symbolic AI to LLMs

Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation

Technological folie à deux: Feedback Loops Between AI Chatbots and Mental Illness

What Makes Code Generation Ethically Sourced?

IntrinsicWeather: Controllable Weather Editing in Intrinsic Space

Shadow in the Cache: Unveiling and Mitigating Privacy Risks of KV-cache in LLM Inference

The Yokai Learning Environment: Tracking Beliefs Over Space and Time

From Next Token Prediction to (STRIPS) World Models

Global Minimizers of Sigmoid Contrastive Loss

RADAR: Reasoning-Ability and Difficulty-Aware Routing for Reasoning LLMs