cs.LG 편의 논문 | Gist.Science

When Machine Learning Gets Personal: Evaluating Prediction and Explanation

이 논문은 의료와 같은 고위험 분야에서 개인화된 머신러닝 모델이 예측 성능과 설명 가능성에 미치는 영향을 통합적으로 평가하는 프레임워크를 제안하고, 데이터셋의 통계적 특성에 따라 이러한 효과를 검증할 수 있는 한계를 규명합니다.

Louisa Cornelis, Guillermo Bernárdez, Haewon Jeong, Nina MiolaneWed, 11 Ma🤖 cs.LG

On the Impact of the Utility in Semivalue-based Data Valuation

이 논문은 시미밸류 기반 데이터 가치 평가의 유틸리티 선택에 따른 민감도 문제를 해결하기 위해 데이터 포인트를 저차원 공간에 매핑하는 '공간 서명' 개념을 도입하고, 이를 통해 유틸리티 변경에 따른 평가 결과의 강건성을 정량적으로 측정하는 실용적인 방법론을 제시합니다.

Mélissa Tamine, Benjamin Heymann, Maxime Vono, Patrick LoiseauWed, 11 Ma🤖 cs.AI

A Distributional Treatment of Real2Sim2Real for Object-Centric Agent Adaptation in Vision-Driven Deformable Linear Object Manipulation

이 논문은 가소성 선형 물체 (DLO) 의 물리적 매개변수에 대한 확률적 추정을 통해 시뮬레이션과 현실 간의 차이를 해소하고, 이를 기반으로 학습된 제어가 추가 미세 조정 없이도 실제 환경에서 성공적으로 적용될 수 있는 통합된 Real2Sim2Real 프레임워크를 제안합니다.

Georgios Kamaras, Subramanian RamamoorthyWed, 11 Ma🤖 cs.LG

← 이전 다음 →

cs.LG

When Machine Learning Gets Personal: Evaluating Prediction and Explanation

On the Impact of the Utility in Semivalue-based Data Valuation

A Distributional Treatment of Real2Sim2Real for Object-Centric Agent Adaptation in Vision-Driven Deformable Linear Object Manipulation

Improving clustering quality evaluation in noisy Gaussian mixtures

HyConEx: Hypernetwork classifier with counterfactual explanations for tabular data

Experiments with Optimal Model Trees

A Consequentialist Critique of Binary Classification Evaluation: Theory, Practice, and Tools

Concept Drift Guided LayerNorm Tuning for Efficient Multimodal Metaphor Identification

Stepwise Guided Policy Optimization: Coloring your Incorrect Reasoning in GRPO

The Gaussian-Multinoulli Restricted Boltzmann Machine: A Potts Model Extension of the GRBM

JULI: Jailbreak Large Language Models by Self-Introspection

Discovering Symbolic Differential Equations with Symmetry Invariants

UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Language Models

A Systematic Evaluation of On-Device LLMs: Quantization, Performance, and Resources

SATURN: SAT-based Reinforcement Learning to Unleash LLMs Reasoning

FrontierCO: Real-World and Large-Scale Evaluation of Machine Learning Solvers for Combinatorial Optimization

Semi-Supervised Conformal Prediction With Unlabeled Nonconformity Score

Pure Exploration with Infinite Answers

Rating Quality of Diverse Time Series Data by Meta-learning from LLM Judgment

Cooperative Game-Theoretic Credit Assignment for Multi-Agent Policy Gradients via the Core