Prompting with the human-touch: evaluating model-sensitivity of foundation models for musculoskeletal CT segmentation
This study evaluates 11 promptable foundation models for musculoskeletal CT segmentation across four anatomical regions, revealing that while specific models like SAM and nnInteractive perform best under ideal conditions, all models exhibit significant sensitivity to human prompting variations, leading to performance drops and highlighting the challenge of selecting robust models for real-world clinical applications.