Biomedical imaging faces a significant hurdle due to batch effects, which are systematic technical variations that can compromise experimental reproducibility and hinder the effectiveness of deep learning systems. These effects can arise from various sources, including differences in instrumentation, sample preparation, and experimental conditions. Researchers have struggled to develop a reliable method to mitigate batch effects, despite years of effort. A recent study proposes a novel approach, leveraging in-context control samples to bridge the domain gap in biomedical imaging1. This technique aims to improve the robustness and generalizability of deep learning models, enabling them to perform well across different experimental batches. By addressing the batch effect problem, this research has the potential to enhance the reliability and practicality of deep learning systems in biomedical imaging, ultimately benefiting practitioners and researchers in the field. This development matters because it can increase the adoption of AI-powered biomedical imaging tools, leading to improved diagnosis and treatment outcomes.