Researchers have introduced CoMet, a novel approach to uncertainty estimation in multimodal large language models (MLLMs), addressing a long-standing challenge in AI models. CoMet focuses on context and multiplicity decomposition to better estimate uncertainty, a crucial aspect of metacognition that is difficult even for humans. This method is particularly important in MLLMs, where uncertainty estimation is becoming increasingly vital. By improving uncertainty estimation, CoMet can help mitigate risks associated with MLLMs, such as overconfidence in incorrect predictions. The development of CoMet has significant implications for the security and reliability of MLLMs, as it can help identify potential vulnerabilities and improve overall performance1. This matters to practitioners because as MLLMs continue to evolve and become more widespread, the need for effective uncertainty estimation and risk management will only continue to grow, making CoMet a crucial step forward in addressing these challenges.