Model Science: Getting Serious about Verifying, Explaining and Controlling AI
Abstract: The recent proliferation of foundation models calls for a paradigm shift from Data Science to Model Science. Unlike data-centric approaches, Model Science places the trained model at the core of analysis, aiming to interact, verify, explain, and control its behavior across diverse operational contexts. This talk introduces a structured framework for a new discipline called Model Science, highlighting four key pillars: Verification, which establishes rigorous, context-aware evaluation protocols; Explanation, which leverages analysis of internal model operations; Control, which integrates alignment techniques to steer model behavior; and Interface, which develops interactive and visual explanation tools to improve human calibration and decision-making. The proposed framework aims to guide the development of credible, safe, and scientifically grounded AI systems.
Bio: Prof. Dr. Przemysław Biecek is a Full Professor at both the Warsaw University of Technology and the University of Warsaw, specializing in mathematical statistics, machine learning, and explainable artificial intelligence (XAI). He leads the MI² Data Lab, focusing on developing tools and methods for responsible machine learning, with applications in healthcare, education, and public policy. His notable projects include the DALEX package and the DrWhy.AI framework, which support model interpretability and fairness assessments. Prof. Biecek’s research emphasizes the integration of statistical rigor with practical applications, aiming to enhance human decision-making through transparent AI systems. He has collaborated with organizations such as Samsung, IBM, and Disney, and is the founder of Solutions42.ai, a company dedicated to deploying responsible AI solutions. A strong advocate for data literacy, he actively contributes to the open-source community and promotes evidence-based approaches in AI. His work has been recognized in leading conferences, including ICML, CVPR, and ECAI.