Date of Award
Spring 2024
Document Type
Dissertation
Degree Name
Doctor of Philosophy (PhD)
Department
Electrical Engineering (ENAS)
First Advisor
Duncan, James
Abstract
In the rapidly growing area of machine learning (ML), there is profound promise in crafting intelligent, data-driven methods for diverse real-world applications. Yet, in safety-critical domains like healthcare, some fundamental challenges remain. First, the insufficiency of raw biomedical data emphasizes the need for data-efficient and robust learning approaches.Second, the imperative of safety and stability necessitates a cohesive framework that unifies learning with theoretical guarantees. Third, the inherent heterogeneity and distribution shifts in real-world clinical data call for robust and generalizable learning methods. Amid the challenges we face, my research vision is to establish a solid foundation for healthcare and machine learning that enables the sustained and trustworthy deployments of artificial intelligence (AI) systems within the complex landscape of real-world clinical settings. Fundamentally, my research agenda is centered around building robust, reliable and equitable biomedical AI systems, that not only efficiently harness the large amount of raw biomedical data, but also accurately reconstruct anatomical structures (e.g., organs, bones, tumors) from sensory signals, intelligently process imperfect biomedical data (e.g., images, text, signals), and are more crucially anchored with theoretical grounding. To this end, I systematically studies representation learning for medical computing and analysis, standing out in three key aspects. First, I formulate the problem from practical and theoretical perspectives, ensuring a comprehensive understanding. Second, I seamlessly integrate the theoretical foundations into state-of-the-art algorithms, not only improving benchmark performance but also providing certifiable theoretical guarantees, where unified medical- and learning- theoretic analysis is just beginning to take shape. Third, my work equips medical AI agents with the capability to generalize across diverse scenarios, significantly broadening their applicability and effectiveness. I have incorporated ideas from infant development into designing mechanisms that enable medical AI systems to explore and understand their environment through experimentation. The findings indicate that my approach to learning empowers the AI agent to seamlessly adapt to unseen clinical settings and leverage its accumulated knowledge to efficiently tackle new clinical challenges.
Recommended Citation
You, Chenyu, "Robust Machine Learning for Biomedical Data: Efficiency, Reliability, and Generalizability" (2024). Yale Graduate School of Arts and Sciences Dissertations. 1358.
https://elischolar.library.yale.edu/gsas_dissertations/1358