Profile picture

Yue Zhao

Assistant Professor of Computer Science

Education

  • 2023, Doctoral Degree, Information Systems, Carnegie-Mellon University
  • 2016, Master's Degree, Computer Science, University of Toronto
  • 2015, Bachelor's Degree, Computer Engineering, University of Cincinnati

Biography

Dr. Yue Zhao is an Assistant Professor of Computer Science at the University of Southern California and a faculty member of the USC Machine Learning Center. He leads the FORTIS Lab (Foundations Of Reliable and Trustworthy Intelligent Systems), where his research focuses on building reliable, safe, and scalable AI systems. His work centers on understanding and mitigating failure modes in modern AI—particularly in large language models and agentic systems—through research on LLM safety, robustness, and system-level evaluation. His work is grounded in foundations such as anomaly and out-of-distribution detection and extends to evaluation frameworks and high-impact applications in science and society. Dr. Zhao has authored over 60 papers in top-tier venues and is internationally recognized for his open-source contributions—including PyOD, PyGOD, TDC, and TrustLLM—which collectively exceed 35 million downloads and 22,000 GitHub stars. His tools are widely used across academia, industry, and government, including by NASA, Tesla, Morgan Stanley, and the U.S. Senate. He has received numerous honors, including the Capital One Research Award, Amazon Research Awards, AAAI New Faculty Highlights, Google Cloud Research Innovators, Norton Labs Fellowship, Meta AI4AI Research Award, the Carnegie Mellon University Presidential Fellowship, the 2025 SIGSPATIAL Best Short Paper Award, and the Second Prize CCC Award at the IEEE ICDM 2025 BlueSky Track. He also serves as an Associate or Action Editor for the ACM Transactions on AI for Science, the IEEE Transactions on Neural Networks and Learning Systems, and the Journal of Data-Centric Machine Learning Research, and as an Area Chair for major conferences including ICLR, ICML, and NeurIPS.

Research Summary

Research Interests: My research centers on building reliable, safe, and scalable AI systems, with a focus on understanding and mitigating failure modes in modern foundation models and agentic systems. I organize my work into two tightly connected tiers: (1) advancing the scientific foundations of safety and robustness in AI, and (2) translating these foundations into system-level evaluation frameworks and high-impact applications.

Tier 1: Foundations of Reliable & Safe AI
I study why and how modern AI systems fail under distribution shift, uncertainty, and strategic pressure, and develop methods to make their behavior more predictable and reliable. This tier integrates two complementary threads:
LLM & Agent Safety: Analyzing and mitigating failure modes in large language models and agentic systems, including hallucinations, jailbreaks, privacy leakage, model extraction, and multi-agent instability.
Robustness & Failure Detection: Developing algorithms and benchmarks for identifying abnormal or unreliable behavior, grounded in robustness, out-of-distribution generalization, and anomaly detection.

Keywords: LLM Safety, Robustness, Agents, Hallucination Mitigation, Jailbreak Detection, OOD Generalization, Failure Analysis

Tier 2: System-Level Evaluation & Scientific/Societal Impact

I adopt a system-oriented perspective to evaluate, stress-test, and deploy reliable AI in realistic settings, and apply these methods to domains where failures carry high cost. This tier emphasizes two directions:
Evaluation & Benchmarking: Designing scalable evaluation frameworks, benchmarks, and workflows that probe model and agent behavior under realistic and adversarial conditions.
AI for Science & Society: Applying reliable foundation models to climate and weather forecasting, healthcare and biomedicine, and political or social decision-making.

Keywords: Evaluation, Benchmarking, System-Level Analysis, AI for Science, Scientific Foundation Models, Climate & Weather Modeling, AI for Healthcare

Awards

  • 2025 ACM SIGSPATIAL Best Short Paper
  • 2025 IEEE ICDM BlueSky Track Second Prize CCC Award
  • 2024 Amazon Amazon Research Awards
  • 2024 Google Google Cloud Research Innovators
  • 2024 Capital One Research Awards
  • 2024 Association for the Advancement of Artificial Intelligence AAAI New Faculty Highlights
Appointments
  • Thomas Lord Department of Computer Science
Office
  • Yue Zhao has not listed an office location.
Contact Information
  • yue.z@usc.edu
Links
Social Media