Analysis of errors in generative image and video models
Mai, Hanlin
This item is only available for download by members of the University of Illinois community. Students, faculty, and staff at the U of I may log in with your NetID and password to view the item. If you are trying to access an Illinois-restricted dissertation or thesis, you can request a copy through your library's Inter-Library Loan office or purchase a copy directly from ProQuest.
Permalink
https://hdl.handle.net/2142/132611
Description
Title
Analysis of errors in generative image and video models
Author(s)
Mai, Hanlin
Issue Date
2025-12-11
Director of Research (if dissertation) or Advisor (if thesis)
Lazebnik, Svetlana
Department of Study
Electrical & Computer Eng
Discipline
Electrical & Computer Engr
Degree Granting Institution
University of Illinois Urbana-Champaign
Degree Name
M.S.
Degree Level
Thesis
Keyword(s)
Generative AI
Diffusion Models
Abstract
Image and video generative models have become increasingly powerful and produce visuals that are more difficult to distinguish from real ones in recent years. Users can create images and videos to make their imaginations come true easier than ever before. However, on closer examination, these models make interesting mistakes. In this thesis, we first discuss a systematic way of analyzing errors in generated images relating to projective geometry and shadows at a population level. We find that generated images can be reliably distinguished from real images by derived geometric features alone without looking at pixels. Then, we introduce methods that can effectively judge whether a generated video is physically plausible for robotic demonstrations.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.