skip to main content
Advanced Topics in Vision: Large Language and Vision Models
12 units (3-0-9)  | third term
Prerequisites: undergraduate calculus, linear algebra, statistics, computer programming, machine learning. Experience programming in Python, Numpy and PyTorch.
The class will focus on large language models (LLMs) and language-and-vision models, as well as on generative methods for artificial intelligence (AI). Topics include deep neural networks,transformers, large language models, generative adversarial networks, diffusion models, and applications of such architectures and methods to image analysis, image synthesis, and text-to-image translation.
Instructors: Perona, Gkioxari