Large Foundation Models: Mathematics, Algorithms, and Applications
Large foundation models have achieved remarkable success across various domains, including general applications like Natural Language Processing, image, speech, and video, as well as scientific fields such as materials science, molecular biology, and protein engineering. While the underlying techniques are firmly rooted in applied mathematics, their development has often been driven by empirical engineering practices, leading to significant practical breakthroughs. Diffusion models serve as a prime example, demonstrating both substantial engineering benefits and profound mathematical underpinnings.
This lecture series aims to bridge the gap between foundation models and their mathematical foundations, fostering interdisciplinary discussions, particularly between mathematics and machine learning.
Prerequisites:
-- For participants from a machine learning background, a solid understanding of Calculus and graduate-level Probability is required. While knowledge of stochastic processes is not strictly necessary, a basic understanding will significantly enhance comprehension.
-- For those from a mathematics background, prior knowledge of neural networks is essential. If you lack this prerequisite, please refer to the first three chapters of 'Neural Networks and Deep Learning' (http://neuralnetworksanddeeplearning.com).
Course Content:
The course will primarily cover autoregressive models, diffusion models, and discrete diffusion models, including their underlying mathematics and algorithms. We will explore their diverse applications across various domains, with a particular focus on multi-modalities.
This lecture series aims to bridge the gap between foundation models and their mathematical foundations, fostering interdisciplinary discussions, particularly between mathematics and machine learning.
Prerequisites:
-- For participants from a machine learning background, a solid understanding of Calculus and graduate-level Probability is required. While knowledge of stochastic processes is not strictly necessary, a basic understanding will significantly enhance comprehension.
-- For those from a mathematics background, prior knowledge of neural networks is essential. If you lack this prerequisite, please refer to the first three chapters of 'Neural Networks and Deep Learning' (http://neuralnetworksanddeeplearning.com).
Course Content:
The course will primarily cover autoregressive models, diffusion models, and discrete diffusion models, including their underlying mathematics and algorithms. We will explore their diverse applications across various domains, with a particular focus on multi-modalities.
讲师
日期
2025年09月08日 至 2026年01月08日
位置
Weekday | Time | Venue | Online | ID | Password |
---|---|---|---|---|---|
周三 | 09:50 - 12:15 | Shuangqing | ZOOM 01 | 928 682 9093 | BIMSA |
修课要求
Calculus, Probability I, Neural networks
听众
Undergraduate
, Advanced Undergraduate
, Graduate
, 博士后
, Researcher
视频公开
公开
笔记公开
公开
语言
中文
, 英文