BIOSTATISTICS SEMINAR SERIES - Learning from Similar Linear Representations: Adaptivity, Minimaxity, and Robustness.
- Starts: 12:45 pm on Thursday, October 10, 2024
- Ends: 1:45 pm on Thursday, October 10, 2024
Speaker : Yang Feng, PhD, Professor, New York University
Abstract:
Representation multi-task learning (MTL) and transfer learning (TL) have achieved tremendous success in practice. However, the theoretical understanding of these methods is still lacking. Most existing theoretical works focus on cases where all tasks share the same representation and claim that MTL and TL almost always improve performance. However, as the number of tasks grows, assuming all tasks share the same representation is unrealistic. Also, this does not always match empirical findings, which suggest that a shared representation may not necessarily improve single-task or target-only learning performance. In this paper, we aim to understand how to learn from tasks with similar but not exactly the same linear representations, while dealing with outlier tasks. With a known intrinsic dimension, we propose two algorithms that are adaptive to the similarity structure and robust to outlier tasks under both MTL and TL settings. Our algorithms outperform single-task or target-only learning when representations across tasks are sufficiently similar and the fraction of outlier tasks is small. Furthermore, they always perform no worse than single-task learning or target-only learning, even when the representations are dissimilar. We provide information-theoretic lower bounds to show that our algorithms are nearly minimax optimal in a large regime. We also propose an algorithm to adapt to the unknown intrinsic dimension. We conduct two simulation studies to verify our theoretical results.
Bio:
Yang Feng is a Professor and Ph.D. Program Director of Biostatistics in the School of Global Public Health and an affiliate faculty in the Center for Data Science at New York University. He obtained his Ph.D. in Operations Research at Princeton University in 2010. Feng's research interests encompass the theoretical and methodological aspects of machine learning, high-dimensional statistics, network models, and nonparametric statistics, leading to a wealth of practical applications, including Alzheimer's disease, cancer classification, and electronic health records. He has published more than 70 papers in statistical and machine-learning journals. His research has been funded by multiple grants from the National Institutes of Health (NIH) and the National Science Foundation (NSF), notably the NSF CAREER Award. He is currently an Associate Editor for the Journal of the American Statistical Association (JASA), the Journal of Business & Economic Statistics (JBES), Journal of Computational & Graphical Statistics (JCGS), and the Annals of Applied Statistics (AoAS). His professional recognitions include being named a fellow of the American Statistical Association (ASA) and the Institute of Mathematical Statistics (IMS), as well as an elected member of the International Statistical Institute (ISI).
- Location:
- Presentation in CT 305 or Online via Zoom ( Meeting ID: 961 3147 3264, Passcode: 334135)
- Link:
- https://bostonu.zoom.us/j/96131473264?pwd=b1JzZXhvQ0FJQURkUHNHM09IZmR5dz09#success
- Contact Name
- Clara M Pereira
- Contact Email
- claraper@bu.edu
- Video Conference Link (Zoom, GoToMeeting, etc.)
- https://bostonu.zoom.us/j/96131473264?pwd=b1JzZXhvQ0FJQURkUHNHM09IZmR5dz09#success
- Host (Department, School, Center, etc.)
- Department of Biostatistics
- SPH Audience (Staff, Faculty, All Students, On Campus Students, Online MPH Students)
- STAFF, FACULTY, ALL STUDENTS, ON CAMPUS STUDENTS, ONLINE MPH STUDENTS
- Open to the public (Yes, No, By Invitation Only)
- No
- Address
- 801 Mass Ave, Crosstown Building, 3rd Floor, Boston, MA, 02118