PhD Course List
Boston University’s PhD in Computing & Data Sciences offers a rigorous, interdisciplinary curriculum designed to prepare scholars for impactful research and leadership in data-driven fields. Housed within BU’s dynamic Faculty of Computing & Data Sciences, the program provides access to world-class faculty, cutting-edge facilities, and collaborative opportunities that span disciplines—from AI and systems engineering to public health and policy. Students gain both theoretical depth and applied experience, positioning them to drive innovation in academia, industry, and beyond.
Below are the pre-approved courses that fulfill competencies as of Fall 2023. The course list is revised annually. Students may petition for alternative means to satisfy the competencies using other courses not on the list. Note: The categorizations on this website may be out of date (though the requirements are accurate). Please refer to the Official Approved Methodology Competency Document .
Methodology Core
Students are required to take at least five courses and must fulfill at least six of the eight competencies listed below. Note that a single course may satisfy multiple competencies.
Mathematical Foundations
CAS CS 531 Advanced Optimization Algorithms
4 credits.
Undergraduate Prerequisites: CAS MA 123 & 124, or equivalent and CAS CS 132 or equivalent; or conse nt of instructor. - Optimization algorithms, highlighting the fruitful interactions between discrete and continuous. Intended audience is advanced master students and doctoral students. Topics include gradient descent algorithms, online optimization, linear and semidefinite programming, duality, network optimization, submodular optimization, approximation algorithms via continuous relaxations.
CAS CS 538 Fundamentals of Cryptography
4 credits. Fall and Spring
Undergraduate Prerequisites: (CASCS131 & CASCS237 & CASCS357) or consent of instructor. - Graduate Prerequisites: (CASCS332) - Basic Algorithms to guarantee confidentiality and authenticity of data. Definitions and proofs of security for practical constructions. Topics include perfectly secure encryption, pseudorandom generators, RSA and Elgamal encryption, Diffie-Hellman key agreement, RSA signatures, secret sharing, block and stream ciphers.
CAS MA 576 Generalized Linear Models
4 credits. Fall and Spring
Prerequisites: (CASMA 575) or consent of instructor. - Covers topics in linear models beyond MA 575: generalized linear models, analysis of binary and polytomous data, log-linear models, multivariate response models, non-linear models, graphical models, and relevant model selection techniques. Additional topics in modern regression as time allows.
CAS MA 582 Mathematical Statistics
4 credits. Fall and Spring
Prerequisites: (CASMA 581 or ENGEK 381 or ENGEK 500) or consent of instructor. - Point estimation including unbiasedness, efficiency, consistency, sufficiency, minimum variance unbiased estimator, Rao-Blackwell theorem, and Rao-Cramer inequality. Maximum likelihood and method of moment estimations; interval estimation; tests of hypothesis, uniformly most powerful tests, uniformly most powerful unbiased tests, likelihood ratio test, and chi-square test.
CAS MA 588 Nonparametric Statistics
4 credits. Fall and Spring
Undergraduate Prerequisites: CASMA 582 or consent of instructor. - The theory and logic in the development of nonparametric techniques including order statistics, tests based on runs, goodness of fit, rank-order (for location and scale), measures of association, analysis of variance, asymptotic relative efficiency.
CDS DS 574 Algorithmic Game Theory
4 credits. Fall and Spring
This course is an introduction to the interdisciplinary area of Algorithmic Mechanism Design: where computational perspectives are applied to economic problems, and economic techniques are brought to problems from computer science. We will explore a broad range of topics at the frontier of new research, starting with some of the fundamentals, such as welfare-maximizing auctions and types of Nash Equilibria.
Throughout the semester, the class will also learn about prevalent topics such as (1) Data Science & Incentives, (2) Mechanism Design for Social Good, and (3) optimization and robustness in mechanism design. As part of this course, students will engage in a (guided) research project, experiencing the various parts of conducting original research. This course is designed as an introductory graduate - level course but is open to advanced undergraduates with permission from the instructor.
While the formal undergraduate prerequisites are DS 120, DS 121, and DS122 and DS 320 (or equivalent), the course assumes strong proficiency in these topics for graduate students. Students should have: - Mathematical maturity and comfort with formal proofs - A solid understanding of probability (discrete and continuous random variables, moments, and conditional probability) - Familiarity with algorithms and computational efficiency.
Undergraduate students interested in this course should contact Professor Goldner (goldner@bu.edu) before registering for the course.
ENG EC 674 Optimization Theory 2
4 credits. Fall
This course is an introduction to optimization problems and algorithms emphasizing problem formulation, basic methodologies and the underlying mathematical structures. We will cover the classical theory as well as the state of the art. The major topics we will cover are: 1. Theory and algorithms for linear programming. 2. Introduction to combinatorial problems and methods for handling intractable problems. 3. Introduction to nonlinear programming. 4. Introduction to network optimization. Optimization techniques have many applications in science and engineering. To name a few: * Optimal routing in communication networks. * Transmission scheduling and resource allocation in sensor networks. * Production planning and scheduling in manufacturing systems. * Fleet management. * Air traffic flow management by airlines. * Optimal resource allocation in manufacturing and communication systems. * Optimal portfolio selection. * Analysis and optimization of fluxes in metabolic networks. * Protein docking. Prerequisites: Working knowledge of Linear Algebra and some degree of mathematical maturity. Same as ENG EC 674, ENG SE 524, ENG EC 674. Students may not receive credits for both.
Statistical Modeling & Inference
CAS CS 542 Principles of Machine Learning
4 credits.
Undergraduate Prerequisites: (CASCS365) - Introduction to modern machine learning concepts, techniques, and algorithms. Topics include regression, kernels, support vector machines, feature selection, boosting, clustering, hidden Markov models, and Bayesian networks. Programming assignments emphasize taking theory into practice, through applications on real-world data sets.
CAS MA 570 Stochastic Methods of Operations Research
4 credits. Spring
Prerequisites: (CASMA 225 or CASMA 230 or CDSDS 122) and (CASMA 242 or CASMA 442 or ENGEK 103 or CASCS 132) or consent of instructor. - Poisson processes, Markov chains, queuing theory. Matrix differential equations, differential-difference equations, probability-generating functions, single- and multiple-channel queues, steady-state and transient distributions.
CAS MA 575 Linear Models
4 credits. Fall and Spring
Prerequisites: (CASMA 214 or CASMA 116) and (CASMA 581 or ENGEK 381 or ENGEK 500 or CASCS 237) and (CASMA 242 or CASMA 442 or ENGEK 103 or CDSDS 121 or CASCS 132) or consent of instructor. - Post-introductory course on linear models. Topics to be covered include simple and multiple linear regression, regression with polynomials or factors, analysis of variance, weighted and generalized least squares, transformations, regression diagnostics, variable selection, and extensions of linear models. Effective Fall 2019, this course fulfills a single unit in the following BU Hub area: Quantitative Reasoning 2, Teamwork/Collaboration.
CAS MA 578 Bayesian Statistics
4 credits.
Prerequisites: CASMA 575 or consent of instructor. - The principles and methods of Bayesian statistics. Subjective probability, Bayes rule, posterior distributions, predictive distributions. Computationally based inference using Monte Carlo integration, Markov chain simulation. Hierarchical models, mixture models, model checking, and methods for Bayesian model selection.
CAS MA 582 Mathematical Statistics
4 credits. Fall and Spring
Prerequisites: (CASMA 581 or ENGEK 381 or ENGEK 500) or consent of instructor. - Point estimation including unbiasedness, efficiency, consistency, sufficiency, minimum variance unbiased estimator, Rao-Blackwell theorem, and Rao-Cramer inequality. Maximum likelihood and method of moment estimations; interval estimation; tests of hypothesis, uniformly most powerful tests, uniformly most powerful unbiased tests, likelihood ratio test, and chi-square test.
CAS MA 585 Time Series and Forecasting
4 credits.
Prerequisites: (CASMA 581 or ENGEK 381 or ENGEK 500) or consent of instructor. - Autocorrelation and partial autocorrelation functions; stationary and nonstationary processes; ARIMA and Seasonal ARIMA model identification, estimation, diagnostics, and forecasting. Modeling financial data via ARCH and GARCH models. Volatility estimation; additional topics, including long-range dependence and state-space models.
CAS MA 588 Nonparametric Statistics
4 credits. Fall and Spring
Undergraduate Prerequisites: CASMA 582 or consent of instructor. - The theory and logic in the development of nonparametric techniques including order statistics, tests based on runs, goodness of fit, rank-order (for location and scale), measures of association, analysis of variance, asymptotic relative efficiency.
ENG EC 505 Stochastic Processes
4 credits. Fall and Spring
Undergraduate Prerequisites: (ENGEC401 & CASMA142) or equivalent and either ENGEK381 or ENGEK500. - Introduction to discrete and continuous-time random processes. Correlation and power spectral density functions. Linear systems driven by random processes. Optimum detection and estimation. Bayesian, Weiner, and Kalman filtering.
QST DS 925 Methods for Causal Inference in Strategy Research
4 credits. Fall
(Formerly SI 915) This course reviews tools and methods for drawing causal inferences from non-experimental data. The class emphasizes conceptual difficulties associated with establishing causality in observational settings, the strengths and weaknesses of statistical methods based on so-called natural experiments, and the practical problems that arise in the application of these tools. This course is designed to complement a traditional two-semester graduate sequence in econometrics.
Efficient & Scalable Algorithms
CAS CS 530 Advanced Algorithms
4 credits.
Undergraduate Prerequisites: (CASCS330) or consent of instructor. - Graduate Prerequisites: (CASCS330) or consent of instructor. - Studies the design and efficiency of algorithms in several areas of computer science. Topics are chosen from graph algorithms, sorting and searching, NP-complete problems, pattern matching, parallel algorithms, and dynamic programming.
CAS CS 565 Algorithmic Data Mining
4 credits.
Undergraduate Prerequisites: (CASCS 112 & CASCS 330 & CASCS 365). - Introduction to data mining concepts and techniques. Topics include association and correlation discovery, classification and clustering of large datasets, outlier detection. Emphasis on the algorithmic aspects as well as the application of mining in real-world problems.
CAS MA 539 Methods of Scientific Computing
4 credits. Fall and Spring
Prerequisites: (CASMA 225 or CASMA 230) and (CASMA 242 or CASMA 442) and programming experience or consent of instructor. - An introduction to topics including computational linear algebra, solutions of linear equations, numerical integration and solution of differential equations, finite element methods, and methods of stochastic simulation (i.e., Monte Carlo methods).
CDS DS 563 Algorithmic Techniques for Taming Big Data
4 credits. Fall and Spring
Undergraduate Prerequisites: CDSDS110 OR CASCS111 OR ENGEK125 OR equivalent; CDSDS320 OR CASCS330 O R ENGEC330 OR equivalent; CDSDS121 OR CASCS132 OR CASMA242 OR equivale nt; CASMA115 OR CASCS327 OR ENGEK381 OR equivalent, OR conse - Growing amounts of available data lead to significant challenges in processing them efficiently. In many cases, it is no longer possible to design feasible algorithms that can freely access the entire data set. Instead of that we often have to resort to techniques that allow for reducing the amount of data such as sampling, sketching, dimensionality reduction, and core sets. Apart from these approaches, the course will also explore scenarios in which large data sets are distributed across several machines or even geographical locations and the goal is to design efficient communication protocols or MapReduce algorithms. The course will include a final project and programming assignments in which we will explore the performance of our techniques when applied to publicly available data sets.
Predictive Analysis & ML
CAS CS 507 Networks and Markets: Theory and Applications
4 credits.
Prerequisites: For CS students: CASCS 112, CS 131, a probability or statistics course equivalent to CS 237 or MA 214 or CDSDS 122 is required. CS330 is recommended as co-requisite. For DS students: CDSDS 210, DS 122 is required. DS 320 is recommended as co-requisite. Successful digital platforms build on both network science and market design. This course explores the interplay between agents, algorithms, and data in networked settings like matching markets and ad auctions. Digital markets will be studied in depth through independent projects.
CAS CS 542 Principles of Machine Learning
4 credits.
Undergraduate Prerequisites: (CASCS365) - Introduction to modern machine learning concepts, techniques, and algorithms. Topics include regression, kernels, support vector machines, feature selection, boosting, clustering, hidden Markov models, and Bayesian networks. Programming assignments emphasize taking theory into practice, through applications on real-world data sets.
CAS MA 575 Linear Models
4 credits. Fall and Spring
Prerequisites: (CASMA 214 or CASMA 116) and (CASMA 581 or ENGEK 381 or ENGEK 500 or CASCS 237) and (CASMA 242 or CASMA 442 or ENGEK 103 or CDSDS 121 or CASCS 132) or consent of instructor. - Post-introductory course on linear models. Topics to be covered include simple and multiple linear regression, regression with polynomials or factors, analysis of variance, weighted and generalized least squares, transformations, regression diagnostics, variable selection, and extensions of linear models. Effective Fall 2019, this course fulfills a single unit in the following BU Hub area: Quantitative Reasoning 2, Teamwork/Collaboration.
CAS MA 576 Generalized Linear Models
4 credits. Fall and Spring
Prerequisites: (CASMA 575) or consent of instructor. - Covers topics in linear models beyond MA 575: generalized linear models, analysis of binary and polytomous data, log-linear models, multivariate response models, non-linear models, graphical models, and relevant model selection techniques. Additional topics in modern regression as time allows.
ENG EC 503 Introduction to Learning from Data
4 credits. Fall and Spring
Prerequisites: EK381 or equivalent; EK102 or equivalent; MA225 or equivalent; EK125 or equivalent.
This is an introductory graduate course in (classical) machine learning covering the basic principles and methods of four major non-sequential supervised and unsupervised learning problems namely, classification, regression, clustering, and dimensionality reduction. A variety of contemporary applications will be explored through homeworks and a project.
ENG EC 523 Deep Learning
4 credits. Fall and Spring
Prerequisites: A strong mathematical background in calculus, linear algebra, and prob ability & statistics, as well as prior coursework in machine learning and programming experience in Python. Mathematical and machine learning background for deep learning. Feed-forward networks. Backpropagation. Training strategies for deep networks. Convolutional networks. Recurrent neural networks. Transformers. Diffusion Models. Deep unsupervised learning. Exposure to Pytorch and other modern programming tools. Other recent topics, time permitting. Same as CAS CS 523. Students may not receive credit for both.
Optimization Algorithms
CAS CS 507 Networks and Markets: Theory and Applications
4 credits.
Prerequisites: For CS students: CASCS 112, CS 131, a probability or statistics course equivalent to CS 237 or MA 214 or CDSDS 122 is required. CS330 is recommended as co-requisite. For DS students: CDSDS 210, DS 122 is required. DS 320 is recommended as co-requisite. Successful digital platforms build on both network science and market design. This course explores the interplay between agents, algorithms, and data in networked settings like matching markets and ad auctions. Digital markets will be studied in depth through independent projects.
CAS CS 531 Advanced Optimization Algorithms
4 credits.
Undergraduate Prerequisites: CAS MA 123 & 124, or equivalent and CAS CS 132 or equivalent; or conse nt of instructor. - Optimization algorithms, highlighting the fruitful interactions between discrete and continuous. Intended audience is advanced master students and doctoral students. Topics include gradient descent algorithms, online optimization, linear and semidefinite programming, duality, network optimization, submodular optimization, approximation algorithms via continuous relaxations.
CAS MA 569 Optimization Methods of Operations Research
4 credits. Fall
Prerequisites: (CASMA 225 or CASMA 230 or CDSDS 122) and (CASMA 242 or CASMA 442 or ENGEK 103 or CDSCS 132) or consent of instructor. - Optimization of linear functions: linear programming, simplex method; transportation, assignment, and network problems. Optimization of non-linear functions: unconstrained optima, constrained optima and Lagrange multipliers, Kuhn-Tucker conditions, calculus of variations, and Euler's equation.
CDS DS 574 Algorithmic Game Theory
4 credits. Fall and Spring
This course is an introduction to the interdisciplinary area of Algorithmic Mechanism Design: where computational perspectives are applied to economic problems, and economic techniques are brought to problems from computer science. We will explore a broad range of topics at the frontier of new research, starting with some of the fundamentals, such as welfare-maximizing auctions and types of Nash Equilibria.
Throughout the semester, the class will also learn about prevalent topics such as (1) Data Science & Incentives, (2) Mechanism Design for Social Good, and (3) optimization and robustness in mechanism design. As part of this course, students will engage in a (guided) research project, experiencing the various parts of conducting original research. This course is designed as an introductory graduate - level course but is open to advanced undergraduates with permission from the instructor.
While the formal undergraduate prerequisites are DS 120, DS 121, and DS122 and DS 320 (or equivalent), the course assumes strong proficiency in these topics for graduate students. Students should have: - Mathematical maturity and comfort with formal proofs - A solid understanding of probability (discrete and continuous random variables, moments, and conditional probability) - Familiarity with algorithms and computational efficiency.
Undergraduate students interested in this course should contact Professor Goldner (goldner@bu.edu) before registering for the course.
ENG EC 524 Optimization Theory and Methods
4 credits.
Undergraduate Prerequisites: (ENGEK103 OR CASMA142) - Introduction to optimization problems and algorithms emphasizing problem formulation, basic methodologies, and underlying mathematical structures. Classical optimization theory as well as recent advances in the field. Topics include modeling issues and formulations, simplex method, duality theory, sensitivity analysis, large-scale optimization, integer programming, interior-point methods, non-linear programming optimality conditions, gradient methods, and conjugate direction methods. Applications are considered; case studies included. Extensive paradigms from production planning and scheduling in manufacturing systems. Other illustrative applications include fleet management, air traffic flow management, optimal routing in communication networks, and optimal portfolio selection. Same as ENG EC 674, ENG SE 524, ENG SE 674. Students may not receive credit for both.
ENG EC 674 Optimization Theory 2
4 credits. Fall
This course is an introduction to optimization problems and algorithms emphasizing problem formulation, basic methodologies and the underlying mathematical structures. We will cover the classical theory as well as the state of the art. The major topics we will cover are: 1. Theory and algorithms for linear programming. 2. Introduction to combinatorial problems and methods for handling intractable problems. 3. Introduction to nonlinear programming. 4. Introduction to network optimization. Optimization techniques have many applications in science and engineering. To name a few: * Optimal routing in communication networks. * Transmission scheduling and resource allocation in sensor networks. * Production planning and scheduling in manufacturing systems. * Fleet management. * Air traffic flow management by airlines. * Optimal resource allocation in manufacturing and communication systems. * Optimal portfolio selection. * Analysis and optimization of fluxes in metabolic networks. * Protein docking. Prerequisites: Working knowledge of Linear Algebra and some degree of mathematical maturity. Same as ENG EC 674, ENG SE 524, ENG EC 674. Students may not receive credits for both.
Computational Complexity
CAS CS 530 Advanced Algorithms
4 credits.
Undergraduate Prerequisites: (CASCS330) or consent of instructor. - Graduate Prerequisites: (CASCS330) or consent of instructor. - Studies the design and efficiency of algorithms in several areas of computer science. Topics are chosen from graph algorithms, sorting and searching, NP-complete problems, pattern matching, parallel algorithms, and dynamic programming.
CAS CS 531 Advanced Optimization Algorithms
4 credits.
Undergraduate Prerequisites: CAS MA 123 & 124, or equivalent and CAS CS 132 or equivalent; or conse nt of instructor. - Optimization algorithms, highlighting the fruitful interactions between discrete and continuous. Intended audience is advanced master students and doctoral students. Topics include gradient descent algorithms, online optimization, linear and semidefinite programming, duality, network optimization, submodular optimization, approximation algorithms via continuous relaxations.
CAS CS 535 Complexity Theory
4 credits. Fall and Spring
Undergraduate Prerequisites: (CASCS332) or consent of instructor. - Graduate Prerequisites: (CASCS332) - Covers topics of current interest in the theory of computation chosen from computational models, games and hierarchies of problems, abstract complexity theory, informational complexity theory, time-space trade-offs, probabilistic computation, and recent work on particular combinatorial problems.
CAS CS 537 Randomness in Computing
4 credits. Fall and Spring
Undergraduate Prerequisites: (CASCS 330 OR CASCS 530). CASCS 535 is recommended or consent of instructor. - Graduate Prerequisites: CASCS 330 or CASCS 530 is recommended. - Survey of probabilistic ideas of the theory of computation. Topics may include Monte Carlo and Las Vegas probabilistic computations; average case complexity and analysis; random and pseudorandom strings; games and cryptographic protocol; information; inductive inference; reliability; others. (Offered alternate years.)
CAS CS 538 Fundamentals of Cryptography
4 credits. Fall and Spring
Undergraduate Prerequisites: (CASCS131 & CASCS237 & CASCS357) or consent of instructor. - Graduate Prerequisites: (CASCS332) - Basic Algorithms to guarantee confidentiality and authenticity of data. Definitions and proofs of security for practical constructions. Topics include perfectly secure encryption, pseudorandom generators, RSA and Elgamal encryption, Diffie-Hellman key agreement, RSA signatures, secret sharing, block and stream ciphers.
CDS DS 563 Algorithmic Techniques for Taming Big Data
4 credits. Fall and Spring
Undergraduate Prerequisites: CDSDS110 OR CASCS111 OR ENGEK125 OR equivalent; CDSDS320 OR CASCS330 O R ENGEC330 OR equivalent; CDSDS121 OR CASCS132 OR CASMA242 OR equivale nt; CASMA115 OR CASCS327 OR ENGEK381 OR equivalent, OR conse - Growing amounts of available data lead to significant challenges in processing them efficiently. In many cases, it is no longer possible to design feasible algorithms that can freely access the entire data set. Instead of that we often have to resort to techniques that allow for reducing the amount of data such as sampling, sketching, dimensionality reduction, and core sets. Apart from these approaches, the course will also explore scenarios in which large data sets are distributed across several machines or even geographical locations and the goal is to design efficient communication protocols or MapReduce algorithms. The course will include a final project and programming assignments in which we will explore the performance of our techniques when applied to publicly available data sets.
Programming & Software Design
CAS CS 561 Data Systems Architectures
4 credits. Spring
BU Hub Learn More Oral and/or Signed Communication Research and Information Literacy
Undergraduate Prerequisites: CAS CS 210 or equivalent and CAS CS 460/660. - Discusses the design of data systems that can address the modern challenges of managing and accessing large, ever-growing, diverse sets of data, often streaming from heterogenous sources, in the context of continuously evolving hardware and software. We use examples from several data management areas including relational systems, distributed database systems, key value stores, newSQL and NoSQL systems, data systems for machine learning (and machine learning for data systems), interactive analytics, and data management as a service. Effective Spring 2021, this course fulfills a single unit in each of the following BU Hub areas: Oral and/or Signed Communication, Research and Information Literacy.
ENG EC 523 Deep Learning
4 credits. Fall and Spring
Prerequisites: A strong mathematical background in calculus, linear algebra, and prob ability & statistics, as well as prior coursework in machine learning and programming experience in Python. Mathematical and machine learning background for deep learning. Feed-forward networks. Backpropagation. Training strategies for deep networks. Convolutional networks. Recurrent neural networks. Transformers. Diffusion Models. Deep unsupervised learning. Exposure to Pytorch and other modern programming tools. Other recent topics, time permitting. Same as CAS CS 523. Students may not receive credit for both.
ENG EC 527 High Performance Programming with Multicore and GPUs
4 credits. Fall and Spring
Undergraduate Prerequisites: EC413 or equivalent; programming in C
Considers theory and practice of hardware-aware programming. Key theme is obtaining a significant fraction of potential performance through knowledge of the underlying computing platform and how the platform interacts with programs. Studies architecture of, and programming methods for, contemporary high-performance processors. These include complex processor cores, multicore processors, and graphics processors. Laboratory component includes use and evaluation of programming methods on these processors through applications such as matrix operations and the Fast Fourier Transform.
Large-scale Data Management
CAS CS 561 Data Systems Architectures
4 credits. Spring
BU Hub Learn More Oral and/or Signed Communication Research and Information Literacy
Undergraduate Prerequisites: CAS CS 210 or equivalent and CAS CS 460/660. - Discusses the design of data systems that can address the modern challenges of managing and accessing large, ever-growing, diverse sets of data, often streaming from heterogenous sources, in the context of continuously evolving hardware and software. We use examples from several data management areas including relational systems, distributed database systems, key value stores, newSQL and NoSQL systems, data systems for machine learning (and machine learning for data systems), interactive analytics, and data management as a service. Effective Spring 2021, this course fulfills a single unit in each of the following BU Hub areas: Oral and/or Signed Communication, Research and Information Literacy.
CAS CS 562 Advanced Database Applications
4 credits. Fall and Spring
Undergraduate Prerequisites: (CASCS460) or consent of instructor. - Research issues in the design and implementation of modern database systems. Spatial, temporal, and spatiotemporal index structures. Indexing methods for image and multimedia databases and data warehouses. New data analysis techniques for large databases, clustering and rule discovery for very large datasets.
CAS CS 565 Algorithmic Data Mining
4 credits.
Undergraduate Prerequisites: (CASCS 112 & CASCS 330 & CASCS 365). - Introduction to data mining concepts and techniques. Topics include association and correlation discovery, classification and clustering of large datasets, outlier detection. Emphasis on the algorithmic aspects as well as the application of mining in real-world problems.
CDS DS 563 Algorithmic Techniques for Taming Big Data
4 credits. Fall and Spring
Undergraduate Prerequisites: CDSDS110 OR CASCS111 OR ENGEK125 OR equivalent; CDSDS320 OR CASCS330 O R ENGEC330 OR equivalent; CDSDS121 OR CASCS132 OR CASMA242 OR equivale nt; CASMA115 OR CASCS327 OR ENGEK381 OR equivalent, OR conse - Growing amounts of available data lead to significant challenges in processing them efficiently. In many cases, it is no longer possible to design feasible algorithms that can freely access the entire data set. Instead of that we often have to resort to techniques that allow for reducing the amount of data such as sampling, sketching, dimensionality reduction, and core sets. Apart from these approaches, the course will also explore scenarios in which large data sets are distributed across several machines or even geographical locations and the goal is to design efficient communication protocols or MapReduce algorithms. The course will include a final project and programming assignments in which we will explore the performance of our techniques when applied to publicly available data sets.
Subject Core
To satisfy the subject core requirement, students must take a minimum of three courses in a single subject area.
Theoretical Foundations
CAS CS 530 Advanced Algorithms
4 credits.
Undergraduate Prerequisites: (CASCS330) or consent of instructor. - Graduate Prerequisites: (CASCS330) or consent of instructor. - Studies the design and efficiency of algorithms in several areas of computer science. Topics are chosen from graph algorithms, sorting and searching, NP-complete problems, pattern matching, parallel algorithms, and dynamic programming.
CAS CS 531 Advanced Optimization Algorithms
4 credits.
Undergraduate Prerequisites: CAS MA 123 & 124, or equivalent and CAS CS 132 or equivalent; or conse nt of instructor. - Optimization algorithms, highlighting the fruitful interactions between discrete and continuous. Intended audience is advanced master students and doctoral students. Topics include gradient descent algorithms, online optimization, linear and semidefinite programming, duality, network optimization, submodular optimization, approximation algorithms via continuous relaxations.
CAS CS 535 Complexity Theory
4 credits. Fall and Spring
Undergraduate Prerequisites: (CASCS332) or consent of instructor. - Graduate Prerequisites: (CASCS332) - Covers topics of current interest in the theory of computation chosen from computational models, games and hierarchies of problems, abstract complexity theory, informational complexity theory, time-space trade-offs, probabilistic computation, and recent work on particular combinatorial problems.
CAS CS 538 Fundamentals of Cryptography
4 credits. Fall and Spring
Undergraduate Prerequisites: (CASCS131 & CASCS237 & CASCS357) or consent of instructor. - Graduate Prerequisites: (CASCS332) - Basic Algorithms to guarantee confidentiality and authenticity of data. Definitions and proofs of security for practical constructions. Topics include perfectly secure encryption, pseudorandom generators, RSA and Elgamal encryption, Diffie-Hellman key agreement, RSA signatures, secret sharing, block and stream ciphers.
CAS MA 576 Generalized Linear Models
4 credits. Fall and Spring
Prerequisites: (CASMA 575) or consent of instructor. - Covers topics in linear models beyond MA 575: generalized linear models, analysis of binary and polytomous data, log-linear models, multivariate response models, non-linear models, graphical models, and relevant model selection techniques. Additional topics in modern regression as time allows.
CAS MA 582 Mathematical Statistics
4 credits. Fall and Spring
Prerequisites: (CASMA 581 or ENGEK 381 or ENGEK 500) or consent of instructor. - Point estimation including unbiasedness, efficiency, consistency, sufficiency, minimum variance unbiased estimator, Rao-Blackwell theorem, and Rao-Cramer inequality. Maximum likelihood and method of moment estimations; interval estimation; tests of hypothesis, uniformly most powerful tests, uniformly most powerful unbiased tests, likelihood ratio test, and chi-square test.
CAS MA 588 Nonparametric Statistics
4 credits. Fall and Spring
Undergraduate Prerequisites: CASMA 582 or consent of instructor. - The theory and logic in the development of nonparametric techniques including order statistics, tests based on runs, goodness of fit, rank-order (for location and scale), measures of association, analysis of variance, asymptotic relative efficiency.
ENG EC 674 Optimization Theory 2
4 credits. Fall
This course is an introduction to optimization problems and algorithms emphasizing problem formulation, basic methodologies and the underlying mathematical structures. We will cover the classical theory as well as the state of the art. The major topics we will cover are: 1. Theory and algorithms for linear programming. 2. Introduction to combinatorial problems and methods for handling intractable problems. 3. Introduction to nonlinear programming. 4. Introduction to network optimization. Optimization techniques have many applications in science and engineering. To name a few: * Optimal routing in communication networks. * Transmission scheduling and resource allocation in sensor networks. * Production planning and scheduling in manufacturing systems. * Fleet management. * Air traffic flow management by airlines. * Optimal resource allocation in manufacturing and communication systems. * Optimal portfolio selection. * Analysis and optimization of fluxes in metabolic networks. * Protein docking. Prerequisites: Working knowledge of Linear Algebra and some degree of mathematical maturity. Same as ENG EC 674, ENG SE 524, ENG EC 674. Students may not receive credits for both.
Software & Systems
CAS CS 561 Data Systems Architectures
4 credits. Spring
BU Hub Learn More Oral and/or Signed Communication Research and Information Literacy
Undergraduate Prerequisites: CAS CS 210 or equivalent and CAS CS 460/660. - Discusses the design of data systems that can address the modern challenges of managing and accessing large, ever-growing, diverse sets of data, often streaming from heterogenous sources, in the context of continuously evolving hardware and software. We use examples from several data management areas including relational systems, distributed database systems, key value stores, newSQL and NoSQL systems, data systems for machine learning (and machine learning for data systems), interactive analytics, and data management as a service. Effective Spring 2021, this course fulfills a single unit in each of the following BU Hub areas: Oral and/or Signed Communication, Research and Information Literacy.
CAS CS 562 Advanced Database Applications
4 credits. Fall and Spring
Undergraduate Prerequisites: (CASCS460) or consent of instructor. - Research issues in the design and implementation of modern database systems. Spatial, temporal, and spatiotemporal index structures. Indexing methods for image and multimedia databases and data warehouses. New data analysis techniques for large databases, clustering and rule discovery for very large datasets.
CDS DS 537 Data Science for Conservation Decisions
4 credits. Fall and Spring
BU Hub Learn More Digital/Multimedia Expression Quantitative Reasoning II Research and Information Literacy
This course covers the application of quantitative methods to support conservation decisions. Ecosystem value mapping, systematic conservation planning, policy instrument design, rigorous impact evaluation, decision theory, data visualization. Implementations in state-of-the-art open-source software. Real-life case studies from the U.S. and abroad. Effective Fall 2021, this course fulfills a single unit in each of the following BU Hub areas: Digital/Multimedia Expression, Quantitative Reasoning II, Research and Information Literacy.
CDS DS 563 Algorithmic Techniques for Taming Big Data
4 credits. Fall and Spring
Undergraduate Prerequisites: CDSDS110 OR CASCS111 OR ENGEK125 OR equivalent; CDSDS320 OR CASCS330 O R ENGEC330 OR equivalent; CDSDS121 OR CASCS132 OR CASMA242 OR equivale nt; CASMA115 OR CASCS327 OR ENGEK381 OR equivalent, OR conse - Growing amounts of available data lead to significant challenges in processing them efficiently. In many cases, it is no longer possible to design feasible algorithms that can freely access the entire data set. Instead of that we often have to resort to techniques that allow for reducing the amount of data such as sampling, sketching, dimensionality reduction, and core sets. Apart from these approaches, the course will also explore scenarios in which large data sets are distributed across several machines or even geographical locations and the goal is to design efficient communication protocols or MapReduce algorithms. The course will include a final project and programming assignments in which we will explore the performance of our techniques when applied to publicly available data sets.
ENG EC 527 High Performance Programming with Multicore and GPUs
4 credits. Fall and Spring
Undergraduate Prerequisites: EC413 or equivalent; programming in C
Considers theory and practice of hardware-aware programming. Key theme is obtaining a significant fraction of potential performance through knowledge of the underlying computing platform and how the platform interacts with programs. Studies architecture of, and programming methods for, contemporary high-performance processors. These include complex processor cores, multicore processors, and graphics processors. Laboratory component includes use and evaluation of programming methods on these processors through applications such as matrix operations and the Fast Fourier Transform.
Data Mining & Machine Learning
CAS CS 542 Principles of Machine Learning
4 credits.
Undergraduate Prerequisites: (CASCS365) - Introduction to modern machine learning concepts, techniques, and algorithms. Topics include regression, kernels, support vector machines, feature selection, boosting, clustering, hidden Markov models, and Bayesian networks. Programming assignments emphasize taking theory into practice, through applications on real-world data sets.
CAS CS 565 Algorithmic Data Mining
4 credits.
Undergraduate Prerequisites: (CASCS 112 & CASCS 330 & CASCS 365). - Introduction to data mining concepts and techniques. Topics include association and correlation discovery, classification and clustering of large datasets, outlier detection. Emphasis on the algorithmic aspects as well as the application of mining in real-world problems.
ENG EC 523 Deep Learning
4 credits. Fall and Spring
Prerequisites: A strong mathematical background in calculus, linear algebra, and prob ability & statistics, as well as prior coursework in machine learning and programming experience in Python. Mathematical and machine learning background for deep learning. Feed-forward networks. Backpropagation. Training strategies for deep networks. Convolutional networks. Recurrent neural networks. Transformers. Diffusion Models. Deep unsupervised learning. Exposure to Pytorch and other modern programming tools. Other recent topics, time permitting. Same as CAS CS 523. Students may not receive credit for both.
AI and Human-Centered Computing
CDS DS 682 Responsible AI, Law, Ethics & Society
4 credits. Spring
This course addresses the deployment of Artificial Intelligence systems across various societal domains, raising fundamental challenges and concerns such as accountability, liability, fairness, transparency, and privacy. Tackling these challenges necessitates an interdisciplinary approach, integrating principles and practices from data science, ethics, and law. This unique course will bring together students from computing and data science disciplines as well as law and public policy disciplines from multiple institutions.
Permission is required to register for this course. Course page: https://learn.responsibly.ai. Please fill out an application form here: https://forms.gle/bMRECdYcMUwHj7xG8. Instructor: shlomi@bu.edu.
DS for Social & Behavioral Sciences
CDS DS 574 Algorithmic Game Theory
4 credits. Fall and Spring
This course is an introduction to the interdisciplinary area of Algorithmic Mechanism Design: where computational perspectives are applied to economic problems, and economic techniques are brought to problems from computer science. We will explore a broad range of topics at the frontier of new research, starting with some of the fundamentals, such as welfare-maximizing auctions and types of Nash Equilibria.
Throughout the semester, the class will also learn about prevalent topics such as (1) Data Science & Incentives, (2) Mechanism Design for Social Good, and (3) optimization and robustness in mechanism design. As part of this course, students will engage in a (guided) research project, experiencing the various parts of conducting original research. This course is designed as an introductory graduate - level course but is open to advanced undergraduates with permission from the instructor.
While the formal undergraduate prerequisites are DS 120, DS 121, and DS122 and DS 320 (or equivalent), the course assumes strong proficiency in these topics for graduate students. Students should have: - Mathematical maturity and comfort with formal proofs - A solid understanding of probability (discrete and continuous random variables, moments, and conditional probability) - Familiarity with algorithms and computational efficiency.
Undergraduate students interested in this course should contact Professor Goldner (goldner@bu.edu) before registering for the course.
CDS DS 682 Responsible AI, Law, Ethics & Society
4 credits. Spring
This course addresses the deployment of Artificial Intelligence systems across various societal domains, raising fundamental challenges and concerns such as accountability, liability, fairness, transparency, and privacy. Tackling these challenges necessitates an interdisciplinary approach, integrating principles and practices from data science, ethics, and law. This unique course will bring together students from computing and data science disciplines as well as law and public policy disciplines from multiple institutions.
Permission is required to register for this course. Course page: https://learn.responsibly.ai. Please fill out an application form here: https://forms.gle/bMRECdYcMUwHj7xG8. Instructor: shlomi@bu.edu.