Calendar

IS&T RCS Tutorial - GPT & Transformers (LLMs Part 3) (Hands-on)

Starts:
10:00 am on Friday, June 13, 2025
Ends:
12:00 pm on Friday, June 13, 2025
Location:
Biological Science Center, 2 Cummington Mall, Room 107
URL:
https://www.bu.edu/tech/about/training/classroom/rcs-tutorials/
Training Large Language Models (LLMs) requires a large neural network, large data, and large compute. We will discuss these difficulties. We’ll look at the Transformer architecture in detail to develop a quantitative understanding of how it works and how specifically tools like ChatGPT, DeepSeek, Llama, etc. work. We will then use a pre-trained SentenceTransformer model to do a range of classification on real-world data.