Data Science

Data Science Blog

Reading the Pseudo-code for Transformers

April 10, 2024

Transformers are the underlying algorithms powering large language models like GPT, Bert, and Llama. In this seminar, Dr. Yixiang Wu, with MTSU’s Department of Mathematical Sciences will delve into a paper authored by the DeepMind team, focusing on the formal algorithms that define transformers. To grasp these algorithms, attendees need only a basic understanding of linear algebra (matrix multiplications) and probability theory (conditional probabilities).

See the video here.

Follow Us!

Contact Us

Data Science Institute
MTSU Box 0499
1301 East Main Street

Murfreesboro, TN 31732
615-898-2122

Data Science

Data Science Blog

Reading the Pseudo-code for Transformers

Recent Posts

Follow Us!

Contact Us