Skip to content

Commit 1281a02

Browse files
author
Azure Pipelines
committed
publish [GPU]: course_UvA-DL/05-transformers-and-MH-attention
1 parent fcbd970 commit 1281a02

File tree

3 files changed

+55
-0
lines changed

3 files changed

+55
-0
lines changed
Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
version https://git-lfs.github.com/spec/v1
2+
oid sha256:e7ce19c647782db988a6ca05c534f8e919f064e314d91fc19730f20b0cbae428
3+
size 3024932
Loading
Lines changed: 52 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,52 @@
1+
title: 'Tutorial 5: Transformers and Multi-Head Attention'
2+
author: Phillip Lippe
3+
created: 2021-06-30
4+
updated: 2023-03-14
5+
license: CC BY-SA
6+
build: 0
7+
tags:
8+
- Text
9+
description: 'In this tutorial, we will discuss one of the most impactful architectures
10+
of the last 2 years: the Transformer model.
11+
12+
Since the paper Attention Is All You Need by Vaswani et al. had been published in
13+
2017,
14+
15+
the Transformer architecture has continued to beat benchmarks in many domains, most
16+
importantly in Natural Language Processing.
17+
18+
Transformers with an incredible amount of parameters can generate long, convincing
19+
essays, and opened up new application fields of AI.
20+
21+
As the hype of the Transformer architecture seems not to come to an end in the next
22+
years,
23+
24+
it is important to understand how it works, and have implemented it yourself, which
25+
we will do in this notebook.
26+
27+
This notebook is part of a lecture series on Deep Learning at the University of
28+
Amsterdam.
29+
30+
The full list of tutorials can be found at https://uvadlc-notebooks.rtfd.io.
31+
32+
'
33+
requirements:
34+
- torchvision
35+
- matplotlib
36+
- seaborn
37+
- lightning>=2.0.0
38+
accelerator:
39+
- GPU
40+
environment:
41+
- pytorch-lightning==2.0.9.post0
42+
- lightning==2.3.3
43+
- setuptools==69.0.3
44+
- ipython==8.16.1
45+
- numpy==1.26.4
46+
- torchvision==0.15.2
47+
- seaborn==0.13.2
48+
- torchmetrics==1.2.1
49+
- matplotlib==3.8.4
50+
- torch==2.0.1
51+
- urllib3==2.2.2
52+
published: '2024-07-19T19:40:50.645247'

0 commit comments

Comments
 (0)