Transformer Architectures From Scratch Download - Transformer Architectures From Scratch Source code download

Transformer Architectures From Scratch

Other source code

1.0.0

Download

Transformer Architecure From Scratch Using PyTorch

1) TRANSFORMER -

A Self attention based Encoder-Decoder Architecture. It is mostly used for

Machine Translation
Document Summaraization
Text extraction

Paper - https://arxiv.org/abs/1706.03762

2) BERT -

A Self-attention based Encoder Architecture. It is mostly used for

Sentiment Classification
Named Entity Recognition
Question and Answering
Sentence Embedding Extraction
Document Matching

Paper - https://arxiv.org/abs/1810.04805

3) GPT-1 -

A Self-attention based Decoder based Autoregressive model. It is mostly used for

Sentence Completion
Generating Text
Sentiment Classification

Paper - https://paperswithcode.com/method/gpt

4) GPT-2 -

A Self-attention based Decoder based Autoregressive model with a slight change in architecture and trained on larger corpus of text than GPT-1. It is mostly used for

Sentence Completion
Generating Text
Sentiment Classification

Paper - https://d4mucfpksywv.cloudfront.net/better-language-models/language-models.pdf

5) ViT -

A State of the art Self-attention based Encoder Architecture for Computer Vision application. It is mostly used for

Image Classification
Image Encoding
Backbone for Object Detection

Paper - https://arxiv.org/abs/2006.03677

6) PERFORMER -

A Self-attention based Encoder-Decoder Architecture with a linear time complexity other than transformer which has quadratic time complexity. It is mostly used