File:Full GPT architecture.png

Original file(863 × 1,038 pixels, file size: 129 KB, MIME type: image/png)

Commons-logo.svg This is a file from the Wikimedia Commons. The description on its description page there is shown below.
Commons is a freely licensed media file repository. You can help.

Summary

Description
English: The full architecture of a generative pre-trained transformer (GPT) model.
Date
Source Own work
Author Marxav
Other versions

Licensing

I, the copyright holder of this work, hereby publish it under the following license:
Creative Commons CC-Zero This file is made available under the Creative Commons CC0 1.0 Universal Public Domain Dedication.
The person who associated a work with this deed has dedicated the work to the public domain by waiving all of their rights to the work worldwide under copyright law, including all related and neighboring rights, to the extent allowed by law. You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission.

Captions

The full architecture of a GPT model.

Items portrayed in this file

depicts

27 December 2022

image/png

File history

Click on a date/time to view the file as it appeared at that time.

Date/TimeDimensionsUserComment
current02:00, 2 January 2023863 × 1,038 (129 KB)Marxavadded a dropout module

The following page uses this file: