Latest News
- Home
- Blog
BERT Models and Its Variants
This article is divided into two parts; they are: • Architecture and Training of BERT • Variations of BERT BERT is an encoder-only model.
Read More...The Journey of a Token: What Really Happens Inside a Transformer
Large language models (LLMs) are based on the transformer architecture, a complex deep neural network whose input is a sequence of token embeddings.
Read More...
By Adrian Tam