Page 2 - LARGE LANUGAGE MODEL AND SMALL LANUGAGE MODEL
P. 2
Definition of LLM
A large language model (LLM) is a computational model notable for its ability to
achieve general-purpose language generation and other natural language
processing tasks.
LLMs acquire these abilities by learning statistical relationships from vast amounts of
text during a computationally intensive self-supervised and semi-supervised training
process.
LLMs are artificial neural networks that utilize the transformer architecture, invented
in 2017. The largest and most capable LLMs, as of June 2024, are built with a
decoder-only transformer-based architecture, which enables efficient processing
and generation of large-scale text data.
In a simple term large language models are AI systems capable of understanding
and generating human language by processing vast amounts of text data.