Models

Large Language Model (LLM)

A type of foundational AI model trained on vast amounts of text to understand and generate human language.

Detailed Explanation

A Large Language Model (LLM) is an artificial intelligence model built on a neural network architecture, typically the Transformer architecture. These models are trained on massive datasets comprising text from the internet, books, and articles. By predicting the next word in a sequence, LLMs learn grammar, facts, reasoning abilities, and coding skills. Examples include OpenAI's GPT-4, Google's Gemini, and Meta's Llama 3.

Related Terms