Models

Large Language Model (LLM)

A type of foundational AI model trained on vast amounts of text to understand and generate human language.

Detailed Explanation

A Large Language Model (LLM) is an artificial intelligence model built on a neural network architecture, typically the Transformer architecture. These models are trained on massive datasets comprising text from the internet, books, and articles. By predicting the next word in a sequence, LLMs learn grammar, facts, reasoning abilities, and coding skills. Examples include OpenAI's GPT-4, Google's Gemini, and Meta's Llama 3.

Related Terms

Generative AI

AI systems capable of creating new, original content like text, images, or audio.

Prompt Engineering

The practice of designing and refining inputs (prompts) to get optimal results from AI models.

Transformer Architecture

The underlying neural network architecture introduced by Google that powers modern LLMs.

Back to Glossary