Large Language Models

Definition

Large Language Models (LLMs) are neural networks trained on massive text datasets to predict and generate human-like text. They are the technology underlying tools like ChatGPT, Claude, and Gemini — and represent the current frontier of AI capability in language, reasoning, and code generation.

Why It Matters

LLMs represent a qualitative shift in AI capability — they are not specialized tools (like chess engines or image classifiers) but general-purpose systems that can write, reason, translate, summarize, code, and converse across an enormous range of tasks. Understanding how they work is essential for using them well and governing them responsibly.

How They Work

Trained to predict the next token (word fragment) in a sequence, given all preceding tokens
Scale is the key driver: more parameters + more data + more compute = more capable models
Emergent capabilities appear at scale that were not present in smaller models — reasoning, code generation, few-shot learning
They do not “know” facts; they pattern-match against training data, which produces both remarkable coherence and confident hallucination

Key Properties

Hallucination: LLMs generate plausible-sounding text even when producing false information — they have no ground truth to verify against
Context window: The amount of text they can “see” at once; beyond it, they lose access to earlier content
Prompt sensitivity: Small changes in phrasing can dramatically change outputs — prompting is a skill, not just input
Alignment gap: What an LLM says it will do and what it actually does can diverge

Connection to Broader AI Themes

The jagged frontier (Co-Intelligence): LLMs are powerful in unexpected places and weak in equally unexpected ones
The alignment problem (The Alignment Problem): Getting LLMs to reliably do what users want is technically unsolved
Governance (The Coming Wave): LLMs are proliferating faster than regulatory frameworks can adapt

Human-AI Collaboration — LLMs are the primary tool for AI collaboration in knowledge work
AI Ethics — LLMs encode biases from training data and can generate harmful content at scale
AI Safety — aligning LLMs with human values is the core challenge
Future of Work — LLMs are the technology most immediately disrupting knowledge work
Prompt Engineering — the practice of eliciting good outputs from LLMs

Key Books

Co-Intelligence — the most practical guide to working with LLMs
The Alignment Problem — why making LLMs reliably safe and helpful is hard
The Coming Wave — LLMs as a component of the broader wave requiring governance

The Library Vault

Explorer

Large Language Models

Large Language Models

Definition

Why It Matters

How They Work

Key Properties

Connection to Broader AI Themes

Key Books

Graph View

Table of Contents

Backlinks

The Library Vault

Explorer

Large Language Models

Large Language Models

Definition

Why It Matters

How They Work

Key Properties

Connection to Broader AI Themes

Related Concepts

Key Books

Graph View

Table of Contents

Backlinks