Foundation Model

A large pretrained model intended as a shared starting point for many downstream tasks via fine-tuning or prompting.

What It Is

In this reference, a foundation model is a general-purpose pretrained checkpoint: wide data, large capacity, and an explicit role as a base for downstream use. It is still a model in the taxonomy sense—weights you can load—not an architecture diagram alone.

Why It Matters

Labeling a release as a foundation model signals adaptation paths: which generative or discriminative heads are common, and how it relates to the plain model glossary entry for artifacts versus roles.

Simple Example

A 70B language model trained on mixed web and code text, then instruction-tuned for chat, often has a foundation pretraining stage and a later alignment stage. The foundation stage checkpoint is the shared base many products build on.

Common Confusions

A foundation model is not automatically generative: the same base can feed classifiers or generators depending on heads and training. It is also not the same as any large model—a narrow specialist can be huge but not intended as a reusable base.

Foundation Model

What It Is

Why It Matters

Simple Example

Common Confusions

Tags

References

On this page

Foundation Model

What It Is

Why It Matters

Simple Example

Common Confusions

Related Concepts And Modules

Tags

References

On this page