Skip to content

Training Data

The dataset used to teach an AI model. For language models, this includes text from books, websites, and other sources. For image generators, it includes image-text pairs. The quality, diversity, and size of training data directly impact a model's capabilities and potential biases.

Related terms

Machine Learning (ML)Fine-Tuning

Related tools

FiftyOne logo
Paid
FiftyOne

FiftyOne is the most powerful data platform for multimodal AI and CV developers. See how it can supercharge your AI workflow.

Data ManagementVisit
Thoughtspot logo
Free Trial
Thoughtspot

Transform insights into action with the ThoughtSpot Agentic Analytics Platform—AI agents, automated insights, and embedded intelligence.

AnalyticsVisit
Tamr logo
Paid
Tamr

Tamr's real-time AI-native MDM platform unifies, cleans, and enriches records to power AI initiatives, decision-making, and operations with trustworthy data.

Data ManagementVisit
← Back to glossary