Training Data
The dataset used to teach an AI model. For language models, this includes text from books, websites, and other sources. For image generators, it includes image-text pairs. The quality, diversity, and size of training data directly impact a model's capabilities and potential biases.