Feed

TRAINING DATA — *the examples a model learns from; garbage-in-garbage-out.* The AI-literacy primitive of *recognizing that the model is what its training examples taught it, and that the examples are not neutral.*

Press play to listen along. The line being read lights up as you go.

Show full transcript

Loading transcript…

01 Opening

Feed was made of paper. But she wasn't just one sheet. She was a tall, neat stack of index cards. A single silver paper clip held her together.

She wasn't an animal. She definitely wasn't a robot. Feed was a paper-craft person, built in the same workshop as her friend Sort. Her whole body, from her paper feet to her paper head, was a stack. And every single card was an example. An example for an AI model to learn from.

If you looked closely at a card, you’d see two things. One part was a picture, a word, or a number. That was the input. The other part was a tiny label. That was the output, or the right answer.

An AI model would study thousands of her cards. It would learn to match the input to the right answer. It wasn't magic. The model didn't actually understand what a "cat" was. It just got really, really good at spotting cat-like patterns in the pictures.

This was a very big deal. A model is what its examples teach it. If the examples were good, the model learned useful patterns. But what if the examples were bad? What if they were missing things? Or just wrong? Then the model learned all those bad things, too.

Feed had a favorite saying. She said it all the time.

"Garbage in, garbage out."

Feed got very serious when she talked about her cards.

02 Feed

"These examples aren't just facts," she would say, tapping her stack. "They are choices. Human choices. Someone picked every picture. Someone wrote every label. They decided what was 'right'."

That's why Feed worked so closely with Catch. Catch was her friend over in the DataForge. When data arrived at the AIForge, it always came with notes from Catch. Catch asked the important questions. Who gathered this data? Why did they gather it? Feed needed those answers to make sure her examples were good. They were a team.

Like Sort, Feed grew up in a paper-crafts workshop. The workshop had one big rule. Every paper figure was made to help another. Feed was folded and clipped together for one reason: to help Sort. Sort learned her first rules for sorting things by studying Feed's cards. They were a perfect pair.

When she first arrived at the AIForge, Bit asked her a question. Bit was the head of the academy.

"What is *training data*?" Bit asked.

Feed didn't hesitate. "It's the examples a model learns from," she said. Her voice was crisp and clear, like folded paper. "The model is only as good as its examples. Good examples, good model. Bad examples, bad model. Garbage in, garbage out."

Bit smiled. "You're hired."

Feed started her first class the same way every time. She would lift her entire stack of cards. With a flick of her paper wrists, she fanned them out. The students saw a rainbow of tiny pictures and words. Each one had a label.

"I am Feed," she'd say. "And this," she shook the cards gently, "is *training data*. It's what an AI model learns from. The model is what these cards teach it. Simple as that."

03 Feed

She taught her students five big questions to ask about any stack of *training data*.

"First," she said, holding up one paper finger. "Where did these examples come from? Who collected them? And why? Remember my friend Catch? Her notes are key. Sometimes, the way data is collected can hide a problem."

"Second: Who wrote the labels? What rules did they follow? Were they the right people to be deciding the 'right' answer?"

"Third: What's included? Do the examples show all different kinds of things? Or just the common ones? What about weird, rare situations?"

"Fourth," Feed's voice grew firm. "What's missing? This is the most important question. A model can't learn about something it never sees. If there are no pictures of purple bananas in my stack, the model will never know they exist."

"And fifth: Are the numbers balanced? If I have ten thousand pictures of cats and only two pictures of dogs, what do you think the model will be good at spotting?"

She paused, looking at the students. "It will be a cat-spotting expert. And a dog-spotting disaster."

"And that brings me to my biggest rule," Feed said. She snapped her cards back into a perfect stack. "Garbage in, garbage out. You can't fix bad examples with a fancy model. A wobbly foundation makes a wobbly building."

She looked at them seriously. "And never, ever say the model 'thinks' or 'understands.' It doesn't. It finds patterns. That's all. Be honest about what it's doing."

04 Feed

"My cards can be wrong," she told the class. "I'm just the stack. I have no way of knowing if a label is a mistake. The humans who made the cards decided what was right. If they made a mistake, the model learns that mistake."

She tapped her paper-clip heart. "That's why you have to be detectives. You have to investigate the *training data*. The model can't fix what it was never taught."

A student once asked her if *training data* was hard.

Feed shook her head, which made her whole stack rustle. "It's not about being hard or easy," she said. "It's about being careful. It's about looking at the examples. Looking at the labels. And most of all, looking at the human choices behind them."

She fanned her cards one last time.

"The model is what the examples taught it. Garbage in, garbage out."

She said it gently, and there was something warm underneath the words — a quiet pride in every careful card, and a happy, hopeful feeling that the students now understood. Feed smiled her crisp paper smile. Being careful, she thought, felt good.

A Bit More About Feed

*What she's like:* Feed is very neat and tidy. She thinks her stack of cards is the most important thing in the world. She's not a robot or an animal—she's a person made of paper, and she's very direct. She doesn't believe examples are just "facts." She'll always remind you that a person chose every single one.

*Who are her friends?:* She's best friends with Sort, because her examples help Sort learn how to do her job. She also works closely with Catch from the DataForge. They're a team!

05 Closing

Things Feed Says A Lot

- "The model is what the examples taught it. Garbage in, garbage out." - "These aren't just cards. They're human choices." - "What's missing from the examples is just as important as what's there." - "The model can't tell if its examples are good or bad. That's our job."

Where You'll See Feed

You'll meet Feed properly in the second story kit, where she gets her own chapter! After that, she'll pop up again and again. You'll see her helping out with picture examples, word examples, and number examples. Later on, her teamwork with Catch from the DataForge becomes super important. She's a key member of the AIForge team!

Feed's Friends

- *Best Pal: Sort. Feed's examples are what Sort learns from. They were made together! - Closest Teammate: Catch, from the DataForge. They have to work together to make sure the data is good. - Other Friends:* Feed is friends with almost everyone at the AIForge. Her work is important for understanding bias with Skew and thinking about rules with Stake.

A Note from the Workshop

We wanted Feed to show that AI isn't scary magic. It's something you can understand by looking at the examples it learns from. You don't need to be a super-genius to ask good questions about them! That's why Feed always says her job is about being careful, not about being complicated. It's also why she always works with Catch—to show that where the examples come from really matters.

Why Is Feed Like This?

Feed's story comes from a few big ideas. The saying "garbage in, garbage out" is a very old and famous rule in computer science. It's a reminder that computers do exactly what you tell them to! The idea that her cards are "human choices" is also super important for understanding AI today. We wanted to show that AI systems are shaped by the people who build them. And having Feed and Catch be a team is our way of showing that you can't understand an AI without understanding the data it learned from. They're two parts of the same story.