Suppose you had a dataset that looks like this:
| date | individual | direction of travel | individual natal group |
|---|---|---|---|
| Jan 1 | LeRoy | North | Green Stars |
| Jan 3 | LeRoy | North | Green Stars |
| Jan 1 | Lucinda | South | Black Stars |
| Jan 3 | Lucinda | North | Black Stars |
It is redundant to keep track of the natal group for each individual in this format, as the natal group never changes.
Its not so bad in small datasets, but gets super redundant when you gets 100s of rows