With anonymization, personal data is permanently and irreversibly removed. This can be achieved through redaction (black bars), deletion or generalisation (e.g. replacing exact age with an age bracket). The result: the document context is lost.
Example: Contract Anonymized
Imagine a purchase contract that is anonymized before being submitted to ChatGPT:
Anonymized: "The contract between [REMOVED] and [REMOVED] for the delivery of [REMOVED] valued at [REMOVED] was signed on [REMOVED]. [REMOVED] commits to delivering the goods by [REMOVED] to the address [REMOVED]."
The AI cannot work with this text. It does not know who the contracting parties are, what is being delivered, what the value is or when delivery is due. Meaningful analysis is impossible.
Example: Contract Pseudonymized
The same contract, pseudonymized:
Pseudonymized: "The contract between Person_A and Company_A for the delivery of Product_A valued at Amount_A was signed on Date_A. Company_A commits to delivering the goods by Date_B to Address_A."
The AI can fully analyse the contract. It understands the structure, the obligations and the deadlines. After analysis, the pseudonyms are replaced with the original data via the replacement table — the result is fully usable.