Skip to main content

How to Handle Duplicated Documents in Tofu

Tofu automatically detects when the same file is uploaded more than once. These files are labeled Duplicated, allowing you to review and resolve them. This helps prevent duplicate records and keeps your document list organized.

B
Written by Bianca Raquel
Updated this week

How to Review Duplicated Documents

You can review duplicates directly from the Source Documents panel, which displays a list of uploaded documents. When duplicates are detected, Tofu provides two ways to review them:

Option 1: Use the Duplicate Notification Banner

At the top of the Source Documents panel, Tofu will display a yellow notification banner that typically says:

“Detected # set of duplicated documents.”

To review them:

  1. Click Review on the right side of the banner.

  2. This opens the duplicate review mode where you can compare and resolve the files.

Option 2: Review from the Document

You can also review duplicates directly from the document.

  1. Locate the document labeled Duplicated.

  2. Click the Duplicated label to open the duplicate review mode and resolve the files.


Resolving Duplicated Documents

Once in Review mode, Tofu will guide you depending on whether the original document already has an extraction.

Scenario 1: Original Document Has No Extraction Yet

You will see the message:

“Original is not yet linked to any extraction.”

Options:

  • Extract original and resolve (Recommended)

    • Creates an extraction from the original document.

    • Both the original and duplicated documents are marked Done, which removes them from the Needs Review list.

  • Ignore and Resolve

    • Marks both the original and duplicated documents as Done, removing them from the Needs Review list.

    • No extraction is created.


Scenario 2: Original Document Already Has an Extraction

If the original document is already linked to an extraction, you will see a message such as:

“Original already linked to: [Extraction Type]”

The extraction type depends on which extraction type the file was uploaded (e.g. AP Bills, AR Invoices, Bank Statements, or Direct Expense).

  • Resolve Duplicates

    • Marks both the original and duplicated documents as Done.

    • Removes them from the Needs Review list.

💡 Tip:
If a document shows it’s linked to an extraction or invoice, it means an extraction has been created from that file. Click the eye icon to isolate and view linked extractions in the Extractions panel.


Best Practices

  • Always review duplicated documents before resolving them.

  • Resolve duplicates after confirming the correct document has been processed to keep the Needs Review list organized.

Did this answer your question?