data-diff

Data-diff is a command-line tool and Python library to efficiently diff rows across two different databases.

⇄ Verifies across many different databases (e.g. PostgreSQL -> Snowflake) !

πŸ” Outputs diff of rows in detail

🚨 Simple CLI/API to create monitoring and alerts

πŸ”₯ Verify 25M+ rows in <10s, and 1B+ rows in ~5min.

♾️ Works for tables with 10s of billions of rows

For more information, See our README

Resources