Seamless Change Data Capture and Database Replication
On-prem and cloud-based transactional and operational databases are an important source of data needed for data science, machine learning and ad hoc analytics. The best way to maintain an up-to-date replica of these databases in your cloud data lake is to create a log-based CDC (Change Data Capture) pipeline that migrates an initial snapshot of the data and then monitors and streams changes from the source database log to keep the copy in sync with the original.
How Upsolver’s CDC solution works
Upsolver’s MySQL CDC feature lets you continuously replicate any on-prem or cloud MySQL database into your AWS or Azure data lake in a few simple steps, and without writing code. Upsolver replicates a baseline snapshot of the source database and then keeps it up to date by monitoring the transaction log and replicating changes as they occur.
This data can be accessed by query engines, or output to external data stores such as your cloud data warehouse.