Databricks. has been granted a patent for a method and system that facilitates merging target and source tables. The process involves executing jobs to identify matching rows, perform merge operations, and manage deletion vectors, ultimately producing a resulting table based on these operations. GlobalData’s report on Databricks gives a 360-degree view of the company including its patenting strategy. Buy the report here.

According to GlobalData’s company profile on Databricks, was a key innovation area identified from patents. Databricks's grant share as of June 2024 was 57%. Grant share is based on the ratio of number of grants to total number of patents.

Table merging with deletion vector management system

Source: United States Patent and Trademark Office (USPTO). Credit: Databricks Inc

The patent US12045220B2 outlines a system and method for merging data from a target table and a source table, particularly when the source table contains files that differ from those in the target table. The system comprises a memory with encoded instructions and one or more processors that execute these instructions. The process begins with determining the need to merge the two tables, followed by a first job that identifies matching files between the target and source tables. This job involves storing information about these matches and the corresponding rows that align in both tables. A second job is then performed, which executes a merge operation based on the identified rows and any deletion vectors associated with previously removed rows in the target table. The outcome of this operation is a resulting table that integrates the relevant data from both sources.

Additionally, the patent details various functionalities related to managing deletion vectors, which track previously deleted rows in the target table. The system can determine and store these vectors based on metadata, ensuring that the merge operation accurately reflects the current state of the data. The instructions also allow for post-processing tasks, such as updating metadata statistics for the resulting table and ensuring the validity of these statistics. The patent emphasizes the importance of maintaining data integrity during the merging process, including the handling of files that may not be keyed to the target table. Overall, the invention provides a structured approach to data merging that enhances the efficiency and accuracy of data management systems.

How well do you really know your competitors?

Access the most comprehensive Company Profiles on the market, powered by GlobalData. Save hours of research. Gain competitive edge.

Company Profile – free sample

Thank you!

Your download email will arrive shortly

Not ready to buy yet? Download a free sample

We are confident about the unique quality of our Company Profiles. However, we want you to make the most beneficial decision for your business, so we offer a free sample that you can download by submitting the below form

By GlobalData
Visit our Privacy Policy for more information about our services, how we may use, process and share your personal data, including information of your rights in respect of your personal data and how you can unsubscribe from future marketing communications. Our services are intended for corporate subscribers and you warrant that the email address submitted is your corporate email address.

To know more about GlobalData’s detailed insights on Databricks, buy the report here.