Databricks. has been granted a patent for a data tree management system that utilizes a KD-epsilon tree structure. This system efficiently organizes data files in a data table, enabling transaction operations while minimizing data file rewrites through a hierarchical node and edge configuration. GlobalData’s report on Databricks gives a 360-degree view of the company including its patenting strategy. Buy the report here.

According to GlobalData’s company profile on Databricks, was a key innovation area identified from patents. Databricks's grant share as of July 2024 was 56%. Grant share is based on the ratio of number of grants to total number of patents.

Data tree for efficient data file management and transactions

Source: United States Patent and Trademark Office (USPTO). Credit: Databricks Inc

The patent US12072863B1 outlines a method for efficiently ingesting records into a data table within a data storage system. The process begins with receiving a request from a client device to add a set of records. The method involves accessing a data tree structure that organizes records based on key-value conditions. Each node in this tree represents specific conditions, with leaf nodes containing data files that hold subsets of records. The method checks if the parent node has enough buffer space to accommodate the incoming records. If the buffer is insufficient, the system writes the records to associated child nodes, ensuring that data storage is optimized and preventing overflow.

Additionally, the patent describes a mechanism for handling multiple requests and performing query operations on the data table. When a second request is received, the system again assesses the buffer capacity of the relevant parent node. If sufficient space is available, the records are ingested directly; otherwise, they are distributed among child nodes. The method also includes a metadata tree that aids in efficiently querying records based on specified key-value ranges. This metadata provides essential information about the data files, such as size and key-value ranges, facilitating quick access to relevant records. Overall, the patent presents a structured approach to data management that enhances storage efficiency and query performance in data storage systems.

To know more about GlobalData’s detailed insights on Databricks, buy the report here.

Data Insights

From

The gold standard of business intelligence.

Blending expert knowledge with cutting-edge technology, GlobalData’s unrivalled proprietary data will enable you to decode what’s happening in your market. You can make better informed decisions and gain a future-proof advantage over your competitors.

GlobalData

GlobalData, the leading provider of industry intelligence, provided the underlying data, research, and analysis used to produce this article.

GlobalData Patent Analytics tracks bibliographic data, legal events data, point in time patent ownerships, and backward and forward citations from global patenting offices. Textual analysis and official patent classifications are used to group patents into key thematic areas and link them to specific companies across the world’s largest industries.