delta-lake
Here are 141 public repositories matching this topic...
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries.
-
Updated
Jun 11, 2024 - Java
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
-
Updated
Jun 11, 2024 - Java
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
-
Updated
Jun 11, 2024 - Java
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
-
Updated
Jun 11, 2024 - Scala
This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you could also launch an EMR notebook via cluster template to check the outcome from the EMR Serverless application.
-
Updated
Jun 11, 2024 - TypeScript
An open protocol for secure data sharing
-
Updated
Jun 10, 2024 - Scala
Free High-Quality Financial Data in Azure
-
Updated
Jun 10, 2024 - Python
Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.
-
Updated
Jun 10, 2024 - Python
This project builds an End-to-End Azure Data Engineering Pipeline, performing ETL and Analytics Reporting on the AdventureWorks2017LT Database.
-
Updated
Jun 10, 2024 - Jupyter Notebook
PawMark is a platform for developers to build, schedule and monitor data pipelines.
-
Updated
Jun 10, 2024 - JavaScript
Analytical database for data-driven Web applications 🪶
-
Updated
Jun 10, 2024 - Rust
A native Rust library for Delta Lake, with bindings into Python
-
Updated
Jun 10, 2024 - Rust
🦖 Efficiently evolve your old fixed-length data files into more modern file formats, fully parallelized!
-
Updated
Jun 7, 2024 - Rust
Hackolade plugin for Delta Lake on Databricks
-
Updated
May 31, 2024 - JavaScript
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
-
Updated
May 29, 2024 - Rust
-
Updated
May 28, 2024 - Jupyter Notebook
Schema mappings in SQL and PySpark for ELT pipelines to normalize data to OCSF
-
Updated
May 28, 2024 - Python
Improve this page
Add a description, image, and links to the delta-lake topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the delta-lake topic, visit your repo's landing page and select "manage topics."