parquet
High-performance time-series data warehouse built on DuckDB, Parquet, and MinIO.
Time-series data warehouse built for speed. 1.89M records/sec on native deployment. DuckDB + Parquet + MinIO. AGPL-3.0
DuckLake is an integrated data lake and catalog format
DuckLake delivers advanced data lake features without traditional lakehouse complexity by using Parquet files and your SQL database. It's an open, standalone format from the DuckDB team.
DuckLake is an open Lakehouse format that is built on SQL and Parquet. DuckLake stores metadata in a catalog database, and stores data in Parquet files. The DuckLake extension allows DuckDB to directly read and write data from DuckLake.
Look At Your Data đź‘€.
Data quality is the most important factor in machine learning success. Hyperparam brings exploration and analysis of massive text datasets to the browser.
Your Data Pipeline, Simplified. GlareDB: An analytics DBMS for distributed data.
Data exists everywhere: your laptop, Postgres, Snowflake and as files in S3. It exists in various formats such as Parquet, CSV and JSON. Regardless, there will always be multiple steps spanning several destinations to get the insights you need.
GlareDB is designed to query your data wherever it lives using SQL that you already know.