<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
    <title>data-transformation</title>
    <link rel="self" type="application/atom+xml" href="https://links.biapy.com/guest/tags/855/feed"/>
    <updated>2026-06-15T03:11:00+00:00</updated>
    <id>https://links.biapy.com/guest/tags/855/feed</id>
            <entry>
            <id>https://links.biapy.com/links/11155</id>
            <title type="text"><![CDATA[Singer]]></title>
            <link rel="alternate" href="https://www.singer.io/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/11155"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Simple, Composable, Open Source ETL

Singer powers data extraction and consolidation for all of your organization’s tools.

- [Singer @ GitHub](https://github.com/singer-io).]]>
            </summary>
            <updated>2025-12-02T16:18:00+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/10846</id>
            <title type="text"><![CDATA[Bruin]]></title>
            <link rel="alternate" href="https://getbruin.com/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/10846"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Your last data platform.
Reliable data. 10x faster, 90% less complexity.

 Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows. 

Bruin is a data pipeline tool that brings together data ingestion, data transformation with SQL &amp;amp; Python, and data quality into a single framework. It works with all the major data platforms and runs on your local machine, an EC2 instance, or GitHub Actions.

- [Bruin @ GitHub](https://github.com/bruin-data/bruin).

Related contents:

- [Digest #186: Inside the AWS Outage, Docker Compose in Production, F1 Hacks and 86,000 npm Packages Attacks @ DevOps Bulletin](https://www.devopsbulletin.com/p/digest-186-inside-the-aws-outage).]]>
            </summary>
            <updated>2025-11-03T10:19:09+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/10252</id>
            <title type="text"><![CDATA[Malloy]]></title>
            <link rel="alternate" href="https://www.malloydata.dev/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/10252"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[A modern open source language for analyzing,
transforming, and modeling data.

Malloy is a modern open source language for describing data relationships and transformations. It is both a semantic modeling language and a query language that uses an existing SQL engine to execute queries. Malloy currently can connect to BigQuery, Snowflake, PostgreSQL, MySQL, Trino, or Presto, and natively supports DuckDB. We&amp;#039;ve built a Visual Studio Code extension to facilitate building Malloy data models, querying and transforming data, and creating simple visualizations and dashboards.

- [Malloy @ GitHub](https://github.com/malloydata/malloy).

Related contents:

- [Lessons on building an AI data analyst @ Pedro Nascimento](https://www.pedronasc.com/articles/lessons-building-ai-data-analyst).]]>
            </summary>
            <updated>2025-09-17T14:29:48+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/10245</id>
            <title type="text"><![CDATA[Chronon]]></title>
            <link rel="alternate" href="https://github.com/airbnb/chronon" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/10245"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Chronon is a data platform for serving for AI/ML applications. 

Chronon is a platform that abstracts away the complexity of data computation and serving for AI/ML applications. Users define features as transformation of raw data, then Chronon can perform batch and streaming computation, scalable backfills, low-latency serving, guaranteed correctness and consistency, as well as a host of observability and monitoring tools.

It allows you to utilize all of the data within your organization, from batch tables, event streams or services to power your AI/ML projects, without needing to worry about all the complex orchestration that this would usually entail.]]>
            </summary>
            <updated>2025-09-17T11:19:20+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/812</id>
            <title type="text"><![CDATA[CocoIndex]]></title>
            <link rel="alternate" href="https://cocoindex.io/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/812"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Extract, Transform, Index Data. Easy and Fresh.
CocoIndex is the world&amp;#039;s first open-source engine that supports both custom transformation logic and incremental updates specialized for data indexing.

 With CocoIndex, users declare the transformation, CocoIndex creates &amp;amp; maintains an index, and keeps the derived index up to date based on source update, with minimal computation and changes. 

- [CocoIndex @ GitHub](https://github.com/cocoindex-io/cocoindex).]]>
            </summary>
            <updated>2025-08-28T18:14:23+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/1578</id>
            <title type="text"><![CDATA[Data Formulator]]></title>
            <link rel="alternate" href="https://github.com/microsoft/data-formulator" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/1578"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[🪄 Create rich visualizations with AI

Data Formulator is an application from Microsoft Research that uses large language models to transform data, expediting the practice of data visualization.

Data Formulator is an AI-powered tool for analysts to iteratively create rich visualizations. Unlike most chat-based AI tools where users need to describe everything in natural language, Data Formulator combines user interface interactions (UI) and natural language (NL) inputs for easier interaction. This blended approach makes it easier for users to describe their chart designs while delegating data transformation to AI.]]>
            </summary>
            <updated>2025-09-04T09:35:37+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/1736</id>
            <title type="text"><![CDATA[SQLMesh]]></title>
            <link rel="alternate" href="https://sqlmesh.readthedocs.io/en/stable/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/1736"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Efficient data transformation and modeling framework that is backwards compatible with dbt.

SQLMesh is a next-generation data transformation framework designed to ship data quickly, efficiently, and without error. Data teams can efficiently run and deploy data transformations written in SQL or Python with visibility and control at any size.

- [SQLMesh @ GitHub](https://github.com/TobikoData/sqlmesh).

Related contents:

- [Why SQLMesh Might be The Best dbt Alternative @ The Data Toolbox](https://thedatatoolbox.substack.com/p/why-sqlmesh-might-be-the-best-dbt).]]>
            </summary>
            <updated>2025-08-28T20:45:46+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/2222</id>
            <title type="text"><![CDATA[SDF Labs]]></title>
            <link rel="alternate" href="https://www.sdf.com/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/2222"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Data Runs Better on SDF. Transform Data Better with SDF.
SDF is the fastest way to build a scalable, reliable, and optimized data warehouse.

SDF is a developer platform for data that scales SQL understanding across an organization, empowering all data teams to unlock the full potential of their data.

SDF is a multi-dialect SQL compiler, transformation framework, and analytical database engine. It natively compiles SQL dialects, like Snowflake, and connects to their corresponding data warehouses to materialize models.

- [SDF Docs](https://docs.sdf.com/).
- [SDF @ GitHub](https://github.com/sdf-labs/sdf-cli).

Source: [Testing is Not Enough: Transforming Data Quality with Write, Audit, Publish using SDF Build @ SDF Blog](https://blog.sdf.com/p/testing-is-not-enough-transforming).]]>
            </summary>
            <updated>2025-08-28T22:06:34+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/2939</id>
            <title type="text"><![CDATA[Substrait]]></title>
            <link rel="alternate" href="https://substrait.io/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/2939"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Cross-Language Serialization for Relational Algebra.
 A cross platform way to express data transformation, relational algebra, standardized record expression and plans. 

Substrait is a format for describing compute operations on structured data. It is designed for interoperability across different languages and systems.

- [Substrait @ GitHub](https://github.com/substrait-io/substrait).
- [Your November Dose of Data - November 2024 @ Data Council](https://mailchi.mp/datacouncil/october-6416684).]]>
            </summary>
            <updated>2025-08-29T00:07:18+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/3093</id>
            <title type="text"><![CDATA[RSS-lambda]]></title>
            <link rel="alternate" href="https://rss-lambda.xyz/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/3093"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[RSS-lambda transforms RSS feeds without RSS client lock-in.

There are RSS clients that can perform transformations on RSS feeds, e.g. only keep entries with certain keywords, or translate texts of the entries

However, using those features from the RSS clients will create RSS client lock-in that prevents you from moving to another RSS client if you desire

RSS-lambda is an application that perform transformations on the server-side instead so that you can freely move to another RSS client while keeping the transformations. It&amp;#039;s also self-hostable so that you don&amp;#039;t even need to rely on the official server instance!

- [RSS-lambda @ GitHub](https://github.com/sekai-soft/rss-lambda).]]>
            </summary>
            <updated>2025-08-29T00:32:30+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/4323</id>
            <title type="text"><![CDATA[dbt]]></title>
            <link rel="alternate" href="https://www.getdbt.com/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/4323"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Transform Data in Your Warehouse. Build trusted data products faster.

Accelerate your data transformation process with dbt Cloud and start delivering data that you and your team can rely on.  dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications. Analysts using dbt can transform their data by simply writing select statements, while dbt handles turning these statements into tables and views in a data warehouse.

- [dbt Core @ GitHub](https://github.com/dbt-labs/dbt-core).
- [dbt Developer Hub](https://docs.getdbt.com/).

Sources:

- [268 - Résilience de la data - Sammy Teillet @ &amp;lt;ifttd&amp;gt; :fr:](https://www.ifttd.io/episodes/resilience-de-la-data).
- [Optimizing SQL queries for speed with dbt @ DataDuel.co](https://www.dataduel.co/optimizing-sql-queries-for-speed-with-dbt/).
- [Test Driven Development (TDD) with dbt: Test First, SQL Later @ Xebia](https://xebia.com/blog/test-driven-development-tdd-with-dbt/).
- [Understanding dbt: basics and best practices @ Datadog](https://www.datadoghq.com/blog/understanding-dbt/).]]>
            </summary>
            <updated>2025-09-08T13:29:23+00:00</updated>
        </entry>
    </feed>
