<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
    <title>benchmark</title>
    <link rel="self" type="application/atom+xml" href="https://links.biapy.com/guest/tags/452/feed"/>
    <updated>2026-04-22T03:55:34+00:00</updated>
    <id>https://links.biapy.com/guest/tags/452/feed</id>
            <entry>
            <id>https://links.biapy.com/links/12126</id>
            <title type="text"><![CDATA[BIRD-bench]]></title>
            <link rel="alternate" href="https://bird-bench.github.io/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/12126"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[A BIg Bench for Large-Scale Relational Database Grounded Text-to-SQLs.

 BIRD (BIg Bench for LaRge-scale Database Grounded Text-to-SQL Evaluation) represents a pioneering, cross-domain dataset that examines the impact of extensive database contents on text-to-SQL parsing. BIRD contains over 12,751 unique question-SQL pairs, 95 big databases with a total size of 33.4 GB. It also covers more than 37 professional domains, such as blockchain, hockey, healthcare and education, etc. 

- [BIRD-SQL @ GitHub](https://github.com/AlibabaResearch/DAMO-ConvAI/tree/main/bird).

Related contents:

- [SQL Is Solved. Here&amp;#039;s Where Chat-BI Still Breaks @ Ju Data Engineering Newsletter](https://juhache.substack.com/p/sql-is-solved-heres-where-chat-bi).]]>
            </summary>
            <updated>2026-03-16T07:02:35+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/10262</id>
            <title type="text"><![CDATA[GSO]]></title>
            <link rel="alternate" href="https://gso-bench.github.io/index.html" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/10262"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Challenging Software Optimization Tasks for Evaluating SWE-Agents.

A benchmark for evaluating language models&amp;#039; capabilities in developing high-performance software.

GSO (Global Software Optimization) is a benchmark for evaluating language models&amp;#039; capabilities in developing high-performance software. We present 100+ challenging optimization tasks across 10 codebases spanning diverse domains and programming languages. Each task provides a codebase and performance test as a precise specification, with agents required to optmize the codebase and measured against expert developer commits.

- [GSO @ GitHub](https://github.com/gso-bench/gso).]]>
            </summary>
            <updated>2025-09-18T06:03:30+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/10091</id>
            <title type="text"><![CDATA[minification benchmarks]]></title>
            <link rel="alternate" href="https://github.com/privatenumber/minification-benchmarks" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/10091"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[What&amp;#039;s the best JavaScript minifier?

🏃‍♂️🏃‍♀️🏃 JS minification benchmarks: babel-minify, esbuild, terser, uglify-js, swc, google closure compiler, tdewolff/minify, oxc-minify]]>
            </summary>
            <updated>2025-09-09T15:28:15+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/278</id>
            <title type="text"><![CDATA[BenchBase]]></title>
            <link rel="alternate" href="https://db.cs.cmu.edu/projects/benchbase/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/278"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Multi-DBMS SQL Benchmarking Framework via JDBC.

BenchBase (formerly OLTPBench) is a Multi-DBMS SQL Benchmarking Framework via JDBC.

- [BenchBase @ GitHub](https://github.com/cmu-db/benchbase).

Related contents:

- [Making Postgres 42,000x slower because I am unemployed @ ByteofDev](https://byteofdev.com/posts/making-postgres-slow/).]]>
            </summary>
            <updated>2025-12-31T08:19:20+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/2174</id>
            <title type="text"><![CDATA[BenchJS]]></title>
            <link rel="alternate" href="https://benchjs.com/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/2174"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[JavaScript Benchmarking.
Browser-based JavaScript benchmarking tool. 

Run, compare, and share JavaScript benchmarks in your browser.

- [BenchJS @ GitHub](https://github.com/3rd/benchjs).]]>
            </summary>
            <updated>2025-08-28T21:58:28+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/3336</id>
            <title type="text"><![CDATA[Stabilizer]]></title>
            <link rel="alternate" href="https://github.com/ccurtsinger/stabilizer" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/3336"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Statistically Sound Performance Evaluation.

Stabilizer is a system that enables the use of the powerful statistical techniques required for sound performance evaluation on modern architectures. Stabilizer forces executions to sample the space of memory configurations by repeatedly rerandomizing layouts of code, stack, and heap objects at runtime.

- [&amp;quot;Performance Matters&amp;quot; by Emery Berger @ Stange Loop Conference&amp;#039;s YouTube](https://www.youtube.com/watch?v=r-TLSBdHe1A).
- [Playing with BOLT and Postgres @ Tomas Vondra](https://vondra.me/posts/playing-with-bolt-and-postgres/).]]>
            </summary>
            <updated>2025-08-29T01:12:57+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/3393</id>
            <title type="text"><![CDATA[tachometer]]></title>
            <link rel="alternate" href="https://github.com/google/tachometer" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/3393"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Statistically rigorous benchmark runner for the web.

tachometer is a tool for running benchmarks in web browsers. It uses repeated sampling and statistics to reliably identify even tiny differences in runtime.

- [Improving rendering performance with CSS content-visibility @ Read the Tea Leaves](https://nolanlawson.com/2024/09/18/improving-rendering-performance-with-css-content-visibility/).]]>
            </summary>
            <updated>2025-08-29T01:21:58+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/3421</id>
            <title type="text"><![CDATA[mitata]]></title>
            <link rel="alternate" href="https://github.com/evanwashere/mitata" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/3421"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[benchmark tooling that loves you ❤️ 

Mitata is a benchmark tooling library for JavaScript and C++ that offers accurate timing down to picoseconds, helpful visualizations, and features like automatic garbage collection and argument handling for benchmarks.]]>
            </summary>
            <updated>2025-08-29T01:26:59+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/3922</id>
            <title type="text"><![CDATA[LLM Benchmark]]></title>
            <link rel="alternate" href="https://llm.aidatatools.com/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/3922"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Benchmark Throughput Performance with running local large language models (LLMs) via ollama. 

- [llm-benchmark (ollama-benchmark) @ GitHub](https://github.com/aidatatools/ollama-benchmark).]]>
            </summary>
            <updated>2025-08-29T02:50:43+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/4448</id>
            <title type="text"><![CDATA[BrowserBench.org]]></title>
            <link rel="alternate" href="https://browserbench.org/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/4448"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Browser Benchmarks

Speedometer is a browser benchmark that measures the responsiveness of Web applications. It uses demo web applications to simulate user actions such as adding to-do items.

- [Speedometer 3.0: The Best Way Yet to Measure Browser Performance @ WebKit blog](https://webkit.org/blog/15131/speedometer-3-0-the-best-way-yet-to-measure-browser-performance/).]]>
            </summary>
            <updated>2025-08-29T04:18:37+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/5974</id>
            <title type="text"><![CDATA[hyperfine]]></title>
            <link rel="alternate" href="https://github.com/sharkdp/hyperfine" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/5974"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[A command-line benchmarking tool.

Related contents:

- [Episode 636: Engineering the Future @ Linux Unplugged](https://linuxunplugged.com/636).]]>
            </summary>
            <updated>2025-10-14T05:57:29+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/6575</id>
            <title type="text"><![CDATA[HASTY]]></title>
            <link rel="alternate" href="https://hasty.dev/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/6575"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[JS performance - Dev tool.
Benchmark your JS snippets for an optimized performance.]]>
            </summary>
            <updated>2025-08-29T10:13:38+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/6712</id>
            <title type="text"><![CDATA[UserBenchmark]]></title>
            <link rel="alternate" href="https://www.userbenchmark.com/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/6712"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[UserBenchmark
Speed test your PC in less than a minute.]]>
            </summary>
            <updated>2025-08-29T10:35:48+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/7725</id>
            <title type="text"><![CDATA[OpenBenchmarking.org]]></title>
            <link rel="alternate" href="http://openbenchmarking.org/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/7725"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[An Open, Collaborative Testing Platform For Benchmarking &amp;amp;amp; Performance Analysis]]>
            </summary>
            <updated>2025-08-29T13:25:18+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/7782</id>
            <title type="text"><![CDATA[Human Benchmark]]></title>
            <link rel="alternate" href="http://www.humanbenchmark.com/dashboard" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/7782"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Test your human brain processing capacities.]]>
            </summary>
            <updated>2025-08-29T13:35:28+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/9665</id>
            <title type="text"><![CDATA[Tsung]]></title>
            <link rel="alternate" href="http://tsung.erlang-projects.org/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/9665"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Tsung is a high-performance benchmark framework for various protocols including HTTP, XMPP, LDAP, etc. 

- [Tsung @ GitHub](https://github.com/processone/tsung).

Related contents:

- [Réaliser des tests de performances de son site web avec Tsung @ L&amp;#039;admin sous GNU / Linux :fr:](https://blog.admin-linux.org/serveurs-web-dapplication/realiser-des-tests-de-performances-de-site-web-avec-tsung).]]>
            </summary>
            <updated>2025-08-29T18:50:14+00:00</updated>
        </entry>
    </feed>
