Skip to main content

Query 1B Rows in PostgreSQL >25x Faster with Squirrels!

· 5 min read
Tim Huang
Co-Founder of Squirrels Analytics

The One Billion Row Challenge has been making waves in the data engineering community lately. Originally created to test CSV parsing performance, the challenge involves processing a file containing 1 billion weather measurements to calculate basic temperature statistics for each city. In this post, I'll tackle a variation of this challenge using PostgreSQL and demonstrate how to achieve dramatic performance improvements using Squirrels.

The Challenge​

The original One Billion Row Challenge focuses on raw CSV processing performance. For our variation, we'll:

  1. Load 1 billion rows into PostgreSQL with additional columns
  2. Query for city-level temperature statistics
  3. Create a Squirrels project to serve these analytics via REST API
  4. Demonstrate significant query performance improvements
  5. Show how to handle incremental data updates

Squirrels - What Problems Does It Solve?

· 5 min read
Tim Huang
Co-Founder of Squirrels Analytics

Squirrels is a REST API framework for data analytics. APIs built with this framework return datasets, dashboards, or parameters for datasets/dashboards. APIs for authentication, data catalog, and testing UI are also included.

Problems Solved​

The following are details on the problems that Squirrels is meant to solve:

Welcome to Squirrels

· 2 min read
Tim Huang
Co-Founder of Squirrels Analytics

Welcome to the official documentation website for Squirrels, an open-source project framework for building parameterizable REST APIs for dynamic data analytics! This is an easy-to-use framework for anyone who is familiar with SQL and YAML.