ParadeDB is an Elasticsearch alternative built on Postgres.
The published benchmarks are based on ParadeDB version v0.8.4.
To run the benchmarks:
- Manually start an AWS EC2 instance
c6a.4xlarge- Ubuntu Server 22.04 LTS (HVM), SSD Volume Type*
- Root 500GB gp2 SSD**
- Wait for the status check to pass, then SSH into the instance via EC2 Instance Connect
- Clone this repository via
git clone https://github.com/ClickHouse/ClickBench - Navigate to the
paradedbdirectory viacd ClickBench/paradedb - Run the benchmark via
./benchmark.sh. This will run the benchmarks against the default settings below.
The benchmark script takes the following parameters:
-w- Type of workload, eithersingleorpartitioned. The default issingle, meaning it uses thehits.parquetClickBench dataset. Thepartitionedoption uses the Clickbench partitioned dataset.
The benchmark should be completed within a few minutes. If you'd like to benchmark against a different version of ParadeDB, modify the Docker tag in the benchmark.sh script. You can find the list of available tags here.