What happens?
It is using all available memory and is terminated by OOM.
To Reproduce
Allocate a machine with 32 GB RAM, like c6a.4xlarge on AWS, with Ubuntu 22.04.
ssh into that machine.
Run the following commands:
sudo apt-get update
sudo apt-get install python3-pip
pip install duckdb
wget 'https://datasets.clickhouse.com/hits_compatible/hits.parquet'
Create the following run.py file:
#!/usr/bin/env python3
import duckdb
import timeit
con = duckdb.connect(database='my-db.duckdb', read_only=False)
print("Will load the data")
start = timeit.timeit()
con.execute("CREATE TABLE hits AS SELECT * FROM parquet_scan('hits.parquet')")
end = timeit.timeit()
print(end - start)
Make it executable:
Run it:
Wait around 10 minutes...
Will load the data
Killed
Environment (please complete the following information):
- OS: Ubuntu 22.04
- DuckDB Version: 0.4.0
- DuckDB Client: Python
Identity Disclosure:
- Full Name: Alexey Milovidov
- Affiliation: ClickHouse, Inc
With OOM it cannot qualify in the ClickHouse benchmark.
Before Submitting
- Python:
pip install duckdb --upgrade --pre It installs the same version 0.4.0.
- R: I don't use R.
- Other Platforms: I don't use other platforms.
What happens?
It is using all available memory and is terminated by OOM.
To Reproduce
Allocate a machine with 32 GB RAM, like c6a.4xlarge on AWS, with Ubuntu 22.04.
ssh into that machine.
Run the following commands:
Create the following run.py file:
Make it executable:
Run it:
Wait around 10 minutes...
Environment (please complete the following information):
Identity Disclosure:
With OOM it cannot qualify in the ClickHouse benchmark.
Before Submitting
masterbranch? No.pip install duckdb --upgrade --preIt installs the same version 0.4.0.