Skip to content

Commit

Permalink
Attempt to limit memory usage
Browse files Browse the repository at this point in the history
  • Loading branch information
ddobie committed Feb 13, 2024
1 parent b3e217b commit 41bd240
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion vast_pipeline/pipeline/utils.py
Original file line number Diff line number Diff line change
Expand Up @@ -706,7 +706,7 @@ def parallel_groupby(df: pd.DataFrame) -> pd.DataFrame:
}
n_cpu = 10 #cpu_count() - 1 # temporarily hardcode n_cpu
from dask.distributed import Client
client = Client(n_workers=n_cpu, memory_limit="5GB")
client = Client(n_workers=n_cpu, memory_limit="3GB")

logger.debug(f"Running parallel_groupby with {n_cpu} CPUs....")
out = dd.from_pandas(df, n_cpu)
Expand Down

0 comments on commit 41bd240

Please sign in to comment.