Memory bottleneck with GPU backend for DMRG

apateonas · August 23, 2022, 7:49pm

Hello,

I have been able to use the GPU backend for DMRG studies, and some preliminary benchmarking shows that it becomes significantly faster than the QN conserving CPU backend as the bond dimension increases (since the GPU backend does not support QN conservation at this moment). However, with high bond dimensions, I quickly run out of memory on a single GPU. What are my options for addressing this limitation? Are there approaches that may reduce the memory footprint? Is there a way to distribute the computation over multiple GPUs?

Thank you!
Roman Dimov

mtfishman · September 12, 2022, 1:44pm

Hi Roman,

Good question.

I haven’t tested it in conjunction with the GPU code, but you could try out the write-to-disk feature we have for DMRG, which stores tensors that are not being immediately used in the calculation (like the environment tensors) on disk to reduce memory usage. You can enable it by setting the keyword argument write_when_maxdim_exceeds: DMRG · ITensors.jl. I’m curious if that works so please report back and let us know!

We plan to start working on QN conserved calculations on GPU soon, which should help with memory. We would also like to investigate multi-GPU calculations but that is a longer term project.

Cheers,
Matt

Topic		Replies	Views
Large RAM and vRAM usage in DMRG DMRG and Numerical Methods	6	80	January 30, 2025
Pooling GPU memories with ITensorMPS ITensor Julia Questions	1	32	June 11, 2025
Running DMRG with storing the PH and Psi into the Disk instead of RAM and use them via I/O DMRG and Numerical Methods julia , dmrg , mps	1	103	February 18, 2024
GPU is not faster CPU ITensor Julia Questions dmrg	3	72	November 2, 2024
ITensor on GPU and conserved Quantum numbers General Discussion julia , dmrg	5	371	October 19, 2023

Memory bottleneck with GPU backend for DMRG

Related topics