Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

provided example does not use GPU #95

Open
ekg opened this issue Feb 24, 2023 · 1 comment
Open

provided example does not use GPU #95

ekg opened this issue Feb 24, 2023 · 1 comment

Comments

@ekg
Copy link

ekg commented Feb 24, 2023

I'm following your example https://github.com/jerryji1993/DNABERT#22-model-training. I did not use apex as I am unable to compile it under the python3.6 environment. Otherwise, I've exactly followed the provided code.

I do not appear to have any GPU utilization. My system has two V100s and I can confirm that they are functioning based on other tests.

The process has been running nearly a day now, and says "Epoch: 1%". I did not expect the example training to be so slow and not GPU driven...

Is this normal? If not, what am I doing wrong?

@CandideThunder
Copy link

I had a similar issue. Please try a:
echo "import torch;print (torch.cuda.is_available())"|python
that should return 'True'
In my case, it did not. I updated cudatoolkit to the newest version, than everything worked (on gpu).:

 2058  conda remove pytorch torchvision cudatoolkit
 2059  conda install pytorch torchvision cudatoolkit -c pytorch
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants