Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cuda blas backward compatibility #1747

Open
wants to merge 6 commits into
base: master
Choose a base branch
from

Conversation

borg323
Copy link
Member

@borg323 borg323 commented May 4, 2022

@ankan-ban is there a reason to use cublasGemmStridedBatchedEx() here?

@ankan-ban
Copy link
Member

I remember seeing some performance difference on RTX GPUs using the "Ex" vs the non-Ex versions (the Ex version was faster). Not sure if that bug is still present.

@mooskagh
Copy link
Member

Should we maybe compile two versions and then ask people on Discord with various GPUs to compare benchmarks?

@Naphthalin Naphthalin added the testing required Feature/bug fix needs more testing. Implies not for merge. label Nov 3, 2022
@borg323
Copy link
Member Author

borg323 commented Dec 23, 2022

Switched it to check CC at runtime like in other uses of cublasGemmStridedBatchedEx(). Should fix #1820.

@borg323 borg323 requested a review from ankan-ban December 23, 2022 23:17
@borg323
Copy link
Member Author

borg323 commented Jun 28, 2023

Reported to work with a GTX660.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
testing required Feature/bug fix needs more testing. Implies not for merge.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants