PGI User Forum
 SearchSearch   MemberlistMemberlist     RegisterRegister   ProfileProfile    Log inLog in 

Free OpenACC Webinar

large matrix multiply

 
Post new topic   Reply to topic    PGI User Forum Forum Index -> Programming and Compiling
View previous topic :: View next topic  
Author Message
mcoffey



Joined: 26 Mar 2011
Posts: 16

PostPosted: Sun Sep 18, 2011 12:57 pm    Post subject: large matrix multiply Reply with quote

Im trying to multiply large matrices on a CUDA device and want to know if my supposition is correct.
I assume if a device has limited RAM then the calling routines need to break the matrices into blocks to manipulate on the device. Im currently trying to multiply 2 matrices of around 1.75GB each and am writing a wrapper to do it in blocks. However, are there routines already out there that do this? Has anyone already done this? The idea is to produce a wrapper that will automatically break a matrix into the required number of blocks appropriate to the number of devices and power available.

Im trying to matrix multiply then invert a matrix of 30,000 * 50,000 full precision

Thanks for any guidance
Back to top
View user's profile
Display posts from previous:   
Post new topic   Reply to topic    PGI User Forum Forum Index -> Programming and Compiling All times are GMT - 7 Hours
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


Powered by phpBB © phpBB Group