| Author |
Message |
Topic: Initial value for constant variable in cuda fortran |
tlstar
Replies: 2
Views: 1198
|
Forum: Programming and Compiling Posted: Thu Apr 14, 2011 3:29 pm Subject: Initial value for constant variable in cuda fortran |
Please test following code by pgfortran V11.4:
program test
use kernel
IMPLICIT NONE
double precision, device :: data1(10)
double precision :: data ... |
Topic: optimization errors in cuda fortran |
tlstar
Replies: 2
Views: 1649
|
Forum: Programming and Compiling Posted: Thu Apr 14, 2011 3:23 pm Subject: optimization errors in cuda fortran |
Please test the following code by pgfortran V11.4.
program test
use kernel
IMPLICIT NONE
double precision, device :: data1(10)
double precision :: ... |
Topic: Compute Visual Profiler- global memory throughput |
tlstar
Replies: 1
Views: 1398
|
Forum: Programming and Compiling Posted: Thu Apr 14, 2011 11:05 am Subject: Re: Compute Visual Profiler- global memory throughput |
Hi there,
At the moment one of my kernels in my code runs with an overall global memory throughput of 1.8 GB/s (a little disappointing). As it happens, this is the slowest kernel in my code so I t ... |
Topic: Registers in Fermi (cc2.0) for cuda fortran |
tlstar
Replies: 0
Views: 3768
|
Forum: Programming and Compiling Posted: Thu Apr 14, 2011 7:31 am Subject: Registers in Fermi (cc2.0) for cuda fortran |
I'm using Tesla M2050 (Fermi cc2.0), which should have 32K 32-bit registers. I setup 64 threads per SM, which is quite good value for my case. So the maximum number of registers per threads is 512.
... |
Topic: CUDA Fortran- threads |
tlstar
Replies: 5
Views: 1831
|
Forum: Programming and Compiling Posted: Thu Apr 14, 2011 7:09 am Subject: CUDA Fortran- threads |
Hi crip_crop,
Thanks a lot for kindest reply. Your information are very helpful to me.
Gaofeng |
| |