|
| View previous topic :: View next topic |
| Author |
Message |
Michael Wolfe
Joined: 19 Jan 2010 Posts: 36
|
Posted: Wed Aug 08, 2012 3:40 pm Post subject: |
|
|
In the first place, we have to link against the driver API, not the runtime API.
In the second place, these aren't callbacks, these would require polling or waiting for the event to finish, neither of which is asynchronous from the host code.
We are looking at reimplementing the async implementation and perhaps getting rid of the auxiliary thread, but we're not sure about the effect on performance. The real problem is that true async data copies require pinned host memory, and the interface for pinning host data is not composable, such as when the user program as well as the OpenACC runtime may try to pin and unpin the same memory. |
|
| Back to top |
|
 |
Alexey A. Romanenko
Joined: 17 Feb 2012 Posts: 31
|
Posted: Fri Feb 22, 2013 12:59 am Post subject: |
|
|
Hi guys,
I'm working now with 13.1 and 13.2. It looks like situation with overlapping data transfer with kernel execution was not changed significantly. Could you please comment future of this feature.
WBR,
Alexey |
|
| Back to top |
|
 |
mkcolg
Joined: 30 Jun 2004 Posts: 4996 Location: The Portland Group Inc.
|
Posted: Fri Feb 22, 2013 9:22 am Post subject: |
|
|
Hi Alexey,
Sorry, nothing has change with regards to the synchronization behaviour. The same challenges outlined by Michael still exist and they are still investigating solutions. There is no time frame on when this can be improved.
- Mat |
|
| Back to top |
|
 |
|
|
You cannot post new topics in this forum You cannot reply to topics in this forum You cannot edit your posts in this forum You cannot delete your posts in this forum You cannot vote in polls in this forum
|
Powered by phpBB © 2001, 2002 phpBB Group
|