gcc/libgomp
Cesar Philippidis bd9b3d3d1a [nvptx] Use CUDA driver API to select default runtime launch geometry
The CUDA driver API starting version 6.5 offers a set of runtime functions to
calculate several occupancy-related measures, as a replacement for the occupancy
calculator spreadsheet.

This patch adds a heuristic for default runtime launch geometry, based on the
new runtime function cuOccupancyMaxPotentialBlockSize.

Build on x86_64 with nvptx accelerator and ran libgomp testsuite.

2018-08-13  Cesar Philippidis  <cesar@codesourcery.com>
	    Tom de Vries  <tdevries@suse.de>

	PR target/85590
	* plugin/cuda/cuda.h (CUoccupancyB2DSize): New typedef.
	(cuOccupancyMaxPotentialBlockSize): Declare.
	* plugin/cuda-lib.def (cuOccupancyMaxPotentialBlockSize): New
	CUDA_ONE_CALL_MAYBE_NULL.
	* plugin/plugin-nvptx.c (CUDA_VERSION < 6050): Define
	CUoccupancyB2DSize and declare
	cuOccupancyMaxPotentialBlockSize.
	(nvptx_exec): Use cuOccupancyMaxPotentialBlockSize to set the
	default num_gangs and num_workers when the driver supports it.

Co-Authored-By: Tom de Vries <tdevries@suse.de>

From-SVN: r263505
2018-08-13 12:04:24 +00:00
..
config [libgomp] Truncate config/nvptx/oacc-parallel.c 2018-08-01 13:01:45 -07:00
plugin [nvptx] Use CUDA driver API to select default runtime launch geometry 2018-08-13 12:04:24 +00:00
testsuite [nvptx] Ignore c++ exceptions 2018-08-02 15:59:01 +00:00
ChangeLog [nvptx] Use CUDA driver API to select default runtime launch geometry 2018-08-13 12:04:24 +00:00
ChangeLog.graphite
Makefile.am
Makefile.in re PR fortran/84381 (replace non-std 'call abort' by 'stop 1' in gfortran testsuite) 2018-03-25 16:00:52 +00:00
acinclude.m4
aclocal.m4
affinity.c
alloc.c
atomic.c
barrier.c
config.h.in
configure [libgomp, nvptx, --without-cuda-driver] Don't use system cuda driver 2018-08-04 20:07:22 +00:00
configure.ac
configure.tgt
critical.c
env.c [openacc] Move GOMP_OPENACC_DIM parsing out of nvptx plugin 2018-05-02 17:53:56 +00:00
error.c
fortran.c
hashtab.h
icv-device.c
icv.c
iter.c
iter_ull.c
libgomp-plugin.c
libgomp-plugin.h
libgomp.h Update OpenACC data clause semantics to the 2.5 behavior 2018-06-20 09:35:15 -07:00
libgomp.map Update OpenACC data clause semantics to the 2.5 behavior 2018-06-20 09:35:15 -07:00
libgomp.spec.in
libgomp.texi libgomp.texi (Top): Move www.openmp.org to https. 2018-06-24 20:38:14 +00:00
libgomp_f.h.in
libgomp_g.h
lock.c
loop.c
loop_ull.c
oacc-async.c [openacc] Factor out async argument utility functions 2018-05-09 13:52:49 +00:00
oacc-cuda.c [openacc] Factor out async argument utility functions 2018-05-09 13:52:49 +00:00
oacc-host.c
oacc-init.c
oacc-int.h [openacc] Factor out async argument utility functions 2018-05-09 13:52:49 +00:00
oacc-mem.c Update OpenACC data clause semantics to the 2.5 behavior 2018-06-20 09:35:15 -07:00
oacc-parallel.c Update OpenACC data clause semantics to the 2.5 behavior 2018-06-20 09:35:15 -07:00
oacc-plugin.c [openacc] Move GOMP_OPENACC_DIM parsing out of nvptx plugin 2018-05-02 17:53:56 +00:00
oacc-plugin.h [openacc] Move GOMP_OPENACC_DIM parsing out of nvptx plugin 2018-05-02 17:53:56 +00:00
omp.h.in
omp_lib.f90.in
omp_lib.h.in
openacc.f90 Update OpenACC data clause semantics to the 2.5 behavior 2018-06-20 09:35:15 -07:00
openacc.h Update OpenACC data clause semantics to the 2.5 behavior 2018-06-20 09:35:15 -07:00
openacc_lib.h Update OpenACC data clause semantics to the 2.5 behavior 2018-06-20 09:35:15 -07:00
ordered.c
parallel.c
priority_queue.c
priority_queue.h
sections.c
secure_getenv.h
single.c
splay-tree.c
splay-tree.h
target.c Update OpenACC data clause semantics to the 2.5 behavior 2018-06-20 09:35:15 -07:00
task.c
taskloop.c
team.c
work.c