gcc/libgomp
Tom de Vries 4cdfee3f20 [libgomp, nvptx] Handle per-function max-threads-per-block in default dims
Currently parallel-loop-1.c fails at -O0 on a Quadro M1200, because one of the
kernel launch configurations exceeds the resources available in the device, due
to the default dimensions chosen by the runtime.

This patch fixes that by taking the per-function max_threads_per_block into
account when using the default dimensions.

2018-07-30  Tom de Vries  <tdevries@suse.de>

	* plugin/plugin-nvptx.c (MIN, MAX): Redefine.
	(nvptx_exec): Ensure worker and vector default dims don't exceed
	targ_fn->max_threads_per_block.

From-SVN: r263062
2018-07-30 08:17:26 +00:00
..
config
plugin [libgomp, nvptx] Handle per-function max-threads-per-block in default dims 2018-07-30 08:17:26 +00:00
testsuite re PR middle-end/86660 (libgomp.c++/for-15.C ICEs with nvptx offloading) 2018-07-26 18:12:58 +02:00
acinclude.m4
aclocal.m4
affinity.c
alloc.c
atomic.c
barrier.c
ChangeLog [libgomp, nvptx] Handle per-function max-threads-per-block in default dims 2018-07-30 08:17:26 +00:00
ChangeLog.graphite
config.h.in
configure
configure.ac
configure.tgt
critical.c
env.c
error.c
fortran.c
hashtab.h
icv-device.c
icv.c
iter_ull.c
iter.c
libgomp_f.h.in
libgomp_g.h
libgomp-plugin.c
libgomp-plugin.h
libgomp.h
libgomp.map
libgomp.spec.in
libgomp.texi libgomp.texi (Top): Move www.openmp.org to https. 2018-06-24 20:38:14 +00:00
lock.c
loop_ull.c
loop.c
Makefile.am
Makefile.in
oacc-async.c
oacc-cuda.c
oacc-host.c
oacc-init.c
oacc-int.h
oacc-mem.c
oacc-parallel.c
oacc-plugin.c
oacc-plugin.h
omp_lib.f90.in
omp_lib.h.in
omp.h.in
openacc_lib.h
openacc.f90
openacc.h
ordered.c
parallel.c
priority_queue.c
priority_queue.h
sections.c
secure_getenv.h
single.c
splay-tree.c
splay-tree.h
target.c
task.c
taskloop.c
team.c
work.c