Commit Graph

187762 Commits

Author SHA1 Message Date
Segher Boessenkool 2484f7a4b0 rs6000: Don't use r12 for CR save on ELFv2 (PR102107)
CR is saved and/or restored on some paths where GPR12 is already live
since it has a meaning in the calling convention in the ELFv2 ABI.

It is not completely clear to me that we can always use r11 here, but
it does seem save, there is checking code (to detect conflicts here),
and it is stage 1.  So here goes.

2021-09-03  Segher Boessenkool <segher@kernel.crashing.org>

	PR target/102107
	* config/rs6000/rs6000-logue.c (rs6000_emit_prologue): On ELFv2 use r11
	instead of r12 for CR save, in all cases.
2021-09-03 21:04:23 +00:00
Iain Sandoe addf167a23 coroutines: Support for debugging implementation state.
Some of the state that is associated with the implementation
is of interest to a user debugging a coroutine.  In particular
items such as the suspend point, promise object, and current
suspend point.

These variables live in the coroutine frame, but we can inject
proxies for them into the outermost bind expression of the
coroutine.  Such variables are automatically moved into the
coroutine frame (if they need to persist across a suspend
expression).  PLacing the proxies thus allows the user to
inspect them by name in the debugger.

To implement this, we ensure that (at the outermost scope) the
frame entries are not mangled (coroutine frame variables are
usually mangled with scope nesting information so that they do
not clash).  We can safely avoid doing this for the outermost
scope so that we can map frame entries directly to the variables.

This is partial contribution to debug support (PR 99215).

Signed-off-by: Iain Sandoe <iain@sandoe.co.uk>

gcc/cp/ChangeLog:

	* coroutines.cc (register_local_var_uses): Do not mangle
	frame entries for the outermost scope.  Record the outer
	scope as nesting depth 0.
2021-09-03 19:42:43 +01:00
Iain Sandoe a45a7ecdf3 coroutines: Add a helper for creating local vars.
This is primarily code factoring, but we take this opportunity
to rename some of the implementation variables (which we intend
to expose to debugging) so that they are in the implementation
namespace.

Signed-off-by: Iain Sandoe <iain@sandoe.co.uk>

gcc/cp/ChangeLog:

	* coroutines.cc (coro_build_artificial_var): New.
	(build_actor_fn): Use var builder, rename vars to use
	implementation namespace.
	(coro_rewrite_function_body): Likewise.
	(morph_fn_to_coro): Likewise.
2021-09-03 19:42:31 +01:00
Iain Sandoe 88974974d8 coroutines: Use DECL_VALUE_EXPR instead of rewriting vars.
Variables that need to persist over suspension expressions
must be preserved by being copied into the coroutine frame.

The initial implementations do this manually in the transform
code.  However, that has various disadvantages - including
that the debug connections are lost between the original var
and the frame copy.

The revised implementation makes use of DECL_VALUE_EXPRs to
contain the frame offset expressions, so that the original
var names are preserved in the code.

This process is also applied to the function parms which are
always copied to the frame.  In this case the decls need to be
copied since they are used in two different contexts during
the re-write (in the building of the ramp function, and in
the actor function itself).

This will assist in improvement of debugging (PR 99215).

Signed-off-by: Iain Sandoe <iain@sandoe.co.uk>

gcc/cp/ChangeLog:

	* coroutines.cc (transform_local_var_uses): Record
	frame offset expressions as DECL_VALUE_EXPRs instead of
	rewriting them.
2021-09-03 19:42:09 +01:00
Andrew Pinski 6b69bf5729 Fix target/102173 ICE after error recovery
After the recent r12-3278-823685221de986a change, the testcase
gcc.target/aarch64/sve/acle/general-c/type_redef_1.c started
to ICE as the code was not ready for error_mark_node in the
type.  This fixes that and the testcase now passes.

gcc/ChangeLog:

	* config/aarch64/aarch64-sve-builtins.cc (register_vector_type):
	Handle error_mark_node as the type of the type_decl.
2021-09-03 16:56:33 +00:00
Andrew Pinski 98f1dd0212 Fix some GC issues in the aarch64 back-end.
I got some ICEs in my latest testsing while running the libstdc++ testsuite.
I had noticed the problem was connected to types and had just touched the
builtins code but nothing which could have caused this and I looked for
some types/variables that were not being marked with GTY.

OK? Bootstrapped and tested on aarch64-linux-gnu with no regressions.

gcc/ChangeLog:

	* config/aarch64/aarch64-builtins.c (struct aarch64_simd_type_info):
	Mark with GTY.
	(aarch64_simd_types): Likewise.
	(aarch64_simd_intOI_type_node): Likewise.
	(aarch64_simd_intCI_type_node): Likewise.
	(aarch64_simd_intXI_type_node): Likewise.
	* config/aarch64/aarch64.h (aarch64_fp16_type_node): Likewise.
	(aarch64_fp16_ptr_type_node): Likewise.
	(aarch64_bf16_type_node): Likewise.
	(aarch64_bf16_ptr_type_node): Likewise.
2021-09-03 16:56:33 +00:00
Aldy Hernandez 8af8abfbba Implement POINTER_DIFF_EXPR entry in range-op.
I've seen cases in the upcoming jump threader enhancements where we see
a difference of two pointers that are known to be equivalent, and yet we
fail to return 0 for the range.  This is because we have no working
range-op entry for POINTER_DIFF_EXPR.  The entry we currently have is
a mere placeholder to avoid ignoring POINTER_DIFF_EXPR's so
adjust_pointer_diff_expr() could get a whack at it here:

//	def = __builtin_memchr (arg, 0, sz)
//	n = def - arg
//
// The range for N can be narrowed to [0, PTRDIFF_MAX - 1].

This patch adds the relational magic to range-op, which we can just
steal from the minus_expr code.

gcc/ChangeLog:

	* range-op.cc (operator_minus::op1_op2_relation_effect): Abstract
	out to...
	(minus_op1_op2_relation_effect): ...here.
	(class operator_pointer_diff): New.
	(operator_pointer_diff::op1_op2_relation_effect): Call
	minus_op1_op2_relation_effect.
	(integral_table::integral_table): Add entry for POINTER_DIFF_EXPR.
2021-09-03 18:40:02 +02:00
Patrick Palka 47543e5f9d c++: shortcut bad convs during overload resolution [PR101904]
In the context of overload resolution we have the notion of a "bad"
argument conversion, which is a conversion that "would be a permitted
with a bending of the language standards", and we handle such bad
conversions specially.  In particular, we rank a bad conversion as
better than no conversion but worse than a good conversion, and a bad
conversion doesn't necessarily make a candidate unviable.  With the
flag -fpermissive, we permit the situation where overload resolution
selects a candidate that contains a bad conversion (which we call a
non-strictly viable candidate).  And without the flag, the caller
of overload resolution usually issues a distinct permerror in this
situation instead.

One consequence of this defacto behavior is that in order to distinguish
a non-strictly viable candidate from an unviable candidate, if we
encounter a bad argument conversion during overload resolution we must
keep converting subsequent arguments because a subsequent conversion
could render the candidate unviable instead of just non-strictly viable.
But checking subsequent arguments can force template instantiations and
result in otherwise avoidable hard errors.  And in particular, all
'this' conversions are at worst bad, so this means the const/ref-qualifiers
of a member function can't be used to prune a candidate quickly, which
is the subject of the mentioned PR.

This patch tries to improve the situation without changing the defacto
output of add_candidates.  Specifically, when considering a candidate
during overload resolution this patch makes us shortcut argument
conversion checking upon encountering the first bad conversion
(tentatively marking the candidate as non-strictly viable, though it
could ultimately be unviable) under the assumption that we'll eventually
find a strictly viable candidate anyway (which renders moot the
distinction between non-strictly viable and unviable, since both are
worse than a strictly viable candidate).  If this assumption turns out
to be false, we'll fully reconsider the candidate under the defacto
behavior (without the shortcutting) so that all its conversions are
computed.

So in the best case (there's a strictly viable candidate), we avoid
some argument conversions and/or template argument deduction that may
cause a hard error.  In the worst case (there's no such candidate), we
have to redundantly consider some candidates twice.  (In a previous
version of the patch, to avoid this redundant checking I created a new
"deferred" conversion type that represents a conversion that is yet to
be computed, and instead of reconsidering a candidate I just realized
its deferred conversions.  But it doesn't seem this redundancy is a
significant performance issue to justify the added complexity of this
other approach.)

	PR c++/101904

gcc/cp/ChangeLog:

	* call.c (build_this_conversion): New function, split out from
	add_function_candidate.
	(add_function_candidate): New parameter shortcut_bad_convs.
	Document it.  Use build_this_conversion.  Stop at the first bad
	argument conversion when shortcut_bad_convs is true.
	(add_template_candidate_real): New parameter shortcut_bad_convs.
	Use build_this_conversion to check the 'this' conversion before
	attempting deduction.  When the rejection reason code is
	rr_bad_arg_conversion, pass -1 instead of 0 as the viable
	parameter to add_candidate.  Pass 'convs' to add_candidate.
	(add_template_candidate): New parameter shortcut_bad_convs.
	(add_template_conv_candidate): Pass false as shortcut_bad_convs
	to add_template_candidate_real.
	(add_candidates): Prefer to shortcut bad conversions during
	overload resolution under the assumption that we'll eventually
	see a strictly viable candidate.  If this assumption turns out
	to be false, re-process the non-strictly viable candidates
	without shortcutting those bad conversions.

gcc/testsuite/ChangeLog:

	* g++.dg/template/conv17.C: New test.
2021-09-03 11:33:41 -04:00
Iain Sandoe 3ccb523bdd libgcc, soft-float: Fix strong_alias macro use for Darwin.
Darwin does not support strong symbol aliases and a work-
around is provided in sfp-machine.h where a second function
is created that simply calls the original.  However this
needs the arguments to the synthesized function to track
the mode of the original function.

So the fix here is to match known floating point modes from
the incoming function and apply the one found to the new
function args.

The matching is highly specific to the current set of modes
and will need adjusting should more cases be added.

Signed-off-by: Iain Sandoe <iain@sandoe.co.uk>

libgcc/ChangeLog:

	* config/i386/sfp-machine.h (alias_HFtype, alias_SFtype
	alias_DFtype, alias_TFtype): New.
	(ALIAS_SELECTOR): New.
	(strong_alias): Use __typeof and a _Generic selector to
	provide the type to the synthesized function.
2021-09-03 16:25:40 +01:00
Aldy Hernandez 0100555037 Do not assume loop header threading in backward threader.
The registry's thread_through_all_blocks() has a may_peel_loop_headers
argument.  When refactoring the backward threader code, I removed this
argument for the local passthru method because it was always TRUE.  This
may not necessarily be true in the future, if the backward threader is
called from another context.  This patch removes the default definition,
in favor of an argument that is exactly the same as the identically
named function in tree-ssa-threadupdate.c.  I think this also makes it
less confusing when looking at both methods across the source base.

Tested on x86-64 Linux.

gcc/ChangeLog:

	* tree-ssa-threadbackward.c (back_threader::thread_through_all_blocks):
	Add may_peel_loop_headers.
	(back_threader_registry::thread_through_all_blocks): Same.
	(try_thread_blocks): Pass may_peel_loop_headers argument.
	(pass_early_thread_jumps::execute): Same.
2021-09-03 17:22:04 +02:00
Aldy Hernandez 62099645c2 Abstract PHI and forwarder block checks in jump threader.
This patch abstracts out a couple common idioms in the forward
threader that I found useful while navigating the code base.

Tested on x86-64 Linux.

gcc/ChangeLog:

	* tree-ssa-threadedge.c (has_phis_p): New.
	(forwarder_block_p): New.
	(potentially_threadable_block): Call forwarder_block_p.
	(jump_threader::thread_around_empty_blocks): Call has_phis_p.
	(jump_threader::thread_through_normal_block): Call
	forwarder_block_p.
2021-09-03 17:19:54 +02:00
Aldy Hernandez 779275c083 Improve backwards threader debugging dumps.
This patch adds debugging helpers to the backwards threader.  I have
also noticed that profitable_path_p() can bail early on paths that
crosses loops and leave the dump of blocks incomplete.  Fixed as
well.

Unfortunately the new methods cannot be marked const, because we call
the solver's dump which is not const.  I believe this was because the
ranger dump calls m_cache.block_range().  This could probably use a
cleanup at a later time.

Tested on x86-64 Linux.

gcc/ChangeLog:

	* tree-ssa-threadbackward.c (back_threader::dump): New.
	(back_threader::debug): New.
	(back_threader_profitability::profitable_path_p): Dump blocks
	even if we are bailing early.
2021-09-03 17:19:54 +02:00
Aldy Hernandez a3ff15afb4 Dump reason why threads are being cancelled and abstract code.
We are inconsistent on dumping out reasons why a thread was canceled.
This makes debugging jump threading problems harder because paths can be
canceled with no reason given.  This patch abstracts out the thread
canceling code and adds a reason for every cancellation.

Tested on x86-64 Linux.

gcc/ChangeLog:

	* tree-ssa-threadupdate.c (cancel_thread): New.
	(jump_thread_path_registry::thread_block_1): Use cancel_thread.
	(jump_thread_path_registry::mark_threaded_blocks): Same.
	(jump_thread_path_registry::register_jump_thread): Same.
2021-09-03 17:19:53 +02:00
Jason Merrill 5ec4990bc7 c++: Avoid bogus -Wunused with recent change
My change to make limit_bad_template_recursion avoid instantiating members
of erroneous classes produced a bogus "used but not defined" warning for
23_containers/unordered_set/instantiation_neg.cc; it's not defined because
we decided not to instantiate it.  So we need to suppress that warning.

gcc/cp/ChangeLog:

	* pt.c (limit_bad_template_recursion): Suppress -Wunused for decls
	we decide not to instantiate.
2021-09-03 11:12:09 -04:00
Tobias Burnus 943c65c449 Fortran: Fix Bind(C) char-len check, add ptr-contiguous check
Add F2018, 18.3.6 (5), pointer + contiguous is not permitted
check for dummies in BIND(C) procs.

Fix misreading of F2018, 18.3.4/18.3.5 + 18.3.6 (5) regarding
character dummies passed as byte stream to a bind(C) dummy arg:
Per F2018, 18.3.1 only len=1 is interoperable (since F2003).
F2008 added 'constant expression' for vars (F2018, 18.3.4/18.3.5),
applicable to dummy args per F2018, C1554. I misread this such
that len > 1 is permitted if len is a constant expr.

While the latter would work as character len=1 a(10) and len=2 a(5)
have the same storage sequence and len is fixed, it is still invalid.
Hence, it is now rejected again.

gcc/fortran/ChangeLog:

	* decl.c (gfc_verify_c_interop_param): Reject pointer with
	CONTIGUOUS attributes as dummy arg. Reject character len > 1
	when passed as byte stream.

gcc/testsuite/ChangeLog:

	* gfortran.dg/bind_c_char_6.f90: Update dg-error.
	* gfortran.dg/bind_c_char_7.f90: Likewise.
	* gfortran.dg/bind_c_char_8.f90: Likewise.
	* gfortran.dg/iso_c_binding_char_1.f90: Likewise.
	* gfortran.dg/pr32599.f03: Likewise.
	* gfortran.dg/bind_c_char_9.f90: Comment testcase bits which are
	implementable but not valid F2018.
	* gfortran.dg/bind_c_contiguous.f90: New test.
2021-09-03 16:28:04 +02:00
Aldy Hernandez 2fcfc03459 Avoid using unavailable objects in jt_state.
The jump threading state is about to get more interesting, and it may
get with a ranger or with the const_copies/etc helpers.  This patch
makes sure we have an object before we attempt to call push_marker or
pop_to_marker.

Tested on x86-64 Linux.

gcc/ChangeLog:

	* tree-ssa-threadedge.c (jt_state::push): Only call methods for
	which objects are available.
	(jt_state::pop): Same.
	(jt_state::register_equiv): Same.
	(jt_state::register_equivs_on_edge): Same.
2021-09-03 15:47:34 +02:00
Aldy Hernandez b237eb9dfd Do not release state location until after path registry.
We are popping state and then calling the registry code.  This causes
the registry to have incorrect information.  This isn't visible in
current trunk, but will be an issue when I submit further enhancements
to the threading code.  However, it is a cleanup on its own so I am
pushing it now.

Tested on x86-64 Linux.

gcc/ChangeLog:

	* tree-ssa-threadedge.c (jump_threader::thread_across_edge):
	Move pop until after a thread is registered.
2021-09-03 15:42:22 +02:00
Aldy Hernandez 9fa5ba4c56 Add debug helper for jump thread paths.
Tested on x86-64 Linux.

gcc/ChangeLog:

	* tree-ssa-threadupdate.c (debug): New.
2021-09-03 15:35:46 +02:00
Aldy Hernandez 7200a4424c RAII class to change dump_file.
The function dump_ranger() shows everything the ranger knows at the
current time.  To do this, we tickle all the statements to force ranger
to provide as much information as possible.  During this process, the
relation code will dump status out to the dump_file, whereas in
dump_ranger, we want to dump it out to a specific file (most likely
stderr).  This patch changes the dump_file through the life of
dump_ranger() and resets it when its done.

This patch only affects dump/debugging code.

Tested on x86-64 Linux.

gcc/ChangeLog:

	* gimple-range-trace.cc (push_dump_file::push_dump_file): New.
	(push_dump_file::~push_dump_file): New.
	(dump_ranger): Change dump_file temporarily while dumping
	ranger.
	* gimple-range-trace.h (class push_dump_file): New.
2021-09-03 15:30:57 +02:00
Aldy Hernandez 4db10cbf21 Add function name when dumping ranger contents.
These are minor cleanups to the dumping code.

Tested on x86-64 Linux.

gcc/ChangeLog:

	* gimple-range-trace.cc (debug_seed_ranger): Remove static.
	(dump_ranger): Dump function name.
2021-09-03 15:30:57 +02:00
Aldy Hernandez 410e874263 Use non-null knowledge in path_range_query.
This patch improves ranges for pointers we are interested in a path, by
using the non-null class from the ranger.  This allows us to thread more
paths with minimal effort.

Tested on x86-64 Linux.

gcc/ChangeLog:

	* gimple-range-path.cc (path_range_query::range_defined_in_block):
	Adjust for non-null.
	(path_range_query::adjust_for_non_null_uses): New.
	(path_range_query::precompute_ranges): Call
	adjust_for_non_null_uses.
	* gimple-range-path.h: Add m_non_null and
	adjust_for_non_null_uses.
2021-09-03 15:30:57 +02:00
Aldy Hernandez 1342891464 Improve path_range_query dumps.
Tested on x86-64 Linux.

gcc/ChangeLog:

	* gimple-range-path.cc (path_range_query::dump): Dump path
	length.
	(path_range_query::precompute_ranges): Dump entire path.
2021-09-03 15:30:57 +02:00
Aldy Hernandez abcd237363 Implement relation_oracle::debug.
Tested on x86-64 Linux.

gcc/ChangeLog:

	* value-relation.cc (relation_oracle::debug): New.
	* value-relation.h (relation_oracle::debug): New.
2021-09-03 15:30:56 +02:00
Aldy Hernandez d2e278e26a Remove unnecessary include from tree-ssa-loop-ch.c
Tested on x86-64 Linux.

gcc/ChangeLog:

	* tree-ssa-loop-ch.c: Remove unnecessary include file.
2021-09-03 15:30:56 +02:00
Aldy Hernandez 5db93cd083 Skip statements with no BB in ranger.
The function postfold_gcond_edges() registers relations coming out of a
GIMPLE_COND.  With upcoming changes, we may be called with statements
not in the IL (for example, dummy statements created by the
forward threader).  This patch avoids breakage by exiting if the
statement does not have a defining basic block.  There is a similar
change to the path solver.

Tested on x86-64 Linux.

gcc/ChangeLog:

	* gimple-range-fold.cc (fold_using_range::postfold_gcond_edges):
	Skip statements with no defining BB.
	* gimple-range-path.cc (path_range_query::range_defined_in_block):
	Do not get confused by statements with no defining BB.
2021-09-03 15:30:56 +02:00
Aldy Hernandez bccf4b88e1 Improve support for IMAGPART_EXPR and REALPART_EXPR in ranger.
Currently we adjust statements containing an IMAGPART_EXPR if the
defining statement was one of a few built-ins known to return boolean
types.  We can also adjust statements for both IMAGPART_EXPR and
REALPART_EXPR where the defining statement is a constant.

This patch adds such support, and cleans up the code a bit.

Tested on x86-64 Linux.

gcc/ChangeLog:

	* gimple-range-fold.cc (adjust_imagpart_expr): Move from
	gimple_range_adjustment.  Add support for constants.
	(adjust_realpart_expr): New.
	(gimple_range_adjustment): Move IMAGPART_EXPR code to
	adjust_imagpart_expr.
	* range-op.cc (integral_table::integral_table): Add entry for
	REALPART_CST.
2021-09-03 15:30:56 +02:00
Tobias Burnus 4ce90454c2 libgomp.*/error-1.{c,f90}: Fix dg-output newline pattern
libgomp/ChangeLog:

	* testsuite/libgomp.c-c++-common/error-1.c: Use \r\n not \n\r in
	dg-output.
	* testsuite/libgomp.fortran/error-1.f90: Likewise.
2021-09-03 15:27:00 +02:00
Eric Botcazou 8d34ffb4e8 Improve compatibility of -fdump-ada-spec with warnings
This makes sure that the style and warning settings used in the
C/C++ bindings generated by -fdump-ada-spec do not leak into the
units that use them.

gcc/c-family/
	* c-ada-spec.c (dump_ads): Generate pragmas to disable style checks
	and -gnatwu warning for the package specification.
2021-09-03 11:19:23 +02:00
Jakub Jelinek 090f0d78f1 openmp: Improve expand_omp_atomic_pipeline
When __atomic_* builtins were introduced, omp-expand.c (omp-low.c
at that point) has been adjusted in several spots so that it uses
the atomic builtins instead of sync builtins, but
expand_omp_atomic_pipeline has not because the __atomic_compare_exchange_*
APIs take address of the argument, so it kept using __sync_val_compare_swap_*.
That means it always uses seq_cst though.
This patch changes it to use the ATOMIC_COMPARE_EXCHANGE ifn which gimple-fold
folds __atomic_compare_exchange_* into - that ifn also passes expected
directly.

2021-09-03  Jakub Jelinek  <jakub@redhat.com>

	* omp-expand.c (expand_omp_atomic_pipeline): Use
	IFN_ATOMIC_COMPARE_EXCHANGE instead of
	BUILT_IN_SYNC_VAL_COMPARE_AND_SWAP_? so that memory order
	can be provided.
2021-09-03 09:54:58 +02:00
Jakub Jelinek e902136b31 c++, abi: Set DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD on C++ zero width bitfields [PR102024]
The removal of remove_zero_width_bitfields function and its call from
C++ FE layout_class_type (which I've done in the P0466R5
layout-compatible helper intrinsics patch, so that the FE can actually
determine what is and isn't layout-compatible according to the spec)
unfortunately changed the ABI on various platforms.
The C FE has been keeping zero-width bitfields in the types, while
the C++ FE has been removing them after structure layout, so in various
cases when passing such structures in registers we had different ABI
between C and C++.

While both the C and C++ FE had some code to remove zero width bitfields
after structure layout, in both FEs it was buggy and didn't really remove
any.  In the C FE that code has been removed later on, while in the C++ FE
for GCC 4.5 in PR42217 it has been actually fixed, so the C++ FE started
to remove those bitfields.

The following patch doesn't change anything ABI-wise, but allows the
targets to decide what to do, emit -Wpsabi warnings etc.
Non-C zero width bitfields will be seen by the backends as normal
zero width bitfields, C++ zero width bitfields that used to be previously
removed will have DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD flag set.
I've reused the DECL_FIELD_ABI_IGNORED flag which is only used on non-bitfield
FIELD_DECLs right now, but the macros now check DECL_BIT_FIELD flag.

Each backend can then decide what it wants, whether it wants to keep
different ABI between C and C++ as in GCC 11 and older (i.e. incompatible
with G++ <= 4.4, compatible with G++ 4.5 .. 11), for that it would
ignore for the aggregate passing/returning decisions all
DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD FIELD_DECLs), whether it wants to never
ignore zero width bitfields (no changes needed for that case, except perhaps
-Wpsabi warning should be added and for that DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD
can be tested), or whether it wants to always ignore zero width bitfields
(I think e.g. riscv in GCC 10+ does that).

All this patch does is set the flag which the backends can then use.

2021-09-03  Jakub Jelinek  <jakub@redhat.com>

	PR target/102024
gcc/
	* tree.h (DECL_FIELD_ABI_IGNORED): Changed into rvalue only macro
	that is false if DECL_BIT_FIELD.
	(SET_DECL_FIELD_ABI_IGNORED, DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD,
	SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD): Define.
	* tree-streamer-out.c (pack_ts_decl_common_value_fields): For
	DECL_BIT_FIELD stream DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead
	of DECL_FIELD_ABI_IGNORED.
	* tree-streamer-in.c (unpack_ts_decl_common_value_fields): Use
	SET_DECL_FIELD_ABI_IGNORED instead of writing to
	DECL_FIELD_ABI_IGNORED and for DECL_BIT_FIELD use
	SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead.
	* lto-streamer-out.c (hash_tree): For DECL_BIT_FIELD hash
	DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead of DECL_FIELD_ABI_IGNORED.
gcc/cp/
	* class.c (build_base_field): Use SET_DECL_FIELD_ABI_IGNORED
	instead of writing to DECL_FIELD_ABI_IGNORED.
	(layout_class_type): Likewise.  In the place where zero-width
	bitfields used to be removed, use
	SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD on those fields instead.
gcc/lto/
	* lto-common.c (compare_tree_sccs_1): Also compare
	DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD values.
2021-09-03 09:46:32 +02:00
liuhongt de6795bbf5 Remove macro check for __AMX_BF16/INT8/TILE__ in header file.
gcc/ChangeLog:

	PR target/102166
	* config/i386/amxbf16intrin.h : Remove macro check for __AMX_BF16__.
	* config/i386/amxint8intrin.h : Remove macro check for __AMX_INT8__.
	* config/i386/amxtileintrin.h : Remove macro check for __AMX_TILE__.

gcc/testsuite/ChangeLog:

	PR target/102166
	* g++.target/i386/pr102166.C: New test.
2021-09-03 13:04:41 +08:00
GCC Administrator 9f7c2bad52 Daily bump. 2021-09-03 00:16:33 +00:00
Sandra Loosemore 93b6b2f614 libgfortran: Further fixes for GFC/CFI descriptor conversions.
This patch is for:
PR100907 - Bind(c): failure handling wide character
PR100911 - Bind(c): failure handling C_PTR
PR100914 - Bind(c): errors handling complex
PR100915 - Bind(c): failure handling C_FUNPTR
PR100917 - Bind(c): errors handling long double real

All of these problems are related to the GFC descriptors constructed
by the Fortran front end containing ambigous or incomplete
information.  This patch does not attempt to change the GFC data
structure or the front end, and only makes the runtime interpret it in
more reasonable ways.  It's not a complete fix for any of the listed
issues.

The Fortran front end does not distinguish between C_PTR and
C_FUNPTR, mapping both onto BT_VOID.  That is what this patch does also.

The other bugs are related to GFC descriptors only containing elem_len
and not kind.  For complex types, the elem_len needs to be divided by
2 and then mapped onto a real kind.  On x86 targets, the kind
corresponding to C long double is different than its elem_len; since
we cannot accurately disambiguate between a 16-byte kind 10 long
double from __float128, this patch arbitrarily prefers to interpret that as
the standard long double type rather than the GNU extension.

Similarly, for character types, the GFC descriptor cannot distinguish
between character(kind=c_char, len=4) and character(kind=ucs4, len=1).
But since the front end currently rejects anything other than len=1
(PR92482) this patch uses the latter interpretation.

2021-09-01  Sandra Loosemore  <sandra@codesourcery.com>
	    José Rui Faustino de Sousa  <jrfsousa@gmail.com>

gcc/testsuite/
	PR fortran/100911
	PR fortran/100915
	PR fortran/100916
	* gfortran.dg/PR100911.c: New file.
	* gfortran.dg/PR100911.f90: New file.
	* gfortran.dg/PR100914.c: New file.
	* gfortran.dg/PR100914.f90: New file.
	* gfortran.dg/PR100915.c: New file.
	* gfortran.dg/PR100915.f90: New file.

libgfortran/
	PR fortran/100907
	PR fortran/100911
	PR fortran/100914
	PR fortran/100915
	PR fortran/100917
	* ISO_Fortran_binding-1-tmpl.h (CFI_type_cfunptr): Make equivalent
	to CFI_type_cptr.
	* runtime/ISO_Fortran_binding.c (cfi_desc_to_gfc_desc): Fix
	handling of CFI_type_cptr and CFI_type_cfunptr.  Additional error
	checking and code cleanup.
	(gfc_desc_to_cfi_desc): Likewise.  Also correct kind mapping
	for character, complex, and long double types.
2021-09-02 16:41:02 -07:00
Sandra Loosemore cb17b50541 Fortran: TS 29113 testsuite
Add tests to exercise features added to Fortran via TS 29113, "Further
Interoperability of Fortran with C":

https://wg5-fortran.org/N1901-N1950/N1942.pdf

2021-09-01  Sandra Loosemore  <sandra@codesourcery.com>

	gcc/testsuite/
	* gfortran.dg/c-interop/allocatable-dummy-c.c: New file.
	* gfortran.dg/c-interop/allocatable-dummy.f90: New file.
	* gfortran.dg/c-interop/allocatable-optional-pointer.f90: New file.
	* gfortran.dg/c-interop/allocate-c.c: New file.
	* gfortran.dg/c-interop/allocate-errors-c.c: New file.
	* gfortran.dg/c-interop/allocate-errors.f90: New file.
	* gfortran.dg/c-interop/allocate.f90: New file.
	* gfortran.dg/c-interop/argument-association-assumed-rank-1.f90:
	New file.
	* gfortran.dg/c-interop/argument-association-assumed-rank-2.f90:
	New file.
	* gfortran.dg/c-interop/argument-association-assumed-rank-3.f90:
	New file.
	* gfortran.dg/c-interop/argument-association-assumed-rank-4.f90:
	New file.
	* gfortran.dg/c-interop/argument-association-assumed-rank-5.f90:
	New file.
	* gfortran.dg/c-interop/argument-association-assumed-rank-6.f90:
	New file.
	* gfortran.dg/c-interop/argument-association-assumed-rank-7.f90:
	New file.
	* gfortran.dg/c-interop/argument-association-assumed-rank-8.f90:
	New file.
	* gfortran.dg/c-interop/assumed-type-dummy.f90: New file.
	* gfortran.dg/c-interop/c-interop.exp: New file.
	* gfortran.dg/c-interop/c1255-1.f90: New file.
	* gfortran.dg/c-interop/c1255-2.f90: New file.
	* gfortran.dg/c-interop/c1255a.f90: New file.
	* gfortran.dg/c-interop/c407a-1.f90: New file.
	* gfortran.dg/c-interop/c407a-2.f90: New file.
	* gfortran.dg/c-interop/c407b-1.f90: New file.
	* gfortran.dg/c-interop/c407b-2.f90: New file.
	* gfortran.dg/c-interop/c407c-1.f90: New file.
	* gfortran.dg/c-interop/c516.f90: New file.
	* gfortran.dg/c-interop/c524a.f90: New file.
	* gfortran.dg/c-interop/c535a-1.f90: New file.
	* gfortran.dg/c-interop/c535a-2.f90: New file.
	* gfortran.dg/c-interop/c535b-1.f90: New file.
	* gfortran.dg/c-interop/c535b-2.f90: New file.
	* gfortran.dg/c-interop/c535b-3.f90: New file.
	* gfortran.dg/c-interop/c535c-1.f90: New file.
	* gfortran.dg/c-interop/c535c-2.f90: New file.
	* gfortran.dg/c-interop/c535c-3.f90: New file.
	* gfortran.dg/c-interop/c535c-4.f90: New file.
	* gfortran.dg/c-interop/cf-descriptor-1-c.c: New file.
	* gfortran.dg/c-interop/cf-descriptor-1.f90: New file.
	* gfortran.dg/c-interop/cf-descriptor-2-c.c: New file.
	* gfortran.dg/c-interop/cf-descriptor-2.f90: New file.
	* gfortran.dg/c-interop/cf-descriptor-3-c.c: New file.
	* gfortran.dg/c-interop/cf-descriptor-3.f90: New file.
	* gfortran.dg/c-interop/cf-descriptor-4-c.c: New file.
	* gfortran.dg/c-interop/cf-descriptor-4.f90: New file.
	* gfortran.dg/c-interop/cf-descriptor-5-c.c: New file.
	* gfortran.dg/c-interop/cf-descriptor-5.f90: New file.
	* gfortran.dg/c-interop/cf-descriptor-6-c.c: New file.
	* gfortran.dg/c-interop/cf-descriptor-6.f90: New file.
	* gfortran.dg/c-interop/cf-descriptor-7-c.c: New file.
	* gfortran.dg/c-interop/cf-descriptor-7.f90: New file.
	* gfortran.dg/c-interop/cf-descriptor-8-c.c: New file.
	* gfortran.dg/c-interop/cf-descriptor-8.f90: New file.
	* gfortran.dg/c-interop/cf-out-descriptor-1-c.c: New file.
	* gfortran.dg/c-interop/cf-out-descriptor-1.f90: New file.
	* gfortran.dg/c-interop/cf-out-descriptor-2-c.c: New file.
	* gfortran.dg/c-interop/cf-out-descriptor-2.f90: New file.
	* gfortran.dg/c-interop/cf-out-descriptor-3-c.c: New file.
	* gfortran.dg/c-interop/cf-out-descriptor-3.f90: New file.
	* gfortran.dg/c-interop/cf-out-descriptor-4-c.c: New file.
	* gfortran.dg/c-interop/cf-out-descriptor-4.f90: New file.
	* gfortran.dg/c-interop/cf-out-descriptor-5-c.c: New file.
	* gfortran.dg/c-interop/cf-out-descriptor-5.f90: New file.
	* gfortran.dg/c-interop/cf-out-descriptor-6-c.c: New file.
	* gfortran.dg/c-interop/cf-out-descriptor-6.f90: New file.
	* gfortran.dg/c-interop/contiguous-1-c.c: New file.
	* gfortran.dg/c-interop/contiguous-1.f90: New file.
	* gfortran.dg/c-interop/contiguous-2-c.c: New file.
	* gfortran.dg/c-interop/contiguous-2.f90: New file.
	* gfortran.dg/c-interop/contiguous-3-c.c: New file.
	* gfortran.dg/c-interop/contiguous-3.f90: New file.
	* gfortran.dg/c-interop/deferred-character-1.f90: New file.
	* gfortran.dg/c-interop/deferred-character-2.f90: New file.
	* gfortran.dg/c-interop/dump-descriptors.c: New file.
	* gfortran.dg/c-interop/dump-descriptors.h: New file.
	* gfortran.dg/c-interop/establish-c.c: New file.
	* gfortran.dg/c-interop/establish-errors-c.c: New file.
	* gfortran.dg/c-interop/establish-errors.f90: New file.
	* gfortran.dg/c-interop/establish.f90: New file.
	* gfortran.dg/c-interop/explicit-interface.f90: New file.
	* gfortran.dg/c-interop/fc-descriptor-1-c.c: New file.
	* gfortran.dg/c-interop/fc-descriptor-1.f90: New file.
	* gfortran.dg/c-interop/fc-descriptor-2-c.c: New file.
	* gfortran.dg/c-interop/fc-descriptor-2.f90: New file.
	* gfortran.dg/c-interop/fc-descriptor-3-c.c: New file.
	* gfortran.dg/c-interop/fc-descriptor-3.f90: New file.
	* gfortran.dg/c-interop/fc-descriptor-4-c.c: New file.
	* gfortran.dg/c-interop/fc-descriptor-4.f90: New file.
	* gfortran.dg/c-interop/fc-descriptor-5-c.c: New file.
	* gfortran.dg/c-interop/fc-descriptor-5.f90: New file.
	* gfortran.dg/c-interop/fc-descriptor-6-c.c: New file.
	* gfortran.dg/c-interop/fc-descriptor-6.f90: New file.
	* gfortran.dg/c-interop/fc-descriptor-7-c.c: New file.
	* gfortran.dg/c-interop/fc-descriptor-7.f90: New file.
	* gfortran.dg/c-interop/fc-descriptor-8-c.c: New file.
	* gfortran.dg/c-interop/fc-descriptor-8.f90: New file.
	* gfortran.dg/c-interop/fc-descriptor-9-c.c: New file.
	* gfortran.dg/c-interop/fc-descriptor-9.f90: New file.
	* gfortran.dg/c-interop/fc-out-descriptor-1-c.c: New file.
	* gfortran.dg/c-interop/fc-out-descriptor-1.f90: New file.
	* gfortran.dg/c-interop/fc-out-descriptor-2-c.c: New file.
	* gfortran.dg/c-interop/fc-out-descriptor-2.f90: New file.
	* gfortran.dg/c-interop/fc-out-descriptor-3-c.c: New file.
	* gfortran.dg/c-interop/fc-out-descriptor-3.f90: New file.
	* gfortran.dg/c-interop/fc-out-descriptor-4-c.c: New file.
	* gfortran.dg/c-interop/fc-out-descriptor-4.f90: New file.
	* gfortran.dg/c-interop/fc-out-descriptor-5-c.c: New file.
	* gfortran.dg/c-interop/fc-out-descriptor-5.f90: New file.
	* gfortran.dg/c-interop/fc-out-descriptor-6-c.c: New file.
	* gfortran.dg/c-interop/fc-out-descriptor-6.f90: New file.
	* gfortran.dg/c-interop/fc-out-descriptor-7-c.c: New file.
	* gfortran.dg/c-interop/fc-out-descriptor-7.f90: New file.
	* gfortran.dg/c-interop/ff-descriptor-1.f90: New file.
	* gfortran.dg/c-interop/ff-descriptor-2.f90: New file.
	* gfortran.dg/c-interop/ff-descriptor-3.f90: New file.
	* gfortran.dg/c-interop/ff-descriptor-4.f90: New file.
	* gfortran.dg/c-interop/ff-descriptor-5.f90: New file.
	* gfortran.dg/c-interop/ff-descriptor-6.f90: New file.
	* gfortran.dg/c-interop/ff-descriptor-7.f90: New file.
	* gfortran.dg/c-interop/note-5-3.f90: New file.
	* gfortran.dg/c-interop/note-5-4-c.c: New file.
	* gfortran.dg/c-interop/note-5-4.f90: New file.
	* gfortran.dg/c-interop/optional-c.c: New file.
	* gfortran.dg/c-interop/optional.f90: New file.
	* gfortran.dg/c-interop/rank-class.f90: New file.
	* gfortran.dg/c-interop/rank.f90: New file.
	* gfortran.dg/c-interop/removed-restrictions-1.f90: New file.
	* gfortran.dg/c-interop/removed-restrictions-2.f90: New file.
	* gfortran.dg/c-interop/removed-restrictions-3.f90: New file.
	* gfortran.dg/c-interop/removed-restrictions-4.f90: New file.
	* gfortran.dg/c-interop/section-1-c.c: New file.
	* gfortran.dg/c-interop/section-1.f90: New file.
	* gfortran.dg/c-interop/section-1p.f90: New file.
	* gfortran.dg/c-interop/section-2-c.c: New file.
	* gfortran.dg/c-interop/section-2.f90: New file.
	* gfortran.dg/c-interop/section-2p.f90: New file.
	* gfortran.dg/c-interop/section-3-c.c: New file.
	* gfortran.dg/c-interop/section-3.f90: New file.
	* gfortran.dg/c-interop/section-3p.f90: New file.
	* gfortran.dg/c-interop/section-4-c.c: New file.
	* gfortran.dg/c-interop/section-4.f90: New file.
	* gfortran.dg/c-interop/section-errors-c.c: New file.
	* gfortran.dg/c-interop/section-errors.f90: New file.
	* gfortran.dg/c-interop/select-c.c: New file.
	* gfortran.dg/c-interop/select-errors-c.c: New file.
	* gfortran.dg/c-interop/select-errors.f90: New file.
	* gfortran.dg/c-interop/select.f90: New file.
	* gfortran.dg/c-interop/setpointer-c.c: New file.
	* gfortran.dg/c-interop/setpointer-errors-c.c: New file.
	* gfortran.dg/c-interop/setpointer-errors.f90: New file.
	* gfortran.dg/c-interop/setpointer.f90: New file.
	* gfortran.dg/c-interop/shape.f90: New file.
	* gfortran.dg/c-interop/size.f90: New file.
	* gfortran.dg/c-interop/tkr.f90: New file.
	* gfortran.dg/c-interop/typecodes-array-basic-c.c: New file.
	* gfortran.dg/c-interop/typecodes-array-basic.f90: New file.
	* gfortran.dg/c-interop/typecodes-array-char-c.c: New file.
	* gfortran.dg/c-interop/typecodes-array-char.f90: New file.
	* gfortran.dg/c-interop/typecodes-array-float128-c.c: New file.
	* gfortran.dg/c-interop/typecodes-array-float128.f90: New file.
	* gfortran.dg/c-interop/typecodes-array-int128-c.c: New file.
	* gfortran.dg/c-interop/typecodes-array-int128.f90: New file.
	* gfortran.dg/c-interop/typecodes-array-longdouble-c.c: New file.
	* gfortran.dg/c-interop/typecodes-array-longdouble.f90: New file.
	* gfortran.dg/c-interop/typecodes-sanity-c.c: New file.
	* gfortran.dg/c-interop/typecodes-sanity.f90: New file.
	* gfortran.dg/c-interop/typecodes-scalar-basic-c.c: New file.
	* gfortran.dg/c-interop/typecodes-scalar-basic.f90: New file.
	* gfortran.dg/c-interop/typecodes-scalar-float128-c.c: New file.
	* gfortran.dg/c-interop/typecodes-scalar-float128.f90: New file.
	* gfortran.dg/c-interop/typecodes-scalar-int128-c.c: New file.
	* gfortran.dg/c-interop/typecodes-scalar-int128.f90: New file.
	* gfortran.dg/c-interop/typecodes-scalar-longdouble-c.c: New file.
	* gfortran.dg/c-interop/typecodes-scalar-longdouble.f90: New file.
	* gfortran.dg/c-interop/ubound.f90: New file.
	* lib/target-supports.exp
	(check_effective_target_fortran_real_c_float128): New function.
2021-09-02 16:34:27 -07:00
Jonathan Wakely 89cf858571 libstdc++: Implement std::atomic<T*>::compare_exchange_weak
For some reason r170217 didn't add compare_exchange_weak to the
__atomic_base<T*> partial specialization, and so weak compare exchange
operations on pointers use compare_exchange_strong instead.

This adds __atomic_base<T*>::compare_exchange_weak and then uses it in
std::atomic<T*>::compare_exchange_weak.

Signed-off-by: Jonathan Wakely <jwakely@redhat.com>

libstdc++-v3/ChangeLog:

	* include/bits/atomic_base.h (__atomic_base<P*>::compare_exchange_weak):
	Add new functions.
	* include/std/atomic (atomic<T*>::compare_exchange_weak): Use
	it.
2021-09-02 18:22:33 +01:00
Jonathan Wakely 892400f1f2 libstdc++: Tweak whitespace in <atomic>
Signed-off-by: Jonathan Wakely <jwakely@redhat.com>

libstdc++-v3/ChangeLog:

	* include/std/atomic: Tweak whitespace.
2021-09-02 18:22:32 +01:00
Jonathan Wakely dba1ab2122 libstdc++: Remove "no stronger" assertion in compare exchange [PR102177]
P0418R2 removed some preconditions from std::atomic::compare_exchange_*
but we still enforce them via __glibcxx_assert. This removes those
assertions.

Signed-off-by: Jonathan Wakely <jwakely@redhat.com>

libstdc++-v3/ChangeLog:

	PR c++/102177
	* include/bits/atomic_base.h (__is_valid_cmpexch_failure_order):
	New function to check if a memory order is valid for the failure
	case of compare exchange operations.
	(__atomic_base<I>::compare_exchange_weak): Simplify assertions
	by using __is_valid_cmpexch_failure_order.
	(__atomic_base<I>::compare_exchange_strong): Likewise.
	(__atomic_base<P*>::compare_exchange_weak): Likewise.
	(__atomic_base<P*>::compare_exchange_strong): Likewise.
	(__atomic_impl::compare_exchange_weak): Add assertion.
	(__atomic_impl::compare_exchange_strong): Likewise.
	* include/std/atomic (atomic::compare_exchange_weak): Likewise.
	(atomic::compare_exchange_strong): Likewise.
2021-09-02 18:21:23 +01:00
Jonathan Wakely 5b73abd1a5 libstdc++: Define std::invoke_r for C++23 (P2136R3)
We already supported this feature as std::__invoke<R>, for internal use.
This just adds a public version of it to <functional>.

Internal uses should continue to include <bits/invoke.h> and use
std::__invoke<R> so that they don't need to include all of <functional>.

Signed-off-by: Jonathan Wakely <jwakely@redhat.com>

libstdc++-v3/ChangeLog:

	* include/std/functional (invoke_r): Define.
	* include/std/version (__cpp_lib_invoke_r): Define.
	* testsuite/20_util/function_objects/invoke/version.cc: Check
	for __cpp_lib_invoke_r as well as __cpp_lib_invoke.
	* testsuite/20_util/function_objects/invoke/4.cc: New test.
2021-09-02 17:48:45 +01:00
Martin Sebor 9695e1c23b Improve -Wuninitialized note location.
Related:
PR tree-optimization/17506 - warning about uninitialized variable points to wrong location
PR testsuite/37182 - Revision 139286 caused gcc.dg/pr17506.c and gcc.dg/uninit-15.c

gcc/ChangeLog:

	PR tree-optimization/17506
	PR testsuite/37182
	* tree-ssa-uninit.c (warn_uninit): Remove conditional guarding note.

gcc/testsuite/ChangeLog:

	PR tree-optimization/17506
	PR testsuite/37182
	* gcc.dg/diagnostic-tree-expr-ranges-2.c: Add expected output.
	* gcc.dg/uninit-15-O0.c: Remove xfail.
	* gcc.dg/uninit-15.c: Same.
2021-09-02 09:20:09 -06:00
Marcel Vollweiler 5960477a43 Add support for device-modifiers for 'omp target device'.
gcc/testsuite/ChangeLog:

	* gfortran.dg/gomp/target-device-ancestor-4.f90: Comment out dg-final to avoid
	UNRESOLVED.
2021-09-02 06:01:22 -07:00
Richard Biener 483e400870 Refine fix for PR78185, improve LIM for code after inner loops
This refines the fix for PR78185 after understanding that the code
regarding to the comment 'In a loop that is always entered we may
proceed anyway.  But record that we entered it and stop once we leave
it.' was supposed to protect us from leaving possibly infinite inner
loops.  The simpler fix of moving the misplaced stopping code
can then be refined to continue processing when the exited inner
loop is finite, improving invariant motion for cases like in the
added testcase.

2021-09-02  Richard Biener  <rguenther@suse.de>

	* tree-ssa-loop-im.c (fill_always_executed_in_1): Refine
	fix for PR78185 and continue processing when leaving
	finite inner loops.

	* gcc.dg/tree-ssa/ssa-lim-16.c: New testcase.
2021-09-02 11:39:27 +02:00
Jakub Jelinek 2af6dd77ea match.pd: Demote IFN_{ADD,SUB,MUL}_OVERFLOW operands [PR99591]
The overflow builtins work on infinite precision integers and then convert
to the result type's precision, so any argument promotions are useless.
The expand_arith_overflow expansion is able to demote the arguments itself
through get_range_pos_neg and get_min_precision calls and if needed promote
to whatever mode it decides to perform the operations in, but if there are
any promotions it demoted, those are already expanded.  Normally combine
would remove the useless sign or zero extensions when it sees the result
of those is only used in a lowpart subreg, but typically those lowpart
subregs appear multiple times in the pattern so that they describe properly
the overflow behavior and combine gives up, so we end up with e.g.
        movswl  %si, %esi
        movswl  %di, %edi
        imulw   %si, %di
        seto    %al
where both movswl insns are useless.

The following patch fixes it by demoting operands of the ifns (only gets
rid of integral to integral conversions that increase precision).
While IFN_{ADD,MUL}_OVERFLOW are commutative and just one simplify would be
enough, IFN_SUB_OVERFLOW is not, therefore two simplifications.

2021-09-02  Jakub Jelinek  <jakub@redhat.com>

	PR tree-optimization/99591
	* match.pd: Demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
	were promoted.

	* gcc.target/i386/pr99591.c: New test.
	* gcc.target/i386/pr97950.c: Match or reject setb or jn?b instructions
	together with seta or jn?a.
2021-09-02 11:25:07 +02:00
Richard Biener 1e6267b335 Revert "tree-optimization/102155 - fix LIM fill_always_executed_in CFG walk"
This reverts commit f482bf2af8.
2021-09-02 09:58:46 +02:00
liuhongt b387e664cf libgcc: Enable hfmode soft-sf/df/xf/tf extensions and truncations.
libgcc/ChangeLog:

	* config/i386/32/sfp-machine.h (_FP_NANFRAC_H): New macro.
	* config/i386/64/sfp-machine.h (_FP_NANFRAC_H): Ditto.
	* config/i386/sfp-machine.h (_FP_NANSIGN_H): Ditto.
	* config/i386/t-softfp: Add hf soft-fp.
	* config.host: Add i386/64/t-softfp.
	* config/i386/64/t-softfp: New file.
2021-09-02 14:11:59 +08:00
liuhongt 7cbc870c49 Enable _Float16 type for TARGET_SSE2 and above.
gcc/ChangeLog:

	* config/i386/i386-modes.def (FLOAT_MODE): Define ieee HFmode.
	* config/i386/i386.c (enum x86_64_reg_class): Add
	X86_64_SSEHF_CLASS.
	(merge_classes): Handle X86_64_SSEHF_CLASS.
	(examine_argument): Ditto.
	(construct_container): Ditto.
	(classify_argument): Ditto, and set HFmode/HCmode to
	X86_64_SSEHF_CLASS.
	(function_value_32): Return _FLoat16/Complex Float16 by
	%xmm0.
	(function_value_64): Return _Float16/Complex Float16 by SSE
	register.
	(ix86_print_operand): Handle CONST_DOUBLE HFmode.
	(ix86_secondary_reload): Require gpr as intermediate register
	to store _Float16 from sse register when sse4 is not
	available.
	(ix86_libgcc_floating_mode_supported_p): Enable _FLoat16 under
	sse2.
	(ix86_scalar_mode_supported_p): Ditto.
	(TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Defined.
	* config/i386/i386.h (VALID_SSE2_REG_MODE): Add HFmode.
	(VALID_INT_MODE_P): Add HFmode and HCmode.
	* config/i386/i386.md (*pushhf_rex64): New define_insn.
	(*pushhf): Ditto.
	(*movhf_internal): Ditto.
	* doc/extend.texi (Half-Precision Floating Point): Documemt
	_Float16 for x86.

gcc/lto/ChangeLog:

	* lto-lang.c (lto_type_for_mode): Return float16_type_node
	when mode == TYPE_MODE (float16_type_node).

gcc/testsuite/ChangeLog

	* gcc.target/i386/sse2-float16-1.c: New test.
	* gcc.target/i386/sse2-float16-2.c: Ditto.
	* gcc.target/i386/sse2-float16-3.c: Ditto.
	* gcc.target/i386/float16-5.c: New test.
2021-09-02 14:11:57 +08:00
liuhongt e42d2d2a20 Update hf soft-fp from glibc.
libgcc/ChangeLog

	* soft-fp/eqhf2.c: New file.
	* soft-fp/extendhfdf2.c: New file.
	* soft-fp/extendhfsf2.c: New file.
	* soft-fp/half.h (FP_CMP_EQ_H): New marco.
	* soft-fp/truncdfhf2.c: New file
	* soft-fp/truncsfhf2.c: New file
2021-09-02 14:11:53 +08:00
Richard Biener f482bf2af8 tree-optimization/102155 - fix LIM fill_always_executed_in CFG walk
This fixes the CFG walk order of fill_always_executed_in to use
RPO oder rather than the dominator based order computed by
get_loop_body_in_dom_order.  That fixes correctness issues with
unordered dominator children.

The RPO order computed by rev_post_order_and_mark_dfs_back_seme in
its for-iteration mode is a good match for the algorithm.

2021-09-01  Richard Biener  <rguenther@suse.de>

	PR tree-optimization/102155
	* tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
	over a part of the RPO array and do not recurse here.
	Dump blocks marked as always executed.
	(fill_always_executed_in): Walk over the RPO array and
	process loops whose header we run into.
	(loop_invariant_motion_in_fun): Compute the first RPO
	using rev_post_order_and_mark_dfs_back_seme in iteration
	order and pass that to fill_always_executed_in.
2021-09-02 07:55:29 +02:00
YunQiang Su 6e16b2123d Revert "MIPS: add .module mipsREV to all output asm file"
This reverts commit d904008df2.

".module MIPSRev" has higher priority than -march=octeon or like.
It makes assembler cannot recognize the extension instructions
of octeon (See pr62030-octeon.c).
2021-09-02 10:17:18 +08:00
GCC Administrator e11c6046f9 Daily bump. 2021-09-02 00:16:59 +00:00
Jeff Law 165446a1e8 Call reduce_vector_comparison_to_scalar_comparison earlier
As noted in the PR, we can get an ICE after the introduction of code to reduce a vector comparison to a scalar.  The problem is we left the operand cache in an inconsistent state because we called the new function too late.   This is trivially fixed by making the transformation before we call update_stmt_if_modified.

The irony here is the whole point of calling reduce_vector_comparison_to_scalar_comparison when we did was to expose these kinds of secondary opportunities.  In this particular case we collapsed the test to a comparison of constants (thus no SSA operands).

Anyway, this fixes the problem in the obvious way.  This may all end up being moot if I can twiddle Richi's match.pd pattern to work.  It doesn't work as-written due to a couple issues that I haven't worked totally through yet.

Installed on the trunk after bootstrap & regression testing on x86 and verifying it addresses the aarch64 issue.

gcc/
	PR tree-optimization/102152
	* tree-ssa-dom.c (dom_opt_dom_walker::optimize_stmt): Reduce a vector
	comparison to a scalar comparison before calling
	update_stmt_if_modified.

gcc/testsuite/
	PR tree-optimization/102152
	* gcc.dg/pr102152.c: New test
2021-09-01 19:13:58 -04:00