qemu-e2k

Commit Graph

Author	SHA1	Message	Date
Alistair Francis	4a2fdb78e7	target/arm: Require alignment for load exclusive According to the ARM ARM exclusive loads require the same alignment as exclusive stores. Let's update the memops used for the load to match that of the store. This adds the alignment requirement to the memops. Reviewed-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com> Signed-off-by: Alistair Francis <alistair.francis@xilinx.com> Signed-off-by: Richard Henderson <richard.henderson@linaro.org> Message-id: 20170815145714.17635-4-richard.henderson@linaro.org [rth: Require 16-byte alignment for 64-bit LDXP.] Signed-off-by: Richard Henderson <richard.henderson@linaro.org> Signed-off-by: Peter Maydell <peter.maydell@linaro.org>	2017-08-15 17:38:44 +01:00
Richard Henderson	19514cde3b	target/arm: Correct load exclusive pair atomicity We are not providing the required single-copy atomic semantics for the 64-bit operation that is the 32-bit paired load. At the same time, leave the entire 64-bit value in cpu_exclusive_val and stop writing to cpu_exclusive_high. This means that we do not have to re-assemble the 64-bit quantity when it comes time to store. At the same time, drop a redundant temporary and perform all loads directly into the cpu_exclusive_* globals. Tested-by: Alistair Francis <alistair.francis@xilinx.com> Reviewed-by: Alistair Francis <alistair.francis@xilinx.com> Signed-off-by: Richard Henderson <richard.henderson@linaro.org> Message-id: 20170815145714.17635-3-richard.henderson@linaro.org Signed-off-by: Peter Maydell <peter.maydell@linaro.org>	2017-08-15 17:38:44 +01:00
Alistair Francis	955fd0ad5d	target/arm: Correct exclusive store cmpxchg memop mask When we perform the atomic_cmpxchg operation we want to perform the operation on a pair of 32-bit registers. Previously we were just passing the register size in which was set to MO_32. This would result in the high register to be ignored. To fix this issue we hardcode the size to be 64-bits long when operating on 32-bit pairs. Reviewed-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com> Tested-by: Portia Stephens <portia.stephens@xilinx.com> Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org> Signed-off-by: Alistair Francis <alistair.francis@xilinx.com> Signed-off-by: Richard Henderson <richard.henderson@linaro.org> Message-id: 20170815145714.17635-2-richard.henderson@linaro.org Message-Id: <bc18dddca56e8c2ea4a3def48d33ceb5d21d1fff.1502488636.git.alistair.francis@xilinx.com> Signed-off-by: Richard Henderson <richard.henderson@linaro.org> Signed-off-by: Peter Maydell <peter.maydell@linaro.org>	2017-08-15 16:11:22 +01:00
Emilio G. Cota	e4256c3cbf	target/arm: fix TCG temp leak in aarch64 rev16 Fix a TCG temporary leak in the new aarch64 rev16 handling. Signed-off-by: Emilio G. Cota <cota@braap.org> Reviewed-by: Peter Maydell <peter.maydell@linaro.org> Signed-off-by: Peter Maydell <peter.maydell@linaro.org>	2017-07-24 17:59:28 +01:00
Lluís Vilanova	9c489ea6be	tcg: Pass generic CPUState to gen_intermediate_code() Needed to implement a target-agnostic gen_intermediate_code() in the future. Reviewed-by: David Gibson <david@gibson.dropbear.id.au> Reviewed-by: Richard Henderson <rth@twiddle.net> Reviewed-by: Alex Benneé <alex.benee@linaro.org> Reviewed-by: Emilio G. Cota <cota@braap.org> Signed-off-by: Lluís Vilanova <vilanova@ac.upc.edu> Message-Id: <150002025498.22386.18051908483085660588.stgit@frigg.lan> Signed-off-by: Richard Henderson <rth@twiddle.net>	2017-07-19 14:45:16 -07:00
Richard Henderson	abb1066df3	target/arm: Optimize aarch64 rev16 It is much shorter to reverse all 4 half-words in parallel than extract, reverse, and deposit each in turn. Suggested-by: Aurelien Jarno <aurelien@aurel32.net> Signed-off-by: Richard Henderson <rth@twiddle.net>	2017-07-19 14:45:15 -07:00
Alex Bennée	b29fd33db5	target/arm: use DISAS_EXIT for eret handling Previously DISAS_JUMP did ensure this but with the optimisation of `8a6b28c7` (optimize indirect branches) we might not leave the loop. This means if any pending interrupts are cleared by changing IRQ flags we might never get around to servicing them. You usually notice this by seeing the lookup_tb_ptr() helper gainfully chaining TBs together while cpu->interrupt_request remains high and the exit_request has not been set. This breaks amongst other things the OPTEE test suite which executes an eret from the secure world after a non-secure world IRQ has gone pending which then never gets serviced. Instead of using the previously implied semantics of DISAS_JUMP we use DISAS_EXIT which will always exit the run-loop. CC: Etienne Carriere <etienne.carriere@linaro.org> CC: Joakim Bech <joakim.bech@linaro.org> CC: Jaroslaw Pelczar <j.pelczar@samsung.com> CC: Peter Maydell <peter.maydell@linaro.org> CC: Emilio G. Cota <cota@braap.org> Signed-off-by: Alex Bennée <alex.bennee@linaro.org> Reviewed-by: Richard Henderson <rth@twiddle.net> Message-id: 20170713141928.25419-7-alex.bennee@linaro.org Signed-off-by: Peter Maydell <peter.maydell@linaro.org>	2017-07-17 13:36:07 +01:00
Alex Bennée	0b609cc128	target/arm: use gen_goto_tb for ISB handling While an ISB will ensure any raised IRQs happen on the next instruction it doesn't cause any to get raised by itself. We can therefore use a simple tb exit for ISB instructions and rely on the exit_request check at the top of each TB to deal with exiting if needed. Signed-off-by: Alex Bennée <alex.bennee@linaro.org> Reviewed-by: Richard Henderson <rth@twiddle.net> Message-id: 20170713141928.25419-6-alex.bennee@linaro.org Signed-off-by: Peter Maydell <peter.maydell@linaro.org>	2017-07-17 13:36:07 +01:00
Alex Bennée	e8d5230221	target/arm/translate: make DISAS_UPDATE match declared semantics DISAS_UPDATE should be used when the wider CPU state other than just the PC has been updated and we should therefore exit the TCG runtime and return to the main execution loop rather assuming DISAS_JUMP would do that. Signed-off-by: Alex Bennée <alex.bennee@linaro.org> Reviewed-by: Richard Henderson <rth@twiddle.net> Message-id: 20170713141928.25419-3-alex.bennee@linaro.org Signed-off-by: Peter Maydell <peter.maydell@linaro.org>	2017-07-17 13:36:07 +01:00
Richard Henderson	8da54b2507	target/arm: Exit after clearing aarch64 interrupt mask Exit to cpu loop so we reevaluate cpu_arm_hw_interrupts. Tested-by: Emilio G. Cota <cota@braap.org> Tested-by: Alex Bennée <alex.bennee@linaro.org> Reviewed-by: Emilio G. Cota <cota@braap.org> Reviewed-by: Alex Bennée <alex.bennee@linaro.org> Signed-off-by: Richard Henderson <rth@twiddle.net>	2017-06-19 11:11:26 -07:00
Emilio G. Cota	e75449a346	target/aarch64: optimize indirect branches Measurements: [Baseline performance is that before applying this and the previous commit] - NBench, aarch64-softmmu. Host: Intel i7-4790K @ 4.00GHz 1.7x +-+--------------------------------------------------------------------------------------------------------------+-+ \| \| \| cross \| 1.6x +cross+jr.................................................####...................................................+-+ \| #++# \| \| # # \| 1.5x +-+...................................................****..#...................................................+-+ \| +++* # \| \| * * # \| 1.4x +-+........................................................#...................................................+-+ \| * * # \| \| ##### * * # \| 1.3x +-+................................***+++#................#...................................................+-+ \| ++* # * * # \| \| * * # * * # \| 1.2x +-+.....................................#................#...................................................+-+ \| * * # * * # \| \| #### * * # * * # \| 1.1x +-+.......................+++#..#.......#................#...................................................+-+ \| **** # * * # * * # ***#### \| \| * # * * # * * # **### +++#### *### * # \| 1x +-++-++++++-++++***###++-++++#+++++-+#++**++++++++++#++++-+#++**++#++*###-++++-+#+++-+++#+-++-+ \| ***### * # * * # * * # ++### * * # * * # * * # * ++# * # * * # \| \| * ++# * # * * # * * # * * # * * # * * # * * # * * # * * # * * # \| 0.9x +-+---***###--###---###--####--###--*###--###--*###--###---###--####---+-+ ASSIGNMENT BITFIELD FOURFP EMULATION HUFFMAN LU DECOMPOSITIONNEURAL NUMERIC SORSTRING SORT hmean png: http://imgur.com/qO9ubtk NB. cross here represents the previous commit. - SPECint06 (test set), aarch64-linux-user. Host: Intel i7-4790K @ 4.00GHz 1.5x +-+--------------------------------------------------------------------------------------------------------------+-+ \| *** \| \| +++ jr \| \| * * \| 1.4x +-+.............................................................................................+++............+-+ \| * * \| \| \| ***** * * \| \| \| * * * * ***** \| 1.3x +-+...........................................................................................\|............+-+ \| +++ * * * * * \| * \| \| ***** * * * * +++ \| \| * * * * * * * * \| 1.2x +-+...............................................................................****..................+-+ \| **** * * * * * * * * * * +++ \| \| * * * * * * * * * * * * ***** \| \| * * * * ***** * * * * * * * * * * \| 1.1x +-+....................................................................+++.......................+-+ \| * * * * * * * * * * ***** * * * * * * \| \| * * * * * * * * ***** * * * * * * * * * * \| \| * * ***** * * * * * * * * ****** * * * * * * * * * * \| 1x +-++-++++-++++++++++-++++-+++++-++++++++++-++++-++****+++++-+++++-++++-++++++++++-++++-++-+ \| * * * * * * * * * * * * * +++ * * * * * * * * * * \| \| * * * * * * * * * * * * * * * * * * * * * * * * * * \| \| * * * * * * * * * * * * * * * * * * * * * * * * * * \| 0.9x +-+---***---*----*---*---*---*---**---*---*---*---*----*---*---+-+ astar bzip2 gcc gobmk h264ref hmmlibquantum mcf omnetpperlbench sjengxalancbmk hmean png: http://imgur.com/3Dp4vvq - SPECint06 (train set), aarch64-linux-user. Host: Intel i7-4790K @ 4.00GHz 1.7x +-+--------------------------------------------------------------------------------------------------------------+-+ \| \| \| jr \| 1.6x +-+...............................................................................................+++............+-+ \| *** \| \| +++ \| \| * * \| 1.5x +-+............................................................................................................+-+ \| +++ * * \| \| ***** * * \| 1.4x +-+.....................................................................+++..................................+-+ \| * * * * \| \| ***** * * * * \| \| * * * * ***** * * \| 1.3x +-+......................................................................................................+-+ \| +++ * * * * * * * * \| \| ***** * * * * * * ***** * * \| 1.2x +-+.............................................................................+++..........****...+-+ \| * * * * * * * * * * * +++ \| \| ***** * * ***** * * * * * * * * * * * * \| \| * * * * +++ * * * * * * * * * * * * \| 1.1x +-+............................................................................................+-+ \| * * ***** * * * * * * ***** * * * * * * * * * * \| \| * * * * * * * * * * +++ ****** +++ * * * * * * * * * * \| 1x +-+---***---*----*---*---*---*---**---*---*---*---*----*---***---+-+ astar bzip2 gcc gobmk h264ref hmmlibquantum mcf omnetpperlbench sjengxalancbmk hmean png: http://imgur.com/vRrdc9j Signed-off-by: Emilio G. Cota <cota@braap.org> Signed-off-by: Richard Henderson <rth@twiddle.net>	2017-06-05 09:25:42 -07:00
Emilio G. Cota	e78722368c	target/aarch64: optimize cross-page direct jumps in softmmu Perf numbers in next commit's log. Signed-off-by: Emilio G. Cota <cota@braap.org> Signed-off-by: Richard Henderson <rth@twiddle.net>	2017-06-05 09:25:42 -07:00
Peter Maydell	8bd5c82030	arm: Add support for M profile CPUs having different MMU index semantics The M profile CPU's MPU has an awkward corner case which we would like to implement with a different MMU index. We can avoid having to bump the number of MMU modes ARM uses, because some of our existing MMU indexes are only used by non-M-profile CPUs, so we can borrow one. To avoid that getting too confusing, clean up the code to try to keep the two meanings of the index separate. Instead of ARMMMUIdx enum values being identical to core QEMU MMU index values, they are now the core index values with some high bits set. Any particular CPU always uses the same high bits (so eventually A profile cores and M profile cores will use different bits). New functions arm_to_core_mmu_idx() and core_to_arm_mmu_idx() convert between the two. In general core index values are stored in 'int' types, and ARM values are stored in ARMMMUIdx types. Signed-off-by: Peter Maydell <peter.maydell@linaro.org> Message-id: 1493122030-32191-3-git-send-email-peter.maydell@linaro.org	2017-06-02 11:51:47 +01:00
Nick Reilly	a4f5c5b723	Add missing fp_access_check() to aarch64 crypto instructions The aarch64 crypto instructions for AES and SHA are missing the check for if the FPU is enabled. Signed-off-by: Nick Reilly <nreilly@blackberry.com> Reviewed-by: Peter Maydell <peter.maydell@linaro.org> Signed-off-by: Peter Maydell <peter.maydell@linaro.org>	2017-02-28 12:08:15 +00:00
Alex Bennée	c22edfebff	target-arm: don't generate WFE/YIELD calls for MTTCG The WFE and YIELD instructions are really only hints and in TCG's case they were useful to move the scheduling on from one vCPU to the next. In the parallel context (MTTCG) this just causes an unnecessary cpu_exit and contention of the BQL. Signed-off-by: Alex Bennée <alex.bennee@linaro.org> Reviewed-by: Richard Henderson <rth@twiddle.net> Reviewed-by: Peter Maydell <peter.maydell@linaro.org>	2017-02-24 10:32:46 +00:00
Peter Maydell	9bb6558a21	target/arm: A32, T32: Create Instruction Syndromes for Data Aborts Add support for generating the ISS (Instruction Specific Syndrome) for Data Abort exceptions taken from AArch32. These syndromes are used by hypervisors for example to trap and emulate memory accesses. This is the equivalent for AArch32 guests of the work done for AArch64 guests in commit `aaa1f954d4`. Signed-off-by: Peter Maydell <peter.maydell@linaro.org> Reviewed-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com>	2017-02-07 18:30:00 +00:00
Richard Henderson	86c9ab2776	target/arm: Fix ubfx et al for aarch64 The patch in `59a71b4c5b` suffered from a merge failure when compared to the original patch in http://lists.nongnu.org/archive/html/qemu-devel/2016-12/msg00137.html Signed-off-by: Richard Henderson <rth@twiddle.net>	2017-01-13 09:48:20 -08:00
Richard Henderson	bc21dbcc12	target-arm: Use clrsb helper Reviewed-by: Alex Bennée <alex.bennee@linaro.org> Signed-off-by: Richard Henderson <rth@twiddle.net>	2017-01-10 08:47:48 -08:00
Richard Henderson	7539a012f6	target-arm: Use clz opcode Reviewed-by: Alex Bennée <alex.bennee@linaro.org> Signed-off-by: Richard Henderson <rth@twiddle.net>	2017-01-10 08:06:11 -08:00
Richard Henderson	59a71b4c5b	target-arm: Use new deposit and extract ops Use the new primitives for UBFX and SBFX. Signed-off-by: Richard Henderson <rth@twiddle.net>	2017-01-10 08:06:10 -08:00
Richard Henderson	0a97c40f8e	target-arm: Fix aarch64 disas_ldst_single_struct We add s->be_data within do_vec_ld/st. Adding it here means that we have the wrong bits set in SIZE for a big-endian host, leading to g_assert_not_reached in write_vec_element and read_vec_element. Signed-off-by: Richard Henderson <rth@twiddle.net> Message-id: 1481085020-2614-3-git-send-email-rth@twiddle.net Reviewed-by: Peter Maydell <peter.maydell@linaro.org> Signed-off-by: Peter Maydell <peter.maydell@linaro.org>	2016-12-27 14:59:24 +00:00
Richard Henderson	416d72b97b	target-arm: Fix aarch64 vec_reg_offset Since CPUARMState.vfp.regs is not 16 byte aligned, the ^ 8 fixup used for a big-endian host doesn't do what's intended. Fix this by adding in the vfp.regs offset after computing the inter-register offset. Signed-off-by: Richard Henderson <rth@twiddle.net> Message-id: 1481085020-2614-2-git-send-email-rth@twiddle.net Reviewed-by: Peter Maydell <peter.maydell@linaro.org> Signed-off-by: Peter Maydell <peter.maydell@linaro.org>	2016-12-27 14:59:24 +00:00
Thomas Huth	fcf5ef2ab5	Move target-* CPU file into a target/ folder We've currently got 18 architectures in QEMU, and thus 18 target-xxx folders in the root folder of the QEMU source tree. More architectures (e.g. RISC-V, AVR) are likely to be included soon, too, so the main folder of the QEMU sources slowly gets quite overcrowded with the target-xxx folders. To disburden the main folder a little bit, let's move the target-xxx folders into a dedicated target/ folder, so that target-xxx/ simply becomes target/xxx/ instead. Acked-by: Laurent Vivier <laurent@vivier.eu> [m68k part] Acked-by: Bastian Koppelmann <kbastian@mail.uni-paderborn.de> [tricore part] Acked-by: Michael Walle <michael@walle.cc> [lm32 part] Acked-by: Cornelia Huck <cornelia.huck@de.ibm.com> [s390x part] Reviewed-by: Christian Borntraeger <borntraeger@de.ibm.com> [s390x part] Acked-by: Eduardo Habkost <ehabkost@redhat.com> [i386 part] Acked-by: Artyom Tarasenko <atar4qemu@gmail.com> [sparc part] Acked-by: Richard Henderson <rth@twiddle.net> [alpha part] Acked-by: Max Filippov <jcmvbkbc@gmail.com> [xtensa part] Reviewed-by: David Gibson <david@gibson.dropbear.id.au> [ppc part] Acked-by: Edgar E. Iglesias <edgar.iglesias@xilinx.com> [cris&microblaze part] Acked-by: Guan Xuetao <gxt@mprc.pku.edu.cn> [unicore32 part] Signed-off-by: Thomas Huth <thuth@redhat.com>	2016-12-20 21:52:12 +01:00

... 2 3 4 5 6

273 Commits