otp.git - Mirror of Erlang/OTP repository.

Age	Commit message (Collapse)	Author
2016-09-09	Merge branch 'sverker/hipe-speedy-reg-alloc/PR-1159/OTP-13879'	Sverker Eriksson
	* sverker/hipe-speedy-reg-alloc/PR-1159: hipe: Refactor ra callbacks to accept context arg hipe: Reuse liveness between regalloc iterations hipe: Add ra_partitioned to o1 and up hipe_regalloc_prepass: Change splitting heuristic hipe: Make sure prepass temps are below SpillLimit hipe_regalloc_prepass: Rename coloring collisions hipe_ppc: Add code rewrite RA callbacks hipe_sparc: Add code rewrite RA callbacks hipe_arm: Add code rewrite RA callbacks hipe_x86: Add code rewrite RA callbacks hipe: Remove defun_to_cfg/1 RA callback Add new sanity assertion to hipe_regalloc_prepass Simplify hipe_x86_ra_finalise:conv_ra_maplet/3 hipe_x86: Simplify ra_postconditions is_mem_opnd hipe_x86: Fix pseudo_tailcall prettyprinting hipe_x86: Extra sanity assertions hipe: clean up unnecessary catches hipe: Remove temp reuse from call_fun hipe: Add IG partitioning to hipe_regalloc_prepass hipe: Add hipe_regalloc_prepass
2016-09-07	Merge branch 'maint'	Björn-Egil Dahlberg

2016-09-05	hipe: Refactor ra callbacks to accept context arg	Magnus Lång
	This allows us to pass around the context data that hipe_regalloc_prepass needs cleanly, without using process dictionary or parameterised modules (like it was previous to this change).
2016-09-05	hipe: Reuse liveness between regalloc iterations	Magnus Lång
	This is sound because the liveness data structure only stores liveness info at basic block boundaries, and the rewrites that happen in TargetSpecific:check_and_rewrite/2 preserves all existing definitions and uses, and all new liveness intervals, belonging to newly introduced temporaries, are always local to a basic block, and thus do not show up in the liveout or livein sets for the basic block.
2016-09-05	hipe: Fix erl_types opaque match order	Björn-Egil Dahlberg

2016-09-05	hipe: Add ra_partitioned to o1 and up	Magnus Lång
	ra_partitioned significantly speeds up register allocation of larger functions without affecting allocation quality negatively. This is the final change needed to make o1 suitable for compiling really large functions without choking.
2016-09-05	hipe_regalloc_prepass: Change splitting heuristic	Magnus Lång
	The division into an initial pass that may introduce temps, and following passes that must not forces us to make the same heuristic decision during each of these passes. Thus, the splitting heuristic can't be based on the number of temporaries -- at least without excluding temporaries above SpillLimit.
2016-09-05	Implement the new ceil/1 and floor/1 guard BIFs	Björn Gustavsson
	Implement as ceil/1 and floor/1 as new guard BIFs (essentially part of Erlang language). They are guard BIFs because trunc/1 is a guard BIF. It would be strange to have trunc/1 as a part of the language, but not ceil/1 and floor/1.
2016-09-02	hipe: Make sure prepass temps are below SpillLimit	Magnus Lång
	If temps introduced by hipe_regalloc_prepass end up above SpillLimit, the register allocators will not spill them. This constraint is unnecessarily limiting the allocators and might theoretically lead to unallocatable programs (more temps above SpillLimit alive at a time than there are physical registers).
2016-09-02	hipe_regalloc_prepass: Rename coloring collisions	Magnus Lång

2016-09-02	hipe_ppc: Add code rewrite RA callbacks	Magnus Lång
	These will not only be useful for hipe_regalloc_prepass, but also, after the introduction of a mk_move/2 (or similar) callback, for the purpose of range splitting. Since the substitution needed to case over all the instructions, a new module, hipe_ppc_subst, was introduced to the ppc backend.
2016-09-02	hipe_sparc: Add code rewrite RA callbacks	Magnus Lång
	These will not only be useful for hipe_regalloc_prepass, but also, after the introduction of a mk_move/2 (or similar) callback, for the purpose of range splitting. Since the substitution needed to case over all the instructions, a new module, hipe_sparc_subst, was introduced to the sparc backend.
2016-09-02	hipe_arm: Add code rewrite RA callbacks	Magnus Lång
	These will not only be useful for hipe_regalloc_prepass, but also, after the introduction of a mk_move/2 (or similar) callback, for the purpose of range splitting. Since the substitution needed to case over all the instructions, a new module, hipe_arm_subst, was introduced to the arm backend.
2016-09-02	hipe_x86: Add code rewrite RA callbacks	Magnus Lång
	These will not only be useful for hipe_regalloc_prepass, but also, after the introduction of a mk_move/2 (or similar) callback, for the purpose of range splitting. Since the substitution needed to case over all the instructions, a new module, hipe_x86_subst, was introduced to the x86 backend. Due to differences in the 'jtab' field of a #jmp_switch{} between x86 and amd64, it regrettably needed to be duplicated to hipe_amd64_subst.
2016-09-02	hipe: Remove defun_to_cfg/1 RA callback	Magnus Lång
	Now that all backends do register allocation on a CFG directly and define the defun_to_cfg/1 callback as the identity function, it can be removed.
2016-09-02	Add new sanity assertion to hipe_regalloc_prepass	Magnus Lång
	As the just_as_good_as assertion was loosened with the `NowRegs >= CheckRegs` check, it no longer verified that hipe_regalloc_prepass had not incorrectly labeled a temp as unallocatable. We add that behaviour back.
2016-09-02	Simplify hipe_x86_ra_finalise:conv_ra_maplet/3	Magnus Lång

2016-09-02	hipe_x86: Simplify ra_postconditions is_mem_opnd	Magnus Lång
	This is due to the improvements in hipe_temp_map, removing the need for duplicated logic in the backends.
2016-09-02	hipe_x86: Fix pseudo_tailcall prettyprinting	Magnus Lång

2016-09-02	hipe_x86: Extra sanity assertions	Magnus Lång

2016-09-02	hipe: clean up unnecessary catches	Magnus Lång

2016-09-02	hipe: Remove temp reuse from call_fun	Magnus Lång

2016-09-02	hipe: Add IG partitioning to hipe_regalloc_prepass	Magnus Lång

2016-09-02	hipe: Add hipe_regalloc_prepass	Magnus Lång
	hipe_regalloc_prepass speeds up register allocation by spilling any temp that is live over a call (which clobbers all register). In order to detect these, a new function was added to the target interface; defines_all_alloc/1, that takes an instruction and returns a boolean.
2016-09-02	Merge branch 'sverker/hipe-performance-o1/PR-1154/OTP-13862'	Sverker Eriksson
	* sverker/hipe-performance-o1/PR-1154: hipe_sparc: Minimise CFG<->linear conversions hipe_ppc: Minimise CFG<->linear conversions hipe_arm: Minimise CFG<->linear conversions hipe_x86: Use lea instead of move+add hipe_arm: Improve peephole optimiser hipe_arm: Be resilient to crappy RTL hipe_ppc: Be resilient to crappy RTL hipe_sparc: Be resilient to crappy RTL hipe: Reuse liveness info for spillmin hipe_x86: Minimise CFG<->linear conversions hipe: Fix o0 and o1 hipe: Add o0 and o1 to tests hipe_rtl_binary:get_word_integer/4: Handle imms hipe_x86: Be resilient to crappy RTL hipe_x86: LSRA for SSE2
2016-09-02	Merge branch 'maint'	Sverker Eriksson

2016-09-02	Merge branch 'sverker/hipe-sparc-19/PR-1148/OTP-13861' into maint	Sverker Eriksson
	* sverker/hipe-sparc-19/PR-1148: Eliminate catch-all clause from two functions Increase the time limit used by the test suite
2016-09-01	Merge branch 'maint'	Hans Bolinder
	* maint: dialyzer: Increase time limit of suites dialyzer: Remove a check that always fails dialyzer: Optimize an opaque type case
2016-08-31	dialyzer: Optimize an opaque type case	Hans Bolinder
	Fix a mistake in commit 85f6fe3b. Instead of using the declared opaque type, the form's type is used in a case where the opaque type is turned into a non-opaque type. The result is more general types (smaller Erlang terms) and faster analyses.
2016-08-30	hipe_sparc: Minimise CFG<->linear conversions	Magnus Lång
	Now, there will only ever be a single Linear->CFG conversion, just after lowering from RTL, and only ever a single CFG->Linear conversion, just before the finalise pass. Both of these now happen in hipe_sparc_main.
2016-08-30	hipe_ppc: Minimise CFG<->linear conversions	Magnus Lång
	Now, there will only ever be a single Linear->CFG conversion, just after lowering from RTL, and only ever a single CFG->Linear conversion, just before the finalise pass. Both of these now happen in hipe_ppc_main.
2016-08-30	hipe_arm: Minimise CFG<->linear conversions	Magnus Lång
	Now, there will only ever be a single Linear->CFG conversion, just after lowering from RTL, and only ever a single CFG->Linear conversion, just before the finalise pass. Both of these now happen in hipe_arm_main.
2016-08-30	hipe_x86: Use lea instead of move+add	Magnus Lång
	This is primarily useful for heap allocations, as a two-address 'add' can't be used to both copy the heap pointer to another register, and add the tag.
2016-08-30	hipe_arm: Improve peephole optimiser	Magnus Lång

2016-08-30	hipe_arm: Be resilient to crappy RTL	Magnus Lång
	The ARM backend crashes if certain RTL optimisations were omitted, preventing it from being usable at lower optimisation levels. One of the problems were caused by shift-by-immediate-zero, which wraps to immediate-32 with some shiftops. TODO: Someplace should be modified to crash when these are generated so debuging further instances of this gets easier in the future.
2016-08-30	hipe_ppc: Be resilient to crappy RTL	Magnus Lång
	The PowerPC backend crashes if certain RTL optimisations were omitted, preventing it from being usable at lower optimisation levels.
2016-08-30	hipe_sparc: Be resilient to crappy RTL	Magnus Lång
	The SPARC backend crashes if certain RTL optimisations were omitted, preventing it from being usable at lower optimisation levels.
2016-08-30	hipe: Reuse liveness info for spillmin	Magnus Lång
	For x86, additionally reuse liveness from float LSRA for the GP LSRA.
2016-08-30	hipe_x86: Minimise CFG<->linear conversions	Magnus Lång
	Most x86 passes were either linearise(pass(to_cfg(Code))) or trivially rewritable to process a CFG. This saves a great deal of time and memory churn when compiling large programs. Now, there will only ever be a single Linear->CFG conversion, just after lowering from RTL, and only ever a single CFG->Linear conversion, just before the finalise pass. Both of these now happen in hipe_x86_main.
2016-08-30	hipe: Fix o0 and o1	Magnus Lång
	These options would not do anything, because they would not supress the 'o2' in ?COMPILE_DEFAULTS. Such behaviour is added to expand_options/2.
2016-08-30	hipe: Add o0 and o1 to tests	Magnus Lång
	Now that x86 is no longer broken with these optimisation levels, we add them to the test suite to ensure they do not break again. Bump timeout to 6min since tests are run twice as many times. The option set of o1 was changed to all optimisations that run fast on both big and small programs, incurring only a slight compile time increase compared to the old set, but with a, presumably, significant improvement to speed of compiled code. Change o0 register allocator to linear_scan.
2016-08-30	hipe_rtl_binary:get_word_integer/4: Handle imms	Magnus Lång
	Immediate arguments to get_word_integer/4 would lead to bad but unreachable RTL being generated. We omit its generation by testing for immediates and performing the logic at compile time.
2016-08-30	hipe_x86: Be resilient to crappy RTL	Magnus Lång
	The x86 backend crashes if certain RTL optimisations were omitted, preventing it from being usable at lower optimisation levels.
2016-08-30	hipe_x86: LSRA for SSE2	Magnus Lång
	There is little point offering LSRA for x86 if we're still going to call hipe_graph_coloring_regalloc for the floats. In particular, all allocators except LSRA allocates an N^2 interference matrix, making them unusable for really large functions.
2016-08-26	Merge branch 'maint'	Sverker Eriksson

2016-08-26	Eliminate catch-all clause from two functions	Kostis Sagonas
	A stronger version of Dialyzer complained that some case clauses in functions xaluop_is_shift/1 and xaluop_normalise/1 are unreachable. These clauses are now commented out. While at it, I thought that it would be better to eliminate the catch-all clauses in order to be certain we properly handle all RTL instructions that are used as inputs to these functions. Note: The code will now crash if there are unhandled cases.
2016-08-25	Increase the time limit used by the test suite	Kostis Sagonas
	This is required in some really old SPARC machines running Solaris we still have access to.
2016-08-22	hipe: Fix amd64 SSE2 encoding crash	Magnus Lång
	Register allocation could transform something like fmove u32, d99 to fmove $rdx, 0x20($rsp) which is an invalid instruction.
2016-08-22	hipe: Fix tailcall stackarg clobber bug	Magnus Lång
	Since the link register/return address is restored before stack arguments are stored to the frame, we must not use it to store a stack argument. We do that by adding it to the registers clobbered by pseudo_tailcall_prepare.
2016-08-22	hipe_arm: Fix translation of shift by 0	Magnus Lång
	The problem was caused by shift-by-immediate-zero, which wraps to immediate-32 with some shiftops. TODO: Someplace should be modified to crash when these are generated so debugging further instances of this gets easier in the future.