otp.git - Mirror of Erlang/OTP repository.

Age	Commit message (Collapse)	Author
2015-03-09	Don't inline core_parse	Björn Gustavsson
	Inlining the core_parse module is slow (the inline pass alone takes more than 6 seconds on my computer) and has no benefit.
2015-03-09	v3_core: Teach pat_alias/2 to eliminate duplicated variables	Björn Gustavsson
	Duplicated variables as aliases in patterns, such as: f({_,_}=Dup=Dup) -> ... will work, but produce sub-optimal code similar to: f({_,_}=Dup=NewVar) when Dup =:= NewVar -> ... with one extra guard test for each duplicated variable. Rewrite pat_alias/2 to eliminate all duplicated variables. While we are at it, also simplify handling of tuples, conses, and literals by using the data functions in the cerl module.
2015-03-09	beam_dead: Improve optimization by eliminating fallthroughs	Björn Gustavsson

2015-03-09	beam_dead: Optimize Var =:= Var	Björn Gustavsson

2015-03-09	beam_peep: Optimize away redundant use of is_boolean tests	Björn Gustavsson

2015-03-09	beam_bool: Correct initialized_regs/2	Björn Gustavsson
	initialized_regs/2 did not handle allocating instructions; instead treating them as any other 'set' instruction. The consequences could be one or both of the following: Going past the allocating instruction (looking at more instructions) would mean that initialized_regs/2 could return registers that were not actually initialized. That could mean that MustBeKilled in ensure_opt_safe/6 could contain too few registers, and that the code that followed tried to use an uninitialized register. The beam_validator should have detected that problem. Not taking account the number of live registers in the allocating instruction could mean that some registers were not found to be initialized, which could mean that MustBeKilled would contain too many registers. That would mean a missed optimization.
2015-03-09	sys_core_fold: Generalize case optimization	Björn Gustavsson
	For a long time, there has been an optimization for: {V1,V2,...} = case Expr of Pat -> ... {Val1,Val2,...}; ... end that avoids building the tuples. The construct looks like this in Core Erlang: let <V> = case X of Pattern -> {Y,Z} end in case V of {A,B} -> A+B end The current optimization will try to replace the second 'case' with a 'let': let <A,B> = case X of Pattern -> <Y,Z> end in A+B Simple variations of the construct would prevent the optimizations; for example this one: let <V> = case X of Pattern -> {'ok',Val} end in case V of {ok,Val} -> Val end The problem is that the optimization tries to do too much. By making the optimization do less and have it depend on other optimizations to finish the job, it will become more powerful. Thus we can rewrite the code like this: let <V1,V2> = case X of Pattern -> <'ok',Val> end in let <V> = {V1,V2} in case V of {ok,Val} -> Val end Note that the second case is unchanged. The other optimizations in the sys_core_fold module will optimize the second 'case' and eliminate the building of the tuple.
2015-03-09	sys_core_fold: Improve optimization of 'not'	Björn Gustavsson
	Optimize away 'not' in sys_core_fold instead of in beam_block and beam_dead, as we can do a better job in sys_core_fold. I modified the test suite temporarily to never turn off Core Erlang modifications and looked at the coverage. With the new optimizations active in sys_core_fold, the code in beam_block and beam_dead did not find a single 'not' that it could optimize. That proves that the new optimization is at least as good as the old one. Manually, I could also verify that the new optimization would optimize some variations of 'not' that the old one would not handle.
2015-03-09	sys_core_fold: Suppress compiler warnings when evaluating element/2	Björn Gustavsson
	More aggressive optimizations that we plan to introduce could cause spurious compiler warnings.
2015-03-09	Clean up evaluation of setelement/3	Björn Gustavsson
	The 'try' ... 'catch' is problematic. Firstly, if no optimization is possible, an exception will always be thrown. Secondly, bugs in the code will go unnoticed.
2015-03-09	Replace '==' with '=:=' when both operands are integers	Björn Gustavsson
	'=:=' is a cheaper operation than '==', so we should always use '=:=' if the result will be the same as if '==' were used.
2015-03-09	Update type information based on BIFs that returns integers	Björn Gustavsson

2015-03-09	sys_core_fold: Strengthen type optimization in lets	Björn Gustavsson
	Make sure that we take extract all possible type information when optimizing a 'let' construct. Since the stronger optimization may generate false warnings, we also need to take special care to suppress false warnings.
2015-03-09	v3_codegen: Teach the put_map_* instructions to reuse source registers	Björn Gustavsson
	The put_map_assoc and put_map_exact instructions in the run-time system will support that the target register is the same as one of the source registers. Teach the code generator to take advantage of that. The disadvantages of not reusing register when possible is that the garbage collector may retain dead terms longer than necessary.
2015-03-09	beam_validator: Tighten tests of maps	Björn Gustavsson

2015-03-09	v3_core: Eliminate the sloppiness-encouraging get_ianno/1 function	Björn Gustavsson
	get_ianno/1 would retrieve either a bare annotation or an annotation wrapped in an #a{} record. In both cases, it would return a wrapped annotation. We can replace the calls to get_ianno/1 with calls to get_anno/1, because the argument is always an #iclause{} and all iclause records are always initialized with a wrapped annotation.
2015-03-09	v3_core: Add is_map tests before map instructions	Björn Gustavsson
	If we have a sequence of put_map_* instructions operating on the same map, it will be more efficient if we can have one is_map/2 instruction before put_map_* instructions, so that each put_map_* does not need to test whether the argument is a map.
2015-03-09	beam_type: Use the complete register map when calculating liveness	Björn Gustavsson
	When calculating the number of live registers for allocation instruction, it is not always safe to start with the number of live registers at the start of the block. We will need to use the register map to know whether there are any holes (dead registers) that are not subsequently filled. If the allocation instruction already has a number of live registers calculated, there is nothing to be gained by raising it.
2015-03-09	Introduce '%live' annotations with a complete register map	Björn Gustavsson
	As a preparation for fixing a bug, introduce a complete register map in the '%live' annotations.
2015-02-27	beam_validator: Teach bif_type/3 and is_bif_safe/2 about is_map/1	Björn Gustavsson

2015-02-27	v3_core: Simplify conversion of map patterns	Björn Gustavsson

2015-02-23	Merge branch 'bjorn/compiler/beam_jump-share'	Björn Gustavsson
	* bjorn/compiler/beam_jump-share: beam_jump: Don't jump into the middle of a 'try'
2015-02-23	Merge branch 'bjorn/compiler/sys_core_fold'	Björn Gustavsson
	* bjorn/compiler/sys_core_fold: sys_core_fold: Fix non-tail-recursive list comprehensions
2015-02-20	Merge branch 'bjorn/compiler/beam_jump'	Björn Gustavsson
	* bjorn/compiler/beam_jump: beam_jump: Eliminate pathologically slow compilation
2015-02-20	Merge branch 'bjorn/compiler/beam_validator'	Björn Gustavsson
	* bjorn/compiler/beam_validator: beam_validator: Exit immediately on crashes beam_validator: Remove the file/1 and files/1 functions beam_validator: Remove support for all other unsupported instructions beam_validator: Remove support for unsupported bit syntax instructions
2015-02-20	sys_core_fold: Fix non-tail-recursive list comprehensions	Björn Gustavsson
	649d6e73 simplified opt_simple_let_2/6 a little bit too much, so that some list comprehensions in effect context were not properly tail-recursive.
2015-02-20	beam_jump: Eliminate pathologically slow compilation	Björn Gustavsson
	José Valim noticed that code such as: match(1) -> 1; match(2) -> 2; match(3) -> 3; ... match(1000) -> 1000. would compile very slowly. The culprit is opt/3 in beam_jump. What happens is that opt/3 will rewrite this code: select_val ... label 1 jump 1000 label 2 jump 1000 ... label 999 jump 1000 label 1000 return very slowly to this code: select_val ... label 1 label 2 ... label 999 label 1000 return The reason for the slowness is that when opt/3 sees this sequence: label 1 jump 1000 ... it will remove the label (storing it in a dictionary), and pick up the previously processed instruction from the accumulator: select_val ... jump 1000 label 2 jump 1000 ... That is done in order to process all labels before the jump and also to get rid of the jump instruction if the previous instruction is an "unreachable after". In this case, re-processing the sequence will remove the now unreachable jump instruction: select_val ... label 2 jump 1000 ... The problem is that re-processing the select_val instruction is expensive. The instruction has a list of 1000 labels, all of which will be added (again) to the set of referenced labels. The select_val instruction will be re-processed again and again until all labels and jumps have been gobbled up. In the original version of beam_jump, opt/3 was not called repeatedly until a fixpoint was found, but was expected to do all its optimizations in one pass. The fixpoint iteration was added later. Since we now have the fixpoint iteration, there is no need to do everything in a single pass. When we encounter a jump, we will collect all previously seen labels and put them into the dictionary, and then we will move on. As a further optimization, we will look for sequences like this: jump X label ... jump X and replace them with: label ... jump X In the example above, that will avoid 1000 updates of the dictionary. After applying this optimization, compilation of the pattern went from roughly 55 s to 0.1 s for the example above but with 10000 clauses. Reported-by: José Valim
2015-02-20	beam_jump: Don't jump into the middle of a 'try'	Björn Gustavsson
	The code sharing optimization could produce a jump into the middle of a 'try' block. beam_validator would reject the code. Reported-by: Ulf Norell
2015-02-18	beam_validator: Exit immediately on crashes	Björn Gustavsson
	The beam_validator catches all exceptions and collect them. It makes more sense to don't catch 'error' and 'exit' exceptions, but to just print out the name of the current function and pass on the exception just as all other compilation passes do. Those kind of exceptions are the symptoms of the kind of severe but easily catched bugs that occur during development.
2015-02-18	beam_validator: Remove the file/1 and files/1 functions	Björn Gustavsson
	Before the beam_validator was added as compiler pass, it was a standalone module that could analyse existing .beam files and .S files. Even though beam_validator has been part of the compiler for many releases, it still supports the analysis of .beam and .S files. To reduce the code bloat and to improve coverage of beam_validator, remove the file/1 and files/1 functions and all associated help functions. We'll need to update the test suite, since some of the checked in .S files have errors that beam_validator ignores, but that will not be accepted when running them throught the compiler using the 'from_asm' option. In particular, we will need to export all functions that should be validated (since the beam_clean pass will remove any function that is not possible to call).
2015-02-18	beam_validator: Remove support for all other unsupported instructions	Björn Gustavsson

2015-02-18	beam_validator: Remove support for unsupported bit syntax instructions	Björn Gustavsson

2015-02-18	beam_validator: Tighten and simplify map validation code	Björn Gustavsson
	The assert_strict_literal_termorder/1 function is used to validate the get_map_elements and has_map_fields instructions. In neither case is it useful to allow an empty lists of fields, so we should no longer allow an empty list. The mmap/2 function is cute, but it is used in only one place, so it is much simpler to write a special-purpose function to extract the keys from the list of map pairs.
2015-02-18	beam_utils: Correct test for has_map_fields in is_pure_test/1	Björn Gustavsson
	The has_map_fields test was not recognized in is_pure_test/1, because beam_a has rewritten the {list,_} part of instruction.
2015-02-18	map_SUITE: Cover comparisons of 'nil' in v3_codegen	Björn Gustavsson

2015-02-12	cerl: Teach is_literal_term/1 to handle maps	Björn Gustavsson

2015-02-12	cerl: Add missing is_c_map/1 function	Björn Gustavsson

2015-02-12	v3_core: Simplify translation of maps	Björn Gustavsson
	There is no need to always introduce a new variable to hold a map. Maps are novars (constructs that don't export variables).
2015-02-12	sys_core_fold: Simplify opt_simple_let_2/6	Björn Gustavsson
	In cd1eaf0116190, opt_simple_let_2/6 was updated to do the same optimizations in 'value' and 'effect' context. Coalesce the clauses for 'value' and 'effect' context to one to make it clear that they do the same thing.
2015-02-12	Break out inlining of 'lists' functions to a new module	Björn Gustavsson
	The code for inlining high-order functions from the lists module is quite annoying when you try to navigate the sys_core_fold module. Break out the code into its own module.
2015-02-12	sys_core_fold: Add is_int_type/2 and is_tuple_type/2	Björn Gustavsson
	Those functions allow us to clean up some more code.
2015-02-12	sys_core_fold: Refactor type information access	Björn Gustavsson
	Introduce access functions to hide the low-level details of how type information is implemented.
2015-02-12	core_lib: Deprecate functions that are no longer used by the compiler	Björn Gustavsson

2015-02-12	Eliminate use of core_lib:literal_value/1	Björn Gustavsson
	Essentially, core_lib:literal_value/1 became useless when literals were introduced in R12. Since we always create #c_literal{} records whenever possible, literal_value/1 would only succeed when it was passed a #c_literal{} argument.
2015-02-12	Eliminate all uses of core_lib:get_anno/1 and core_lib:set_anno/2	Björn Gustavsson
	We are about to deprecate core_lib:get_anno/1 and core_lib:set_anno/2, so we should stop using them in the compiler.
2015-02-12	core_lint: Eliminate call to core_lib:is_literal/1	Björn Gustavsson
	Attributes must be literals. Since 1fcdcd50, both core_parse and v3_core guarantees all Core Erlang terms that may be represented as literals in fact are represented as literals. Therefore, we no longer need to call core_lib:is_literal/1, but can test for a #c_literal{} directly.
2015-02-12	test_lib: Include test_server.hrl using -include_lib	Björn Gustavsson
	This module is the last that uses -include() instead of include_lib(). With that change, a quick smoke test can be done simply like this: PATH=$ERL_TOP/bin:$PATH erlc -W0 *.erl Without the change, you would also need to add -I $ERL_TOP/lib/test_server/include to the command line.
2015-02-11	sys_core_fold: Rename add_scope/2 to fit in the sub_* family	Björn Gustavsson
	Rename add_scope/2 to sub_add_scope/2 to be similar in naming as the other functions that operates on #sub{} (in particular, sub_subst_scope/1). Also, move the definition to be near to the other sub_* functions.
2015-02-11	v3_core: Suppress compiler-generated calls in guards	Björn Gustavsson
	Compiling the following function: f(V) when not (bar and V) -> true; %Line 4 f(_) -> false. would produce the following warnings: no_file: Warning: the call to is_boolean/1 has no effect t.erl:4: Warning: the guard for this clause evaluates to 'false' t.erl:4: Warning: use of operator '=:=' has no effect Two of the warnings refer to calls to is_boolean/1 and '=:='/2 which v3_core added when translating the code to Core Erlang. The only relevant warning is: t.erl:4: Warning: the guard for this clause evaluates to 'false' Suppress the other two warning by marking the compiler-generated calls with a 'compiler_generated' annotation.
2015-02-11	v3_core: Remove out-commented code	Björn Gustavsson