otp.git - Mirror of Erlang/OTP repository.

Age	Commit message (Collapse)	Author
2019-04-30	Merge branch 'john/compiler/fix-missing-match-reposition/ERL-923'	John Högberg
	* john/compiler/fix-missing-match-reposition/ERL-923: compiler: Propagate match context position on fail path
2019-04-29	compiler: Propagate match context position on fail path	John Högberg

2019-04-23	beam_validator: Don't infer types for dead values	John Högberg

2019-04-15	Optimize encoding of simple literals	Björn Gustavsson
	`beam_asm` would encode `{literal,[]}`, `{literal,erlang}`, and `{literal,42}` in a less efficient way than the equivalent values `nil`, `{atom,erlang}`, and `{integer,42}`. That would increase the size of BEAM files and could increase the loaded code size. It would probably not harm performance, because `literal` was only used this way in code that generates `badmatch` and `case_clause` exceptions.
2019-04-10	compiler: Add missing header dependencies	John Högberg

2019-04-08	Merge pull request #2200 from ↵	Björn Gustavsson
	bjorng/bjorn/deprecation-warnings/ERL-904/OTP-15749 Add compiler option for suppressing warnings about removed functions/modules
2019-04-05	Add options for suppressing warnings about removed functions	Björn Gustavsson
	An appliction outside of OTP may want to reuse then name of a module that was previously included in OTP. Therefore, there should be a way to suppress warnings for removed functions.
2019-04-04	cerl_clauses: Fix broken link	Björn Gustavsson

2019-04-04	erts: Fix more internal docs xmllint	Lukas Larsson

2019-04-04	compiler: Add internal docs section	Lukas Larsson

2019-03-28	compile: Add some types to core_parse.hrl	Hans Bolinder
	The core_parse.hrl can be used in applications with the +warn_untyped_record compiler flag.
2019-03-28	compiler: Fix documentation of cerl:c_binary/1	Hans Bolinder

2019-03-28	Merge branch 'bjorn/compiler/cuddle-with-tests'	Björn Gustavsson
	* bjorn/compiler/cuddle-with-tests: Verify the highest opcode for the r21 test suites Add test_lib:highest_opcode/1 sys_core_fold: Simplify case_expand_var/2 beam_validator: Remove uncovered lines in lists_mod_return_type/3 Cover return type determination of lists functions
2019-03-26	compiler: Fully disable no_return optimizations in try blocks	John Högberg
	Validation could fail when a function that never returned was used in a try block (see attached test case). It's possible to solve this without disabling the optimization as the generated code is sound, but I'm not comfortable making such a large change this close to the OTP 22 release.
2019-03-25	Verify the highest opcode for the r21 test suites	Björn Gustavsson

2019-03-25	Add test_lib:highest_opcode/1	Björn Gustavsson

2019-03-25	sys_core_fold: Simplify case_expand_var/2	Björn Gustavsson
	5239eb0c62a9 stopped storing patterns that a variable has matched. The only tuples that are stored in the type database are tuples that have been previously constructed. Therefore, the code in sys_core_fold:case_expand_var/2 and friends that converts a tuple pattern to a tuple construction can be simplified to used the stored tuple directly.
2019-03-25	beam_validator: Remove uncovered lines in lists_mod_return_type/3	Björn Gustavsson
	Don't bother infering the return types for lists function that beam_ssa_type does not handle.
2019-03-25	Cover return type determination of lists functions	Björn Gustavsson

2019-03-21	Merge branch 'bjorn/hipe-compilation/OTP-15596'	Björn Gustavsson
	* bjorn/hipe-compilation/OTP-15596: HiPE: Don't fail the compilation for unimplemented instructions
2019-03-21	lib/compiler/scripts/smoke: Compile some cloudi packages	Björn Gustavsson

2019-03-20	HiPE: Don't fail the compilation for unimplemented instructions	Björn Gustavsson

2019-03-19	Merge branch 'john/compiler/fix-eq-type-infererence-in-validator/ERL-886'	John Högberg
	* john/compiler/fix-eq-type-infererence-in-validator/ERL-886: beam_validator: Infer types on both sides of '=:='
2019-03-18	lib/compiler/scripts/smoke: Force a local hex	Björn Gustavsson
	The script would hang if no local hex had been installed previously.
2019-03-18	beam_validator: Infer types on both sides of '=:='	John Högberg

2019-03-13	Merge pull request #2177 from bjorng/bjorn/erts/tail-recursive-bifs	Björn Gustavsson
	Optimize tail-recursive calls of BIFs OTP-15674
2019-03-12	Merge branch 'maint'	Henrik Nord
	* maint: Updated OTP version Prepare release
2019-03-11	Prepare release	Erlang/OTP

2019-03-09	Optimize tail-recursive calls of BIFs	Björn Gustavsson
	BEAM currently does not call BIFs at the end of a function in a tail-recursive way. That is, when calling a BIF at the end of a function, the BIF is first called, and then the stack frame is deallocated, and then control is transferred to the caller. If there is no stack frame when a BIF is called in the tail position, the loader will emit a sequence of three instructions: first an instruction that allocates a stack frame and saves the continuation pointer (`allocate`), then an instruction that calls the BIF (`call_bif`), and lastly an instruction that deallocates the stack frame and returns to the caller (`deallocate_return`). The old compiler would essentially allocate a stack frame for each clause in a function, so it would not be that common that a BIF was called in the tail position when there was no stack frame, so the three-instruction sequence was deemed acceptable. The new compiler only allocates stack frames when truly needed, so the three-instruction BIF call sequence has become much more common. This commit introduces a new `call_bif_only` instruction so that only one instruction will be needed when calling a BIF in the tail position when there is no stack frame. This instruction is also used when there is a stack frame to make it possible to deallocate the stack frame before calling the BIF, which may make a subsequent garbage collection at the end of the BIF call cheaper (copying less garbage). The one downside of this change is that the function that called the BIF will not be included in the stack backtrace (similar to how a tail-recursive call to an Erlang function will not be included in the backtrace). That was the quick summary of the commit. Here comes a detailed look at how BIF calls are translated by the loader. The first example is a function that calls `setelement/3` in the tail position: update_no_stackframe(X) -> setelement(5, X, new_value). Here is the BEAM code: {function, update_no_stackframe, 1, 12}. {label,11}. {line,[...]}. {func_info,{atom,t},{atom,update_no_stackframe},1}. {label,12}. {move,{x,0},{x,1}}. {move,{atom,new_value},{x,2}}. {move,{integer,5},{x,0}}. {line,[...]}. {call_ext_only,3,{extfunc,erlang,setelement,3}}. Because there is no stack frame, the `call_ext_only` instruction will be used to call `setelement/3`: {call_ext_only,3,{extfunc,erlang,setelement,3}}. The loader will transform this instruction to a three-instruction sequence: 0000000020BD8130: allocate_tt 0 3 0000000020BD8138: call_bif_e erlang:setelement/3 0000000020BD8148: deallocate_return_Q 0 Using the `call_bif_only` instruction introduced in this commit, only one instruction is needed: 000000005DC377F0: call_bif_only_e erlang:setelement/3 `call_bif_only` calls the BIF and returns to the caller. Now let's look at a function that already has a stack frame when `setelement/3` is called: update_with_stackframe(X) -> foobar(X), setelement(5, X, new_value). Here is the BEAM code: {function, update_with_stackframe, 1, 14}. {label,13}. {line,[...]}. {func_info,{atom,t},{atom,update_with_stackframe},1}. {label,14}. {allocate,1,1}. {move,{x,0},{y,0}}. {line,[...]}. {call,1,{f,16}}. {move,{y,0},{x,1}}. {move,{atom,new_value},{x,2}}. {move,{integer,5},{x,0}}. {line,[...]}. {call_ext_last,3,{extfunc,erlang,setelement,3},1}. Since there is a stack frame, the `call_ext_last` instruction will be used to deallocate the stack frame and call the function: {call_ext_last,3,{extfunc,erlang,setelement,3},1}. Before this commit, the loader would translate this instruction to: 0000000020BD81B8: call_bif_e erlang:setelement/3 0000000020BD81C8: deallocate_return_Q 1 That is, the BIF is called before deallocating the stack frame and returning to the calling function. After this commit, the loader will translate the `call_ext_last` like this: 000000005DC37868: deallocate_Q 1 000000005DC37870: call_bif_only_e erlang:setelement/3 There are still two instructions, but now the stack frame will be deallocated before calling the BIF, which could make the potential garbage collection after the BIF call slightly more efficient (copying less garbage). We could have introduced a `call_bif_last` instruction, but the code for calling a BIF is relatively large and there does not seem be a practical way to share the code between `call_bif` and `call_bif_only` (since the difference is at the end, after the BIF call). Therefore, we did not want to clone the BIF calling code yet another time to make a `call_bif_last` instruction.
2019-03-08	beam_ssa_opt: Fix crash in ssa_opt_float	John Högberg
	For reasons better explained in the source code, ssa_opt_float skips optimizing inside guards but it failed to do so consistently; while the pass never processed guard blocks, it was still possible to erroneously defer error checking to a guard block, crashing the compiler once it realized its state was invalid.
2019-03-06	beam_validator: Express test_arity/is_tagged_tuple as type tests	John Högberg
	This ensures that unreachable branches are properly ignored on repeated checks (although tuple type subtraction isn't complete yet).
2019-03-06	beam_validator: Fix type subtraction on select_* and inequality	John Högberg
	Type subtraction never resulted in the 'none' type, even when it was obvious that it should. Once that was fixed it became apparent that inequality checks also fell into the same subtraction trap that the type pass warned about in a comment. This then led to another funny problem with select_val, consider the following code: {bif,'>=',{f,0},[{x,0},{integer,1}],{x,0}}. {select_val,{x,0},{f,70},{list,[{atom,false},{f,69}, {atom,true},{f,68}]}}. The validator knows that '>=' can only return a boolean, so once it has subtracted 'false' and 'true' it killed the state because all all valid branches had been taken, so validation would crash once it tried to branch off the fail label.
2019-03-05	beam_validator: Style fix for asserts in list comprehensions	John Högberg

2019-03-05	beam_validator: Infer types from bs_put instructions	John Högberg

2019-03-05	beam_validator: is_map no longer needs special treatment	John Högberg

2019-03-05	beam_validator: Refactor type conflict resolution	John Högberg
	The current type conflict resolution works well for the example case in the comment, but doesn't handle branched code properly, consider the following: {label,2}. {test,is_tagged_tuple,{f,ignored},[{x,0},3,{atom,r}]}. {allocate_zero,2,1}. {move,{x,0},{y,0}}. %% {y,0} is known to be {r, _, _} now. {get_tuple_element,{x,0},2,{x,0}}. {'try',{y,1},{f,3}}. %% ... snip ... {jump,{f,5}}. {label,3}. {try_case,{y,1}}. %% {x,0} is the error class (an atom), {x,1} is the error term. {test,is_eq_exact,{f,ignored},[{x,0},{y,0}]}. %% ... since tuples and atoms can't meet, the type of {y,0} is %% now {atom,[]} because the current code assumes the type %% we're updating with. {move,{x,1},{x,0}}. {jump,{f,5}}. {label,5}. %% ... joining tuple (block 2) and atom (block 3) means 'term', %% so the get_tuple_element instruction fails to validate %% despite this being unrechable from block 3. {test_heap,3,1}. {get_tuple_element,{y,0},1,{x,1}}. {put_tuple2,{x,0},{list,[{x,1},{x,0}]}}. {deallocate,2}. return. This commit kills the state on type conflicts, making unreachable instructions truly unreachable.
2019-03-05	beam_validator: Refactor branch handling	John Högberg
	While complex_test made certain branching instructions a lot easier to read, we're still using `branch_state` for many others which is hard to read and makes it impossible to "abort" branches on type conflicts. This commit replaces nearly all uses of `branch_state` with a general branching mechanism, improving readability and paving the way for proper type conflict resolution.
2019-03-04	beam_validator: Fix element/2 BIF handling	John Högberg
	The element type can not be extracted before the tuple type has been updated.
2019-03-04	beam_validator: Rename get_move_term_type and clean up get_raw_type	John Högberg

2019-03-04	Merge pull request #2168 from josevalim/jv-v3_kernel-binary-all	Björn Gustavsson
	Move size=all binary clause pruning to v3_kernel
2019-03-04	Merge pull request #2167 from bjorng/bjorn/tune-beam	Björn Gustavsson
	Tune BEAM instructions for the new compiler (part 1)
2019-03-04	Merge pull request #2166 from bjorng/bjorn/compiler/fix-slow-beam_ssa_dead	Björn Gustavsson
	Optimize the beam_ssa_dead sub pass
2019-03-01	Move size=all binary clause prunning to v3_kernel	José Valim
	The advantage of moving it up is that it reduces the size of the code emitted by v3_kernel, speeding v3_kernel itself and beam_kernel_to_ssa pass.
2019-03-01	Optimize v3_kernel for thousands of clauses	José Valim
	Prior to this patch, v3_kernel would do multiple passes on the clauses to group them. This commit unrolls those passes, making v3_kernel up to 10% faster in those cases.
2019-03-01	Add a comment about the time complexity of beam_ssa_dead	Björn Gustavsson

2019-03-01	Pass the from node as a function argument instead of in a map	Björn Gustavsson
	This is cleaner and slightly faster.
2019-03-01	Do some minor optimizations of compilation times	Björn Gustavsson
	The general complexity of the shortcut sub pass of `beam_ssa_dead` is quadratic, but those optimizations will reduce the constant factor somewhat.
2019-03-01	Keep the set of unset variables as small as possible	Björn Gustavsson
	Refactor the code to avoid putting any variable from a skippable block into the set of unset variables. Keeping the set of unset variables as small as possible will make beam_ssa_dead almost twice as fast when compiling lib/unicode/tokenizer.ex in elixir.
2019-02-28	beam_ssa_opt: Use is_tagged_tuple more	Björn Gustavsson
	Consider this code: foo(X) -> case X of {ok,A} -> A; error -> X end. The `is_tagged_tuple` instruction would not be used because not all instructions in the tuple matching sequence had the same failure label: function t:foo(_0) { 0: @ssa_bool:7 = bif:is_tuple _0 br @ssa_bool:7, label 8, label 4 8: @ssa_arity = bif:tuple_size _0 @ssa_bool:9 = bif:'=:=' @ssa_arity, literal 2 br @ssa_bool:9, label 6, label 3 6: _4 = get_tuple_element _0, literal 0 @ssa_bool = bif:'=:=' _4, literal ok br @ssa_bool, label 5, label 3 5: _3 = get_tuple_element _0, literal 1 ret _3 4: @ssa_bool:11 = bif:'=:=' _0, literal error br @ssa_bool:11, label 10, label 3 10: ret _0 3: _2 = put_tuple literal case_clause, _0 %% t.erl:5 @ssa_ret:12 = call remote (literal erlang):(literal error)/1, _2 ret @ssa_ret:12 } Enhance the ssa_opt_record optimization to use `is_tagged_tuple` even if all failure labels are not the same: function t:foo(_0) { 0: @ssa_bool:7 = bif:is_tuple _0 br @ssa_bool:7, label 8, label 4 8: @ssa_bool:9 = is_tagged_tuple _0, literal 2, literal ok br @ssa_bool:9, label 6, label 3 6: _3 = get_tuple_element _0, literal 1 ret _3 4: @ssa_bool:11 = bif:'=:=' _0, literal error br @ssa_bool:11, label 10, label 3 10: ret _0 3: _2 = put_tuple literal case_clause, _0 %% t.erl:5 @ssa_ret:12 = call remote (literal erlang):(literal error)/1, _2 ret @ssa_ret:12 } The tuple test will be repeated, but since four instructions are replaced by two instructions, the code will still be faster and smaller.
2019-02-28	beam_ssa_opt: Order consecutive get_tuple_element instructions	Björn Gustavsson