The syntax of Erlang tokens allow the use of the full ISO-8859-1 (Latin-1) character set. This is noticeable in the following ways:
All the Latin-1 printable characters can be used and are shown without the escape backslash convention.
Atoms and variables can use all Latin-1 letters.
In Erlang/OTP R16B the syntax of Erlang tokens was extended to
handle Unicode. The support was limited to
string literals and comments.
More about the usage of Unicode in Erlang source files
can be found in
From Erlang/OTP 20, atoms and function names are also allowed to contain Unicode characters outside the ISO-Latin-1 range. Module names, application names, and node names are still restricted to the ISO-Latin-1 range.
The Erlang source file
The following example selects UTF-8 as default encoding:
%% coding: utf-8
Two more examples, both selecting Latin-1 as default encoding:
%% For this file we have chosen encoding = Latin-1
%% -*- coding: latin-1 -*-
The default encoding for Erlang source files is changed from Latin-1 to UTF-8 since Erlang/OTP 17.0.