<?xml version="1.0" encoding="latin1" ?> <!DOCTYPE chapter SYSTEM "chapter.dtd"> <chapter> <header> <copyright> <year>1997</year><year>2009</year> <holder>Ericsson AB. All Rights Reserved.</holder> </copyright> <legalnotice> The contents of this file are subject to the Erlang Public License, Version 1.1, (the "License"); you may not use this file except in compliance with the License. You should have received a copy of the Erlang Public License along with this software. If not, it can be retrieved online at http://www.erlang.org/. Software distributed under the License is distributed on an "AS IS" basis, WITHOUT WARRANTY OF ANY KIND, either express or implied. See the License for the specific language governing rights and limitations under the License. </legalnotice> <title>Getting Started with Mnesia</title> <prepared>Claes Wikström, Hans Nilsson and Håkan Mattsson</prepared> <responsible></responsible> <docno></docno> <approved></approved> <checked></checked> <date></date> <rev>C</rev> <file>Mnesia_chap2.xml</file> </header> <p>This chapter introduces Mnesia. Following a brief discussion about the first initial setup, a Mnesia database example is demonstrated. This database example will be referenced in the following chapters, where this example is modified in order to illustrate various program constructs. In this chapter, the following mandatory procedures are illustrated by examples: </p> <list type="bulleted"> <item>Starting an Erlang session and specifying a directory for the Mnesia database. </item> <item>Initializing a database schema. </item> <item>Starting Mnesia and creating the required tables.</item> </list> <section> <title>Starting Mnesia for the first time</title> <p>Following is a simplified demonstration of a Mnesia system startup. This is the dialogue from the Erlang shell: </p> <pre><![CDATA[ unix> erl -mnesia dir '"/tmp/funky"' Erlang (BEAM) emulator version 4.9 Eshell V4.9 (abort with ^G) 1> 1> mnesia:create_schema([node()]). ok 2> mnesia:start(). ok 3> mnesia:create_table(funky, []). {atomic,ok} 4> mnesia:info(). ---> Processes holding locks <--- ---> Processes waiting for locks <--- ---> Pending (remote) transactions <--- ---> Active (local) transactions <--- ---> Uncertain transactions <--- ---> Active tables <--- funky : with 0 records occupying 269 words of mem schema : with 2 records occupying 353 words of mem ===> System info in version "1.0", debug level = none <=== opt_disc. Directory "/tmp/funky" is used. use fall-back at restart = false running db nodes = [nonode@nohost] stopped db nodes = [] remote = [] ram_copies = [funky] disc_copies = [schema] disc_only_copies = [] [{nonode@nohost,disc_copies}] = [schema] [{nonode@nohost,ram_copies}] = [funky] 1 transactions committed, 0 aborted, 0 restarted, 1 logged to disc 0 held locks, 0 in queue; 0 local transactions, 0 remote 0 transactions waits for other nodes: [] ok ]]></pre> <p>In the example above the following actions were performed: </p> <list type="bulleted"> <item>The Erlang system was started from the UNIX prompt with a flag <c>-mnesia dir '"/tmp/funky"'</c>. This flag indicates to Mnesia which directory will store the data. </item> <item>A new empty schema was initialized on the local node by evaluating <c>mnesia:create_schema([node()]).</c> The schema contains information about the database in general. This will be thoroughly explained later on. </item> <item>The DBMS was started by evaluating <c>mnesia:start()</c>. </item> <item>A first table was created, called <c>funky</c> by evaluating the expression <c>mnesia:create_table(funky, [])</c>. The table was given default properties. </item> <item><c>mnesia:info()</c> was evaluated and subsequently displayed information regarding the status of the database on the terminal. </item> </list> </section> <section> <title>An Introductory Example</title> <p>A Mnesia database is organized as a set of tables. Each table is populated with instances (Erlang records). A table also has a number of properties, such as location and persistence. </p> <p>In this example we shall: </p> <list type="bulleted"> <item>Start an Erlang system, and specify the directory where the database will be located. </item> <item>Initiate a new schema with an attribute that specifies on which node, or nodes, the database will operate. </item> <item>Start Mnesia itself. </item> <item>Create and populate the database tables. </item> </list> <section> <title>The Example Database</title> </section> <p>In this database example, we will create the database and relationships depicted in the following diagram. We will call this database the <em>Company</em> database. </p> <image file="company.gif"> <icaption>Company Entity-Relation Diagram</icaption> </image> <p>The database model looks as follows: </p> <list type="bulleted"> <item>There are three entities: employee, project, and department. </item> <item> <p>There are three relationships between these entities:</p> <list type="ordered"> <item>A department is managed by an employee, hence the <em>manager</em> relationship. </item> <item>An employee works at a department, hence the <em>at_dep</em> relationship. </item> <item>Each employee works on a number of projects, hence the <em>in_proj</em> relationship. </item> </list> </item> </list> <section> <title>Defining Structure and Content</title> <p>We first enter our record definitions into a text file named <c>company.hrl</c>. This file defines the following structure for our sample database: </p> <codeinclude file="company.hrl" tag="%0" type="erl"></codeinclude> <p>The structure defines six tables in our database. In Mnesia, the function <c>mnesia:create_table(Name, ArgList)</c> is used to create tables. <c>Name</c> is the table name <em>Note:</em> The current version of Mnesia does not require that the name of the table is the same as the record name, See Chapter 4: <seealso marker="Mnesia_chap4#recordnames_tablenames">Record Names Versus Table Names.</seealso></p> <p>For example, the table for employees will be created with the function <c>mnesia:create_table(employee, [{attributes, record_info(fields, employee)}]).</c> The table name <c>employee</c> matches the name for records specified in <c>ArgList</c>. The expression <c>record_info(fields, RecordName)</c> is processed by the Erlang preprocessor and evaluates to a list containing the names of the different fields for a record. </p> </section> <section> <title>The Program</title> <p>The following shell interaction starts Mnesia and initializes the schema for our <c>company</c> database: </p> <pre> % <input>erl -mnesia dir '"/ldisc/scratch/Mnesia.Company"'</input> Erlang (BEAM) emulator version 4.9 Eshell V4.9 (abort with ^G) 1> mnesia:create_schema([node()]). ok 2> mnesia:start(). ok </pre> <p>The following program module creates and populates previously defined tables: </p> <codeinclude file="company.erl" tag="%0" type="erl"></codeinclude> </section> <section> <title>The Program Explained</title> <p>The following commands and functions were used to initiate the Company database: </p> <list type="bulleted"> <item><c>% erl -mnesia dir '"/ldisc/scratch/Mnesia.Company"'.</c> This is a UNIX command line entry which starts the Erlang system. The flag <c>-mnesia dir Dir</c> specifies the location of the database directory. The system responds and waits for further input with the prompt <em>1></em>. </item> <item><c>mnesia:create_schema([node()]).</c> This function has the format <c>mnesia:create_schema(DiscNodeList)</c> and initiates a new schema. In this example, we have created a non-distributed system using only one node. Schemas are fully explained in Chapter 3:<seealso marker="Mnesia_chap3#def_schema">Defining a Schema</seealso>. </item> <item><c>mnesia:start().</c> This function starts Mnesia. This function is fully explained in Chapter 3: <seealso marker="Mnesia_chap3#start_mnesia">Starting Mnesia</seealso>. </item> </list> <p>Continuing the dialogue with the Erlang shell will produce the following:</p> <pre><![CDATA[ 3> company:init(). {atomic,ok} 4> mnesia:info(). ---> Processes holding locks <--- ---> Processes waiting for locks <--- ---> Pending (remote) transactions <--- ---> Active (local) transactions <--- ---> Uncertain transactions <--- ---> Active tables <--- in_proj : with 0 records occuping 269 words of mem at_dep : with 0 records occuping 269 words of mem manager : with 0 records occuping 269 words of mem project : with 0 records occuping 269 words of mem dept : with 0 records occuping 269 words of mem employee : with 0 records occuping 269 words of mem schema : with 7 records occuping 571 words of mem ===> System info in version "1.0", debug level = none <=== opt_disc. Directory "/ldisc/scratch/Mnesia.Company" is used. use fall-back at restart = false running db nodes = [nonode@nohost] stopped db nodes = [] remote = [] ram_copies = [at_dep,dept,employee,in_proj,manager,project] disc_copies = [schema] disc_only_copies = [] [{nonode@nohost,disc_copies}] = [schema] [{nonode@nohost,ram_copies}] = [employee,dept,project,manager,at_dep,in_proj] 6 transactions committed, 0 aborted, 0 restarted, 6 logged to disc 0 held locks, 0 in queue; 0 local transactions, 0 remote 0 transactions waits for other nodes: [] ok ]]></pre> <p>A set of tables is created: </p> <list type="bulleted"> <item><c>mnesia:create_table(Name,ArgList)</c>. This function is used to create the required database tables. The options available with <c>ArgList</c> are explained in Chapter 3: <seealso marker="Mnesia_chap3#create_tables">Creating New Tables</seealso>. </item> </list> <p>The <c>company:init/0</c> function creates our tables. Two tables are of type <c>bag</c>. This is the <c>manager</c> relation as well the <c>in_proj</c> relation. This shall be interpreted as: An employee can be manager over several departments, and an employee can participate in several projects. However, the <c>at_dep</c> relation is <c>set</c> because an employee can only work in one department. In this data model we have examples of relations that are one-to-one (<c>set</c>), as well as one-to-many (<c>bag</c>). </p> <p><c>mnesia:info()</c> now indicates that a database which has seven local tables, of which, six are our user defined tables and one is the schema. Six transactions have been committed, as six successful transactions were run when creating the tables. </p> <p>To write a function which inserts an employee record into the database, there must be an <c>at_dep</c> record and a set of <c>in_proj</c> records inserted. Examine the following code used to complete this action: </p> <codeinclude file="company.erl" tag="%1" type="erl"></codeinclude> <list type="bulleted"> <item> <p><c>insert_emp(Emp, DeptId, ProjNames) -></c>. The <c>insert_emp/3</c> arguments are:</p> <list type="ordered"> <item><c>Emp</c> is an employee record. </item> <item><c>DeptId</c> is the identity of the department where the employee is working. </item> <item><c>ProjNames</c> is a list of the names of the projects where the employee are working.</item> </list> </item> </list> <p>The <c>insert_emp(Emp, DeptId, ProjNames) -></c> function creates a <em>functional object</em>. Functional objects are identified by the term <c>Fun</c>. The Fun is passed as a single argument to the function <c>mnesia:transaction(Fun)</c>. This means that Fun is run as a transaction with the following properties: </p> <list type="bulleted"> <item>Fun either succeeds or fails completely. </item> <item>Code which manipulates the same data records can be run concurrently without the different processes interfering with each other. </item> </list> <p>The function can be used as:</p> <code type="none"> Emp = #employee{emp_no= 104732, name = klacke, salary = 7, sex = male, phone = 98108, room_no = {221, 015}}, insert_emp(Me, 'B/SFR', [Erlang, mnesia, otp]). </code> <note> <p>Functional Objects (Funs) are described in the Erlang Reference Manual, "Fun Expressions". </p> </note> </section> <section> <title>Initial Database Content</title> <p>After the insertion of the employee named <c>klacke</c> we have the following records in the database: </p> <marker id="table2_1"></marker> <table> <row> <cell align="left" valign="middle">emp_no</cell> <cell align="left" valign="middle">name</cell> <cell align="left" valign="middle">salary</cell> <cell align="left" valign="middle">sex</cell> <cell align="left" valign="middle">phone</cell> <cell align="left" valign="middle">room_no</cell> </row> <row> <cell align="left" valign="middle">104732</cell> <cell align="left" valign="middle">klacke</cell> <cell align="left" valign="middle">7</cell> <cell align="left" valign="middle">male</cell> <cell align="left" valign="middle">99586</cell> <cell align="left" valign="middle">{221, 015}</cell> </row> <tcaption> Employee</tcaption> </table> <p>An employee record has the following Erlang record/tuple representation: <c>{employee, 104732, klacke, 7, male, 98108, {221, 015}}</c></p> <marker id="table2_2"></marker> <table> <row> <cell align="left" valign="middle">emp</cell> <cell align="left" valign="middle">dept_name</cell> </row> <row> <cell align="left" valign="middle">klacke</cell> <cell align="left" valign="middle">B/SFR</cell> </row> <tcaption> At_dep</tcaption> </table> <p>At_dep has the following Erlang tuple representation: <c>{at_dep, klacke, 'B/SFR'}</c>. </p> <marker id="table3_3"></marker> <table> <row> <cell align="left" valign="middle">emp</cell> <cell align="left" valign="middle">proj_name</cell> </row> <row> <cell align="left" valign="middle">klacke</cell> <cell align="left" valign="middle">Erlang</cell> </row> <row> <cell align="left" valign="middle">klacke</cell> <cell align="left" valign="middle">otp</cell> </row> <row> <cell align="left" valign="middle">klacke</cell> <cell align="left" valign="middle">mnesia</cell> </row> <tcaption> In_proj</tcaption> </table> <p>In_proj has the following Erlang tuple representation: <c>{in_proj, klacke, 'Erlang', klacke, 'otp', klacke, 'mnesia'}</c></p> <p>There is no difference between rows in a table and Mnesia records. Both concepts are the same and will be used interchangeably throughout this book. </p> <p>A Mnesia table is populated by Mnesia records. For example, the tuple <c>{boss, klacke, bjarne}</c> is a record. The second element in this tuple is the key. In order to uniquely identify a table row both the key and the table name is needed. The term <em>object identifier</em>, (oid) is sometimes used for the arity two tuple {Tab, Key}. The oid for the <c>{boss, klacke, bjarne}</c> record is the arity two tuple <c>{boss, klacke}</c>. The first element of the tuple is the type of the record and the second element is the key. An oid can lead to zero, one, or more records depending on whether the table type is <c>set</c> or <c>bag</c>. </p> <p>We were also able to insert the <c>{boss, klacke, bjarne}</c> record which contains an implicit reference to another employee which does not yet exist in the database. Mnesia does not enforce this. </p> </section> <section> <title>Adding Records and Relationships to the Database</title> <p>After adding additional record to the Company database, we may end up with the following records: </p> <p><em>Employees</em></p> <code type="none"> {employee, 104465, "Johnson Torbjorn", 1, male, 99184, {242,038}}. {employee, 107912, "Carlsson Tuula", 2, female,94556, {242,056}}. {employee, 114872, "Dacker Bjarne", 3, male, 99415, {221,035}}. {employee, 104531, "Nilsson Hans", 3, male, 99495, {222,026}}. {employee, 104659, "Tornkvist Torbjorn", 2, male, 99514, {222,022}}. {employee, 104732, "Wikstrom Claes", 2, male, 99586, {221,015}}. {employee, 117716, "Fedoriw Anna", 1, female,99143, {221,031}}. {employee, 115018, "Mattsson Hakan", 3, male, 99251, {203,348}}. </code> <p><em>Dept</em></p> <code type="none"> {dept, 'B/SF', "Open Telecom Platform"}. {dept, 'B/SFP', "OTP - Product Development"}. {dept, 'B/SFR', "Computer Science Laboratory"}. </code> <p><em>Projects</em></p> <code type="none"> %% projects {project, erlang, 1}. {project, otp, 2}. {project, beam, 3}. {project, mnesia, 5}. {project, wolf, 6}. {project, documentation, 7}. {project, www, 8}. </code> <p>The above three tables, titled <c>employees</c>, <c>dept</c>, and <c>projects</c>, are the tables which are made up of real records. The following database content is stored in the tables which is built on relationships. These tables are titled <c>manager</c>, <c>at_dep</c>, and <c>in_proj</c>. </p> <p><em>Manager</em></p> <code type="none"> {manager, 104465, 'B/SF'}. {manager, 104465, 'B/SFP'}. {manager, 114872, 'B/SFR'}. </code> <p><em>At_dep</em></p> <code type="none"> {at_dep, 104465, 'B/SF'}. {at_dep, 107912, 'B/SF'}. {at_dep, 114872, 'B/SFR'}. {at_dep, 104531, 'B/SFR'}. {at_dep, 104659, 'B/SFR'}. {at_dep, 104732, 'B/SFR'}. {at_dep, 117716, 'B/SFP'}. {at_dep, 115018, 'B/SFP'}. </code> <p><em>In_proj</em></p> <code type="none"> {in_proj, 104465, otp}. {in_proj, 107912, otp}. {in_proj, 114872, otp}. {in_proj, 104531, otp}. {in_proj, 104531, mnesia}. {in_proj, 104545, wolf}. {in_proj, 104659, otp}. {in_proj, 104659, wolf}. {in_proj, 104732, otp}. {in_proj, 104732, mnesia}. {in_proj, 104732, erlang}. {in_proj, 117716, otp}. {in_proj, 117716, documentation}. {in_proj, 115018, otp}. {in_proj, 115018, mnesia}. </code> <p>The room number is an attribute of the employee record. This is a structured attribute which consists of a tuple. The first element of the tuple identifies a corridor, and the second element identifies the actual room in the corridor. We could have chosen to represent this as a record <c>-record(room, {corr, no}).</c> instead of an anonymous tuple representation. </p> <p>The Company database is now initialized and contains data. </p> </section> <section> <title>Writing Queries</title> <p>Retrieving data from DBMS should usually be done with <c>mnesia:read/3</c> or <c>mnesia:read/1</c> functions. The following function raises the salary:</p> <codeinclude file="company.erl" tag="%5" type="erl"></codeinclude> <p>Since we want to update the record using <c>mnesia:write/1</c> after we have increased the salary we acquire a write lock (third argument to read) when we read the record from the table. </p> <p>It is not always the case that we can directly read the values from the table, we might need to search the table or several tables to get the data we want, this is done by writing database queries. Queries are always more expensive operations than direct lookups done with <c>mnesia:read</c> and should be avoided in performance critical code.</p> <p>There are two methods for writing database queries: </p> <list type="bulleted"> <item>Mnesia functions </item> <item>QLC</item> </list> <section> <title>Mnesia functions </title> <p></p> <p>The following function extracts the names of the female employees stored in the database: </p> <pre> mnesia:select(employee, [{#employee{sex = female, name = '$1', _ = '_'},[], ['$1']}]). </pre> <p>Select must always run within an activity such as a transaction. To be able to call from the shell we might construct a function as: </p> <codeinclude file="company.erl" tag="%20" type="erl"></codeinclude> <p>The select expression matches all entries in table employee with the field sex set to female. </p> <p>This function can be called from the shell as follows: </p> <pre> (klacke@gin)1> <input>company:all_females().</input> {atomic, ["Carlsson Tuula", "Fedoriw Anna"]} </pre> <p>See also the <seealso marker="Mnesia_chap4#matching">Pattern Matching </seealso> chapter for a description of select and its syntax. </p> </section> <section> <title>Using QLC </title> <p>This section contains simple introductory examples only. Refer to <em>QLC reference manual</em> for a full description of the QLC query language. Using QLC might be more expensive than using Mnesia functions directly but offers a nice syntax. </p> <p>The following function extracts a list of female employees from the database: </p> <pre> Q = qlc:q([E#employee.name || E <![CDATA[<-]]> mnesia:table(employee), E#employee.sex == female]), qlc:e(Q), </pre> <p>Accessing mnesia tables from a QLC list comprehension must always be done within a transaction. Consider the following function: </p> <codeinclude file="company.erl" tag="%2" type="erl"></codeinclude> <p>This function can be called from the shell as follows: </p> <pre> (klacke@gin)1> <input>company:females().</input> {atomic, ["Carlsson Tuula", "Fedoriw Anna"]} </pre> <p>In traditional relational database terminology, the above operation would be called a selection, followed by a projection. </p> <p>The list comprehension expression shown above contains a number of syntactical elements. </p> <list type="bulleted"> <item>the first <c>[</c> bracket should be read as "build the list" </item> <item>the <c>||</c> "such that" and the arrow <c><![CDATA[<-]]></c> should be read as "taken from" </item> </list> <p>Hence, the above list comprehension demonstrates the formation of the list <c>E#employee.name</c> such that <c>E</c> is taken from the table of employees and the <c>sex</c> attribute of each records is equal with the atom <c>female</c>. </p> <p>The whole list comprehension must be given to the <c>qlc:q/1</c> function. </p> <p>It is possible to combine list comprehensions with low level Mnesia functions in the same transaction. If we want to raise the salary of all female employees we execute: </p> <codeinclude file="company.erl" tag="%4" type="erl"></codeinclude> <p>The function <c>raise_females/1</c> returns the tuple <c>{atomic, Number}</c>, where <c>Number</c> is the number of female employees who received a salary increase. Should an error occur, the value <c>{aborted, Reason}</c> is returned. In the case of an error, Mnesia guarantees that the salary is not raised for any employees at all. </p> <pre> 33><input>company:raise_females(33).</input> {atomic,2} </pre> </section> </section> </section> </chapter>