This HTML document is automatically generated from the Help file that accompanies Bracmat, using Bracmat to do the conversion.

BRACMAT

starting with Bracmat
pattern matching
grammar
binary operators
prefixes/unary operators
strings or atoms
symbols
the four evaluation contexts
programming in Bracmat
functions
data structures
objects
hash tables
how Bracmat evolved
why the name 'Bracmat'?
how to obtain Bracmat
more example code

If you haven't seen Bracmat before ..

Bracmat is an interpreted programming language that will evaluate instructions that you type at the keyboard. For example, Bracmat can

handle big, irregular data structures,
let you delve into a data structure by means of pattern matching and
simplify an algebraic formula,
read a file with source text and execute this code in one go,
save neatly formatted source text in a file.

In Bracmat, data and instructions have the same syntax: a mix of parentheses, binary operators, operands and prefixes preceding those operands or parentheses. In fact, in many cases you can't tell the difference between the data and instructions. Bracmat has a REPL (Read Eval Print Loop), like Lisp and many other programming languages. It offers a simple environment for input of both data and code. After the prompt {?} you can write. When you hit Enter Bracmat evaluates your input and writes the result to the screen following a {!} sequence (unless there was no visible result). Under the result follows a line that tells whether the evaluation was successful (S) or not (F) (in rare cases you may see a "I", which, for the time being, you may interpret as failure). In the same line the machine shows how much processor time it needed. Intermediary results may also appear on the screen.

Bracmat and calculators have an important common feature: as long as the input data are simple enough, the user need (and can) not specify how these data have to be processed. Calculators handle basic calculations in a predictable, unchangeable way, e.g. "4+7" will always result in "11", and not in, say, "10" or "23-12". That is because the manufacturer had good reasons to think that "11" is what the user expects and nothing else. In the same way, Bracmat handles basic "calculations" with a much wider variety of data: rational numbers, symbols, words and collections thereof. For example, "a+b+a" becomes "2*a+b" and not, say, "x.a" or "b+2*a". Again, Bracmat takes decisions that you can't easily circumvent. However, the more complex the data are, the better are the chances that not all results, although defendable, have an appearance that suits you. It is here that programming comes in: Bracmat leaves certain kinds of data unchanged, but opens the possibility to dissect data, to perform calculations on the parts, and to assemble an answer from the resulting sub-answers.

First and foremost Bracmat was developed with symbolic algebra in mind. You may add, multiply, take powers and logarithms and differentiate. Oddly enough, Bracmat has no operators for subtraction and division. This reduces the amount of arbitrariness in the presentation of formulae. "a-b" must be written as "a+-1*b" and "a/b" must be written as "a*b^-1". Calculations are always exact. Number expressions for which no rational representation exists are not further evaluated. Bracmat knows how to handle the special symbols i, e and pi, but it offers no numerical representation for e and pi. Examples:

{?} a+b+-1*a                 { this is how you subtract in Bracmat }
{?} 1+(1+i)*(1+-1*i)+-1      { the leading and trailing terms force
                               Bracmat to expand the product }
{?} -12345/54321 ^ 1/2       { the square root of -12345/54321 }
{?} x^(a+x\L2*1000)
{?} 5/2 \L 987654
{?} y\D(x\D((x+y)^-2))

Bracmat can only do arithmetic operations with integer and rational numbers. There is virtually no limit to their size, but the program is not optimised for number crunching.

Bracmat handles non-integer powers of positive rational numbers provided that the number (if it is an integer) or the numerator and the denominator (if the number is a fraction) are less than 2^32 or 2^64 respectively, depending on whether Bracmat is compiled for a 32 or a 64 bit platform.

Bracmat happily adds and multiplies numbers of, say, a few hundred digits. There is a pre-defined function, flt$, that represents rational numbers in a "scientific" floating point notation, but Bracmat cannot do calculations with these "reals", unless you write a function to convert them back to rational numbers.

The binary operators have an ordering of precedence. Of the mathematical operators, + has the lowest precedence and \D the highest (+ * ^ \L \D). You may use parentheses to overrule this ordering:

{?} (a+b)*(a+c)+a^(-1*d^2+(d+1)*(d+-1))

When instructions are entered from the keyboard, the program waits until all of the conditions below are fulfilled:

<return> (or <enter>) is the last key that has been pressed
every opening parenthesis has a closing counterpart
every opening brace (start of comment) has a closing counterpart
every string has either two enclosing double quotes, or none.

You can write a multiple-line instruction by putting the instruction inside an extra pair of parentheses. After each non-terminating <return>, Bracmat shows in the next line how many closing parentheses are needed for the completion of the instruction. If you are in the middle of a string or a comment when pressing <return>, the next line starts with {str} or {com}, respectively.

If you want to enter several instructions on the same line, you should write a semicolon ";" between the instructions. These instructions are executed in the same order. If you do not want to see the result of a calculation, you may write a semicolon after the last instruction. Instructions in a text file must be separated by semicolons.

You may freely surround binary operators with white-space characters (e.g. space, tab, line feed). Take care not to put spaces between the characters that make up a fraction or negative number, - 1234 / 5678 is not the same as -1234/5678.

Bracmat can open and read a text file and execute the instructions as they are read. After the last instruction has been executed, the file is closed. The program that currently produces this information, is read from the file named "help". The instruction for reading a file "myprog" and executing the instructions therein is "get$myprog". The "$" is a binary operator that has to do with function evaluation. "get" is the function name of one of the few built-in functions.

The result of the last executed instruction in a file is written to the screen. For better control over screen output one may use the built-in function "put", which writes from the current cursor position to the right, or the pre-defined (but changeable) function "out", which writes an extra line feed after its argument has been written to the screen.

{?} put$(x*x);put$(y+y)

Often you will need the result of the last evaluated instruction in the next one. You can use the exclamation mark "!" instead of re-entering the result. (Because Bracmat is still evaluating the instruction "get$help", this trick is of no use in the context of this program.) Example (try later):

{?} 1+1
{?} !^!^!^!

When you write programs in the Bracmat language you will normally use an external text editor. You can enter small programs directly at the Bracmat prompt {?}, but a small change in an instruction can only be done by re- entering the whole instruction. You may save instructions that are still held in memory (which is only possible if they are bound to variables), by the built-in "lst" function. This function takes a number of optional parameters that tell the system whether the instruction has to be written in a compact and barely readable form, or in a more pretty form, with lots of indentations. Comments are never written, as they are ignored at input time. As an example, you can write the definition of the pre-defined factorisation function "fct" to a file "factorise" by entering the following instruction:

{?} lst$(fct,factorise,NEW) {NEW:replace old file with same name}

At first sight, a Bracmat program doesn't look like programs written in other languages. This may even become a permanent impression. So here are some recommendations about programming habits:

Experiment with the examples from this information program.
Use a modular and incremental style of programming: test every function before going on to the next one.
Keep functions small, to begin with. Error messages point to functions, not to line numbers.
Use meaningful names. There is no limit to their length and you may use the minus-sign as separator in multiple-word-names.
Write comments if variable names do not tell enough.
Feel free to write parentheses if that clarifies the program structure.
Write parentheses in a BEGIN-END-style, the closing parenthesis below the opening parenthesis.
Check whether Bracmat interprets your code in the way you intended by listing your functions without the LIN option.
Try to evade the "not" prefix ~ and the use of arrays.

⇑ BRACMAT .

Pattern matching

The single most outstanding feature of Bracmat is how it can recognise patterns in data. The data can be an algebraic expression, a directory listing in table form, a thesaurus structured like a tree, a text or whatever data that can be expressed as a string of characters or as a tree of such strings. Bracmat's patterns are much more advanced than regular expressions. Regular expressions are fixed patterns once the matching operation is started. Bracmat implements not only propositional rules comparable to regular expressions, but also first order predicate rules, backtracking to search the space of combinations of data that make its predicates come true. In this way it is very easy to implement a relational database. But it doesn't stop here, because Bracmat supports recursive invocations of pattern matching and other operations. Examples:

{?} S=(|0 !S|1 !T);T=(0 !T|1 !S); { Regular grammar }
{?} 0 1 0 1 0:!S { Check whether subject contains an even number of 1's. }
{?} P=(|0 ?x 1 & !x:!P); { context free grammar }
{?} 0 0 0 1 1 1:!P { Check whether subject consist of a row of 0's
                     followed by a row of 1's of the same length. }
{?} connected=("South America"."North America") (Africa.Asia) (Asia.Europe);
{?} (reachable = a b f
    .     !arg:(?a.?b.?f)
        & !f:? ((!a.!b)|(!b.!a)) ?
      |   !f:?A ((!a.?c)|(?c.!a)) ?Z
        & reachable$(!c.!b.!A !Z)
    ); {Remove used fact from fact base.}
{?} (   Antarctic Europe Australia Africa Asia "North America" "South America"
    :   ?
        %@?x  {Pick a continent.}
        ?
        ( %@?y {Pick another continent}
        & reachable$(!x.!y.!connected) {Are they reachable?}
        & out$(!x "is reachable from" !y)
        & ~  {Force backtracking to collect all answers.}
        )
        ?
    );    { pattern using second order logic }

Europe is reachable from Africa
Europe is reachable from Asia
Africa is reachable from Asia
North America is reachable from South America
Pattern matching in character strings
Pattern matching in tree structures

⇑ BRACMAT .

Pattern matching in strings

@(<string> : <pattern>)   "match <string> with <pattern>"

Pattern matching in a string of characters (a single atom) is like pattern matching in a string of atoms. Use the '@' to instruct the program to look inside the atom and use space operators to combine subpatterns. The space operator does not itself match any characters. To match a space in an atom, use a space in an atom!

You cannot negate the result of string pattern matching by adding the ~ prefix, unless there are other prefixes than @ and ~ present.

{?} a b:(~@(? b:a %)) {succeeds, ~@ means:'not an atom'}
{?} ~@(a b:a ?)       {succeeds, ~@ means: not a string match'}
{?} ~@(a:a)           {succeeds, same reason}
{?} 12/34:@(?x:#?a (~#%@:?y) #?b) {succeeds, ?x matches the atom 12/34,
                       while #?a (~#%@:?y) #?b matches 12/34 as a string.}
{?} 12:~/@(?x:#%?a #%?b) {succeeds, ~ negates /, not @, so we have a string
                       match.}

⇑ Pattern matching.
⇑ =.,|&: +*^LD'$_.
⇑ Binary operators.
⇒ Matching a number in a string.
⇐ Binary operators in pattern matching.

Matching a number in a string

In a string match, the % can be used to force characterwise matching if the subject is a number and the pattern otherwise would have been treated as a number. You have to take care with minuses: the patterns %"-20/5" and %-20/5 are different. In %"-20/5", the % is superfluous and the pattern matches characterwise. In %-20/5, the pattern matches 20/5 and the minus is ignored!

{?} @(abcd40/10efgh:?a    20/5  ?z)    {succeeds, because 4 = 20/5 = 4}
{?} @(abcd52/13efgh:?a    20/5  ?z)    {succeeds, because 52/13 = 20/5 = 4}
{?} @(abcd40/10efgh:?a   %20/5  ?z)    {fails}
{?} @(abcd-20/5efgh:?a %"-20/5" ?z)    {succeeds}
{?} @(abcd-20/5efgh:?a  %-20/5  ?z)    {succeeds, a = abcd-}
{?} @(abcd-20/5efgh:?a   -20/5  ?z)    {succeeds, a = abcd }

⇑ Pattern matching.
⇑ =.,|&: +*^LD'$_.
⇑ Binary operators.
⇐ Pattern matching in strings.
⇒ Escaping operator in patterns.

Binary operators in pattern matching

<subject> : <pattern>   "match <subject> with <pattern>"

A match succeeds if <subject> succeeds and <pattern> is successfully matched with <subject>. The returned value is the left operand, <subject>.

Patterns may be built up from sub-patterns and may also include actions that are triggered if a sub-pattern successfully matches (part of) the subject.

As with the evaluation of other binary operators, the left operand of the ":" operator is evaluated first. The other operand, <pattern>, is not evaluated, but in the process of pattern matching (parts of) <pattern> may be evaluated several times. This is the case with function calls, atoms with a "!" prefix and all right hand sides of the "&" operator. The use of the involved binary operators ($ ' &) and prefixes (! !!) as "meta operators" does not restrict the range of matchable expressions in a serious way, as these operators and prefixes normally do not occur in evaluated subject expressions. The same is true for some other operators (: | _ and =). These operators, too, have a special meaning within patterns. All other binary operators occurring in a pattern are searched for in the subject expression as part of the pattern matching.

Especially the & | and : operators are helpful in formulating complex patterns with alternatives, conjunctions and side effects in the form of actions. In the following examples, !s stands for the subject expression, the expressions in parentheses are patterns and !p, !pa, !pb, etc. are sub-patterns therein. !a, !aa, etc. stands for an action (a part of the pattern that is conditionally evaluated).


    !s:(!p&!a)

If !p matches successfully with !s, then !a is evaluated. If !a fails, the whole match fails. In more complex patterns, only part of the match might fail, resulting in backtracking and retry.


    !s:(!pa|!pb)

If pattern !pa does not match with subject !s, then !pb is tried.


    !s:(!pa:!pb)

If pattern !pa matches with !s, then pattern !pb is also tried.

The next example combines these operators in a grammar-like expression:


    !s:( !pa         & !aa       If either !pa, or !pb or both of !pc1 and !pc2
       | !pb         & !ab       "fire", actions !aa, !ab and !ac, respectively
       | (!pc1:!pc2) & !ac       are triggered.
       )

Take care for the grouping of the : & and | operators :

(!s:!pa):!pb and !s:(!pa:!pb) have, accidentally, the same effect, but the following expressions are very different :


    (!s:!p)&!e
 or  !s:!p&!e

If !s matches with !p, !e is returned.


    !s:(!p&!a)

If !s matches with !p, !a is evaluated, but the expression as a whole returns !s.


    (!s:!p)|!e
 or !s:!p|!e

If !s matches with !p, !s is returned. Otherwise, !e is returned.


    !s:(!pa|!pb)

If !s matches with either !pa or !pb (in that order), !s is returned.

The possibility that !s might fail further complicates the above examples.

⇑ Pattern matching.
⇑ =.,|&: +*^LD'$_.
⇑ Binary operators.
⇑ Programming advice.
⇒ Escaping operator in patterns.
⇒ Pattern matching in strings.
⇒ Prefixes and pattern matching.
⇐ Program flow.
⇐ Assignment to variables.

Escaping operator in patterns

Some operators can not be part of a pattern unless 'escaped', because these operators play an active role in pattern matching instead of being passiv part of a pattern. These operators are = | & : ' $ _

The normal role of these operators is ignored by the pattern matching evaluator if they are 'escaped' with a $ node with an empty lhs.

{?} (=foo'bar):(=$(foo'bar))
{?} (=foo'bar):(=$(?f'?x)) & !f !x
{!} foo bar

The escape operator only affects the top node of the escape operator's rhs. The lhs and rhs of the affected node are matched against the subject in the normal way. The escape operator functions with all Bracmat operators, but the operators . , whitespace, + * ^ \L and \D should not be escaped normally.

⇑ Pattern matching.
⇑ =.,|&: +*^LD'$_.
⇑ Binary operators.
⇑ Programming advice.
⇑ functions.
⇐ Binary operators in pattern matching.
⇐ function evaluation.
⇐ Matching a number in a string.
⇐ Prefixes and pattern matching.
⇐ macro evaluation.

The grammar of Bracmat

Note: in long lists the "|" is left out.

<input>           ::= [<expression>] [;<input>]
<expression>      ::=   <white space> <expression> <white space>
                      | [<prefixes>] ( <expression> )
                      | <leaf>
                      | <expression> <binop> <expression>
<leaf>            ::= [<prefixes>] <atom-or-nil>
<atom-or-nil>     ::= <atom> | <nil>
<atom>            ::= "<string>" | <string>
<string>          ::= <character> [<string>]
<character>       ::= any printable character except \ and " | <spec>
<spec>            ::= \a \b \t \n \v \f \r \" \\
<nil>             ::= ""   (or nothing at all, such as in "()")
<binop>           ::= = . , | & : <white space> + * ^ \L \D ' $ _
<prefixes>        ::= <prefix> [<prefixes>]
<prefix>          ::= [ ~ / # < > % @ ` ? ! !!
<white space>     ::= spaces, tabs, new line and form feed characters

White space (operator/cosmetic measure) almost never leads to confusion. It does in (some) cases where a <nil> leaf without prefixes is adjacent to the white space operator. For example : "get' out$now". Bracmat interprets this as : "get'(out$now)". "" or () fixes the problem : "get'() out$now". Quotation marks are not part of the string they surround. They should be used if necessary, e.g. "in this case" or "he{this is not a comment}re". Comments can be written everywhere, except in the middle of a string in Quotation marks. Comments are enclosed in {} and may be nested.

⇑ BRACMAT .

Binary operators

choose a specific binary operator
algebraic operations
program flow
pattern matching
data structures
assignment
functions and macros
the dummy operator

⇑ BRACMAT .

=.,|&: +*^LD'$_

These are, in order of growing priority, the 15 binary Bracmat operators.
= operator (assignment)
. operator (lists)
, operator (lists with "unwrinkling")
| operator (OR ELSE)
& operator (AND THEN)
: operator (match subject with pattern)
[blank] operator (word lists)
+ operator (addition)
* operator (multiplication)
^ operator (exponentiation)
L operator (logarithm)
D operator (differentiation)
' operator (function, macro)
$ operator (function, macro)
_ operator (dummy)

⇑ Binary operators.

The = operator

This operator assures that the right hand operator stays unevaluated. It is mainly used in the definition of pieces of code (e.g. functions). The code on the right is bound to the name on the left.
<atom>=<expression>
Each time when the value of <atom> is asked for, a fresh copy of <expression> is made available. <expression> itself is unchangeable and can only be wiped out by removing the binding between <expression> and its name, <atom>. This has, in turn, no influence on the copies made earlier.

{?} a=2    { create binding }
{?} !a:?b  { bind copy to b }
{?} !b     { show b's value }
{?} a=3    { remove a's binding to 2}
{?} !b     { show b's value }

There is a second way of using the = operator, with a slightly different syntax :
<nil>=<expression>
The = operator serves as a shock proof container for <expression>. The effect of evaluating this type of expression is almost the same as that of the macro instruction ()'<expression>. Indeed, after evaluating a macro instruction we have an expression with the <nil>=<expression> syntax.

{?} out$(b+a)
{?} out$(=b+a)
{?} out$('(b+a))
{?} c=3
{?} out$(=b+a+$c)
{?} out$('(b+a+$c))

⇑ =.,|&: +*^LD'$_.
⇒ Objects.

Objects

With the = and . operators you can construct and dereference conventional data structures and even objects with methods. In an expression, each subexpression with a = operator in the top node and an atom in the lhs of the top node indicates a field or object method that can be accessed and changed independently of other fields and methods, i.e. without the need to dissect and reassemble the whole expression. Such expressions are objects. An object member (a field or method) is addressed by using the lhs of the = operator as the member's name, preceded by the objects name. The name of the object and the name of the member must be separated by a dot operator.

In the example below an object named "John" is created with the members "length", "age" and "name". The "name" member has two sub-members "first" and "family":

{?} John = (length = 180),(age = 30),(name = (first=John) (family=Bull))

There is no prescribed way in which the members should be glued together to form an object. Here, the comma operator and blank operator are used, but any operator but the = operator can be used to separate field names. John's length can be changed to 185 in the following ways:

{?} John.length = 185

{?} 185 : ?(John.length)

The same object can be assigned to another variable, creating an alias, but we have to take care not to evaluate John, because that would create or overwrite the variables "length", "age" and "name"):

{?} !John:?alias {Wrong, (alias=length,age,name);}

{?} '$John : (=?alias) {Right,  (alias=
                                  (length=185)
                                , (age=30)
                                , (name=(first=John) (family=Bull)));}

Bracmat replaces the expression '$John by the value of John, protected against evaluation by a = operator. For that reason, the pattern on the rhs of the match operator : contains a = operator as well. Now we can change John's age by operating on the variable "alias":

{?} alias.age = 31

To see that the above expression indeed has the wanted (side-)effect, we can inspect John:

{?} lst$John
 (John=
   (length=180)
 , (age=30)
 , (name=(first=John) (family=Bull)));

Alternatively, we can also just show the field "age" in John:

{?} !(John.age)
{!} 31

It is also possible to create an alias for a sub-object. Taking the previous example, we could create an alias for the name member:

{?} '$(John.name):(=?nm)

Now assign a new family name:

{?} Flinter:?(nm.family)
{?} lst$John
 (John=
   (length=180)
 , (age=30)
 , (name=(first=John) (family=Flinter)));

Using an alias for a sub-object can save some code and processing time if the sub-object is accessed many times. Without the alias for John's name, we can change his family name in this way:

{?} Flinter:?(John.name.family)

It is valid to have an empty name for a member:

{?} x=(header=blabla) (=(a=1) (b=2))

Here, a and b are fields in a "nameless" sub-object of x. We can ask for the value of b:

{?} !(x..b)
{!} 2

To retrieve the whole sub-object:

{?} '$(x.):(=?sub-object)
{?} lst$sub-object
 (sub-object=
 (a=1) (b=2));

An alias can also be created for part of an object:

{?} x=(a=) (b=) (c=) (d=)
{?} '$x:(=(a=) ?alias (d=))

Now 'alias' only shares the members x.b and x.c with x. The same result follows from

{?} '$x:(=? ((b=) (c=):?alias) ?)

Objects can be composed to form new objects containing the union of the members of the contributing objects:

{?} x=(a=) (b=)
{?} '((p=) ($x) (q=)):(=?r)

Evaluation of an expression that contains '=' operators can have unexpected side effects, as the following example shows.

First suppose that x (containing one record with one anonymous field) is unevaluated (case A) and assigned to two other variables:

{?} x=(=)
{?} !x:?y
{?} !x:?z

In this case, x, y and z are different objects. For example does

{?} 2:?(y.)

not affect x and z. Do the assignment again, but this time evaluating x only once:

{?} !x:?y:?z

Now y and z are the same object, but still different from x. A change made to y affects z but does not affect x.

Suppose that x IS evaluated (case B):

{?} (=):?x
{?} !x:?y
{?} !x:?z

Now x, y and z are the same object.

Explanation: in (A) the value of x is not evaluated, especially the lhs of the '=' operator. Therefore, a new '=' node is created each time x is evaluated. In (B), the value of x IS evaluated, so no new copies of the '=' node are made.

⇑ =.,|&: +*^LD'$_.
⇑ Programming advice.
⇑ BRACMAT .
⇐ The = operator.
⇐ Construction of data structures.
⇒ Assignment to variables.

Assignment to variables

There are two forms of assignment to a variable:

<variable> = <expression>: <expression> is not evaluated before assignment to <variable>.
<expression> : ?<variable>: <expression> is evaluated before assignment takes place.

The = operator is used to bind (still) unevaluated expressions such as patterns and functions to variables.

Assignment with the ":" makes use of pattern matching with a universally unifying pattern. This way of assignment is very powerful and can even be used to assign unevaluated expressions, by preceding the subject with an = or an ' operator. Example: define Lisp's car-function, first using = and then using : to bind the function definition to the variable "car".

{?} car=.!arg:(?%arg ?)&!arg          { one may freely reuse arg ! }
{?} (=(.!arg:(?%arg ?)&!arg)):(=?car) { another way to define car }
{?} car$(one two three)
{?} (four five six):(?`%first ?rem) {"`": 0 of 1, "%": 1 or more, together 1}
{?} The first element is !first and the remainder is !rem.

⇑ =.,|&: +*^LD'$_.
⇑ Binary operators.
⇐ Objects.
⇒ Binary operators in pattern matching.

Construction of data structures

In Bracmat linear lists can be made by separating the elements with comma, plus sign, asterisk, space and dot. The first four operators create linear structures (right descending lists), moving nodes as necessary, whereas the dot operator creates any tree structure. In addition, the plus sign and the asterisk (times) do not preserve the order of the elements if they are not canonical order. Which operator one should use in a given situation depends on the following considerations:

Space, comma, plus and asterisk offer automatic concatenation of lists, but are slower than the dot.
Space, plus and asterisk are useful if there is a need to search in a list, because these operator support backtracking, but they are slower than the comma.
The comma can be used in linear lists that are accessed recursively and in record-like structures, where the elements must have absolute positions. The dot can be used for the same purpose.
If a function has a fixed number of parameters, you should use dots to separate them.
If there is a need to sort elements alphabetically, use the plus operator.
Lists constructed with plus or asterisk behave like sets. Adding an element that already is present does not make the list any longer, but will instead increase a factor (plus) or exponent (astererisk).

Examples :

{?} x=a.b.c
{?} y=p.q
{?} !x.!y
{!} (a.b.c).p.q
{?} x=a b c
{?} y=p q
{?} !x !y
{!} a b c p q
{?} set=jan+piet+klaas
{?} !set
{!} jan+klaas+piet
{?} !set+klaas
{!} jan+2*klaas+piet

{?} rotate=car,cdr.!arg:(?car,?cdr) & (!cdr,!car)

{?} rotate$(one,two,three,four)
{?} rotate$((one,two),(three,four))

By combining dots, commas and spaces, one may build any tree-like data structure that, thanks to the backtracking mechanism on space-separated lists, make the formulation of queries (goals) almost as easy as in Prolog. This is an example of a simple database, in which each row starts with a descriptor field, followed by a varying number of similar fields.

{?} M=( (odd  ,1 3 5 7 9)
        (even ,0 2 4 6 8)
        (prime,2 3 5 7)
      )

We choose the space operator to form the backbone of the lists of numbers, because we want to access these numbers associatively, by using the back- tracking mechanism.

Let us formulate a query that searches for all numbers that occur in two or more categories (odd, even, prime). The findings are to be printed to the screen.

{?} ( !M                                     
    :   ?                                   { skip 0 or more rows ---               }
        (?c1,?row)                          { ---fetch (number type, number row)--- }
        ?                                   { ---skip 0 or more rows ---            }
        ( ?c2                               { ---fetch another number type,...      }
        ,   ?                               { ...skip 0 or more numbers...          }
            ( %?`el                         { ...fetch a number...                  }
            & !row:? !el ?                  { does number occur in earlier row ?    }
            & out$(!el is both !c1 and !c2) { yes?  show result                     }
            & ~                             { not satisfied yet: fail and backtrack }
            )                               
            ?                               { ...skip rest of numbers---            }
        )                                   
        ?                                   { ---skip rest of rows                  }
    )

This prints


    3 is both odd and prime
    5 is both odd and prime
    7 is both odd and prime
    2 is both even and prime

and finally fails when backtracking (induced by the ~) has found all answers to the query.

Experimentation with the implementation of matrices in Bracmat has learned that lists (of lists (of lists..)) lead to smaller and faster programs than arrays, artificially made multidimensional by playing with the index. A drawback of the list approach is its unconventionality. Much time has to be spend in reformulating existing algorithms based on indices. On the other hand, the list approach is essentially insensitive to the dimensionality of the matrix at hand, and may even be indifferent to the number of indices.

⇑ =.,|&: +*^LD'$_.
⇑ Binary operators.
⇑ Programming advice.
⇑ BRACMAT .
⇒ Objects.

Binary operators in program flow

<exprA> & <exprB>  ("<exprA> and then <exprB>")
<exprB> is only evaluated if <exprA> succeeds,

<exprA> | <exprB>  ("<exprA> or else <exprB>")

<exprB> is only evaluated if <exprA> does not succeed.
In both cases <exprA> is always evaluated and <exprB> conditionally. If <exprB> is to be evaluated, <exprA> and the & or | operator have served their purpose. Therefore, they are eliminated before <exprB> is evaluated. In this way, the program stack doesn't grow indefinitely when recursive calls are made from the right hand side of any & or | operator occurring in an expression . Even a conventional sequence of instructions (where the success or failure of the evaluations of each instruction do not matter) can make use of this tail recursion optimisation. In that case one uses the pacifier (short cut prefix) ` .


(`!a & !b)     !b is always evaluated. (sequence)
(`!a | !b)     !b is not evaluated. (useless in this form)

The pacifier or shortcut prefix is inherited by higher levels, it percolates towards operators that are closer to the root of the tree, until it is subsumed in situations like the above ones.

⇑ =.,|&: +*^LD'$_.
⇑ Binary operators.
⇒ Program flow.
⇒ Some often used control structures.

Program flow

Most binary operators are used in expressions that "flow on their own" or "flow not at all". In the first group are the arithmetic operators, in the second is the dot-operator. In between are the two other structuring operators, comma and white space.

Branching to a function is done with the $ and ' operators :


a $ b (or a'b)       "evaluate function a with argument b"

Branching without argument passing and local variables is done with the unary operator (prefix) ! :


!X                   "do subroutine X"

but often this prefix and its cousin !! are used for the purpose of variable expansion, it just depends on whether a variable is bound to an unevaluated or to an evaluated expression :


!X                   "expand X"
!!Y                  "expand expansion of Y" (two !'s is the maximum)

Conditional evaluation is decided by the success or failure of subexpressions. Every (sub)expression has two kinds of value: a visible value and a success(S)/failure(F)/ignore(I) value. Success and failure are primarily decided by the low level functions in the interpreter. The ignore value is generated if a failing expression is back-quoted. The & and | operators are sensitive to the S/F/I value of the left operand (where I counts as S). Often this left operand is a matching expression.


!a & !b              "if !a succeeds do !b"
!a | !b              "if !a fails do !b"
!subject : !pattern  "try to prove that !pattern describes !subject"

The back quote ` can be used to overrule the failure of a subexpression. The tilde ~ negates failure and success.


`!p & !q             "do !p and then do !q"
!a:!p & `!b | !c     "if !a matches !p do !b else do !c"
~!a                  "succeed if !a fails and fail if !a succeeds"

⇑ =.,|&: +*^LD'$_.
⇑ Programming advice.
⇐ Binary operators in program flow.
⇒ Binary operators in pattern matching.
⇒ Some often used control structures.

Algebraic operations

<term> + <term>         addition
<factor> * <factor>     multiplication
<base> ^ <exponent>     exponentiation
<base> \L <expr>        logarithm
<variable> \D <expr>    differentiation

Subtraction and division are treated as special forms of addition and multiplication. Therefore there are no binary operators for subtraction and division. (The minus sign - and the slash / can be used in numbers, however.)

If one operand of an algebraic operator is evaluated then the other one is normally evaluated as well, even if this may seem unnecessary (multiplication by 0). This is done to ensure that all side effects take place as intended. However, if an operand fails to evaluate then the algebraic expression fails too and if the failing operand is the left hand side of the expression, then the right hand side is not evaluated. In this sense algebraic operators behave like the logical & operator.

Bracmat gives the user practically NO control over the format of evaluated algebraic expressions, such as the order of terms or factors. Bracmat tries to present algebraic objects in a unique (canonical) form. This is in many cases an unattainable goal : the forms (a+b)*(c+d) and a*c+a*d+b*c+b*d are both stable expressions. On the other hand, (a+b)*(c+d)+e becomes e+a*c+a*d+b*c+b*d. Bracmat keeps "expensive" completely factorised expressions, but does not automatically factorise factorisable expressions. Another domain of duality are expressions with logarithms.

Sums and products start with rational numbers, followed by pi, i and e (if present, that is). Then follow other terms and factors. It is recommended not to assume anything about the ordering of these terms and factors, as this may change in later versions of the program.

⇑ =.,|&: +*^LD'$_.
⇑ Binary operators.
⇐ Differentiation.

Differentiation

<variable> \D <expr>
Bracmat knows how to differentiate expressions in which no other binary operators occur but + * ^ and \L. Example:

{?} y\Dx\D(a^(x^2+y^2))
{!} 4*a^(x^2+y^2)*x*y*e\La^2
{?} y\Dr
{!} 0

The last example gives zero, which in many applications isn't what we want. Often, with y we express the y-component of a vector with length r, and r consequently is a function of y (and the other components). We can solve this as follows:

{?} dep=(r.x) (r.y) (r.z) {'dep' is a special variable}
{?} y\Dr
{!} y\Dr

Now the expression is just left unevaluated. Later, you can substitute an expression for r in terms of its components

{?} y\D(r^-1):?derivative
{!} -1*r^-2*y\Dr
{?} sub$(!derivative.r.(x^2+y^2+z^2)^1/2):?derivative
{!} -1*y*(x^2+y^2+z^2)^-3/2

And, if you like, you can simplify the result by putting r back in:

{?} sub$(!derivative.x^2+y^2+z^2.r^2):?derivative
{!} -1*r^-3*y

⇑ =.,|&: +*^LD'$_.
⇒ Algebraic operations.

function evaluation

The binary operators $ and ' are similar in most respects. In general, the left operand evaluates to the name of a built-in or defined function, whereas the right operand is an expression that is passed as an argument to the function. The $ evaluates the right operand before it is passed over, the ' doesn't. Parameter passing is by value, although the implementation postpones and limits copying of data as much as possible. In the code of the called function, the passed argument is bound to a local variable that is always called arg.

Most often, the left operand of the $ and the ' operator evaluates to an alfa- numeric name. There are a few special function names:

No name at all. Here, the $ and the ' operator have decidedly different and complementary roles. (Forced evaluation of subexpressions in otherwise unevaluated expressions, such as patterns)
An integral number. (Array indexing.)
Only prefixes. (Prefix pasting or influencing success and failure of non- atomic expressions.)

Function calls are even effective in patterns, as it is fair to assume that the $ and ' operators seldom occur in subjects and so need not to be matched (the same is, a fortiori, true for the & and | operators). In patterns, the return value of a function is part of the pattern. A function may be called several times during one evaluation of a matching expression, due to backtracking and retrying.

⇑ =.,|&: +*^LD'$_.
⇑ Binary operators.
⇑ functions.
⇒ macro evaluation.
⇒ Escaping operator in patterns.
⇒ The "nameless" functions $<expression> and '<expression>.

macro evaluation

The ' operator with empty lhs is the macro evaluator. The macro evaluator returns the rhs unchanged, except where the $ operator (also with empty lhs) occurs. The expressions headed by such $ operators are replaced as follows:

'$$

If the rhs is the $ operator with empty lhs, the macro evaluator replaces the expression with the rhs of the heading $, and macro-evaluates the rhs of the result.

{?} '(one dollar ($($)))
{!} =one dollar ()$
{?} '(two dollars ($($($($)))))
{!} =two dollars ()$($)

'$_

If the rhs is headed by the _ operator, the expression is replaced by the rhs, where the dummy operator is evaluated to its current value.

{?} (=a^b):(=?_?) & '(dummy (a_b) power ($(a_b)))
{!} =dummy a_b power a^b

In all other cases the rhs is evaluated to a variable name, a member name or a definition. The expression is thereafter replaced by the value of the variable, the member of the definition. More examples:

{?} '(b+a c)
{!} =b+a c
{?} (x=value) & '(a ($x) z)
{!} =a value z
{?} (object=(member=value)) & '(a ($(object.member)) z)
{!} =a value z
{?} '(a ($(=value)) z)
{!} =a value z

⇑ =.,|&: +*^LD'$_.
⇑ Binary operators.
⇑ functions.
⇐ function evaluation.
⇐ The "nameless" functions $<expression> and '<expression>.
⇒ Escaping operator in patterns.

The dummy operator "_"

Bracmat has only 1 "variable" that binds to a binary operator, the "_" operator. Worse even, this variable is global. Nevertheless this variable is most useful in definitions of certain types of recursive functions ("tree walkers").

The assignment of a new value to the "_" variable can only take place in a match. A "_" in a pattern is always "receiving", whereas a "_" outside a pattern is either "giving" or left unchanged. Try this :

{?} a_b             { This has unpredictable results. }
{?} x^y:?_? & a_b   { _ gets bound to ^. Thus a_b evaluates to a^b }

A "_" is evaluated by the expression evaluator, but also by the macro evaluator. The latter is useful if the "_" has matched an operator that is very volatile, such as "&" and "|".

{?} (=!a:!b&!c):(=?left_?right) {match the "&"}
{?} '_
{!} =&              { It worked, the "_" is replaced by a "&". }
{?} get$(str$('$_),MEM,VAP):"=" ?op & !op { "freeze" and slice }
{!} &               { The operator is immobilised in a string. }

The "_" variable is always expanded BEFORE the left and right hand side operands are evaluated. That explains why new assignments in the operands do not result in unwanted side effects in the upper node with the "_".

⇑ =.,|&: +*^LD'$_.
⇑ Binary operators.
⇒ Recursion and the "_"-operator.

Recursion and the "_"-operator

In Bracmat functions are allowed to call themselves. Often this happens if a the function's argument is split into a left subtree and a right subtree and the function is called with each subtree in turn as its argument. If the operator in between the subtrees is unknown, it is time consuming to try all patterns ?+? ?*? ?$? ?'? etc. The _ operator circumvents this problem. It is a dummy operator that matches any other operator and expands to the operator with which it matched last time. Thereby preceding matches are "forgotten" : the _ operator is a global variable.

{?} ( reverse
    =   l,r
      .     (!arg:?l_?r)              { If arg is a compound expression ...}
          & (reverse$!r)_(reverse$!l) { ... swap the reversed operands. }
        | !arg                        { Let atoms as they are. }
    )
{?} reverse$(Bill loves sweet Nancy. This is true)

⇑ =.,|&: +*^LD'$_.
⇑ Binary operators.
⇑ functions.
⇐ The dummy operator "_".
⇐ <function name>=<var1> [,<var2>, ...].<function body>.

Some often used control structures

In the left column control structures are written in pseudocode, in the right column the nearest equivalents are written in Bracmat.


    Sequence:
    a; b;                                  `!a & !b { !b must be evaluated, even if !a fails }

    Repetition:
    WHILE a DO b;                           whl'(!a & !b);
    DO b WHILE a;                           whl'(!b & !a);
    DO b UNTIL a;                           whl'(`!b & ~!a);
    FOR i := m TO n DO b;                   !m:?i & whl'(!i:~>!n & `!b & 1+!i:?i);

    Selection:
    IF a THEN b ELSE c;                    !a & `!b | !c;
    v := IF a THEN b ELSE c;               (!a & `!b | `!c) : ?v  {works even if !c fails}
    SELECT  a                              !a : (!c1 & `!b1 | !c2 & `!b2 |? & !bx);
    WHEN  c1 : b1
    WHEN  c2 : b2
    OTHERWISE  bx;                         { the ? matches anything }

    Branching:
    CALL a;                                !a;
    CALL b(x,y,z);                         b$(x,y,z);
    v := b(x,y,z);                         b$(x,y,z) : ?v;

⇑ Binary operators.
⇑ Programming advice.
⇐ Binary operators in program flow.
⇐ Program flow.

The "nameless" functions $<expression> and '<expression>

Sometimes a variable predictably will evaluate to the same value repeatedly, for example in an inner loop or a pattern that repeatedly backtracks. In such situations macro substitution can improve performance by replacing the variable by its value in an early stage.

In Bracmat, a macro has the general form '<expression>. When '<expression> is evaluated, <expression> is searched for sub-expressions headed by the operator $, with empty lhs. Such sub-expressions are replaced, depending on what is found on the rhs of the $ operator.

After macro substitution has taken place, what remains is an expression of the form =<expression>. The = operator is a safeguard against evaluation of <expression>.

Macro substitution makes it possible to dynamically create unevaluated code and bind it to a variable.

{?} '($out):?my-fun-var
{?} !my-fun-var$(Hello world)
{?} '($out):(=?my-fun-alias)
{?} my-fun-alias$(Hello world)

Pattern matching can sometimes be made more efficient by using macro substitution, but the resulting code is harder to understand:

{?} ( 0:?count
    &   41 3 5 7 6 23 12 11 19
      :   ?
          %?`A
          ?
          ( %?`B                            {Each number pair [A,B] ... }
          & !A+!B:?C                        {is added only once, giving C. }
          &   '(? ()$(!count+1:?count&C) ?)
            : (=?rem)                       {C's value is hard-coded into rem}
          )
          !rem                              {which is the remaining pattern. }
    & out$(after !count "trials:" !A "+" !B "=" !C )
    )

after 16 trials: 5 + 7 = 12

In the same way, function code can be pieced together before it is ever executed.

{?} power=three
{?} ((!power : two & (=!arg^2)) | (=!arg^3)) : (=?abc)      {If power="two",
    abc is bound to !arg^2 (unevaluated). Otherwise, abc is bound to !arg^3.}
{?} '(.!arg + -1*$abc + 2) : (=?poly) {"poly" is the name of a new function
    that will return a value that depends on the current value of arg
    and on the value of power at the time when abc got its binding.}
{?} lst$poly { show poly's definition }
{?} poly$4
{!} -58

The macro construct '<expr> is useful if an expression has to be executed many times while parts of it remain constant, for example in nested loops:

Without macro construct (5 X 5 multiplication table) :

{?} 0 : ?m                    { initialise counter of outer loop }
{?} (outer = 1+!m : <6 : ?m   { code for outer loop : }
            & put$\n          { start output on new line }
            & 0 : ?n          { initialise counter of inner loop }
            & `!inner         { execute inner loop }
            & !outer)         { loop }
{?} (inner = 1+!n : <6 : ?n   { code for inner loop : }
           & put$(!m X !n "=" !m*!n ", ")
                              { the same !m is expanded 10 times }
           & !inner)          { loop }
{?} !outer

With macro construct :

{?} 0:?m
{?} (outer = 1+!m : <6 : ?m
            & '( 1+!n : <6 : ?n
               & put$($m X !n "=" ()$m*!n ",")
                              { !m is expanded only 2 times }
               & !inner
               ) : (=?inner)  { at each pass through the outer loop
                                the inner loop "inner" is defined anew }
            & put$\n
            & 0 : ?n
            & `!inner
            & !outer)
{?} !outer

⇑ Binary operators.
⇑ functions.
⇐ function evaluation.
⇒ macro evaluation.

Prefixes

program flow
pattern matching
prefixes combined with expressions
unary operators

⇑ BRACMAT .

Prefixes and program flow

Unlike other programming languages, Bracmat does not return the value of a variable or object member if we type its name. In Bracmat, variables and object members have to be told explicitly that we want their value, not their name. This is achieved with the ! and the !! prefixes in front of the variable name or object member name.


!<atom>  is replaced by the binding of <atom>
!!<atom> is replaced by the binding of the binding (after evaluation) of <atom>

Likewise !(<object name>.<member name>) is replaced by the binding of <object name>.<member name>

Bindings can be evaluated or unevaluated. In the last case, the next step after expansion is the evaluation of the binding, unless expansion took place within a pattern.

{?} 2+3:?four     { bind 5 to "four" }
{?} !four         { evaluation has already taken place when four is expanded }

{?} 5=2+!four     { numbers are legal names. 5 is bound to 2 + !four}
{?} !5            { evaluation takes place immediately after expansion }

{?} sum=%+%       { define pattern "sum" }
{?} a+b+c:!sum    { is a+b+c a sum? After expansion, %+% is not evaluated }

The !! prefix is not used as often as the single !, but comes in handy if you want to pass a variable by name instead of by value.

{?} (check=one,two,criterion
      .     !arg:(?one,?criterion,?two)
          & !!criterion
          & TRUE
        | FALSE
    )
{?} is-greater-than = !one:>!two
{?} is-divisor-of = (div$(!two,!one)*!one):!two
{?} check$(3,is-greater-than,15)  { pass by name }
{?} check$(3,is-divisor-of,15)

Passing by name is used here to postpone the evaluation of the second argument until it has arrived in the function check and the local variables "one" and "two" have been bound to the first and the third arguments, respectively.

Postponement of evaluation can also be achieved with the = and the ' operators.

{?} (chack=one,two,criterion
      .    !arg:(?one,(=?criterion),?two)
         & !criterion
         & TRUE
       | FALSE
    )
{?} is-greater-than == !one:>!two                  { an extra = }
{?} is-divisor-of ='((div$(!two,!one)*!one):!two)  { an extra ' }
{?} chack$(3,!is-greater-than,15)  { pass by value }
{?} chack$(3,!is-divisor-of,15)

⇑ Prefixes.

Prefixes and pattern matching

In patterns, atoms and expressions within parentheses may be preceded by prefixes that control the matching process. ! and !! in front of an non-nil atom or an expression denoting a member of an object causes expansion of the atom or the member to its direct or indirect binding. This binding is matched with the subject. ` causes backtracking if the pattern did not successfully unify with a non-trivial element of the subject-list. A list is an expression consisting of terms (+ operator), factors (* operator) or words (white-space operator). Trivial elements are 0 in a sum, 1 in a product and a word without characters in a list of words. In non-sophisticated patterns, ` means simply : unify with at most one non-trivial element. Zero non-trivial elements are allowed, in which case unification takes place with an implicit trivial element: Bracmat sees 0's everywhere in a sum, 1's in a product and zero length words in a sentence. ? unifies with anything. If ? is followed by a non-nil atom denoting a variable or an expression denoting a member of an object, then the matched part of the subject is captured by this variable or member. In other words, pattern matching can have assignment as a side-effect. @ unifies only with atoms. % causes the match to succeed only with one or more non-trivial elements of the subject-list. (Exception: in combination with [ prefix). < and > unifies only with atoms that are "less" or "greater" than the atom following the < or > prefix. # unifies only with rational numbers. / unifies only with non-integer rational numbers. ~ constrains the match to subjects that are not equal to the atom following the ~ prefix. [ Position prefix. Must be followed by an expression that evaluates to a number (For example [4 or [(!pos+3) ) or by a variable having a question mark as in [?pos. In the first case, the pattern cannot succeed unless the element following the [ element is at the indicated position. The [ element itself does not occupy a position; it sits in front of the indicated position. The second form is for querying the current position. Position 0 is the start of the subject. Positive positions count from the beginning of the subject, negative positions from the end. Position -1 is the position following the last element. (When combined with '%' the meaning is different.) The above prefixes may be combined. The ordering in which they are input by the user is irrelevant; Bracmat keeps prefixes in this order : [ ~ / # < > % @ ` ? ! !! Repeating prefixes in front of the same atom does not convey a new meaning to the pattern, except for the ! and the ~ prefixes. More than one ! is interpreted as the !! prefix. An odd number of ~ is treated as a single ~, an even number thereof is treated as none. A ~ in front of other prefixes negates the first of them. The most useful combinations are:

?!: in front of an atom causes the atom to be expanded to its binding. This binding is treated as a variable name.
?!!: is like ?!, but expands two levels deep (with an evaluation of the first level expansion), instead of one.
<>: is like a solitary ~.
/<>5/6: unifies only with non-integer rational numbers unequal to 5/6.
~<: means "greater or equal" ("not less").
~>: means "less or equal".
~<>: means "not different", i.e. "the same, in some sense". Strings are compared case insensitive. This applies to the full Unicode table, but defaults to ASCII and the upper 128 characters in the ISO8859-1 (Latin 1) character set if the characters are not UTF-8 encoded.
~#: does not unify with rational numbers.
~/: does not unify with non-integer rational numbers.
~/#: unifies only with integer numbers.
~/#<9: unifies only with integer numbers less than 9.
~/#<>0: unifies only with non-zero integer numbers.
~@: unifies only with non-atomic expressions.
~`: backtrack immediately.
[%: The current subject is stored in the variable 'sjt' and the expression carrying this prefix combination is evaluated. If the evaluation succeeds, the match succeeds and vice versa. The subject can be a trivial element of the subject list.

Many of these combinations can be combined further, e.g. ~/#?!! accepts only an integer number and binds it to the indirect binding of the atom following the prefixes.

If you want to match pattern !pat one or more times (this is often written as {pat}+ ), use the complex pattern (? !pat|`). Likewise, if you want to match !pat zero or more times ( {pat}* ), use (|? !pat|`). These patterns should not be the last sub-pattern or precede a subpattern that is static and fixes the end point of the repeating sequence, because the correct working of the repeating patterns depends on repeated backtracking from following sub-patterns. Bracmat may be optimized to skip such backtracking and jump to the 'right' end position if that is fixed by the next

In the last resort, you can add a pattern like () or (&) or (|) or (:), which match with an empty list only (assuming that the connecting nodes are spaces, otherwise use 0 in the case of a sum and 1 in the case of a product). Such patters don't fix the next position. Example:

{?} a a a c c:(? a|`) (|? b|`) (? c|`) (&) { {a}+ {b}* {c}+ }

The following expression succeeds, because the subpattern doesn't confront the substring aaak.

{?} @(aaakamcccc:(? a|`) m (|? b|`) (? c|`)) (&)

An empty string before the m has the effect that Bracmat doesn't optimize the backtracking process away.

{?} @(aaakamcccc:(? a|`) () m (|? b|`) (? c|`)) (&)

⇑ Prefixes.
⇑ Programming advice.
⇐ Binary operators in pattern matching.
⇒ Escaping operator in patterns.

minus sign

The minus sign "-" has only its normal arithmetic meaning when used as an unary operator in front of a rational number or the imaginary number "i".

If a product contains both a rational number and the number "i", the "i" takes precedence in accepting a minus sign:


                   -7*i*a

is evaluated to    7*-i*a.

The advantage of having both "i" and "-i" becomes clear by considering the following:


                   (-1*i)^1/3

evaluates to       (-i)^1/3,

which is written as -i^1/3. As expected, this is the complex conjugate of


                    i^1/3.

If Bracmat did not have a separate representation for -i, then


                   (-1*i)^1/3
would evaluate to  i,

(because i^3 is equal to -i),which means that Bracmat would not consider (-1*i)^1/3 and i^1/3 as complex conjugates.

The transcendental numbers e and pi do not accept arithmetic minus signs.

⇑ Prefixes.

strings or atoms

A string in Bracmat is the same as an "atom". If you envisage a Bracmat expression as a tree like structure, atoms or strings are to be found in the leafs. In Bracmat terminology, an empty leaf is syntactically represented by <nil>. <nil> is not an <atom> proper, but an <atom-or-nil>. So not every leaf contains an atom. On the other hand, leafs may contain other things besides <atoms>, such as prefixes.

In Bracmat, atoms are less accessible than trees. Therefore there are some ways to convert atoms to trees and back.

Conversion between an atom and its constituent characters:
- get$(<atom>,MEM,VAP) puts every character in the literal <atom> in its own leaf in a tree, which has space-operators in every node.
- str$<tree> does more or less the inverse.
Conversion between an atom and executable Bracmat code:
- get$(<atom>,MEM) literally "reads" an atom as though it is a file with Bracmat expressions.
- lst$(<variable>,MEM) "writes" the expression that is bound to <variable> to an atom.

Atoms can be used as names for variables, functions, files, etc.. Often they are used as literals, such as mathematical symbols or text.

Most characters which the computer knows of can be members of an atom. Only the first seven characters in the ASCII character set are forbidden. These characters are used by the system. In most cases you don't need quotation marks in order to get a string of characters into an atom. You do need them if you want parentheses, operators or prefixes to be part of an atom. Some special characters have to be preceded by a back slash:

\a: attention (bell)
\b: backspace
\t: tab
\n: new line
\v: vertical tab
\f: form feed
\r: carriage return
\\: back slash
\": double quote

If you precede a string with the prefix @, then back slashes are treated as normal characters. E.g. sys$@"C:\dos\edit". In stead of the tab and new line characters above, you may enter tabs and new lines by pressing the tab and the return key, respectively.

Examples:

{?} this is a "tree" with\nsix leafs
{?} (this is a "tree" with
      seven leafs)
{?} "this" has 4 characters and "" (nil) none
{?} "this is an \"atom\" with 36 characters"
{?} "this string\nno verb"
{?} "this string

no verb either"

{?} "if zero equals one, I hang up" = "1:0&get$(\")y\",MEM)"
{?} get$(!"if zero equals one, I hang up",MEM)

⇑ BRACMAT .

Symbols

literals
variables

⇑ BRACMAT .

Literals

In Bracmat, symbols have only literal meaning, unless we explicitly state that we want a symbol to behave like a programming variable. Contrary to most computer languages, Bracmat evaluates an expression with literals not by expanding these literals to their associated values (if they have any) and computing with these values until a result is obtained, but by rearranging and transforming the expression until a stable form is reached.

{?} a + a
{?} i*i
{?} e^(19/2*pi*i)

In Bracmat, the context of a symbol decides whether it is treated as a variable or as a literal. So it is not necessary to "kill" a variable in order to use its symbol as a literal, the two uses live peacefully together.

{?} i=2   { variable i is bound to the literal "2" }
{?} !i^2  { the associated value of i is squared }
{?} i^2   { the literal "i" (a special one, like "pi" and "e") is squared }
{?} 7 = prime { the variable 7 is bound to the literal "prime"}
{?} 7 is !7 { the symbol 7 is used as both a literal and a variable}

⇑ Symbols.

Variables

Variables are represented by <atom>'s, but not all <atom>'s are variables. The context of a symbol determines whether it is a variable or not :

the left operand of the = operator, unless this operand has zero length.
the atom following the ! and !! prefixes
within a pattern, a non-zero length atom following the ? prefix
the left operand of the $ and ' operators
the right operand of the $ operator in macro constructs (e.g. '(1+$a) ).

⇑ Symbols.

The four evaluators

In Bracmat, a binary operator may have four different effects, depending on the context of the operator. For each of these contexts there is one evaluator. Of these four evaluators, the macro evaluator is relatively unimportant. The four evaluators are :

the expression evaluator, which takes care for the transformations of expressions.
the match evaluator, which handles the unification of pattern expressions with subject expressions.
the macro evaluator, which merely substitutes certain parts of an expression.
the archivist, which doesn't do anything but keeping expressions alive.

The expression evaluator is the first evaluator that a newly input expression is confronted with. If necessary, it delegates tasks to one of the other three evaluators. The match evaluator can only delegate tasks to the expression evaluator and to the archivist. The macro evaluator can only delegate tasks to the expression evaluator. The archivist doesn't delegate any tasks to other evaluators.

The cross link is in most cases a binary operator. The exceptions to this rule are in the context of the match evaluator: some (combinations of) prefixes involve the expansion of a chain of variable bindings and all but the last subexpansion demand the expression evaluator. In the scheme below, you'll find the "current" evaluator in the left column and the successor evaluators in the top row. A cross link is represented by the relevant operator or prefixes. If the change of evaluator only applies to the left (right) operand of the cross link operator, the symbol "l" ("r") is used. If the transition depends on the left operand being <nil>, the symbol "n" is used.

            expression       match            macro            archivist
expression                   :r               n'r              =r 'r
match       &r $ 'l ?! !!                     n'r              =r 'r
macro       $r

archivist

⇑ BRACMAT .

Programming advice

program flow
pattern matching
data structures
debugging

⇑ BRACMAT .

Debugging

If a program written in the Bracmat language doesn't work properly, the same debugging protocol applies as with other programming languages :

Test extensively, above all with absurd and trivial input, in order to locate the pain in the many limbs of your program code.
Create watch points by inserting out$ instructions at sensible places (entry and exit points of functions, branches, before and after assignments).
If you are in doubt whether Bracmat has interpreted your program in the way you intended, use lst$(<function-name>,<file-name>) and inspect the code that is output into <file-name>.
Errors that are easily made are :
- Forget that Bracmat may see trivial elements (0 terms, 1 factors, zero length words) at places where this is not what you intend. Are all % ` and @ prefixes in place?
- Forget the grouping of operators. a b c : ?%x ?%y is grouped as (a b c):(?%x ?%y) , but a,b,c : ?%x,?%y is grouped as a,b,(c:?%x),?%y Remember that = and . have very low priorities, often making a pair of parentheses necessary.
- A misconceived idea about the Bracmat's backtracking mechanism. Unlike other languages with backtracking capabilities, Bracmat does not offer suspend/resume cycles. An expression embodying alternatives does not successively produce each alternative on every evaluation.

⇑ Programming advice.
⇒ using out$ as debugging aid.

using out$ as debugging aid

The best aid in finding out what a program does, is using the out$ function. The following code is part of a function that computes n! .


  loop = !k+1 : ?k          { increment k }
              : <!n         { compare (old) k+1 with n; if not less, stop }
       & !fac*!k : ?fac     { multiply fac with k }
       & !loop              { repeat until k = n }

Outside patterns out$ is most easily used. Inside patterns, if you want to inspect a variable that has just been assigned a new value, you use the & operator to temporarily escape into the non-pattern world. If you want to add extra text to the output, remember that all of out$'s argument is returned.


  loop = out$!k+1 : (?k & out$(k is !k)) {show k before and after increment}
                  : <!n                  {but before comparison with n}
       & out$("new fac is:" (!fac*!k:?fac)) {show fac after computation}
       & out$(still need !n+-1*!k loops) {you don't always need quotation marks}
       & !loop

Now an example that is faulty. The purpose is to find two equal words in a sentence. This expression succeeds, but finds nothing:


  (He loves her and she loves him : (? ?a ? !a ?) & out$(!a is occurring twice))

Check what is unified with "? ?a". To do so, put a variable after the first "?" and insert an output action after each sub-pattern.


  (He loves her and she loves him
      : ((?x & out$(x is !x))   {output x after unification}
        (?a & out$(a is !a))    {output a after unification}
        ? !a ?)                 {the remainder of the pattern}
      & out$(!a is occurring twice)
  )

The program would have to backtrack several times until ?a was unified with "loves", but the match succeeds with ?a unified with the omnipresent zero length word. A % sign avoids this. A back quote ` helps speeding up, since it avoids multi-word assignments and forces immediate backtracking.


  (He loves her and she loves him
      : ((?x & out$(x is !x))  { Watch the number of words in ?x grow ... }
        (%`?a & out$(a is !a)) { while ?a moves towards "loves". }
        ? !a ?)                { There backtracking stops }
      & out$(!a is occurring twice) { and the message is output. }
  )

⇑ Programming advice.
⇑ Predefined functions.
⇐ Debugging.
⇐ out$<expression>.
⇒ using dbg' as debugging aid.

using dbg' as debugging aid

Some programming errors may be found with the built-in dbg function. The argument of the dbg function is evaluated with an internal debugging flag set. With this flag set, suspicious code is warned against.

It is important that the argument is not evaluated before being passed to the dbg function.

⇑ Programming advice.
⇑ Predefined functions.
⇐ using out$ as debugging aid.

functions

function evaluation
defining functions
nameless functions
lambda abstractions, currying
built-in functions
predefined, changeable functions

⇑ BRACMAT .

<function name>=<var1> [,<var2>, ...].<function body>

Definition of a function. <var1>,<var2>, etc. are explicitly declared local variables. A function is called by <function name>$<argument expression> or <function name>'<argument expression> , depending on whether <argument expression> must be evaluated ($) or not (') before it is passed to the function in the always present local variable "arg". The returned value of a function is simply the function body after it has been evaluated.

{?} square=.!arg^2 {definition}
{?} square$5            {call}

{?} (swap = a,b            {declare local variables a and b}
        .   (!arg:(?a,?b)) {dissect arg to find the "real" arguments}
         & (!b,!a))        {swap and return}
{?} swap$(I think,I guess)

In a match context, a function call creates a second local variable, "sjt", the current subject. The value returned from a function in a match context is interpreted as a pattern by the match evaluator. However, if the function call fails, the pattern match is not attempted and fails as well. If the returned value is negated the behaviour is not defined.

{?} ( like
    =
      .   sim$(!arg,!sjt):>9/10 & ?
        |   den$(sim$(!sjt,)):~<(den$(sim$(!arg,0)))
          & ~`
    )
{?} @( "Dogs and Cats are my enemies": ? like$cat ?)

Local variables in Bracmat are shallowly bound dynamically scoped variables. This means that variables that are used in a function but not locally declared in that function, are inherited from the (function or global) context from which the function is called, which in turn may inherit any undeclared variables from another calling context. This scheme contrasts with most programming languages. It is efficient, but the effect of forgetting to declare a local variable can be unexpected behaviour of conceptually unrelated code.

It is possible to declare a function inside another function. Always declare the name of an embedded function as a local variable.

⇑ functions.
⇒ Recursion and the "_"-operator.

Lambda calculus, currying

The lambda abstraction (λx.x)y

translates to: /('(x.$x))$y

Bracmat's implementation of lambda calculus is a variant of Bracmat's macro substitution. The expression /('(x.$x))

evaluates to itself, not to something like: /(=(x.foo))

(assuming that the variable x had the value 'foo'). In contrast, the same expression without the leading slash ('(x.$x))

evaluates to: =x.foo

The rhs of the '$' operator must be an atom.

In an lambda abstraction: /('(x.$x,$y))

'$x' is a bound variable and '$y' is a free variable. The expression '$x' is only replaced by a value if x is the variable in the lambda abstraction or a lambda abstraction that contains the lambda abstraction, as in /('(x.(/('(y.($x) ($y)))$aap)))$noot

which evaluates to: noot aap

No Bracmat variables come into play, not even 'arg'. Thus, in the example above the value 'aap' is bound in ($x), but never assigned to a variable 'x'.

The expression: /('(x.(/('(x.($x) ($x)))$aap)))$noot

evaluates to

    aap aap

⇑ functions.

Built-in functions

sel access array element
alc allocate memory (low level)
arg return program (command line) argument
asc convert character to internal representation
chr convert internal representation to character
chu convert Unicode codepoint to UTF-8 character
clk CPU seconds since start of session
d2x convert decimal number to hexadecimal number
dbg debugging aid
den denominator
div quotient
fil file I/O (low-level)
flg splits expression in prefixes and expression without prefixes
glf opposite of flg: combines prefixes and expression
fre return allocated memory (low level)
get get input (from file,keyboard or memory)
low convert to lower case
lst list un-evaluated value of variable(s)
mem list existing variable names
mod remainder
new create new object as a copy of another object
pee get value from address (peek) (low level)
pok put value at address (poke) (low level)
put write output
ren rename file or directory or move file
rmv remove file
rev string reverse
sim similarity between two atoms
str stringize expression into atom
swi software interrupt (low level)
sys command line shell
tbl establish array size
upp convert to upper case
utf convert UTF-8 character to Unicode codepoint
whl while loop
x2d convert hexadecimal number to decimal number
"?" assign to nameless variable

⇑ functions.

<index>$<array name>

<array name> and <index> should both evaluate to atoms. <array name> may be
preceded by prefixes, such as ? or !. Indexing starts at 0 and is done modulo(size-of-array). Negative values count from the upper end of the array. The chosen index remains in force until a new indexing function is evaluated.

{?} tbl$(array,4)              { declare array[0..3] }
{?} a-value : 2 $ ?array       { array[2] := a-value }
{?} array = another-value      { array[2] := another-value }
{?} !array : -1 $ ?array       { array[3] := another-value }
{?} 45 : 1$?array              { array[1] := 45 }
{?} 2'!array : 3'!array        { are array[2] and array[3] equal?
                                 Notice use of ' instead of $. }