X-Git-Url: https://git.distorted.org.uk/~mdw/sod/blobdiff_plain/ae0f15ee8427fa91cfd1945bfded847032cb8a25..7292d6e1bc85cca454f64353722ba78a9263f257:/doc/syntax.tex diff --git a/doc/syntax.tex b/doc/syntax.tex index 560dc47..18b46e1 100644 --- a/doc/syntax.tex +++ b/doc/syntax.tex @@ -26,51 +26,6 @@ \chapter{Module syntax} \label{ch:syntax} %%%-------------------------------------------------------------------------- -\section{Notation} \label{sec:syntax.notation} - -Fortunately, Sod is syntactically quite simple. The notation is slightly -unusual in order to make the presentation shorter and easier to read. - -Anywhere a simple nonterminal name $x$ may appear in the grammar, an -\emph{indexed} nonterminal $x[a_1, \ldots, a_n]$ may also appear. On the -left-hand side of a production rule, the indices $a_1$, \ldots, $a_n$ are -variables which vary over all nonterminal and terminal symbols, and the -variables may also appear on the right-hand side in place of a nonterminal. -Such a rule stands for a family of rules, in each variable is replaced by -each possible simple nonterminal or terminal symbol. - -The letter $\epsilon$ denotes the empty nonterminal -\begin{quote} - \syntax{$\epsilon$ ::=} -\end{quote} - -The following indexed productions are used throughout the grammar, some often -enough that they deserve special notation. -\begin{itemize} -\item @[$x$@] abbreviates @$[x]$, denoting an optional occurrence - of $x$: - \begin{quote} - \syntax{@[$x$@] ::= $[x]$ ::= $\epsilon$ @! $x$} - \end{quote} -\item $x^*$ abbreviates @$[x]$, denoting a sequence of zero or - more occurrences of $x$: - \begin{quote} - \syntax{$x^*$ ::= $[x]$ ::= - $\epsilon$ @! $[x]$ $x$} - \end{quote} -\item $x^+$ abbreviates @$[x]$, denoting a sequence of zero or - more occurrences of $x$: - \begin{quote} - \syntax{$x^+$ ::= $[x]$ ::= $[x]$ $x$} - \end{quote} -\item @$[x]$ denotes a sequence of one or more occurrences of $x$ - separated by commas: - \begin{quote} - \syntax{$[x]$ ::= $x$ @! $[x]$ "," $x$} - \end{quote} -\end{itemize} - -%%%-------------------------------------------------------------------------- \section{Lexical syntax} \label{sec:syntax.lex} Whitespace and comments are discarded. The remaining characters are @@ -196,16 +151,16 @@ binary. However, length and signedness indicators are not permitted. ::= any character other than "*" or "/" - ::= "//" @^* + ::= "/\,/" @^* ::= a newline character ::= any character other than newline \end{grammar} -Comments are exactly as in C99: both traditional block comments `\texttt{/*} -\dots\ \texttt{*/}' and \Cplusplus-style `\texttt{//} \dots' comments are -permitted and ignored. +Comments are exactly as in C99: both traditional block comments `@|/*| \dots\ +@|*/|' and \Cplusplus-style `@|/\,/| \dots' comments are permitted and +ignored. \subsection{Special nonterminals} \label{sec:syntax.lex.special} @@ -407,14 +362,15 @@ though the only operators currently defined act on integer values only. \subsubsection{The expression evaluator} \begin{grammar} - ::= | "+" | "-" + ::= | "+" | "--" ::= | "*" | "/" - ::= | "+" | "-" + ::= | "+" | "--" ::= | | | +\alt "<" ">" \alt "?" \alt "(" ")" \end{grammar} @@ -451,15 +407,22 @@ recognized. \alt "bool" | "_Bool" \alt "imaginary" | "_Imaginary" | "complex" | "_Complex" \alt +\alt \alt ::= | "const" | "volatile" | "restrict" + ::= @^+ + ::= - "(" @^+ ")" + "(" ")" ::= "atomic" | "_Atomic" + ::= "(" ")" + + ::= "alignas" "_Alignas" + ::= \end{grammar} @@ -475,7 +438,8 @@ defined in the built-in module. Declaration specifiers may appear in any order. However, not all combinations are permitted. A declaration specifier must consist of zero or -more @s, and one of the following, up to reordering. +more @s, zero or more @s, and one of the +following, up to reordering. \begin{itemize} \item @ \item @ @@ -500,29 +464,31 @@ All of these have their usual C meanings. \subsubsection{Declarators} \begin{grammar} -$[k]$ ::= @^* $[k]$ +$[k, a]$ ::= @^* $[k, a]$ -$[k]$ ::= $k$ -\alt "(" $[k]$ ")" -\alt $[k]$ @ +$[k, a]$ ::= $k$ +\alt "(" $[k, a]$ ")" +\alt $[k, a]$ @$[a]$ ::= "*" @^* - ::= "[" "]" -\alt "(" ")" +$[a]$ ::= "[" "]" +\alt "(" $a$ ")" - ::= $\epsilon$ | "..." -\alt $[\mbox{@}]$ @["," "..."@] + ::= $\epsilon$ | "\dots" +\alt $[\mbox{@}]$ @["," "\dots"@] ::= @^+ - ::= $[\epsilon]$ + ::= $[\epsilon, \mbox{@}]$ ::= $[\mbox{@ @! $\epsilon$}]$ - ::= $[\mbox{@}]$ + ::= + $[\mbox{@ @! $\epsilon$}, \mbox{@}]$ - ::= "." + ::= + $[\mbox{@}, \mbox{@}]$ \end{grammar} The declarator syntax is taken from C, but with some differences. @@ -537,6 +503,24 @@ The declarator syntax is taken from C, but with some differences. The remaining differences are (I hope) a matter of presentation rather than substance. +There is additional syntax to support messages and methods which accept +keyword arguments. + +\begin{grammar} + ::= @["=" @] + + ::= + @[$[\mbox{@}]$@] + "?" @[$[\mbox{@}]$@] + + ::= @! + + ::= "." + +$[k]$ ::= + $[k, \mbox{@}]$ +\end{grammar} + \subsection{Class definitions} \label{sec:syntax.module.class} @@ -553,17 +537,17 @@ substance. A @ informs Sod that an @ will be used to name a class which is currently undefined. Forward declarations are necessary in order to resolve certain kinds of circularity. For example, -\begin{listing} -class Sub; +\begin{prog} +class Sub; \\+ -class Super : SodObject { - Sub *sub; -}; +class Super : SodObject \{ \\ \ind + Sub *sub; \-\\ +\}; \\+ -class Sub : Super { - /* ... */ -}; -\end{listing} +class Sub : Super \{ \\ \ind + /* \dots\ */ \-\\ +\}; +\end{prog} \subsubsection{Full class definitions} \begin{grammar} @@ -576,6 +560,8 @@ class Sub : Super { ::= \alt +\alt +\alt \alt \alt \end{grammar} @@ -589,7 +575,10 @@ names, to distinguish them from other kinds of identifiers. The @$[\mbox{@}]$ names the direct superclasses for the new class. It is an error if any of these @s does not name a defined -class. +class. The superclass list is required, and must not be empty; listing +@|SodObject| as your class's superclass is a good choice if nothing else +seems suitable. It's not possible to define a \emph{root class} in the Sod +language: you must use Lisp to do this, and it's quite involved. The @ provide additional information. The standard class properties are as follows. @@ -634,28 +623,28 @@ functions are fine. An @, if present, is treated as if a separate @ containing the slot name and initializer were present. For example, -\begin{listing} -[nick = eg] -class Example : Super { - int foo = 17; -}; -\end{listing} +\begin{prog} +[nick = eg] \\ +class Example : Super \{ \\ \ind + int foo = 17; \-\\ +\}; +\end{prog} means the same as -\begin{listing} -[nick = eg] -class Example : Super { - int foo; - eg.foo = 17; -}; -\end{listing} +\begin{prog} +[nick = eg] \\ +class Example : Super \{ \\ \ind + int foo; \\ + eg.foo = 17; \-\\ +\}; +\end{prog} \subsubsection{Initializer items} \begin{grammar} ::= @["class"@] $[\mbox{@}]$ ";" - ::= "=" + ::= @["=" @] - :: "{" "}" | + :: \end{grammar} An @ provides an initial value for one or more slots. If @@ -666,15 +655,28 @@ The first component of the @ must be the nickname of one of the class's superclasses (including itself); the second must be the name of a slot defined in that superclass. -The initializer has one of two forms. -\begin{itemize} -\item A @ enclosed in braces denotes an aggregate initializer. - This is suitable for initializing structure, union or array slots. -\item A @ \emph{not} beginning with an open brace is a `bare' - initializer, and continues until the next @`,' or @`;' which is not within - nested brackets. Bare initializers are suitable for initializing scalar - slots, such as pointers or integers, and strings. -\end{itemize} +An @|initarg| property may be set on an instance slot initializer (or a +direct slot definition). See \xref{sec:concepts.lifecycle.birth} for the +details. An initializer item must have either an @|initarg| property, or an +initializer expression, or both. + +Each class may define at most one initializer item with an explicit +initializer expression for a given slot. + +\subsubsection{Initarg items} +\begin{grammar} + ::= + "initarg" + @^+ + $[\mbox{@}]$ ";" +\end{grammar} + +\subsubsection{Fragment items} +\begin{grammar} + ::= "{" "}" + + ::= "init" | "teardown" +\end{grammar} \subsubsection{Message items} \begin{grammar}