Lexical Elements¶

This public entry explains which characters, keywords, pragmas, and literal forms truST accepts before parsing starts.

Use the full spec below when you need the exact token-level rules, keyword tables, or lexer-facing IEC deviations.

Lexical Elements¶

IEC 61131-3 Edition 3.0 (2013) - Section 6.1, 6.2, 6.3

This specification defines the lexical elements for the trust-syntax lexer.

1. Character Set (Table 1, Section 6.1.1)¶

The character set is based on ISO/IEC 10646:2012 (Unicode).

No.	Description	Notes
1	ISO/IEC 10646	Standard requires Unicode support; trust-lsp currently supports ASCII only (DEV-013)
2a	Lower case characters	a-z
2b	Number sign	`#` (used in typed literals; Siemens SCL also uses `#identifier` local references, DEV-034)
2c	Dollar sign	`$` (used in string escapes)

Case Sensitivity Rule: When lower-case letters are supported, the case of letters shall NOT be significant in language elements, except: - Within comments - Within string literals - Within variables of type STRING and WSTRING

2. Identifiers (Table 2, Section 6.1.2)¶

An identifier is a string of letters, digits, and underscores which shall begin with a letter or underscore character.

Rules¶

Case insensitivity: abcd, ABCD, and aBCd shall be interpreted identically
Underscore significance: A_BCD and AB_CD are different identifiers
Multiple underscores forbidden:
__LIM_SW5 (leading double underscore) - INVALID
LIM__SW5 (embedded double underscore) - INVALID
Trailing underscores forbidden: LIM_SW5_ - INVALID
Minimum uniqueness: At least 6 characters of uniqueness shall be supported
Maximum length: Implementer specific

Implementation note (DEV-013): trust-lsp currently validates identifiers using ASCII-only rules (A-Z, a-z, 0-9, _) with ASCII case-folding, and does not accept Unicode identifiers as allowed by IEC 61131-3 §6.1.1–6.1.2 (Table 1–2).

Features¶

No.	Description	Examples
1	Upper case letters and numbers	`IW215`, `QX75`, `IDENT`
2	Upper/lower case, numbers, embedded underscore	`LIM_SW_5`, `LimSw5`, `abcd`, `ab_Cd`
3	Upper/lower case, numbers, leading or embedded underscore	`_MAIN`, `_12V7`

3. Keywords (Section 6.1.3)¶

Keywords are unique combinations of characters utilized as individual syntactic elements.

Rules¶

Keywords shall not contain embedded spaces
Case of characters shall NOT be significant (e.g., FOR and for are equivalent)
Keywords shall not be used for other purposes (e.g., variable names)

Complete Keyword List¶

Data Types¶

BOOL, SINT, INT, DINT, LINT, USINT, UINT, UDINT, ULINT
REAL, LREAL
BYTE, WORD, DWORD, LWORD
STRING, WSTRING, CHAR, WCHAR
TIME, LTIME, DATE, LDATE, TIME_OF_DAY, TOD, LTIME_OF_DAY, LTOD
DATE_AND_TIME, DT, LDATE_AND_TIME, LDT

Generic Types¶

ANY, ANY_DERIVED, ANY_ELEMENTARY, ANY_MAGNITUDE, ANY_NUM
ANY_REAL, ANY_INT, ANY_UNSIGNED, ANY_SIGNED, ANY_DURATION
ANY_BIT, ANY_CHARS, ANY_STRING, ANY_CHAR, ANY_DATE

Variable Declarations¶

VAR, VAR_INPUT, VAR_OUTPUT, VAR_IN_OUT, VAR_TEMP
VAR_GLOBAL, VAR_EXTERNAL, VAR_ACCESS, VAR_CONFIG
END_VAR
CONSTANT, RETAIN, NON_RETAIN
AT

Type Declarations¶

TYPE, END_TYPE, STRUCT, END_STRUCT, ARRAY, OF

Program Organization Units¶

FUNCTION, END_FUNCTION
FUNCTION_BLOCK, END_FUNCTION_BLOCK
PROGRAM, END_PROGRAM
CLASS, END_CLASS
INTERFACE, END_INTERFACE
METHOD, END_METHOD
PROPERTY, END_PROPERTY
NAMESPACE, END_NAMESPACE
USING

Property keywords are defined in IEC 61131-3 Ed.3 §6.6.5 (Table 50).

OOP Keywords¶

EXTENDS, IMPLEMENTS, OVERRIDE, FINAL, ABSTRACT
THIS, SUPER
PUBLIC, PRIVATE, PROTECTED, INTERNAL

Property Accessors¶

GET, END_GET
SET, END_SET

Accessor keywords are defined in IEC 61131-3 Ed.3 §6.6.5 (Table 50).

Control Flow¶

IF, THEN, ELSIF, ELSE, END_IF
CASE, OF, END_CASE
FOR, TO, BY, DO, END_FOR
WHILE, END_WHILE
REPEAT, UNTIL, END_REPEAT
EXIT, CONTINUE, RETURN
JMP

JMP is defined in IEC 61131-3 Ed.3 Table 72.

Operators¶

AND, OR, XOR, NOT, MOD

Boolean Literals¶

TRUE, FALSE

Reference Operations¶

REF, REF_TO, NULL

Configuration¶

CONFIGURATION, END_CONFIGURATION
RESOURCE, END_RESOURCE, ON
TASK, WITH

SFC Elements¶

STEP, END_STEP, INITIAL_STEP
TRANSITION, END_TRANSITION, FROM, TO
ACTION, END_ACTION

SFC profile note: truST reserves the IEC SFC keywords and ships a visual SFC editor, but textual SFC body syntax is not specified in the Structured Text parser. Use the visual-editor documentation for current authoring scope; see docs/public/develop/visual-editors/sfc.md and docs/IEC_DEVIATIONS.md (DEV-020).

Special¶

EN, ENO
R_EDGE, F_EDGE
READ_ONLY, READ_WRITE

Implementation Extensions (Reserved Keywords)¶

The following keywords are reserved by truST but are not part of the IEC 61131-3 keyword list. They are implemented as extensions and documented in docs/IEC_DEVIATIONS.md:

VAR_STAT
PERSISTENT
POINTER
UNION, END_UNION
NEW, __NEW, __DELETE
ADR, SIZEOF
TEST_PROGRAM, END_TEST_PROGRAM
TEST_FUNCTION_BLOCK, END_TEST_FUNCTION_BLOCK

VAR_STAT is a shipped vendor-extension keyword. Runtime semantics are recorded in docs/IEC_DEVIATIONS.md: function statics persist across calls, method statics persist per instance and per method, and PROGRAM/FUNCTION_BLOCK/CLASS VAR_STAT behaves as ordinary instance storage.

4. White Space (Section 6.1.4)¶

White space characters (space, tab, newline, etc.) may be inserted anywhere except: - Within keywords - Within literals - Within enumerated values - Within identifiers - Within directly represented variables - Within delimiter combinations (e.g., :=, (*)

5. Comments (Table 3, Section 6.1.5)¶

No.	Type	Syntax	Example
1	Single-line	`//...`	`X:= 13; // comment`
2a	Multi-line	`(* ... *)`	`(* multi-line comment *)`
2b	Multi-line (alt)	`/* ... */`	`/* multi-line comment */`
3a	Nested	`(* ... (* ... ) ... )`	`(* (* NESTED ) )`
3b	Nested (alt)	`/* ... /* ... / ... /`	`/* /* NESTED / /`

Rules¶

Single-line comments end at line feed, newline, form feed, or carriage return
In single-line comments, (*, *), /*, */ have no special meaning
In multi-line comments, // has no special meaning
Comments are permitted anywhere spaces are allowed, except within string literals
Comments have no syntactic or semantic significance - treated as white space
Nested comments must use matching pairs

6. Pragmas (Table 4, Section 6.2)¶

Pragmas are delimited by curly brackets { and }.

No.	Description	Examples
1	Pragma	`{VERSION 2.0}`, `{AUTHOR JHC}`, `{x:= 256, y:= 384}`

Rules¶

Syntax and semantics of pragma contents are Implementer specific
Pragmas are permitted anywhere spaces are allowed, except within string literals

7. Numeric Literals (Table 5, Section 6.3.2)¶

No.	Type	Examples	Notes
1	Integer	`-12`, `0`, `123_4`, `+986`	Decimal
2	Real	`0.0`, `0.4560`, `3.14159_26`	With decimal point
3	Real with exponent	`-1.34E-12`, `1.0E+6`, `1.234e6`	Scientific notation
4	Binary	`2#1111_1111`, `2#1110_0000`	Base 2
5	Octal	`8#377`, `8#340`	Base 8 (DEPRECATED)
6	Hexadecimal	`16#FF`, `16#ff`, `16#E0`	Base 16
7	Boolean (numeric)	`0`, `1`
8	Boolean (keyword)	`FALSE`, `TRUE`
9	Typed literal	`INT#-123`, `WORD#16#AFF`, `BOOL#TRUE`	Type prefix with `#`

Rules¶

Underscores _ between digits are not significant and can be used as separators
No other use of underscores in numeric literals is allowed
Real literals are distinguished by presence of a decimal point
Exponents indicate power of ten
Based numbers (2#, 8#, 16#) shall NOT contain leading sign + or -
Based numbers are interpreted as bit string literals
For base 16, letters A-F (or a-f) represent decimal 10-15
Octal literals (8#) are DEPRECATED

8. Character String Literals (Table 6, Section 6.3.3)¶

Single-byte Strings (using single quotes)¶

No.	Description	Example
1a	Empty string	`''`
1b	Single character (CHAR)	`'A'`
1c	Space character	`' '`
1d	Single quote in string	`'$''`
1e	Double quote in string	`'"'`
1f	Escape sequences	`'$R$L'`
1g	Hex character (2 digits)	`'$0A'`

Double-byte Strings (using double quotes)¶

No.	Description	Example
2a	Empty string	`""`
2b	Single character (WCHAR)	`"A"`
2c	Space character	`" "`
2d	Single quote in string	`"'"`
2e	Double quote in string	`"$""`
2f	Escape sequences	`"$R$L"`
2h	Hex character (4 digits)	`"$00C4"`

Typed String Literals¶

No.	Description	Example
3a	Typed string	`STRING#'OK'`
3b	Typed character	`CHAR#'X'`
4a	Typed double-byte string	`WSTRING#"OK"`
4b	Typed double-byte character	`WCHAR#"X"`
4c	Typed double-byte string (single quotes)	`WSTRING#'OK'`
4d	Typed double-byte character (single quotes)	`WCHAR#'X'`

9. Escape Sequences (Table 7, Section 6.3.3)¶

Two-character combinations beginning with dollar sign $:

No.	Meaning	Combination
1	Dollar sign	`$$`
2	Single quote	`$'`
3	Line feed	`$L` or `$l`
4	Newline	`$N` or `$n`
5	Form feed (page)	`$P` or `$p`
6	Carriage return	`$R` or `$r`
7	Tab	`$T` or `$t`
8	Double quote	`$"`

Notes: - $' is only valid inside single-quoted strings - $" is only valid inside double-quoted strings - $N (newline) provides implementation-independent line ending

10. Duration Literals (Table 8, Section 6.3.4)¶

Time Unit Abbreviations¶

Abbrev.	Meaning
d	Day
h	Hour
m	Minute
s	Second
ms	Millisecond
us	Microsecond (no μ available)
ns	Nanosecond

Prefixes¶

Short	Long
`T#`	`TIME#`
`LT#`	`LTIME#`

Examples¶

No.	Description	Examples
2a	Without underscore (short)	`T#14ms`, `T#-14ms`, `T#14.7h`, `t#14.7d`
2b	Without underscore (long)	`TIME#14ms`, `TIME#-14ms`
3a	With underscore (short)	`t#25h_15m`, `t#5d_14h_12m_18s_3.5ms`
3b	With underscore (long)	`TIME#25h_15m`, `LTIME#5d_14h_12m_18s_3.5ms`

Rules¶

Duration literals are delimited by T#, TIME#, LT#, or LTIME#
Units can be in upper or lower case
Underscore separators can be used between units
The least significant unit can use real notation (e.g., 3.5ms)
"Overflow" of most significant unit is permitted (e.g., T#25h_15m)
Both positive and negative values are allowed

11. Date and Time Literals (Table 9, Section 6.3.5)¶

Date Literals¶

No.	Type	Prefix	Example
1a	Date (long)	`DATE#` or `date#`	`DATE#1984-06-25`
1b	Date (short)	`D#`	`D#1984-06-25`
2a	Long date (long)	`LDATE#`	`LDATE#2012-02-29`
2b	Long date (short)	`LD#`	`LD#1984-06-25`

Time of Day Literals¶

No.	Type	Prefix	Example
3a	Time of day (long)	`TIME_OF_DAY#`	`TIME_OF_DAY#15:36:55.36`
3b	Time of day (short)	`TOD#`	`TOD#15:36:55.36`
4a	Long time of day (short)	`LTOD#`	`LTOD#15:36:55.36`
4b	Long time of day (long)	`LTIME_OF_DAY#`	`LTIME_OF_DAY#15:36:55.36`

Date and Time Literals¶

No.	Type	Prefix	Example
5a	Date and time (long)	`DATE_AND_TIME#`	`DATE_AND_TIME#1984-06-25-15:36:55.360227400`
5b	Date and time (short)	`DT#`	`DT#1984-06-25-15:36:55.360_227_400`
6a	Long date and time (long)	`LDATE_AND_TIME#`	`LDATE_AND_TIME#1984-06-25-15:36:55.360_227_400`
6b	Long date and time (short)	`LDT#`	`LDT#1984-06-25-15:36:55.360_227_400`

Format¶

Date format: YYYY-MM-DD
Time format: HH:MM:SS[.fraction]
Combined format: YYYY-MM-DD-HH:MM:SS[.fraction]
Underscores can separate fraction digits

Implementation Notes for trust-syntax¶

Lexer Error Conditions¶

Invalid identifier (double underscore, trailing underscore)
Unterminated string literal
Unterminated comment
Invalid escape sequence in string
Invalid numeric literal format
Invalid duration/date format

Lexical Elements¶

Lexical Elements¶

1. Character Set (Table 1, Section 6.1.1)¶

2. Identifiers (Table 2, Section 6.1.2)¶

Rules¶

Features¶

3. Keywords (Section 6.1.3)¶

Rules¶

Complete Keyword List¶

Data Types¶

Generic Types¶

Variable Declarations¶

Type Declarations¶

Program Organization Units¶

OOP Keywords¶

Property Accessors¶

Control Flow¶

Operators¶

Boolean Literals¶

Reference Operations¶

Configuration¶

SFC Elements¶

Special¶

Implementation Extensions (Reserved Keywords)¶

4. White Space (Section 6.1.4)¶

5. Comments (Table 3, Section 6.1.5)¶

Rules¶

6. Pragmas (Table 4, Section 6.2)¶

Rules¶

7. Numeric Literals (Table 5, Section 6.3.2)¶

Rules¶

8. Character String Literals (Table 6, Section 6.3.3)¶

Single-byte Strings (using single quotes)¶

Double-byte Strings (using double quotes)¶

Typed String Literals¶

9. Escape Sequences (Table 7, Section 6.3.3)¶

10. Duration Literals (Table 8, Section 6.3.4)¶

Time Unit Abbreviations¶

Prefixes¶

Examples¶

Rules¶

11. Date and Time Literals (Table 9, Section 6.3.5)¶

Date Literals¶

Time of Day Literals¶

Date and Time Literals¶

Format¶

Implementation Notes for trust-syntax¶

Token Categories¶

Lexer Error Conditions¶