Manpages

NAME

router − zmailer message routing daemon

SYNOPSIS

router [ −diksSVW ] [ −f configfile ] [ −n #routers ] [ −o zmshoptions ] [ −r routerdirloops ] [ −t traceflag ] [ −L logfile ] [ −P postoffice ]

DESCRIPTION

The router daemon makes all decisions affecting the processing of messages in ZMailer.

A mail message is submitted by placing it in a file in the POSTOFFICE/router directory. The router frequently scans this directory for new files and will lock and process them as it finds them. The result is a message control file that gets linked into the POSTOFFICE/scheduler and POSTOFFICE/transport directories for use by the scheduler(8) in the next step of message processing. The original message file is then moved to the POSTOFFICE/queue directory.

The router’s behaviour is controlled by a configuration file read at startup. It is really a zmsh(1) script that uses facilities provided builtin to the router.

OPTIONS

Invoking router without any arguments will do nothing (except make it reads its configuration file and promptly exit). The normal startup method is to run the zmailer(1) script, as in zmailer router. This will start the router as a daemon and kill the previous incarnation of the router.

−d

detach and run as a daemon.

−f configfile

overrides the default configuration file MAILSHARE/router.cf.

−i

run interactively, presenting a zmsh session with the configuration file preloaded.

−k

kill the currently running router by sending it a SIGTERM signal.

−L logfile

overrides the default log file location LOGDIR/router.

−m memtracefile

For debug purposes when compiled with XMEM option, not in general production setup.

−N

don’t syslog normal routing operation information, syslog only errors..

−n #routers

starts the specified number of parallel router processes. The default is a single router process.

−o zmshoptions

sets the option string passed on the the internal zmsh invocation. The default is -O. Note that the leading ’-’ is mandatory. See zmsh(1) for the available options.

−P postoffice

specifies an alternate POSTOFFICE directory.

−r routerdirloops

Process routerdirloops messages out of any alternate router directories, then check to see if any higher-priority jobs have been created. If undefined and alternate router directories are in use, the router will clear the entire directory before returning. See below and zmailer(3) about Z-Environment and ROUTERDIRS.

−S

Can be used to turn off non-serious syslogging.

−s

Turns stability-flag off and on. Without this flag, the search of new jobs will be done with (sometimes) timeconsuming care of organizing the job files into time order.

−t traceflag

sets trace options, one per -t switch, even before the configuration file is loaded. This is otherwise equivalent to the builtin trace command. The currently known options are: assign, bind, compare, db, final, functions, matched, memory, on, regexp, resolv, rewrite, router, and sequencer.

−V

print version message and run interactively.

−W

Warn about syntax errors in the RFC-822 headers when this option is on. Without this, bad headers are shown in their original form, not by rendering them thru any formatting RFC-822 engine. This deviates from old behaviour of the ZMailer major way!

To restart a router daemon:

router -dk

To test an address, start up an interactive session:

router -i

or if the ZMailer sendmail(8) is installed:

sendmail -bt

Then just use the pre-defined functions.

CONFIGURATION FILE

The router configuration file must provide some services so the message processing can proceed:

-

There must be a way to translate an arbitrary syntactically valid address into a specification for how to deliver for that address. This is routing and must be implemented by a configuration file function with the name: router.

-

There must be a way to make policy decisions about what to do with an address in the context of the particular message it came from. The most typical kind of policy decision is how to rewrite addresses in the message header for the immediate destination of a message envelope recipient address. Policy at this level is implemented by a configuration file function called crossbar.

-

There must be a way to specify what should happen when message processing cannot continue due to a temporary resource unavailability (e.g., when the nameserver is acting up). There is a shell variable, defer, which may be set internally in the database lookup routines in the router. In case of a temporary resource failure, the variable will be set to a well-defined non-null string related to the failure. Temporary failures encountered during message header address rewriting may be dealt with by the header_defer function.

The interface specifications of these items exhaust the expectations the router has of its configuration file. The router contains useful builtin functions that will aid in implementing the required functionality. These functions are described in the section on BUILTIN FUNCTIONS below.

Optionally the router may provide services to other programs. In particular, the smtpserver(8) program relies on the router to do address validation and verification when it is asked (and enabled) to provide this service during an SMTP conversation. The expected name for this function is server.

The following are the interface specifications for the functions mentioned above:
router
address attributes

This function is applied to all message envelope addresses. The first argument to this function is a syntactically valid (understood by the parser) address. The second argument is a symbol whose value is a property list for the address. The property list consists of alternating keys and values, and is modified using the lreplace function.

The router function returns a list of address groups. Each address group is a list of mutually exclusive address tuples, i.e. delivering to one is equivalent to delivering to any other destination address in the group. An address tuple is a list of 4 elements (another name for this is a quad), which in order are:

-

the delivery channel, designating a transport agent.

-

the next host to be delivered to (an uninterpreted channel parameter).

-

the address to present the next host.

-

the symbol whose value is a property list for this address.

By convention, if either the channel or host is irrelevant, it is given as "-".

For example, these are possible parameters and return value:

z# g0=(privilege 0 type recipient)
z# router rayan g0
(((local - rayan g0)))

This function also handles any alias expansion that may be necessary. The following example shows the expansion of a single address to multiple independent recipients:

z# router root g0
(((local - ken g0)) ((local - rayan g0)))

The router function is free to change the privilege of the address, or to add any other information to the property list for use by the crossbar function.

Note: in the current version of ZMailer, only the router knows about mutually exclusive addresses. Therefore all quads must be the lone element of their address group.

If a resource deferral occurs during processing (e.g., the nameserver is busy or broken), the global variable defer will be set to a non-null string indicating the problem. This string is in a format interpreted by the hold transport agent when it sees a host name in a destination address.

crossbar from to

This function controls policy decisions that require context knowledge. It rewrites the next-address portion of an address quad to the proper form, and determines which style of message header rewriting is appropriate. The arguments are sender and recipient address quads, as returned by the router function. The return value is a list of three elements:

-

the name of a function that will be used to rewrite all message header addresses

-

rewritten sender address quad

-

rewritten recipient address quad

The message header address rewriting function will be called with a single argument, the address in original form, and returns a string argument, the address rewritten.

header_defer address

This function rewrites a message header address that might not have been rewritten properly due to a resource deferral. In that case the defer variable will be set by the router during execution of the message header address rewriting function specified by the crossbar function. To avoid repeatedly having to check for this at the end of such rewriting functions, this mechanism is provided as a convenience. Any resource deferral during the execution of this function is ignored. Typically header_defer would just quote the address and make it relative to the local host.

server key ...

This function is not used by the router while processing mail. It is called from the smtpserver when synchronous address validation is required. Since this function may need services provided by other parts of the normal configuration file, it is included by convention. The first argument is a keyword which describes the desired service, followed by parameters to that service. The known keywords are:

init

which should be invoked before any other service.

to

verifies its single parameter as a valid SMTP RCPT TO address.

from

verifies its single parameter as a valid SMTP MAIL FROM address.

verify

verifies its single parameter as a resolvable address.

expand

prints the alias expansion, if any, of the single address parameter.

SMTP error codes and text may be echo’ed directly, and multiline output is handled by a filter in the smtpserver.

BUILTIN FUNCTIONS

The following builtin functions are provided in the router that do not exist in zmsh:
Functions that take a list argument:
channel

returns the channel (1st) component of an address quad.

host

returns the host (2nd) component of an address quad.

user

returns the next-address (3rd) component of an address quad.

attributes

returns the property list symbol (4th) component of an address quad.

Normal functions that take string parameters and return strings:

daemon

starts the router running in daemon mode, scanning the POSTOFFICE/router directory every few seconds for message files to process. This function is invoked automatically by other code in the router program and has no other purpose. router has a bit more complex directory semantics, than can be seen from above. See zmailer(3) for details.

basename pathname [ suffix ]

prints the base filename of the pathname. If a suffix is given and matches the filename, the suffix too is stripped from the filename.

db { add|remove|flush|owner|print|toc } [ database [ key [ value ] ] ]

is the access function to the database facilities in the router. The keyword arguments are:

add

add a key,value entry to the database, if possible.

remove

remove a key entry from the database, if possible.

flush

remove all entries from the database, if possible.

owner

print the account name of the owner of the database, if possible. This is usually determined by the files associated with the database.

print

print all entries of the database, if possible.

toc

print a table of defined relations and their associated information. This table has five (5) columns, in order: the name of the relation, its type and subtype, cache entries and maximum size, flags, and associated files. See the relation function for more information.

The keywords may be abbreviated to their smallest unique prefix (usually a single character).

erraddron [ file ]

specifies a filename to append all address parsing error messages to. If there is no argument given, the logging is stopped. This is primarily for curious postmasters or other collectors of address trivia.

filepriv file [ uid ]

prints the numeric user id of the least privileged account that can modify the specified file. This is determined by an approximation that pessimistically assumes that any file or directory writable by group or others is insecure, and optimistically assumes that it is enough to check a file and its parent directory instead of all the way to the filesystem root. The reason for the latter is that if grandparent directories are insecure, the system is likely to have just as bad potential problems as can be created by using mail to run processes with forged powers (besides, doing the full check would be quite expensive).

If a second argument is given it is the numeric user id to assume for the file. This means only the parent directory will be checked for non-writability and for having the same (or a 0) uid.

gensym

generates and prints a new symbol name in the sequence g0 to gN every time it is called. The sequence is reset and any symbol values destroyed after the router has processed a message. This function is used to generate new symbols, to hold attached address property lists, during alias expansion.

groupmembers groupname

prints the accounts that are listed as members of a group in the system groups file, one per line. Note that accounts with the same login group id in but that are not listed in the groups file will not appear in this list.

homedirectory user

prints the home directory of the specified account.

hostname [ name ]

with an argument this sets the router’s idea of the system hostname, else the name as retrieved from the system is printed. The router has no preconceived notion of what the hostname is, so Message-Id and Received headers will only be generated if a hostname has been set using this function.

lappend varname $values..

Looks up named variable, and gives up if does not find it.

The variable data content is considered to be a simple list, and the values are appended to it as a chain.

listaddresses [ -e error-address ] [ -E errors-to-address ] [ -c
comment
]

filters an RFC822 address list on standard input to produce one normal form (no non-address tokens) address per line on its output. This function can be used to parse the alias file or .forward files or similar. If an error-address is specified, any syntax errors at list parsing will cause a report to be mailed to the given address. If a comment is specified, it will be inserted in the error report. If an errors occurs while messages are being delivered, the ’errors-to-address’ can be used to force error message destination elsewere than to the default ’sender’ of the message.

listexpand [ -c comment ] [ -e erroraddress ] [ -E errors-to ]
$attribute $localpart $origaddr
< listfile

implements the most common pipeline where listaddresses was used with more efficient memory consumption handling.

login2uid login

Prints the uid associated with the specified account name, if any. A side-effect is to add the GECOS name field of the account to the fullname in-core database, to add the login name to uid mapping to the pwnam in-core database, and to add the uid to login name mapping to the pwuid in-core database.

lreplace listvarname fieldindex $newvalue

This modifies the content of named listvarname (property-list) variable by replacing:
with numeric index

the value of indexed element; say "1" would replace second data item in simple chain. If the index goes beyond the chain, the new value is added at the tail of the chain.

with property-element name

the value of possibly existing property-element pair (say: "name1" "value1" "name2" "value2") is replaced with a new one. If the name can not be found, name and its value are appended to the variable.

printaliases [ -v ] [ -o indexoutputfile ] file

reads RFC822 syntax header lines from the specified file, parses them assuming contents must be an address list, and sorts and prints the header lines with all addresses in normal form. If the −o switch is specified, each header line will also generate a header TAB byteoffset line in the indexfile. Comments are allowed; they extend from the character ’#’ at the beginning of a line, or after an address, to the end of line. This function is used by the newaliases(1) program to generate the aliases database from a source file.

process messagefile

is the protocol switch function. It is called by the daemon function to process a message found in the POSTOFFICE/router directory. This function will in turn call an internal protocol-specific function which knows the syntax and semantics of the message file. The current version knows about messages submitted using the MSG_RFC822 parameter to mail_open(3). For that case, the protocol function is called rfc822. router has a bit more complex directory semantics, than is stated above. See zmailer(3) for details. Although the process function is provided built in, it is usually overridden by a defined function in the router configuration file.

recipient

is a boolean function that returns the value of the statement "executing a header rewriting function and the address is a recipient address in a message header".

recase [ -u | -l | -p ] string

is a case-mapping function that prints the parameter string in either all-uppercase, all-lowercase, or capitalized (pretty).

relation -t dbtype[,subtype] [ -f file -e# -s# -bilmnNu% -d driver ]
name

informs the router of the existence of a database, and how to access it. It also creates a builtin function with the specified name, which is used to retrieve the value associated with a key in the database. The options are:
−t dbtype

is one of the known types of databases, currently:

incore

is a database maintained in virtual memory (using splay trees). This type should not be used for any database that must periodically be flushed, since not all occupied memory can be freed.

unordered

is a file with key-value pairs on every line, separated by whitespace. (See about "−m"-option!)

ordered

is a file with key-value pairs on every line, separated by whitespace, sorted by key. (See about "−m"-option!)

hostsfile

is the hosts(5) file.

bind

is the BIND implementation of a Domain Name System resolver. The subtype for this type is the name of a Resource Record type in the IN class.

btree

Is SleepyCat BSD DB version 2.x B-TREE database.

bhash

Is SleepyCat BSD DB version 2.x HASH database.

ndbm

is the new DBM library. The BSD4.4 has a thing called db, which is different thing, but it can be used in place of ndbm via its interface library. (The BSD4.4-db does have only one database file, not two, like ndbm does.)

gdbm

is the GNU implementation of the new DBM library. Note: GDBM uses ONE file, which is named exactly as you parametrize it, this is unlike NDBM, which appends .dir and .pag to the supplied name!

dbm

is the old DBM library. There can be only one DBM open at the time, and this system keeps them all open all the time... Avoid if you can!

yp

is the Network Information Service from Sun Microsystems Inc. (Latter renamed to be NIS, the still newer NIS+ is not supported)

header

is an database type used to store RFC822 header semantics information. It is unlikely to be used for anything else. See the HEADERS section below.

−f file

is a file associated with the database, typically the file containing the data, or the basename of DBM files or something similarly relevant to the database access routines.

−e#

is the default time-to-live on cached information. When the information has been in the cache for this many seconds, it is discarded. The default is 0.

−s#

sets the cache size to the specified number of entries. The default is usually 10, depending on the database type.

−b

if the key exists in the database, return the key as the value.

−i

if the key exists, its value is a byte offset into a file named by the subtype for this database. The value then becomes the concatenation of the data on the lines following that offset which start with whitespace. This is used for the aliases file.

−l

map all keys to lowercase before searching.

−m

check for file content modification before every access. Reopen the file when a change is detected. This option is used when the router should discover changes to a database underfoot so it need not be restarted to use new data. This is warmly recommended on relations which use unordered, or ordered datasets (aliases, routes, ...), and especially if the system is configured to use mmap(2) facility.

Updateing such databases should preferrably use mv command to move a new version of the database in place of the old one. ( do not use copy! )

−n

if the key exists in the database and the value is null or a list, return the key as value. Otherwise return the value retrieved, if any.

−N

Negative Cache, if the key is not found from the backend DB, place the result into the cache with configured TTL. (See −e option.)

−u

map all keys to uppercase before searching.

−d driver

specifies a search driver that allows searching for structured keys using special knowledge. The argument to this option must be a known driver.

Currently known drivers are:
pathalias

The lookup sequence for ’’foo.bar.edu’’ is:

foo.bar.edu
.foo.bar.edu
.bar.edu
.edu
.

pathalias.nodot

The lookup sequence for ’’foo.bar.edu’’ is:

bar.edu
edu

longestmatch

The lookup sequence for ’’foo.bar.edu’’ is:

foo.bar.edu
.bar.edu
.edu
.

−%

Marks that the database results containing ’%0’ thru ’%9’ are to be replaced with positional arguments to the database call (via name).

The ’%0’ is always the full key, ’%1’ may be "wildcarded" portion of the key in case any of the driver routines is used, and the original full key does not give a match.

Nominally ’%1’ thru ’%9’ are positional options to name:

result=$(name $keyval $opt1 $opt2 ... $opt9)

However like is above mentioned, "wildcarding" lookup drivers may change the indices making "$opt1" above to be ’%2’, and eight option would become ’%9’ making ninth inaccessible.

name

is the name bound to the lookup function of defined database.

rfc822 messagefile

This function controls the parsing and processing of a message file in RFC822/976 format. It is called by the process function.

rfc822date

prints the current time in RFC822 format.

runas user function [ arguments... ]

changes the current effective user id of the router process to that given (which may be numeric or an account name), then runs the specified function with the specified arguments, then switches the effective user id of the process back (to root).

sender

is a boolean function that returns the value of the statement "executing a header rewriting function and the address is a sender address in a message header".

rfc822syntax address

This is a simple interface to the address parser. If the command line argument is a syntactically valid RFC822 address, this command is silent and returns 0 as exit status. If there is a parse error, a verbose error message is printed to stdout and the function returns a non-0 exit status.

squirrel { [-]event }

sets the kinds of events that cause a message to be copied into the POSTOFFICE/postman directory. The events are: breakin, badheader, illheader, nochannel, nosender. Whether or not a ’-’ is necessary for an event depends on the current state of the event’s flag. The usage message will indicate what to do to toggle the event flag.

stability { on | off }

determines whether the router will process incoming messages in arrival order (when on), or in random order determined by position in the router directory. The router will by default do the first queue scan in stable mode, and subsequent scans in unstable mode. The name of this command is the name for a similar characteristic of sorting algorithms.

trace key1 ... keyN

Enables tracing of the specified items. The valid keywords are:

all

turns on all tracing options. You only do this to test the I/O capabilities of your system.

except

flips the sense of tokens following this one.

The rtrace routine at the default router scripts is defined as:

trace all except rfc822 regexp

assign

print shell variable assignments.

bind

prints various information from the code that calls the DNS resolver.

compare

print sift statement pattern-selector comparisons.

db

print database lookups, including cache search and update information.

final

prints the message envelope information after processing each message.

functions

print shell function calls and return values, with nesting indicated by indentation.

matched

print sift statement pattern-selector matches.

on

same as functions.

regexp

prints regular expression matching execution.

resolv

turns on the RES_DEBUG flag in the BIND resolver library, and prints various information from the code that calls the DNS resolver.

rewrite

prints the tokenized addresses sent through the message header address rewriting functions.

router

prints the tokenized addresses sent through the router function.

sequencer

prints procedural steps taken during message processing.

memory

prints memory allocation information after each message.

uid2login uid

Prints the first account name associated with a specified user id, if any, or uid#uid if no account exists with that user id. It has the same side-effects as the login2uid function.

untrace key1 ... keyN

Complement of the trace function.

In addition any defined relations will create a builtin function with the same name as the relation.

GLOBAL VARIABLES

The following shell variables are defined by the router:

defer

is set by the database lookup routines to indicate a temporary failure of some kind. The value assigned to this variable is a valid host parameter to the hold channel’s transport agent.

envelopeinfo

is set by the rfc822 function as soon as possible after the message has been validated, and before the first call to the router shell function. It is a property list of at least the following elements:

file

The message file name.

message-id

The message-id.

uid

The user id of the owner of the message file.

gid

The group id of the owner of the message file.

size

The message size (header + body) in bytes.

headersize

The message header size in bytes.

bodysize

The message body size in bytes.

now

The time the router started processing the message (in seconds since epoch).

delay

The number of seconds the message had been queued.

resent

A "yes" or "no" indicating whether Resent- headers exist.

trusted

A "yes" or "no" indicating whether file owner is trusted.

Furthermore every envelope header without address semantics will be added to the list, typically:

with

The RFC822 Received header "with" value.

via

The RFC822 Received header "via" value.

rcvdfrom

The host of the last MTA to touch the message.

etc.

HEADERS

The predefined database headers is used to specify to the RFC822 message header parser which headers it should pay attention to and what their semantics are. The database keys are lowercased header field names. The values are strings of three fields separated by a colon (’:’) character. The first field is the name of the semantic rule in the parser that should be used to parse the header. The second field is either Sender or Recipient or null (indicated by a -), corresponding to whether the header contains addresses and if so which type. The third field is used for a Resent flag, which should be given if the header has a proper Resent- prefix, otherwise null.

There are two kinds of entries in the headers database: mandatory, and optional. The mandatory entries are for those headers that are absolutely necessary for the proper relaying of mail. Typically these are purely address headers. The optional entries are for headers that are not essential to the relaying of mail, but whose semantics are in fact specified in RFC822. For example, here is a representative sample of the contents of the database:

You may decide to add or remove header definitions from this database. This is done using the normal database interface function, db. For example, if you want to disable the automatic checking (and rewriting) of Date message headers, you would do "db delete headers date". Such actions should never be done lightly, since it will likely cause violation of the RFC822 protocol when transferring mail to other mailers.

SIGNALS

SIGHUP:

execution of the shell trap handler is deferred until a sequence point. This makes it easier to do log rollovers entirely in the configuration file, without fear of data corruption.

SIGTERM:

exit cleanly (immediately if idle, otherwise after finishing with the message being processed).

Other signals may be handled by shell traps.

Z-ENVIRONMENT VARIABLES

MAILVAR
MAILSHARE

these two tell the directories where the configuration files and routing databases are located at. The MAILVAR files are host dependent, while MAILSHARE files can be common to all of the campus.

NROUTERS

tells how many parallel routers there may be running.

NOBODY

this tells the cornerstone of the system security -- who is nobody. It can tell either a numeric userid, or account name.

LOGDIR

defines location of log files. Example: LOGDIR=/var/log/mail

LOGLEVEL

** document missing **

POSTOFFICE

defines directory where all POSTOFFICE functions are under.
Example: POSTOFFICE=/var/spool/postoffice

ROUTERDIRS

defines a ’:’ separated list of alternate router directories. If these are defined at all, they must exist, if alternate queueing priority mechanism is desired to be used.
Example: ROUTERDIRS=router1:router2:router3:router4

SCHEDULERDIRHASH

Carries a numeric value of ’’1’’ or ’’2’’ (if defined at all), which will then override possible ’’-H’’ option at the scheduler. Existence of this ZENV-variable tells the router to send messages directly to the scheduler’s hash subdirectories, thus eliminating a few directory operations which the scheduler would otherwise do, and at the same time limiting the size of the directory files.

SYSLOGFLG

** document missing **

FILES

SEE ALSO

router(8), mailq(1), zmailer(3), zmsh(1)

AUTHOR

This program authored and copyright by:
Rayan Zachariassen <rayan [AT] cs.edu>
A plenty of changes and further developement by:
Matti Aarnio <mea [AT] nic.fi>