NFRCP

Section: NFRCP (1)
Updated: $Date: 2009/08/30 13:33:26 $
Index Return to Main Contents

NAME

nfrcp - Copies files between a local and a remote host or between two remote hosts

SYNOPSIS

nfrcp [-prv] source destination

DESCRIPTION

nfrcp is a version of the standard unix rcp command that communicates over its own sockets and uses a multithreaded reader and writer to speed up file transfers across fast networks. See 'man rcp' for more details. It also calculates checksums of transferred files and can store these checksums in a database for later verification.

All the hosts mentioned in the command arguments must have nfrcp installed on them.

OPTIONS

-p: The -p option causes to attempt to preserve (duplicate) in its copies the modification times and modes of the source files, ignoring the umask. By default, the mode and owner of file2 are preserved if it already existed; otherwise the mode of the source file modified by the umask(2) on the destination host is used.
-r: If any of the source files are directories, nfrcp copies each subtree rooted at that name; in this case the destination must be a directory.
-v: Verbose output. Reports various details about the transfer. This happens at both ends of the transfer. On the remote end it will attempt to write a log file as the remote user. See FILES below.
-n: Inhibit calculation of checksums.

DESCRIPTION

See 'man rcp' for details on the source and destination arguments. nfrcp uses much of the same protocol that ordinary rcp uses, except that instead of copying the files via the socket connecting the local and remote ends, it just exchanges a small amount of handshaking data over this connection and copies the file over sockets that each end creates. In the case where the remote nfrcp is proxied through some other host (e.g. a front-end node to a cluster that does some load balancing by redirecting logins to another host), this avoids sending traffic through the host in the middle. It also allows setting of the socket send-buffer size, which must be done before a connection is set up.

If the configuration file (see below) specifies a db-root, nfrcp will attempt to store and retrieve checksums for transferred files in an sqlite database. The database entries are keyed by file inode and modification time, so they should be immune to file renames within a filesystem. Ideally the checksums would be stored iin extended attributes of the files themselves instead of in a database, but not all filesystems (most notably QFS) support extended attributes.

When sending a file, nfrcp will first try to retrieve its checksum from the local database. If there is no database entry for a source file (either because there is no database at the sending end, or no database record can be found for the file), a checksum is generated from the file data at the sending end. This checksum is compared against the file as it is received by the other host, and recorded in the database if one exists at the receiving end. If a received checksum does not match the received file, an error is generated.

CONFIGURATION

The optional configuration file /etc/nfrcp.conf contains option-value pairs, separated by whitespace, one pair per line, that are used to alter the behaviour of nfrcp.

hostname my.preferred.interface.hostname

When set on the sending end, this forces the receiver to connect to the sender via the specified hostname, and avoids the peer-detection heuristic described below. Set this to the hostname or doted-decimal ip address of the host's preferred interface if required (see below). This string will be sent by the sending nfrcp to the receiving end as part of the handshake, and the receiving end will then try to connect to that hostname or ip address, so it must make sense to the receiving nfrcp.

If this is not set (e.g. where there is no configuration file), the receiving nfrcp will attempt to work out whether there is a firewall or rsh-proxy between it and the sender. If there is, the receiver will attempt to connect to the hostname that the sender sent in the handshake (as returned by gethostname() on the sender). If there isn't, the receiver will attempt to contact the sender via the address used in the rsh (rcmd()) connection. The firewall/proxy detection works by having the sender send the ip address and port of its end of the rsh connection in the handshake. If the receiver sees the handshake coming from the address and port mentioned in the handshake, it can be reasonably sure there is no NAT or rsh-proxy in between. That said, it's possible to construct a NAT setup whereby the rsh connection goes through untranslated, but other things don't. For these weird cases, override the connection address by setting the hostname in the config file.

db-root /path/to/checksum/database/root/directory

If this appears, file checksums will be written to a database rooted at the specified path.

hash-name sha1|md5|....

The OpenSSL algorithm name of the checksum to use.

RESTRICTIONS

The nfrcp command is confused by output generated by commands in a .cshrc file on the remote host.

If a user writes to a file that is recorded by nfrcp in its database, the database entry is effectively hidden by virtue of the file mtime changing. If the user conspires to keep the mtime unchanged, nfrcp will report a checksum error when retrieving the file, but the file will still be copied as normal.

FILES

/etc/hosts.equiv: Specifies remote hosts from which users can execute commands on the local host (provided these users have an account on the local host).
$HOME/.rhosts: Specifies remote users who can use a local user account.
/tmp/nfrcp-$UID.log: When the -v option is used, the remote end sends all its verbose output to this file. If you run more than one 'nfrcp -v ...' involving the same remote host simultaneously, you will only see the log from the last one.
/etc/nfrcp.conf: Configuration file. See above.

NOTES

nfrcp must be in the user's PATH on the remote end. i.e. 'rsh hostname which nfrcp' must find nfrcp.

The checksum database has the following features:

1. Checksums are indexed by file inode and mtime. This menas files can be renamed within a filesystem without losing their checksums.

2. By using a daemon-less database (sqlite), database access is controlled by standard unix file permissions. nfrcp needs to be setuid root so it can call rcmd() (just like rcp ) but should also be setgid to protect access to the checksum database. The checksum database root directory should only be writable by the group that owns the nfrcp executable.

Each database entry records the inode number, modification time, writer uid, checksum, and checksum state of a file and some associated timestamps.

3. File pathnames are not stored. They are not needed due to the inode indexing, and might present an information disclosure risk if they were.

This document was created by man2html, using the manual pages.
Time: 12:36:11 GMT, August 30, 2009

NFRCP

NAME

SYNOPSIS

DESCRIPTION

OPTIONS

DESCRIPTION

CONFIGURATION

RESTRICTIONS

FILES

NOTES

SEE ALSO

Index