FTP client
Currently implements:
The client is written in asynchronous style (using [root:Uq_engines]).
The interface of this module was changed in Ocamlnet-3.3. Before this
release, the module was marked as "experimental". This is no longer
the case, as the interface has been updated, and the missing features
have been added (e.g. STOR
support).
Something went wrong, often on socket level
The server violates the FTP specification
A timeout on the control or data connection (this is a fatal error)
The command state:
`Not_connected
: Not connected.`Init
: Just connected, no greeting message arrived yet`Success
: Got the greeting message/last command was successful`Proto_error
: currently unused`Temp_failure
: Last command was not successful, and the code
was between 400 and 499`Perm_failure
: Last command was not successful, and the code
was between 500 and 599`Rename_seq
: Used instead of `Success
after the RNFR command`Restart_seq
: Used instead of `Success
after the REST command`User_pass_seq
: Used instead of `Success
after the USER command
when a password must be typed in`User_acct_seq
: Used instead of `Success
after the USER command
when an account ID must be typed in`Pass_acct_seq
: Used instead of `Success
after the PASS command
when an account iD must be typed in`Preliminary
: a reply with code 100 to 199. There will be another
final reply for the commandThe port of the data connection: `Active
means that the server
initiates the data connection to the listening client, and in the
case of `Passive
the client initiates the data connection to the
listening server. The string argument is the IP address as dotted
quad, the int argument is the port number, and the descriptor
is the listening master socket.
The form_code
has a meaning when FTP is used to print files:
`Non_print
: This is not the case.`Telnet
: Telnet control codes are used for vertical movement`ASA
: ASA (Fortran) control codes are used for vertical movementThe representation of the transferred file:
`ASCII
: An ASCII variant is used, i.e. the server sends files in
ASCII encoding with CR/LF as end-of-line marker. Supported by all
servers.`EBCDIC
: An EBCDIC variant is used, i.e. the server sends files in
EBCDIC encoding with NEL as end-of-line marker`Image
: The file is transferred in its original binary
representation. Supported by all servers."Local" representations are not supported.
This FTP client does not recode the files such that they match the selected representation. When files are downloaded, they are stored as they are received. When files are uploaded, they are sent as they are. The user of this client must do recodings when necessary (the class Ftp_data_endpoint.data_converter may be useful for this).
If no representation is selected, FTP servers assume `ASCII None
as default.
The client supports two structures:
`File_structure
: Files are simply contiguous streams of bytes`Record_structure
: Files are sequences of records. FTP does
not make a difference between variable and fixed length records.
It is not forbidden that the records are themselves structured
into lines, in fact it can happen that end-of-line markers are
contained in binary records. Operating systems that support
record-structured files often store text files in this format, i.e.
every line is stored in its own record, without end-of-line marker.
If record structure is selected by a STRU command, it is recommended
to use the classes Ftp_data_endpoint.out_record_channel and
Ftp_data_endpoint.in_record_channel for the local representation
of the files, otherwise the records may be incorrectly mapped
to the local conventions.Page-structured files (i.e. indexed files) are not supported.
If no structure is selected, FTP servers will assume file structure as default.
The transmission mode selects how the data are encoded in the data connection.
`Stream_mode
: This is the simple format that is responsible for
all the failed FTP downloads. It is supported by all FTP servers,
actually, you cannot assume a better transmission mode from an
unknown FTP server. It is unreliable because it cannot distinguish
between a transmission failure and the regular end-of-file condition.`Block_mode
: This is an improved format using frames to protect
the transmitted data. Unfortunately, almost no FTP server supports
it.Both modes are compatible with both structures, i.e. you can transfer a record-structured file in stream mode and a flat file in block mode. However, in practice this is not the case. Servers that only know flat files are likely to only support stream mode, and servers implementing record structure imply that block transfers base on the record format. So the advice is to use stream mode for flat files, and block mode for record files.
#
cmd_state
| : cmd_state | ; | (* | the command state | *) |
#
ftp_connected
| : bool | ; | (* | whether connected with the server | *) |
#
ftp_data_conn
| : bool | ; | (* | whether there is a clean data conn | *) |
#
ftp_user
| : string option | ; | (* | successfully sent user identifier | *) |
#
ftp_password
| : string option | ; | (* | successfully sent password | *) |
#
ftp_account
| : string option | ; | (* | successfully sent account identifier | *) |
#
ftp_logged_in
| : bool | ; | (* | whether the user is logged in | *) |
#
ftp_port
| : port | ; | (* | the selected port | *) |
#
ftp_repr
| : representation | ; | (* | the selected representation | *) |
#
ftp_structure
| : structure | ; | (* | the selected structure | *) |
#
ftp_trans
| : transmission_mode | ; | (* | the selected trans mode | *) |
#
ftp_dir
| : string list | ; | (* | The current directory, expressed as list of CWD changes minus CDUP changes. This is only reasonable if CWD does not include slashes. The list is in reverse order, i.e. deepest directory first. | *) |
#
ftp_features
| : (string * string option) list option | ; | (* | The list of features returned by the last FEAT command.
None means that no FEAT command was yet tried.
Some [] means that there are no features (either FEAT
returned an empty list, or the FEAT command is not implemented
by the server). Otherwise the list enumerates pairs
(label,param) where label is the case-sensitive feature
label and param the optional parameter. There is no
defined order for the list of features. | *) |
#
ftp_options
| : (string * string option) list | ; | (* | Remembers the OPTS commands sent to the server. The list
enumerates pairs (command,optionparam) , where command
is the uppercase command name the option refers to. Only
the last negotiated optionparam for the command is remembered. | *) |
The ftp_state reflects the knowledge of the client about what has been agreed upon with the server.
An FTP command. Not all commands are implemented by all servers.
Reply code plus text
Run another command as engine. The command is first started when the previous command has terminated. The protocol interpreter does not check whether this command is allowed in the current ftp_state or not.
When the command is done, the engine transitions to
`Done(st,r)
where st
is the state after the command, and r
is the final reply of the server.
Due to the FTP specification there may be several replies for
a command: First, zero or more replies with cmd_state = `Preliminary
,
and then exactly one reply with a final state. The preliminary replies
can be intercepted with the prelim
callback.
Sends immediately an ABOR
command, even when a data transfer is
in progress.
TODO - not yet implemented
Starts the event system; same as Unixqueue.run
The state in the sense of Uq_engines
. Possible values are:
`Working _
: The control connection is still active. The int
argument is currently meaningless.`Done()
: The control connection has been terminated.`Error e
: A violation of the FTP protocol happened, or another
exception e
occurred`Aborted
: The abort
method was calledShuts any active connection immediately down, and changes the
state of the engine to `Aborted
.
Whether the queue is empty
Whether `EPSV
or `EPRT
are required instead of
`PASV
and `PORT
, respectively. This is first set after
connecting to a server (i.e. when the IP address family is known).
Before, it is always false
.
Feature tests
The following methods are first set when the `FEAT
command is run.
Use feat_method
to do so. Until then, always false
is returned.
Whether TVFS filenames are supported
Whether the `MDTM
command is supported. Note that `MDTM
is sometimes
even supported even if the server does not provide the `FEAT
command
to test for this feature.
Whether the `SIZE
command is supported. Note that `SIZE
is sometimes
even supported even if the server does not provide the `FEAT
command
to test for this feature.
Whether the `MLST
and `MLSD
commands are supported
All available facts for `MLST
and `MLSD
The enabled facts for `MLST
and `MLSD
Whether the UTF-8 extension is understood by the server (RFC 2640)
The client protocol interpreter...
An ftp_method
is a small procedure doing some task
These exceptions may be raised during execution by the FTP method. The int is the unexpected FTP control code and the string the corresponding text. A temporary failure has a code between 400 and 499, and a permanent failure has a code between 500 and 599.
This method connects to the host
This FTP method logs the user
in. get_password
is called when
the FTP server asks for the password (may be skipped). get_account
is called when the server asks for the account ID (may be skipped).
This FTP method walks to the target directory:
`File name
: The name
is interpreted as slash-separated path.
It is always interpreted relative to the home directory of the
user (i.e. the directory after login), even if it begins with a
slash. The FTP command walks to the directory containing name
.`Dir name
: The FTP command walks to the directory name
(same
syntax as for `File
).`Stay
: The FTP command does nothing (stays in the current directory).There are several methods how to specify filenames:
`NVFS name
: The "Network Virtual File System" is the normal way
of accessing FTP files. The client walks into the directory containing
the file using CWD
and CDUP
commands, and calls the file operation
from this directory. For simplicity, this client interprets slashes
in name
as path component separators. The FTP server will never
see these slashes.`TVFS name
: The optional "Trivial Network File System" avoids the
CWD
and CDUP
commands. The tagged name
is normalized (double
slashed removed etc.), and is passed to the server as-is. Before using
the faster TVFS one should check whether it is supported (feature
"TVFS"). Note that even for TVFS there are no special files "."
and ".."!`Verbatim name
: The string name
is passed to the server without
transforming it in any way.This FTP method walks to the right directory and gets file
from
the server. The file is stored in the local_receiver
that can be
obtained by calling the store
function. The selected representation
remains unchanged.
This FTP method walks to the right directory and puts file
to
the server. The file is taken from the local_sender
that can be
obtained by calling the store
function. The selected representation
remains unchanged.
meth
selects the method to use (default `STOR
).
Requests a certain structure for future file transfers
Requests a certain mode for future file transfers
Renames the file_from
into file_to
.
Both file names must be of the same type, either `NVFS
or `Verbatim
.
If `NVFS
, both names must be in the same directory.
Lists the contents of the directory dir
using the LIST command.
The representation
must not be `Image
.
Lists the contents of the directory dir
using the NLST command
The representation
must not be `Image
.
Returns the filenames contained in the output of `NLST
Determines the date and time of the last modification of file
.
On success, process_result
is called.
Determines the size of file
. On success, process_result
is called.
The size depends on representation
.
Get the list of feature tokens (see also Ftp_client.ftp_state.ftp_features)
A file entry (name, facts)
. The facts are given as pairs
(factname,value)
where factname
is always lowercase.
For parsers for common facts see below.
Get the file entry for file
.
Gets the entries for this directory.
Types:
`File
: entry refers to file`Cdir
: entry refers to the directory being listed`Pdir
: entry is a parent of the directory being listed`Dir
: entry refers to directory`Other
: entry is neither file nor directoryPermissions:
`Append
: append to file possible`Create
: file can be created in this dir`Delete
: file or dir can be deleted`Enter
: dir can be entered`Rename
: file or dir can be renamed`List
: dir can be listed`Mkdir
: subdir can be created in this dir`Delete_member
: a file or dir can be deleted in this directory`Read
: a file can be retrieved`Write
: a file can be storedThe following functions extract commonly used facts from entries.
They may raise Not_found
.
The curerent protocol interpreter. It is allowed that a different pi
is created when a new connection is opened.
The pi
is first available after running the first method. The
method fails if pi
is unavailable.
Configures a timeout for both the control and the data connection. This must be done before connecting to a server.
Sets that a SOCKS version 5 proxy is used at this host and port. There is no support for authentication at the proxy. Only passive mode is supported.
Aborts all current activities if any, and re-initializes the client
The ftp client is a user session that may even span several connections. However, only one server is connected at once.
To download a single flat file from a server:
let buffer = Buffer.create 1000 in
let ch = new Netchannels.output_buffer buffer in
let client = new ftp_client() in
client # exec (connect_method ~host:"127.0.0.1" ());
client # exec (login_method ~user:"foo"
~get_password:(fun () -> "password")
~get_account:(fun () -> "foo") ());
client # exec (get_method ~file:(`NVFS "path/to/file")
~representation:`Image
~store:(fun _ -> `File_structure ch) ());
The file is stored in buffer
. By using a different netchannel, it
can be stored whereever wanted.
To download a record-structured text file, use a store
like:
let ch = (as above) in
let rec_ch = new Ftp_data_endpoint.write_out_record_channel
~repr:(`ASCII_unix `Enc_iso88591)
ch
...
... ~store:(fun _ -> `Record_structure rec_ch)
Here, the end-of-record is transcoded to newline. Note that the ASCII
variant (`Enc_iso88591
) is ignored by write_out_record_channel
.
Open: How to select record structure using an FTP method.
Character conversions: To convert an EBCDIC file to ASCII, use something like
let ch = (as above) in
let converter = new Ftp_data_endpoint.data_converter
~fromrepr:(`EBCDIC `Enc_cp1047)
~torepr:(`ASCII_unix `Enc_iso88591) in
let ch_ebcdic = new Netchannels.output_filter converter ch in
...
... ~representation:(`EBCDIC None)
... ~store:(fun _ -> `File_structure ch_ebcdic)
The class data_converter
also performs the transformation of the
end-of-line convention, unlike the similar class
Netconversion.conversion_pipe
.