NAME

Net::Server::HTTP - very basic Net::Server based HTTP server class

TEST ONE LINER

perl -e 'use base qw(Net::Server::HTTP); main->run(port => 8080)'
# will start up an echo server

SYNOPSIS

use base qw(Net::Server::HTTP);
__PACKAGE__->run;

sub process_http_request {
    my $self = shift;

    print "Content-type: text/html\n\n";
    print "<form method=post action=/bam><input type=text name=foo><input type=submit></form>\n";

    require Data::Dumper;
    local $Data::Dumper::Sortkeys = 1;

    require CGI;
    my $form = {};
    my $q = CGI->new; $form->{$_} = $q->param($_) for $q->param;

    print "<pre>".Data::Dumper->Dump([\%ENV, $form], ['*ENV', 'form'])."</pre>";
}

DESCRIPTION

Even though Net::Server::HTTP doesn't fall into the normal parallel of the other Net::Server flavors, handling HTTP requests is an often requested feature and is a standard and simple protocol.

Net::Server::HTTP begins with base type MultiType defaulting to Net::Server::PreFork. It is easy to change it to any of the other Net::Server flavors by passing server_type => $other_flavor in the server configuration. The port has also been defaulted to port 80 - but could easily be changed to another through the server configuration. You can also very easily add ssl by including, proto=>"ssl" and provide a SSL_cert_file and SSL_key_file.

For example, here is a basic server that will bind to all interfaces, will speak both HTTP on port 8080 as well as HTTPS on 8443, and will speak both IPv4, as well as IPv6 if it is available.

use base qw(Net::Server::HTTP);

__PACKAGE__->run(
    port  => [8080, "8443/ssl"],
    ipv   => '*', # IPv6 if available
    SSL_key_file  => '/my/key',
    SSL_cert_file => '/my/cert',
);

METHODS

_init_access_log

Used to open and initialize any requested access_log (see access_log_file and access_log_format).

_tie_client_stdout

Used to initialize automatic response header parsing.

process_http_request

Will be passed the client handle, and will have STDOUT and STDIN tied to the client.

During this method, the %ENV will have been set to a standard CGI style environment. You will need to be sure to print the Content-type header. This is one change from the other standard Net::Server base classes.

During this method you can read from %ENV and STDIN just like a normal HTTP request in other web servers. You can print to STDOUT and Net::Server will handle the header negotiation for you.

Note: Net::Server::HTTP has no concept of document root or script aliases or default handling of static content. That is up to the consumer of Net::Server::HTTP to work out.

Net::Server::HTTP comes with a basic %ENV display installed as the default process_http_request method.

process_request

This method has been overridden in Net::Server::HTTP - you should not use it while using Net::Server::HTTP. This overridden method parses the environment and sets up request alarms and handles dying failures. It calls process_http_request once the request is ready and headers have been parsed.

request_denied_hook

This method has been overridden to call send_400. This is new behavior. To get the previous behavior (where the client was closed without any indication), simply provide

process_headers

Used to read in the incoming headers and set the ENV.

_init_http_request_info

Called at the end of process_headers. Initializes the contents of http_request_info.

http_request_info

Returns a hashref of information specific to the current request. This information will be used for logging later on.

send_status

Takes an HTTP status, an optional message, optional body, and optional generate_body flag. Sends out the correct headers.

$self->send_status(500);
$self->send_status(500, 'Internal Server Error');
$self->send_status(500, 'Internal Server Error', "<h1>Internal Server Error</h1><p>Msg</p>");
$self->send_status(500, undef, undef, ['Msg']);
send_400

Calls send_status with 400 and the passed arguments as generate_body.

send_500

Calls send_status with 500 and the argument passed to send_500.

c<log_http_request>

Called at the end of post_process_request. The default method looks for the default access_log_format and checks if logging was initialized during _init_access_log. If both of these exist, the http_request_info is formatted using http_log_format and the result is logged.

http_log_format

Takes a format string, and request_info and returns a formatted string. The format should follow the apache mod_log_config specification. As in the mod_log_config specification, backslashes, quotes should be escaped with backslashes and you may also include \n and \t characters as well.

The following is a listing of the available parameters as well as sample output based on a very basic HTTP server.

%%                %                 # a percent
%a                ::1               # remote ip
%A                ::1               # local ip
%b                83                # response size (- if 0) Common Log Format
%B                83                # response size
%{bar}C           baz               # value of cookie by that name
%D                916               # elapsed in microseconds
%{HTTP_COOKIE}e   bar=baz           # value of %ENV by that name
%f                -                 # filename - unused
%h                ::1               # remote host if lookups are on, remote ip otherwise
%H                http              # request protocol
%{Host}i          localhost:8080    # request header by that name
%I                336               # bytes received including headers
%l                -                 # remote logname - unsused
%m                GET               # request method
%n                Just a note       # http_note by that name
%{Content-type}o  text/html         # output header by that name
%O                189               # response size including headers
%p                8080              # server port
%P                22999             # pid - does not support %{tid}P
q                 ?hello=there      # query_string including ? (- otherwise)
r                 GET /bam?hello=there HTTP/1.1      # the first line of the request
%s                200               # response status
%u                -                 # remote user - unused
%U                /bam              # request path (no query string)
%t                [06/Jun/2012:12:14:21 -0600]       # http_log_time standard format
%t{%F %T %z}t     [2012-06-06 12:14:21 -0600]        # http_log_time with format
%T                0                 # elapsed time in seconds
%v                localhost:8080    # http_log_vhost - partial implementation
%V                localhost:8080    # http_log_vhost - partial implementation
%X                -                 # Connection completed and is 'close' (-)

Additionally, the log parsing allows for the following formats.

%>s               200               # status of last request
%<s               200               # status of original request
%400a             -                 # remote ip if status is 400
%!400a            ::1               # remote ip if status is not 400
%!200a            -                 # remote ip if status is not 200

There are few bits not completely implemented:

> and <    # There is no internal redirection
%I         # The answer to this is based on header size and Content-length
             instead of the more correct actual number of bytes read though
             in common cases those would be the same.
%X         # There is no Connection keepalive in the default server.
%v and %V  # There are no virtual hosts in the default HTTP server.
%{tid}P    # The default servers are not threaded.

See the access_log_format option for how to set a different format as well as to see the default string.

exec_cgi

Allow for calling an external script as a CGI. This will use IPC::Open3 to fork a new process and read/write from it.

use base qw(Net::Server::HTTP);
__PACKAGE__->run;

sub process_http_request {
    my $self = shift;

    if ($ENV{'PATH_INFO'} && $ENV{'PATH_INFO'} =~ s{^ (/foo) (?= $ | /) }{}x) {
       $ENV{'SCRIPT_NAME'} = $1;
       my $file = "/var/www/cgi-bin/foo"; # assuming this exists
       return $self->exec_cgi($file);
    }

    print "Content-type: text/html\n\n";
    print "<a href=/foo>Foo</a>";
}

At this first release, the parent server is not tracking the child script which may cause issues if the script is running when a HUP is received.

http_log_time

Used to implement the %t format.

http_log_env

Used to implement the %e format.

Used to implement the %C format.

http_log_header_in

used to implement the %i format.

http_log_note

Used to implement the %n format.

http_note

Takes a key and an optional value. If passed a key and value, sets the note for that key. Always returns the value. These notes currently only are used for %{key}n output format.

http_log_header_out

Used to implement the %o format.

http_log_pid

Used to implement the %P format.

http_log_vhost

Used to implement the %v and %V formats.

http_log_constat

Used to implement the %X format.

exec_trusted_perl

Allow for calling an external perl script. This method will still fork, but instead of using IPC::Open3, it simply requires the perl script. That means that the running script will be able to make use of any shared memory. It also means that the STDIN/STDOUT/STDERR handles the script is using are those directly bound by the server process.

use base qw(Net::Server::HTTP);
__PACKAGE__->run;

sub process_http_request {
    my $self = shift;

    if ($ENV{'PATH_INFO'} && $ENV{'PATH_INFO'} =~ s{^ (/foo) (?= $ | /) }{}x) {
       $ENV{'SCRIPT_NAME'} = $1;
       my $file = "/var/www/cgi-bin/foo"; # assuming this exists
       return $self->exec_trusted_perl($file);
    }

    print "Content-type: text/html\n\n";
    print "<a href=/foo>Foo</a>";
}

At this first release, the parent server is not tracking the child script which may cause issues if the script is running when a HUP is received.

exec_fork_hook

This method is called after the fork of exec_trusted_perl and exec_cgi hooks. It is passed the pid (0 if the child) and the file being ran. Note, that the hook will not be called from the child during exec_cgi.

http_dispatch

Called if the default process_http_request and process_request methods have not been overridden and app configuration parameters have been passed. In this case this replaces the default echo server. You can also enable this subsystem for your own direct use by setting enable_dispatch to true during configuration. See the app configuration item. It will be passed a dispatch qr (regular expression) generated during _check_dispatch, and a dispatch table. The qr will be applied to path_info. This mechanism could be used to augment Net::Server::HTTP with document root and virtual host capabilities.

OPTIONS

In addition to the command line arguments of the Net::Server base classes you can also set the following options.

max_header_size

Defaults to 100_000. Maximum number of bytes to read while parsing headers.

server_revision

Defaults to Net::Server::HTTP/$Net::Server::VERSION.

timeout_header

Defaults to 15 - number of seconds to wait for parsing headers.

timeout_idle

Defaults to 60 - number of seconds a request can be idle before the request is closed.

access_log_file

Defaults to undef. If true, this represents the location of where the access log should be written to. If a special value of STDERR or /dev/stderr is passed, the access log entry will be written to the same location as the ERROR log. If a special value of STDOUT or /dev/stdout is passed, the access log entry will be written to standard out.

access_log_function

Can take a coderef or method name to call when an log_http_request method is called. Will be passed the formatted log access log message.

Note that functions depending upon stdout will not function during Net::Server::HTTP process_request because stdout is always tied for the client (and not restored after running).

access_log_format

Should be a valid apache log format that will be passed to http_log_format. See the http_log_format method for more information.

The default value is the NCSA extended/combined log format:

'%h %l %u %t \"%r\" %>s %b \"%{Referer}i\" \"%{User-Agent}i\"'
app

Takes one or more items and registers them for dispatch. Arguments may be supplied as an arrayref containing a location/target pairs, a hashref containing a location/target pairs, a bare code ref that will use "/" as the location and the codref as the target, a string with a space indicating "location target", a string containing "location=target", or finally a string that will be used as both location and target. For items passed as an arrayref or hashref, the target may be a coderef which will be called and should handle the request. In all other cases the target should be a valid executable suitable for passing to exec_cgi.

The locations will be added in the order that they are configured. They will be added to a regular expression which will be applied to the incoming PATH_INFO string. If the match is successful, the $ENV{'SCRIPT_NAME'} will be set to the matched portion and the matched portion will be removed from $ENV{'PATH_INFO'}.

Once an app has been passed, it is necessary for the server to listen on /. Therefore if "/" has not been specifically configured for dispatch, the first found dispatch target will also be used to handle "/".

For convenience, if the log_level is 2 or greater, the dispatch table is output to the log.

This mechanism is left as a generic mechanism suitable for overriding by servers meant to handle more complex dispatch. At the moment there is no handling of virtual hosts. At some point we will add in the default ability to play static content and likely for the ability to configure virtual hosts - or that may have to wait for a third party module.

app => "/home/paul/foo.cgi",
  # Dispatch: /home/paul/foo.cgi => home/paul/foo.cgi
  # Dispatch: / => home/paul/foo.cgi (default)


app => "../../foo.cgi",
app => "./bar.cgi",
app => "baz ./bar.cgi",
app => "bim=./bar.cgi",
  # Dispatch: /foo.cgi => ../../foo.cgi
  # Dispatch: /bar.cgi => ./bar.cgi
  # Dispatch: /baz => ./bar.cgi
  # Dispatch: /bim => ./bar.cgi
  # Dispatch: / => ../../foo.cgi (default)


app => "../../foo.cgi",
app => "/=./bar.cgi",
  # Dispatch: /foo.cgi => ../../foo.cgi
  # Dispatch: / => ./bar.cgi

# you could also do this on the commandline
net-server HTTP app ../../foo.cgi app /=./bar.cgi

# extended options when configured from code

Net::Server::HTTP->run(app => { # loses order of matching
  '/' => sub { ... },
  '/foo' => sub { ... },
  '/bar' => '/path/to/some.cgi',
});

Net::Server::HTTP->run(app => [
  '/' => sub { ... },
  '/foo' => sub { ... },
  '/bar' => '/path/to/some.cgi',
]);
default_content_type

Default is text/html. Set on any responses that have not yet passed a content-type

allow_body_on_all_statuses

By default content-type and printing a body are not allowed on 204, 304, or 1xx statuses. Set this flag to automatically send a content-type on those statuses as well.

TODO

Add support for writing out HTTP/1.1.

AUTHOR

Paul T. Seamons paul@seamons.com

THANKS

See Net::Server

SEE ALSO

Please see also Net::Server::Fork, Net::Server::INET, Net::Server::PreFork, Net::Server::PreForkSimple, Net::Server::MultiType, Net::Server::Single Net::Server::SIG Net::Server::Daemonize Net::Server::Proto