Contents:
The Perl Debugger
Common Goofs for Novices
Efficiency
Programming with Style
Distribution and Installation
Perl Poetry
History Made Practical
Did you ever have a junk drawer? You know, one of those drawers where you put everything important enough to keep (like the spare key to the back door), but not important enough to have a place of its own (like the back door itself).
Well, this chapter is the junk drawer of the book. We stuffed many important (and a few not-so-important) things in this chapter. Read on.
First of all, have you tried using the -w switch?
If you invoke Perl with the -d switch, your script runs under the Perl debugger. This works like an interactive Perl environment, prompting for debugger commands that let you examine source code, set breakpoints, dump out your function-call stack, change the values of variables, and so on. Any command not recognized by the debugger[ 1 ] is directly executed ( eval 'd) as Perl code in the current package.[ 2 ] This is so wonderfully convenient that you often fire up the debugger all by itself just to test out Perl constructs interactively to see what they do. Here's a common way to get that:
[1] Leading whitespace before a command would cause the debugger to think it's not a command for it, but rather for Perl, so be careful not to do that.
[2] The debugger uses the DB package for its own state information.
perl -d -e 42
In Perl, the debugger is not a separate program as it usually is in a typical programming environment. Instead, the -d flag tells the compiler to insert source information into the parse trees it's about to hand off to the interpreter. That means your code must first compile correctly for the debugger to work on it. Then when the interpreter starts up, it pre-loads a Perl library file containing the debugger itself.
The debugger understands the following commands:
h
[
command
]Prints out a help message.
If you supply another debugger command as an argument to the
h
command, it prints out the description for just that command. The command "
h h
" produces a more compact help listing designed to fit on one screen. If the output of the
h
command (or any command, for that matter) scrolls past your screen, just precede the command with a leading pipe symbol so it's run through your pager:
DB<1> |h
p
expr
Same as "
print DB::OUT
expr
" in the current package. In particular, since this is just Perl's own
print
function, this means that nested data structures and objects are not dumped, unlike with the
x
command. The
DB::OUT
handle is opened to
/dev/tty
(or perhaps an editor window) no matter where standard output may have been redirected to.
x
expr
Evals its expression in a list context and dumps out the result in a pretty-printed fashion. Nested data structures are printed out recursively, unlike with the print command above.
V
[
pkg
[
vars
]]
Display all (or some) variables in package (defaulting to the main package) using a data pretty-printer. (Hashes show their keys and values so you see what's what, control characters are made printable, nested data structures print out in a legible fashion, and so on.) Make sure you type the identifiers without a type specifier such as
$
or
@
, like this:
V DB filename line
In place of a variable name, you can use
~
pattern
or
!
pattern
to print existing variables whose names either match or don't match the specified regular expression.
X
[
vars
]
T
Produce a stack backtrace. See below for details on its output.
s
[
expr
]Single step. Executes until it reaches the beginning of another statement, descending into subroutine calls. If an expression is supplied that includes function calls, it, too, will be single-stepped.
n
Next. Executes over subroutine calls, until it reaches the beginning of the next statement at this same level.
<CR>
Repeat last
n
or
s
command.
c
[
line
]Continue, optionally inserting a one-time-only breakpoint at the specified line.
l
l
min+incr
l
min-max
List lines
min
through
max
.
l
line
List a single line.
l
subname
List first few lines from subroutine.
-
w
[
line
]
List window (a few lines) around the given
line
, or the current one if no
line
is supplied.
Return debugger pointer to the last-executed line and print it out.
f
filename
pattern
/
pattern
?
L
S
[[!]
pattern
]
List subroutine names matching (or not matching with "!")
pattern
. If no
pattern
is given, all subroutines are listed.
t
t
expr
Trace through execution of
expr
.
b
[
line
] [
condition
]
Set a breakpoint at
line
. If
line
is omitted, set a breakpoint on the line that is about to be executed.
condition
, if given, is evaluated each time the statement is reached, and a breakpoint is taken only if
condition
is true. Breakpoints may only be set on lines that begin an executable statement. Conditions don't use
if
:
b 237 $x > 30 b 33 /pattern/i
b
subname
[
condition
]Set a (possibly conditional) breakpoint at the first line of the named subroutine.
d
[
line
]
Delete a breakpoint at the specified
line
. If
line
is omitted, deletes the breakpoint on the line that is about to be executed.
D
a
[
line
]
command
Set an action to be done before the
line
is executed. The sequence of steps taken by the debugger is:
Check for a breakpoint at this line.
Print the line if necessary (tracing).
Do any actions associated with that line.
Prompt the user if at a breakpoint or in single-step.
Evaluate the line.
For example, this will print out
$foo
every time line 53 is passed:
a 53 print "DB FOUND $foo\n"
A
Delete all installed actions.
O
[
opt
[=
val
]]
Set or query values of options.
val
defaults to 1.
opt
can be abbreviated to the shortest unique string, which is why some options are uppercase and others are lowercase. Options are:
Option | Value |
---|---|
recallCommand ShellBang
|
The characters used to recall command or spawn shell. By default, these are both set to "!" (see below). |
pager
|
Program to use for output of pager-piped commands (those beginning with a |
PrintRet
|
Enables printing of return value after |
frame
|
Enables printing messages on entry and exit from subroutines. |
The following options affect what happens with
V
,
X
, and
x
commands:
Option | Value |
---|---|
arrayDepth hashDepth
|
Print only to depth |
compactDump veryCompact
|
Change style of array and hash dump. |
globPrint
|
Whether to print contents of globs. |
DumpDBFiles
|
Dump arrays holding debugged files. |
DumpPackages
|
Dump symbol tables of packages. |
quote HighBit undefPrint
|
Change style of string dump. |
tkRunning
|
Run Tk while prompting (with ReadLine).[ 3 ] |
signalLevel warnLevel dieLevel
|
Level of verbosity. |
[3] A Perl application is usually frozen when sitting at the debugger prompt. Tk support keeps the event loop of Tk running while reading the prompt.
During startup, options are initialized from
$ENV{PERLDB_OPTS}
. You can put additional initialization options
TTY
,
noTTY
,
ReadLine
, and
NonStop
there. Here's an example using the
$ENV{PERLDB_OPTS}
variable:
$ PERLDB_OPTS="N f=2" perl -d myprogram
This will run the script
myprogram
without human intervention, printing out the call tree with entry and exit points. Note that "
N f=2
" is equivalent to "
NonStop=1 frame=2
".
command
Set an action to happen before every debugger prompt. A multi-line
command
may be entered by backslashing the newlines.
command
should be Perl code.
command
Set an action to happen after the prompt when you've just given a command to return to executing the script. A multi-line
command
may be entered by backslashing the newlines.
command
should be Perl code.
number
number
Redo
number
'th-to-last command.
pattern
Redo last command that started with
pattern
. See "
O recallCommand
", too.
cmd
Run
cmd
in a subprocess (which will read from
DB::IN
, write to
DB::OUT
). See "
O shellBang
", too.
H -
number
Display last
number
commands. Only commands longer than one character are listed. If
number
is omitted, lists them all.
q
or
^D
R
Restart the debugger by exec ing a new session. It tries to maintain your history across this, but internal settings and command line options may be lost.
dbcmd
dbcmd
Same as |
dbcmd
but
DB::OUT
is temporarily
select
ed as well. Often used with commands that would otherwise produce long output, such as
|V main
= [
alias value
]
command
Execute
command
as a Perl statement. A semicolon is not needed at the end.
If you have any compile-time executable statements (code within a
BEGIN
block or a
use
statement), they will not be stopped by the debugger, although
require
s will.
The debugger prompt is something like:
DB<8>
or even:
DB<<17>>
where that number is the command number. A
csh
-like history mechanism allows you to access previous commands by number. For example,
!17
would repeat command number 17. The number of angle brackets indicates the depth of the debugger. You get more than one set of brackets, for example, if you're already at a breakpoint and then print out the result of a function call that itself also has a breakpoint.
If you want to enter a multi-line command, such as a subroutine definition with several statements, you may escape the newline that would normally end the debugger command with a backslash. Here's an example:
DB<1> for (1..4) { \ cont: print "ok\n"; \ cont: } ok ok ok ok
Note that this business of escaping a newline is specific to interactive commands typed into the debugger.
Let's say you want to fire up the debugger on a little program of yours (let's call it
camel_flea
), and stop it as soon as it gets down to a function named
infested
. Here's how you'd do that:
shell_prompt%
perl -d camel_flea
Stack dump during die enabled outside of evals. Loading DB routines from perl5db.pl patch level 0.94 Emacs support available. Enter h or `h h' for help. main::(camel_flea:3): $a = 1; DB<1>
The debugger halts your program right before the first run-time executable statement (but see above regarding compile-time statements) and asks you to enter a command. Contrary to popular expectations, whenever the debugger stops to show you a line of code, it displays the line it's about to execute, not the one it just executed.
Now, you'd like to stop as soon as your program gets to the
infested
function, so you enter a breakpoint there like so:
DB<1> b infested DB<2> c
The debugger now continues until it hits that function, at which point it does this:
main::infested(camel_flea:12): my bugs;
It might be nice to look at a window of source code around the breakpoint, so you use the
w
command:
DB<2> w 9: } 10: 11: sub infested { 12==>b my $bugs; 13: return 3.5; 14: } DB<2>
As you see, your current line is line 12, and it has a breakpoint on it.
Now, you'd like to see who called whom, so you ask for a stack backtrace:
DB<2> T $ = main::infested called from file `Ambulation.pm' line 10 @ = Ambulation::legs(1, 2, 3, 4) called from file `camel_flea' line 7 $ = main::pests('bactrian', 4) called from file `camel_flea' line 4
The left-hand character up there (
$
or
@
) tells whether the function was called in a scalar or list context (we bet you can tell which is which). There are three lines because you were three functions deep when you ran the stack backtrace. Here's what each line means:
Line number one says you were in the function
main::infested
when you ran the stack dump. It tells you the function was called in a scalar context from line 10 of the file
Ambulation.pm
. It also shows that it was called without any arguments whatsoever, meaning it was called as
&infested
.
Line number two shows that the function
Ambulation::legs
was called in a list context from the
camel_flea
file with four arguments.
Line number three shows that
main::pests
was called in a scalar context, also from
camel_flea
, but from line 4.
Limited control over the Perl debugger can also be managed from within your Perl script itself. You might do this, for example, to set an automatic breakpoint at a certain subroutine whenever a particular program is run under the debugger. Setting
$DB::single
to 1 will stop at the next statement as though you'd used the debugger's
s
command. If you set
$DB::single
to 2, it's equivalent to having just typed the
n
command. The
$DB::trace
variable can be set to 1 to simulate having typed the
t
command.
To modify the debugger, copy
perl5db.pl
from the Perl library to another file and modify it as necessary. You'll also want to set your
PERL5DB
environment variable to say something like this:
BEGIN { require "myperl5db.pl" }
You can do some customization by setting up a .perldb file with initialization code. For instance, you could make aliases like these (the last one is one people expect to be there):
$DB::alias{'len'} = 's/^len(.*)/p length($1)/'; $DB::alias{'stop'} = 's/^stop (at|in)/b/'; $DB::alias{'ps'} = 's/^ps\b/p scalar /'; $DB::alias{'quit'} = 's/^quit\b.*/exit/';
As shipped, the only command-line history mechanism supplied is a simplistic one that checks for leading exclamation points. This is fine for casual use. However, if you install the Term::ReadKey and Term::ReadLine modules from CPAN, you will have full editing capabilities much like GNU readline (3) provides. Look for these in the modules/by-module/Term directory on CPAN.
If you have GNU emacs installed on your system, it can interact with the Perl debugger to provide an integrated software development environment reminiscent of its interactions with C debuggers.
Perl is also delivered with a start file for making emacs act like a syntax-directed editor that understands (some of) Perl's syntax. Look in the emacs/ directory of the Perl source distribution.
(Historically, a similar setup for interacting with vi and the X11 window system had also been available, but at the time of this writing, no debugger support for vi currently exists.)
When you call the
caller
function from package DB, Perl sets the
@DB::args
array to the arguments that stack frame was called with. It also maintains other magical internal variables, such as
@DB::dbline
, an array of the source code lines for the currently selected (with the debugger's
f
command) file. Perl effectively inserts a call to the function
DB::DB
(
linenum
) in front of every place that can have a breakpoint. Instead of a subroutine call it calls
DB::sub
, setting
$DB::sub
to the name of the called subroutine. It also inserts a
BEGIN {require 'perl5db.pl'}
before the first line, since no subroutine call is possible until
&DB::sub
is defined (for subroutines defined outside this file). In fact, the same is true if
$DB::deep
(how many levels of recursion deep into the debugger you are) is not defined.
At the start, the debugger reads your config file (
./.perldb
or
~/.perldb
under UNIX), which can set important options. This file may define a subroutine
&afterinit
to be executed after the debugger is initialized.
After the config file is processed, the debugger consults the environment variable PERLDB_OPTS and parses it as arguments to the
O
opt=val
debugger command.
The following options can only be specified at startup. To set them in your config file, call
&parse_options("
opt
=
val
")
.
The TTY to use for debugging I/O.
If set, goes in
NonStop
mode. On an interrupt, if TTY is not set, it uses the value of
noTTY
or
/tmp/perldbtty$$
to find TTY using
Term::Rendezvous
. The current variant is to have the name of TTY in this file.
If false, a dummy ReadLine is used so that you can debug ReadLine applications.
If true, no interaction is performed until an interrupt.
File or pipe to print line number info to. If it's a pipe, then a short, emacs -like message is used. Example config file:
&parse_options("NonStop=1 LineInfo=db.out"); sub afterinit { $trace = 1; }
The script will run without human intervention, putting trace information into the file
db.out
. (If you interrupt it, you had better reset
LineInfo
to something "interactive"!)
If your program exit s or die s, so too does the debugger.
You cannot get the stack frame information or otherwise debug functions that were not compiled by Perl, such as C or C++ extensions.
If you alter your @_ arguments in a subroutine (such as with shift or pop ), the stack backtrace will not show the original values.
If you wish to supply an alternative debugger for Perl to run, just invoke your script with the
-d:
module
switch. One of the most popular alternative debuggers for Perl is DProf, the Perl profiler. As of this writing, DProf was not included with the standard Perl distribution, but it is expected to be included "real soon now."
Meanwhile, you can fetch the Devel::DProf module from CPAN. Assuming it's properly installed on your system, you can use it to profile the Perl program in mycode.pl by typing:
perl -d:DProf mycode.pl
When the script terminates, the profiler will dump the profile information to a file called tmon.out . A tool like dprofpp (also supplied with the Devel::DProf package) interprets the profile.