4 @c @include configdoc.texi
12 * Ld:: The GNU linker.
18 This file documents the GNU linker LD.
20 Copyright (C) 1991, 1992, 1993 Free Software Foundation, Inc.
22 Permission is granted to make and distribute verbatim copies of
23 this manual provided the copyright notice and this permission notice
24 are preserved on all copies.
26 Permission is granted to copy and distribute modified versions of this
27 manual under the conditions for verbatim copying, provided also that
28 the entire resulting derived work is distributed under the terms of a
29 permission notice identical to this one.
31 Permission is granted to copy and distribute translations of this manual
32 into another language, under the above conditions for modified versions.
35 Permission is granted to process this file through Tex and print the
36 results, provided the printed document carries copying permission
37 notice identical to this one except for the removal of this paragraph
38 (this paragraph not being relevant to the printed manual).
44 @setchapternewpage odd
45 @settitle Using LD, the GNU linker
48 @subtitle The GNU linker
50 @subtitle @code{ld} version 2
52 @author Steve Chamberlain and Roland Pesch
53 @author Cygnus Support
58 \hfill Cygnus Support\par
59 \hfill steve\@cygnus.com, pesch\@cygnus.com\par
60 \hfill {\it Using LD, the GNU linker}\par
61 \hfill Edited by Jeffrey Osier (jeffrey\@cygnus.com), March 1993.\par
63 \global\parindent=0pt % Steve likes it this way.
66 @vskip 0pt plus 1filll
67 Copyright @copyright{} 1991, 1992, 1993 Free Software Foundation, Inc.
69 Permission is granted to make and distribute verbatim copies of
70 this manual provided the copyright notice and this permission notice
71 are preserved on all copies.
73 Permission is granted to copy and distribute modified versions of this
74 manual under the conditions for verbatim copying, provided also that
75 the entire resulting derived work is distributed under the terms of a
76 permission notice identical to this one.
78 Permission is granted to copy and distribute translations of this manual
79 into another language, under the above conditions for modified versions.
82 @c FIXME: Talk about importance of *order* of args, cmds to linker!
87 This file documents the GNU linker ld.
91 * Invocation:: Invocation
92 * Commands:: Command Language
94 * Machine Dependent:: Machine Dependent Features
98 * H8/300:: ld and the H8/300
101 * i960:: ld and the Intel 960 family
104 @ifclear SingleFormat
107 @c Following blank line required for remaining bug in makeinfo conds/menus
109 * MRI:: MRI Compatible Script Files
118 @cindex what is this?
119 @code{ld} combines a number of object and archive files, relocates
120 their data and ties up symbol references. Usually the last step in
121 compiling a program is to run @code{ld}.
123 @code{ld} accepts Linker Command Language files written in
124 a superset of AT&T's Link Editor Command Language syntax,
125 to provide explicit and total control over the linking process.
127 @ifclear SingleFormat
128 This version of @code{ld} uses the general purpose BFD libraries
129 to operate on object files. This allows @code{ld} to read, combine, and
130 write object files in many different formats---for example, COFF or
131 @code{a.out}. Different formats may be linked together to produce any
132 available kind of object file. @xref{BFD} for a list of formats
133 supported on various architectures.
136 Aside from its flexibility, the GNU linker is more helpful than other
137 linkers in providing diagnostic information. Many linkers abandon
138 execution immediately upon encountering an error; whenever possible,
139 @code{ld} continues executing, allowing you to identify other errors
140 (or, in some cases, to get an output file in spite of the error).
145 The GNU linker @code{ld} is meant to cover a broad range of situations,
146 and to be as compatible as possible with other linkers. As a result,
147 you have many choices to control its behavior.
151 * Options:: Command Line Options
152 * Environment:: Environment Variables
156 @section Command Line Options
161 Here is a summary of the options you can use on the @code{ld} command
164 @c FIXME! -relax only avail h8/300, i960. Conditionals screwed in examples.
166 ld [-o @var{output} ] @var{objfile}@dots{}
167 [ -A@var{architecture} ] [ -b @var{input-format} ] [ -Bstatic ]
168 [ -c @var{MRI-commandfile} ] [ -d | -dc | -dp ]
169 [ -defsym @var{symbol}=@var{expression} ]
170 [ -e @var{entry} ] [ -F ] [ -F @var{format} ]
171 [ -format @var{input-format} ] [ -g ] [ -G @var{size} ] [ -i ]
172 [ -l@var{ar} ] [ -L@var{searchdir} ] [ -M ] [ -m @var{emulation} ]
173 [ -N | -n ] [ -noinhibit-exec ] [ -R @var{filename} ]
174 [ -relax ] [ -r | -Ur ] [ -S ] [ -s ] [ -T @var{commandfile} ]
175 [ -Ttext @var{textorg} ] [ -Tdata @var{dataorg} ] [ -Tbss @var{bssorg} ]
176 [ -t ] [ -u @var{sym}] [-V] [-v] [ -X ] [ -x ] [ -y@var{symbol} ]
177 [ @{ @var{script} @} ]
180 This plethora of command-line options may seem intimidating, but in
181 actual practice few of them are used in any particular context.
182 @cindex standard Unix system
183 For instance, a frequent use of @code{ld} is to link standard Unix
184 object files on a standard, supported Unix system. On such a system, to
185 link a file @code{hello.o}:
188 ld -o @var{output} /lib/crt0.o hello.o -lc
191 This tells @code{ld} to produce a file called @var{output} as the
192 result of linking the file @code{/lib/crt0.o} with @code{hello.o} and
193 the library @code{libc.a}, which will come from the standard search
194 directories. (See the discussion of the @samp{-l} option below.)
196 The command-line options to @code{ld} may be specified in any order, and
197 may be repeated at will. Repeating most options with a
198 different argument will either have no further effect, or override prior
199 occurrences (those further to the left on the command line) of that
202 @ifclear SingleFormat
203 The exceptions---which may meaningfully be used more than once---are
204 @samp{-A}, @samp{-b} (or its synonym @samp{-format}), @samp{-defsym},
205 @samp{-L}, @samp{-l}, @samp{-R}, and @samp{-u}.
208 The exceptions---which may meaningfully be used more than once---are
209 @samp{-A}, @samp{-defsym}, @samp{-L}, @samp{-l}, @samp{-R}, and @samp{-u}.
213 The list of object files to be linked together, shown as @var{objfile}@dots{},
214 may follow, precede, or be mixed in with command-line options, except that
215 an @var{objfile} argument may not be placed between an option and
218 Usually the linker is invoked with at least one object file, but other
219 forms of binary input files can also be specified with @samp{-l},
220 @samp{-R}, and the script command language. If @emph{no} binary input
221 files at all are specified, the linker does not produce any output, and
222 issues the message @samp{No input files}.
224 Option arguments must either follow the option letter without intervening
225 whitespace, or be given as separate arguments immediately following the
226 option that requires them.
229 @item @var{objfile}@dots{}
230 The object files to be linked.
233 @cindex architectures
235 @item -A@var{architecture}
236 In the current release of @code{ld}, this option is useful only for the
237 Intel 960 family of architectures. In that @code{ld} configuration, the
238 @var{architecture} argument identifies the particular architecture in
239 the 960 family, enabling some safeguards and modifying the
240 archive-library search path. @xref{i960,,@code{ld} and the Intel 960
241 family}, for details.
243 Future releases of @code{ld} may support similar functionality for
244 other architecture families.
247 @ifclear SingleFormat
248 @cindex binary input format
249 @kindex -b @var{format}
251 @item -b @var{input-format}
253 Specify the binary format for input object files that follow this option
254 on the command line. You don't usually need to specify this, as
255 @code{ld} is configured to expect as a default input format the most
256 usual format on each machine. @var{input-format} is a text string, the
257 name of a particular format supported by the BFD libraries.
258 @w{@samp{-format @var{input-format}}} has the same effect. @xref{BFD}.
260 You may want to use this option if you are linking files with an unusual
261 binary format. You can also use @samp{-b} to switch formats explicitly (when
262 linking object files of different formats), by including
263 @samp{-b @var{input-format}} before each group of object files in a
266 The default format is taken from the environment variable
271 You can also define the input
272 format from a script, using the command @code{TARGET}; see @ref{Other
278 Ignored. This option is accepted for command-line compatibility with
281 @kindex -c @var{MRI-cmdfile}
282 @cindex compatibility, MRI
283 @item -c @var{MRI-commandfile}
284 For compatibility with linkers produced by MRI, @code{ld} accepts script
285 files written in an alternate, restricted command language, described in
286 @ref{MRI,,MRI Compatible Script Files}. Introduce MRI script files with
287 the option @samp{-c}; use the @samp{-T} option to run linker
288 scripts written in the general-purpose @code{ld} scripting language.
289 If @var{MRI-cmdfile} does not exist, @code{ld} looks for it in the directories
290 specified by any @samp{-L} options.
292 @cindex common allocation
299 These three options are equivalent; multiple forms are supported for
300 compatibility with other linkers. They
301 assign space to common symbols even if a relocatable output file is
302 specified (with @samp{-r}). The script command
303 @code{FORCE_COMMON_ALLOCATION} has the same effect. @xref{Other
306 @cindex symbols, from command line
307 @kindex -defsym @var{symbol}=@var{exp}
308 @item -defsym @var{symbol}=@var{expression}
309 Create a global symbol in the output file, containing the absolute
310 address given by @var{expression}. You may use this option as many
311 times as necessary to define multiple symbols in the command line. A
312 limited form of arithmetic is supported for the @var{expression} in this
313 context: you may give a hexadecimal constant or the name of an existing
314 symbol, or use @code{+} and @code{-} to add or subtract hexadecimal
315 constants or symbols. If you need more elaborate expressions, consider
316 using the linker command language from a script (@pxref{Assignment, ,
317 Assignment: Symbol Definitions}). @emph{Note:} there should be no
318 white space between @var{symbol}, the equals sign (``@key{=}''), and
321 @cindex entry point, from command line
322 @kindex -e @var{entry}
324 Use @var{entry} as the explicit symbol for beginning execution of your
325 program, rather than the default entry point. @xref{Entry Point}, for a
326 discussion of defaults and other ways of specifying the
329 @ifclear SingleFormat
332 @itemx -F@var{format}
333 Ignored. Some older linkers used this option throughout a compilation
334 toolchain for specifying object-file format for both input and output
335 object files. The mechanisms @code{ld} uses for this purpose (the
336 @samp{-b} or @samp{-format} options for input files, the @code{TARGET}
337 command in linker scripts for output files, the @code{GNUTARGET}
338 environment variable) are more flexible, but @code{ld} accepts the
339 @samp{-F} option for compatibility with scripts written to call the old
343 @item -format @var{input-format}
344 Synonym for @samp{-b @var{input-format}}.
349 Ignored. Provided for compatibility with other tools.
354 @itemx -G @var{value}
355 Set the maximum size of objects to be optimized using the GP register to
356 @var{size} under MIPS ECOFF. Ignored for other object file formats.
359 @cindex incremental link
361 Perform an incremental link (same as option @samp{-r}).
363 @cindex archive files, from cmd line
366 Add archive file @var{ar} to the list of files to link. This
367 option may be used any number of times. @code{ld} will search its
368 path-list for occurrences of @code{lib@var{ar}.a} for every @var{ar}
371 @cindex search directory, from cmd line
373 @item -L@var{searchdir}
374 Add path @var{searchdir} to the list of paths that @code{ld} will search
375 for archive libraries and @code{ld} control scripts. You may use this
376 option any number of times.
379 The default set of paths searched (without being specified with
380 @samp{-L}) depends on which emulation mode @code{ld} is using, and in
381 some cases also on how it was configured. @xref{Environment}.
384 The paths can also be specified in a link script with the
385 @code{SEARCH_DIR} command.
390 Print (to the standard output) a link map---diagnostic information
391 about where symbols are mapped by @code{ld}, and information on global
392 common storage allocation.
395 @kindex -m @var{emulation}
396 @item -m@var{emulation}
397 @itemx -m @var{emulation}
398 Emulate the @var{emulation} linker. You can list the available
399 emulations with the @samp{-V} option. The
400 default is the system for which you configured @code{ld}.
403 @cindex read/write from cmd line
406 Set the text and data sections to be readable and writable. Also, do
407 not page-align the data segment. If the output format supports Unix
408 style magic numbers, mark the output as @code{OMAGIC}.
412 @cindex read-only text
414 Set the text segment to be read only, and mark the output as
415 @code{NMAGIC} if possible.
417 @item -noinhibit-exec
418 @cindex output file after errors
419 @kindex -noinhibit-exec
420 Retain the executable output file whenever it is still usable.
421 Normally, the linker will not produce an output file if it encounters
422 errors during the link process; it exits without writing an output file
423 when it issues any error whatsoever.
425 @item -o @var{output}
426 @kindex -o @var{output}
427 @cindex naming the output file
428 Use @var{output} as the name for the program produced by @code{ld}; if this
429 option is not specified, the name @file{a.out} is used by default. The
430 script command @code{OUTPUT} can also specify the output file name.
432 @item -R @var{filename}
433 @kindex -R @var{file}
434 @cindex symbol-only input
435 On some platforms, this option performs global optimizations
436 that become possible when the linker resolves addressing in the
437 program, such as relaxing address modes and synthesizing new
438 instructions in the output object file.
442 @cindex synthesizing linker
443 @cindex relaxing addressing modes
444 An option with machine dependent effects. Currently this option is only
445 supported on the H8/300.
447 @xref{H8/300,,@code{ld} and the H8/300}.
450 On some platforms, use option performs global optimizations that
451 become possible when the linker resolves addressing in the program, such
452 as relaxing address modes and synthesizing new instructions in the
455 On platforms where this is not supported, @samp{-relax} is accepted, but
460 @cindex relocatable output
462 Generate relocatable output---i.e., generate an output file that can in
463 turn serve as input to @code{ld}. This is often called @dfn{partial
464 linking}. As a side effect, in environments that support standard Unix
465 magic numbers, this option also sets the output file's magic number to
468 If this option is not specified, an absolute file is produced. When
469 linking C++ programs, this option @emph{will not} resolve references to
470 constructors; to do that, use @samp{-Ur}.
472 This option does the same as @code{-i}.
476 @cindex strip debugger symbols
477 Omit debugger symbol information (but not all symbols) from the output file.
481 @cindex strip all symbols
482 Omit all symbol information from the output file.
484 @item @{ @var{script} @}
485 @kindex @{ @var{script} @}
486 @cindex scripts on command line
487 You can, if you wish, include a script of linker commands directly in
488 the command line instead of referring to it via an input file. When the
489 character @samp{@{} occurs on the command line, the linker switches to
490 interpreting the command language until the end of the list of commands
491 is reached; the end is indicated with a closing brace @samp{@}}.
492 @code{ld} does not recognize other command-line options while parsing
493 the script. @xref{Commands}, for a description of the command language.
495 @item -Tbss @var{bssorg}
496 @kindex -Tbss @var{bssorg}
497 @itemx -Tdata @var{dataorg}
498 @kindex -Tdata @var{dataorg}
499 @itemx -Ttext @var{textorg}
500 @kindex -Ttext @var{textorg}
501 @cindex segment origins, cmd line
502 Use @var{org} as the starting address for---respectively---the
503 @code{bss}, @code{data}, or the @code{text} segment of the output file.
504 @var{org} must be a single hexadecimal integer;
505 for compatibility with other linkers, you may omit the leading
506 @samp{0x} usually associated with hexadecimal values.
508 @item -T @var{commandfile}
509 @itemx -T@var{commandfile}
510 @kindex -T @var{script}
512 Read link commands from the file
513 @var{commandfile}. These commands completely override @code{ld}'s
514 default link format (rather than adding to it); @var{commandfile} must
515 specify everything necessary to describe the target format.
517 If @var{commandfile} does not exist, @code{ld} looks for it in the directories
518 specified by any @samp{-L} options.
520 You may also include a script of link commands directly in the command
521 line by bracketing it between @samp{@{} and @samp{@}}.
526 @cindex input files, displaying
527 Print the names of the input files as @code{ld} processes them.
531 @cindex undefined symbol
532 Force @var{sym} to be entered in the output file as an undefined symbol.
533 Doing this may, for example, trigger linking of additional modules from
534 standard libraries. @samp{-u} may be repeated with different option
535 arguments to enter additional undefined symbols.
536 @c Nice idea, but no such command: This option is equivalent
537 @c to the @code{EXTERN} linker command.
542 For anything other than C++ programs, this option is equivalent to
543 @samp{-r}: it generates relocatable output---i.e., an output file that can in
544 turn serve as input to @code{ld}. When linking C++ programs, @samp{-Ur}
545 @emph{will} resolve references to constructors, unlike @samp{-r}.
550 Display the version number for @code{ld} and list the supported emulations.
551 Print which input files can and can not be opened.
556 Display the version number for @code{ld}.
557 Print which input files can and can not be opened.
561 @cindex local symbols, deleting
562 @cindex L, deleting symbols beginning
563 If @samp{-s} or @samp{-S} is also specified, delete only local symbols
564 beginning with @samp{L}.
568 @cindex deleting local symbols
569 If @samp{-s} or @samp{-S} is also specified, delete all local symbols,
570 not just those beginning with @samp{L}.
573 @kindex -y@var{symbol}
574 @cindex symbol tracing
575 Print the name of each linked file in which @var{symbol} appears. This
576 option may be given any number of times. On many systems it is necessary
577 to prepend an underscore.
579 This option is useful when you have an undefined symbol in your link but
580 don't know where the reference is coming from.
585 @section Environment Variables
587 You can change the behavior of @code{ld} with the environment
588 variable @code{GNUTARGET}.
591 @cindex default input format
592 @code{GNUTARGET} determines the input-file object format if you don't
593 use @samp{-b} (or its synonym @samp{-format}). Its value should be one
594 of the BFD names for an input format (@pxref{BFD}). If there is no
595 @code{GNUTARGET} in the environment, @code{ld} uses the natural format
596 of the host. If @code{GNUTARGET} is set to @code{default} then BFD attempts to discover the
597 input format by examining binary input files; this method often
598 succeeds, but there are potential ambiguities, since there is no method
599 of ensuring that the magic number used to specify object-file formats is
600 unique. However, the configuration procedure for BFD on each system
601 places the conventional format for that system first in the search-list,
602 so ambiguities are resolved in favor of convention.
606 @chapter Command Language
608 @cindex command files
609 The command language provides explicit control over the link process,
610 allowing complete specification of the mapping between the linker's
611 input files and its output. It controls:
620 addresses of sections
622 placement of common blocks
625 You may supply a command file (also known as a link script) to the
626 linker either explicitly through the @samp{-T} option, or implicitly as
627 an ordinary file. If the linker opens a file which it cannot recognize
628 as a supported object or archive format, it tries to interpret the file
631 You can also include a script directly on the @code{ld} command line,
632 delimited by the characters @samp{@{} and @samp{@}}.
635 * Scripts:: Linker Scripts
636 * Expressions:: Expressions
637 * MEMORY:: MEMORY Command
638 * SECTIONS:: SECTIONS Command
639 * Entry Point:: The Entry Point
640 * Other Commands:: Other Commands
644 @section Linker Scripts
645 The @code{ld} command language is a collection of statements; some are
646 simple keywords setting a particular option, some are used to select and
647 group input files or name output files; and two statement
648 types have a fundamental and pervasive impact on the linking process.
650 @cindex fundamental script commands
651 @cindex commands, fundamental
652 @cindex output file layout
653 @cindex layout of output file
654 The most fundamental command of the @code{ld} command language is the
655 @code{SECTIONS} command (@pxref{SECTIONS}). Every meaningful command
656 script must have a @code{SECTIONS} command: it specifies a
657 ``picture'' of the output file's layout, in varying degrees of detail.
658 No other command is required in all cases.
660 The @code{MEMORY} command complements @code{SECTIONS} by describing the
661 available memory in the target architecture. This command is optional;
662 if you don't use a @code{MEMORY} command, @code{ld} assumes sufficient
663 memory is available in a contiguous block for all output.
667 You may include comments in linker scripts just as in C: delimited
668 by @samp{/*} and @samp{*/}. As in C, comments are syntactically
669 equivalent to whitespace.
673 @cindex expression syntax
675 Many useful commands involve arithmetic expressions. The syntax for
676 expressions in the command language is identical to that of C
677 expressions, with the following features:
680 All expressions evaluated as integers and
681 are of ``long'' or ``unsigned long'' type.
683 All constants are integers.
685 All of the C arithmetic operators are provided.
687 You may reference, define, and create global variables.
689 You may call special purpose built-in functions.
693 * Integers:: Integers
694 * Symbols:: Symbol Names
695 * Location Counter:: The Location Counter
696 * Operators:: Operators
697 * Evaluation:: Evaluation
698 * Assignment:: Assignment: Defining Symbols
699 * Built-ins:: Built-In Functions
704 @cindex integer notation
705 @cindex octal integers
706 An octal integer is @samp{0} followed by zero or more of the octal
707 digits (@samp{01234567}).
712 @cindex decimal integers
713 A decimal integer starts with a non-zero digit followed by zero or
714 more digits (@samp{0123456789}).
719 @cindex hexadecimal integers
721 A hexadecimal integer is @samp{0x} or @samp{0X} followed by one or
722 more hexadecimal digits chosen from @samp{0123456789abcdefABCDEF}.
727 @cindex negative integers
728 To write a negative integer, use
729 the prefix operator @samp{-}; @pxref{Operators}.
734 @cindex scaled integers
735 @cindex K and M integer suffixes
736 @cindex M and K integer suffixes
737 @cindex suffixes for integers
738 @cindex integer suffixes
739 Additionally the suffixes @code{K} and @code{M} may be used to scale a
743 @c END TEXI2ROFF-KILL
744 @code{1024} or @code{1024*1024}
748 ${\rm 1024}$ or ${\rm 1024}^2$
750 @c END TEXI2ROFF-KILL
751 respectively. For example, the following all refer to the same quantity:
760 @subsection Symbol Names
763 @cindex quoted symbol names
765 Unless quoted, symbol names start with a letter, underscore, point or
766 hyphen and may include any letters, underscores, digits, points,
767 and minus signs. Unquoted symbol names must not conflict with any
768 keywords. You can specify a symbol which contains odd characters or has
769 the same name as a keyword, by surrounding the symbol name in double quotes:
772 "with a space" = "also with a space" + 10;
775 @node Location Counter
776 @subsection The Location Counter
779 @cindex location counter
780 @cindex current output location
781 The special linker variable @dfn{dot} @samp{.} always contains the
782 current output location counter. Since the @code{.} always refers to
783 a location in an output section, it must always appear in an
784 expression within a @code{SECTIONS} command. The @code{.} symbol
785 may appear anywhere that an ordinary symbol is allowed in an
786 expression, but its assignments have a side effect. Assigning a value
787 to the @code{.} symbol will cause the location counter to be moved.
789 This may be used to create holes in the output section. The location
790 counter may never be moved backwards.
805 In the previous example, @code{file1} is located at the beginning of the
806 output section, then there is a 1000 byte gap. Then @code{file2}
807 appears, also with a 1000 byte gap following before @code{file3} is
808 loaded. The notation @samp{= 0x1234} specifies what data to write in
809 the gaps (@pxref{Section Options}).
812 @subsection Operators
813 @cindex Operators for arithmetic
814 @cindex arithmetic operators
815 @cindex precedence in expressions
816 The linker recognizes the standard C set of arithmetic operators, with
817 the standard bindings and precedence levels:
820 @c END TEXI2ROFF-KILL
822 precedence associativity Operators Notes
828 5 left == != > < <= >=
834 11 right &= += -= *= /= (2)
839 (2) @xref{Assignment}
844 %"lispnarrowing" is the extra indent used generally for @example
845 \hskip\lispnarrowing\vbox{\offinterlineskip
848 {\vrule#&\strut\hfil\ #\ \hfil&\vrule#&\strut\hfil\ #\ \hfil&\vrule#&\strut\hfil\ {\tt #}\ \hfil&\vrule#\cr
849 height2pt&\omit&&\omit&&\omit&\cr
850 &Precedence&& Associativity &&{\rm Operators}&\cr
851 height2pt&\omit&&\omit&&\omit&\cr
853 height2pt&\omit&&\omit&&\omit&\cr
855 % '176 is tilde, '~' in tt font
856 &1&&left&&\qquad- \char'176\ !\qquad\dag&\cr
860 &5&&left&&== != > < <= >=&\cr
866 &11&&right&&\qquad\&= += -= *= /=\qquad\ddag&\cr
868 height2pt&\omit&&\omit&&\omit&\cr}
873 @obeylines@parskip=0pt@parindent=0pt
874 @dag@quad Prefix operators.
875 @ddag@quad @xref{Assignment}.
878 @c END TEXI2ROFF-KILL
881 @subsection Evaluation
883 @cindex lazy evaluation
884 @cindex expression evaluation order
885 The linker uses ``lazy evaluation'' for expressions; it only calculates
886 an expression when absolutely necessary. The linker needs the value of
887 the start address, and the lengths of memory regions, in order to do any
888 linking at all; these values are computed as soon as possible when the
889 linker reads in the command file. However, other values (such as symbol
890 values) are not known or needed until after storage allocation. Such
891 values are evaluated later, when other information (such as the sizes of
892 output sections) is available for use in the symbol assignment
896 @subsection Assignment: Defining Symbols
897 @cindex assignment in scripts
898 @cindex symbol definition, scripts
899 @cindex variables, defining
900 You may create global symbols, and assign values (addresses) to global
901 symbols, using any of the C assignment operators:
904 @item @var{symbol} = @var{expression} ;
905 @itemx @var{symbol} &= @var{expression} ;
906 @itemx @var{symbol} += @var{expression} ;
907 @itemx @var{symbol} -= @var{expression} ;
908 @itemx @var{symbol} *= @var{expression} ;
909 @itemx @var{symbol} /= @var{expression} ;
912 Two things distinguish assignment from other operators in @code{ld}
916 Assignment may only be used at the root of an expression;
917 @samp{a=b+3;} is allowed, but @samp{a+b=3;} is an error.
922 You must place a trailing semicolon (``@key{;}'') at the end of an
923 assignment statement.
926 Assignment statements may appear:
929 as commands in their own right in an @code{ld} script; or
931 as independent statements within a @code{SECTIONS} command; or
933 as part of the contents of a section definition in a
934 @code{SECTIONS} command.
937 The first two cases are equivalent in effect---both define a symbol with
938 an absolute address. The last case defines a symbol whose address is
939 relative to a particular section (@pxref{SECTIONS}).
941 @cindex absolute and relocatable symbols
942 @cindex relocatable and absolute symbols
943 @cindex symbols, relocatable and absolute
944 When a linker expression is evaluated and assigned to a variable, it is
945 given either an absolute or a relocatable type. An absolute expression
946 type is one in which the symbol contains the value that it will have in
947 the output file, a relocatable expression type is one in which the
948 value is expressed as a fixed offset from the base of a section.
950 The type of the expression is controlled by its position in the script
951 file. A symbol assigned within a section definition is created relative
952 to the base of the section; a symbol assigned in any other place is
953 created as an absolute symbol. Since a symbol created within a
954 section definition is relative to the base of the section, it
955 will remain relocatable if relocatable output is requested. A symbol
956 may be created with an absolute value even when assigned to within a
957 section definition by using the absolute assignment function
958 @code{ABSOLUTE}. For example, to create an absolute symbol whose address
959 is the last byte of an output section named @code{.data}:
965 _edata = ABSOLUTE(.) ;
970 The linker tries to put off the evaluation of an assignment until all
971 the terms in the source expression are known (@pxref{Evaluation}). For
972 instance, the sizes of sections cannot be known until after allocation,
973 so assignments dependent upon these are not performed until after
974 allocation. Some expressions, such as those depending upon the location
975 counter @dfn{dot}, @samp{.} must be evaluated during allocation. If the
976 result of an expression is required, but the value is not available,
977 then an error results. For example, a script like the following
980 text 9+this_isnt_constant :
985 @kindex Non constant expression
987 will cause the error message ``@code{Non constant expression for initial
991 @subsection Built-In Functions
992 @cindex functions in expression language
993 The command language includes a number of built-in
994 functions for use in link script expressions.
996 @item ABSOLUTE(@var{exp})
997 @kindex ABSOLUTE(@var{exp})
998 @cindex expression, absolute
999 Return the absolute (non-relocatable, as opposed to non-negative) value
1000 of the expression @var{exp}. Primarily useful to assign an absolute
1001 value to a symbol within a section definition, where symbol values are
1002 normally section-relative.
1004 @item ADDR(@var{section})
1005 @kindex ADDR(@var{section})
1006 @cindex section address
1007 Return the absolute address of the named @var{section}. Your script must
1008 previously have defined the location of that section. In the following
1009 example, @code{symbol_1} and @code{symbol_2} are assigned identical
1015 start_of_output_1 = ABSOLUTE(.);
1020 symbol_1 = ADDR(.output1);
1021 symbol_2 = start_of_output_1;
1026 @item ALIGN(@var{exp})
1027 @kindex ALIGN(@var{exp})
1028 @cindex rounding up location counter
1029 Return the result of the current location counter (@code{.}) aligned to
1030 the next @var{exp} boundary. @var{exp} must be an expression whose
1031 value is a power of two. This is equivalent to
1033 (. + @var{exp} - 1) & ~(@var{exp} - 1)
1036 @code{ALIGN} doesn't change the value of the location counter---it just
1037 does arithmetic on it. As an example, to align the output @code{.data}
1038 section to the next @code{0x2000} byte boundary after the preceding
1039 section and to set a variable within the section to the next
1040 @code{0x8000} boundary after the input sections:
1043 .data ALIGN(0x2000): @{
1045 variable = ALIGN(0x8000);
1050 The first use of @code{ALIGN} in this example specifies the location of
1051 a section because it is used as the optional @var{start} attribute of a
1052 section definition (@pxref{Section Options}). The second use simply
1053 defines the value of a variable.
1055 The built-in @code{NEXT} is closely related to @code{ALIGN}.
1057 @item DEFINED(@var{symbol})
1058 @kindex DEFINED(@var{symbol})
1059 @cindex symbol defaults
1060 Return 1 if @var{symbol} is in the linker global symbol table and is
1061 defined, otherwise return 0. You can use this function to provide default
1062 values for symbols. For example, the following command-file fragment shows how
1063 to set a global symbol @code{begin} to the first location in the
1064 @code{.text} section---but if a symbol called @code{begin} already
1065 existed, its value is preserved:
1069 begin = DEFINED(begin) ? begin : . ;
1075 @item NEXT(@var{exp})
1076 @kindex NEXT(@var{exp})
1077 @cindex unallocated address, next
1078 Return the next unallocated address that is a multiple of @var{exp}.
1079 This function is closely related to @code{ALIGN(@var{exp})}; unless you
1080 use the @code{MEMORY} command to define discontinuous memory for the
1081 output file, the two functions are equivalent.
1083 @item SIZEOF(@var{section})
1084 @kindex SIZEOF(@var{section})
1085 @cindex section size
1086 Return the size in bytes of the named @var{section}, if that section has
1087 been allocated. In the following example, @code{symbol_1} and
1088 @code{symbol_2} are assigned identical values:
1089 @c What does it return if the section hasn't been allocated? 0?
1097 symbol_1 = .end - .start ;
1098 symbol_2 = SIZEOF(.output);
1103 @item SIZEOF_HEADERS
1104 @kindex SIZEOF_HEADERS
1106 @itemx sizeof_headers
1107 @kindex sizeof_headers
1108 Return the size in bytes of the output file's headers. You can use this number
1109 as the start address of the first section, if you choose, to facilitate
1115 @section MEMORY Command
1117 @cindex regions of memory
1118 @cindex discontinuous memory
1119 @cindex allocating memory
1120 The linker's default configuration permits allocation of all available memory.
1121 You can override this configuration by using the @code{MEMORY} command. The
1122 @code{MEMORY} command describes the location and size of blocks of
1123 memory in the target. By using it carefully, you can describe which
1124 memory regions may be used by the linker, and which memory regions it
1125 must avoid. The linker does not shuffle sections to fit into the
1126 available regions, but does move the requested sections into the correct
1127 regions and issue errors when the regions become too full.
1129 The command files may contain at most one use of the @code{MEMORY}
1130 command; however, you can define as many blocks of memory within it as
1131 you wish. The syntax is:
1136 @var{name} (@var{attr}) : ORIGIN = @var{origin}, LENGTH = @var{len}
1142 @cindex naming memory regions
1143 is a name used internally by the linker to refer to the region. Any
1144 symbol name may be used. The region names are stored in a separate
1145 name space, and will not conflict with symbols, file names or section
1146 names. Use distinct names to specify multiple regions.
1148 @cindex memory region attributes
1149 is an optional list of attributes, permitted for compatibility with the
1150 AT&T linker but not used by @code{ld} beyond checking that the
1151 attribute list is valid. Valid attribute lists must be made up of the
1152 characters ``@code{LIRWX}''. If you omit the attribute list, you may
1153 omit the parentheses around it as well.
1158 is the start address of the region in physical memory. It is
1159 an expression that must evaluate to a constant before
1160 memory allocation is performed. The keyword @code{ORIGIN} may be
1161 abbreviated to @code{org} or @code{o}.
1166 is the size in bytes of the region (an expression).
1167 The keyword @code{LENGTH} may be abbreviated to @code{len} or @code{l}.
1170 For example, to specify that memory has two regions available for
1171 allocation---one starting at 0 for 256 kilobytes, and the other
1172 starting at @code{0x40000000} for four megabytes:
1177 rom : ORIGIN = 0, LENGTH = 256K
1178 ram : org = 0x40000000, l = 4M
1182 Once you have defined a region of memory named @var{mem}, you can direct
1183 specific output sections there by using a command ending in
1184 @samp{>@var{mem}} within the @code{SECTIONS} command (@pxref{Section
1185 Options}). If the combined output sections directed to a region are too
1186 big for the region, the linker will issue an error message.
1189 @section SECTIONS Command
1191 The @code{SECTIONS} command controls exactly where input sections are
1192 placed into output sections, their order and to which output sections
1195 You may use at most one @code{SECTIONS} command in a commands file,
1196 but you can have as many statements within it as you wish. Statements
1197 within the @code{SECTIONS} command can do one of three things:
1200 define the entry point;
1202 assign a value to a symbol;
1204 describe the placement of a named output section, and what input
1205 sections make it up.
1208 The first two possibilities---defining the entry point, and defining
1209 symbols---can also be done outside the @code{SECTIONS} command:
1210 @pxref{Entry Point}, @pxref{Assignment}. They are permitted here as
1211 well for your convenience in reading the script, so that symbols or the
1212 entry point can be defined at meaningful points in your output-file
1215 When no @code{SECTIONS} command is specified, the default action
1216 of the linker is to place each input section into an identically named
1217 output section in the order that the sections are first encountered in
1218 the input files; if all input sections are present in the first file,
1219 for example, the order of sections in the output file will match the
1220 order in the first input file.
1223 * Section Definition:: Section Definitions
1224 * Section Contents:: Section Contents
1225 * Section Options:: Optional Section Attributes
1228 @node Section Definition
1229 @subsection Section Definitions
1230 @cindex section definition
1231 The most frequently used statement in the @code{SECTIONS} command is
1232 the @dfn{section definition}, which you can use to specify the
1233 properties of an output section: its location, alignment, contents,
1234 fill pattern, and target memory region. Most of
1235 these specifications are optional; the simplest form of a section
1244 @cindex naming output sections
1246 @var{secname} is the name of the output section, and @var{contents} a
1247 specification of what goes there---for example, a list of input files or
1248 sections of input files. As you might assume, the whitespace shown is
1249 optional. You do need the colon @samp{:} and the braces @samp{@{@}},
1252 @var{secname} must meet the constraints of your output format. In
1253 formats which only support a limited number of sections, such as
1254 @code{a.out}, the name must be one of the names supported by the format
1255 (@code{a.out}, for example, allows only @code{.text}, @code{.data} or
1256 @code{.bss}). If the output format supports any number of sections, but
1257 with numbers and not names (as is the case for Oasys), the name should be
1258 supplied as a quoted numeric string. A section name may consist of any
1259 sequence characters, but any name which does not conform to the standard
1260 @code{ld} symbol name syntax must be quoted.
1261 @xref{Symbols, , Symbol Names}.
1263 @node Section Contents
1264 @subsection Section Contents
1265 @cindex contents of a section
1266 In a section definition, you can specify the contents of an output section by
1267 listing particular object files, by listing particular input-file
1268 sections, or by a combination of the two. You can also place arbitrary
1269 data in the section, and define symbols relative to the beginning of the
1272 The @var{contents} of a section definition may include any of the
1273 following kinds of statement. You can include as many of these as you
1274 like in a single section definition, separated from one another by
1278 @item @var{filename}
1279 @kindex @var{filename}
1280 @cindex input files, section defn
1281 @cindex files, including in output sections
1282 You may simply name a particular input file to be placed in the current
1283 output section; @emph{all} sections from that file are placed in the
1284 current section definition. To specify a list of particular files by
1287 .data : @{ afile.o bfile.o cfile.o @}
1290 The example also illustrates that multiple statements can be included in
1291 the contents of a section definition, since each file name is a separate
1294 If the file name has already been mentioned in another section
1295 definition, with an explicit section name list, then only those sections
1296 which have not yet been allocated are used.
1298 @item @var{filename}( @var{section} )
1299 @itemx @var{filename}( @var{section}, @var{section}, @dots{} )
1300 @itemx @var{filename}( @var{section} @var{section} @dots{} )
1301 @kindex @var{filename}(@var{section})
1302 @cindex files and sections, section defn
1303 You can name one or more sections from your input files, for
1304 insertion in the current output section. If you wish to specify a list
1305 of input-file sections inside the parentheses, you may separate the
1306 section names by either commas or whitespace.
1308 @item * (@var{section})
1309 @itemx * (@var{section}, @var{section}, @dots{})
1310 @itemx * (@var{section} @var{section} @dots{}
1311 @cindex input sections to output section
1312 @kindex *(@var{section})
1313 Instead of explicitly naming particular input files in a link control
1314 script, you can refer to @emph{all} files from the @code{ld} command
1315 line: use @samp{*} instead of a particular file name before the
1316 parenthesized input-file section list.
1318 For example, to copy sections @code{1} through @code{4} from an Oasys file
1319 into the @code{.text} section of an @code{a.out} file, and sections @code{13}
1320 and @code{14} into the @code{.data} section:
1333 If you have already explicitly included some files by name, @samp{*}
1334 refers to all @emph{remaining} files---those whose places in the output
1335 file have not yet been defined.
1337 @item [ @var{section} ]
1338 @itemx [ @var{section}, @var{section}, @dots{} ]
1339 @itemx [ @var{section} @var{section} @dots{} ]
1340 @kindex [ @var{sections} ]
1341 This is an alternate notation to specify named sections from all
1342 unallocated input files; its effect is exactly the same as that of
1343 @samp{* (@var{section}@dots{})}
1345 @item @var{filename}@code{( COMMON )}
1348 @cindex uninitialized data
1349 @cindex commons in output
1350 Specify where in your output file to place uninitialized data
1351 with this notation. @code{*(COMMON)} by itself refers to all
1352 uninitialized data from all input files (so far as it is not yet
1353 allocated); @var{filename}@code{(COMMON)} refers to uninitialized data
1354 from a particular file. Both are special cases of the general
1355 mechanisms for specifying where to place input-file sections:
1356 @code{ld} permits you to refer to uninitialized data as if it
1357 were in an input-file section named @code{COMMON}, regardless of the
1358 input file's format.
1361 For example, the following command script arranges the output file into
1362 three consecutive sections, named @code{.text}, @code{.data}, and
1363 @code{.bss}, taking the input for each from the correspondingly named
1364 sections of all the input files:
1367 .text : @{ *(.text) @}
1368 .data : @{ *(.data) @}
1369 .bss : @{ *(.bss) *(COMMON) @}
1373 The following example reads all of the sections from file @code{all.o}
1374 and places them at the start of output section @code{outputa} which
1375 starts at location @code{0x10000}. All of section @code{.input1} from
1376 file @code{foo.o} follows immediately, in the same output section. All
1377 of section @code{.input2} from @code{foo.o} goes into output section
1378 @code{outputb}, followed by section @code{.input1} from @code{foo1.o}.
1379 All of the remaining @code{.input1} and @code{.input2} sections from any
1380 files are written to output section @code{outputc}.
1402 There are still more kinds of statements permitted in the contents of
1403 output section definitions. The foregoing statements permitted you to
1404 arrange, in your output file, data originating from your input files.
1405 You can also place data directly in an output section from the link
1406 command script. Most of these additional statements involve
1407 expressions; @pxref{Expressions}. Although these statements are shown
1408 separately here for ease of presentation, no such segregation is needed
1409 within a section definition in the @code{SECTIONS} command; you can
1410 intermix them freely with any of the statements we've just described.
1413 @item CREATE_OBJECT_SYMBOLS
1414 @kindex CREATE_OBJECT_SYMBOLS
1415 @cindex input filename symbols
1416 @cindex filename symbols
1417 Create a symbol for each input file
1418 in the current section, set to the address of the first byte of
1419 data written from the input file. For instance, with @code{a.out}
1420 files it is conventional to have a symbol for each input file. You can
1421 accomplish this by defining the output @code{.text} section as follows:
1426 CREATE_OBJECT_SYMBOLS
1428 _etext = ALIGN(0x2000);
1434 If @code{objsym} is a file containing this script, and @code{a.o},
1435 @code{b.o}, @code{c.o}, and @code{d.o} are four input files with
1436 contents like the following---
1446 @samp{ld -M sample a.o b.o c.o d.o} would create a map like this,
1447 containing symbols matching the object file names:
1449 00000000 A __DYNAMIC
1452 00002020 T _afunction
1455 00002038 T _bfunction
1458 00002050 T _cfunction
1461 00002068 T _dfunction
1471 @item @var{symbol} = @var{expression} ;
1472 @kindex @var{symbol} = @var{expression} ;
1473 @itemx @var{symbol} @var{f}= @var{expression} ;
1474 @kindex @var{symbol} @var{f}= @var{expression} ;
1475 @var{symbol} is any symbol name (@pxref{Symbols}). ``@var{f}=''
1476 refers to any of the operators @code{&= += -= *= /=} which combine
1477 arithmetic and assignment.
1479 @cindex assignment, in section defn
1480 When you assign a value to a symbol within a particular section
1481 definition, the value is relative to the beginning of the section
1482 (@pxref{Assignment}). If you write
1487 .data : @{ @dots{} rel = 14 ; @dots{} @}
1488 abs2 = 14 + ADDR(.data);
1492 @c FIXME: Try above example!
1494 @code{abs} and @code{rel} do not have the same value; @code{rel} has the
1495 same value as @code{abs2}.
1497 @item BYTE(@var{expression})
1498 @kindex BYTE(@var{expression})
1499 @itemx SHORT(@var{expression})
1500 @kindex SHORT(@var{expression})
1501 @itemx LONG(@var{expression})
1502 @kindex LONG(@var{expression})
1503 @cindex direct output
1504 By including one of these three statements in a section definition, you
1505 can explicitly place one, two, or four bytes (respectively) at the
1506 current address of that section.
1508 @ifclear SingleFormat
1509 Multiple-byte quantities are represented in whatever byte order is
1510 appropriate for the output file format (@pxref{BFD}).
1513 @item FILL(@var{expression})
1514 @kindex FILL(@var{expression})
1515 @cindex holes, filling
1516 @cindex unspecified memory
1517 Specifies the ``fill pattern'' for the current section. Any otherwise
1518 unspecified regions of memory within the section (for example, regions
1519 you skip over by assigning a new value to the location counter @samp{.})
1520 are filled with the two least significant bytes from the
1521 @var{expression} argument. A @code{FILL} statement covers memory
1522 locations @emph{after} the point it occurs in the section definition; by
1523 including more than one @code{FILL} statement, you can have different
1524 fill patterns in different parts of an output section.
1527 @node Section Options
1528 @subsection Optional Section Attributes
1529 @cindex section defn, full syntax
1530 Here is the full syntax of a section definition, including all the
1536 @var{secname} @var{start} BLOCK(@var{align}) (NOLOAD) : @{ @var{contents} @} =@var{fill} >@var{region}
1541 @var{secname} and @var{contents} are required. @xref{Section
1542 Definition}, and @pxref{Section Contents} for details on @var{contents}.
1543 The remaining elements---@var{start}, @code{BLOCK(@var{align)}},
1544 @code{(NOLOAD)} @code{=@var{fill}}, and @code{>@var{region}}---are all
1549 @cindex start address, section
1550 @cindex section start
1551 @cindex section address
1552 You can force the output section to be loaded at a specified address by
1553 specifying @var{start} immediately following the section name.
1554 @var{start} can be represented as any expression. The following
1555 example generates section @var{output} at location
1560 output 0x40000000: @{
1567 @item BLOCK(@var{align})
1568 @kindex BLOCK(@var{align})
1569 @cindex section alignment
1570 @cindex aligning sections
1571 You can include @code{BLOCK()} specification to advance
1572 the location counter @code{.} prior to the beginning of the section, so
1573 that the section will begin at the specified alignment. @var{align} is
1578 @cindex prevent unnecessary loading
1579 Use @samp{(NOLOAD)} to prevent a section from being loaded into memory
1580 each time it is accessed. For example, in the script sample below, the
1581 @code{ROM} segment is addressed at memory location @samp{0} and does not
1582 need to be loaded into each object file:
1585 ROM 0 (NOLOAD) : @{ @dots{} @}
1592 @cindex section fill pattern
1593 @cindex fill pattern, entire section
1595 @code{=@var{fill}} in a section definition specifies the initial fill
1596 value for that section.
1597 You may use any expression to specify @var{fill}.
1598 Any unallocated holes in the current output
1599 section when written to the output file will be filled with the two
1600 least significant bytes of the value, repeated as necessary. You can
1601 also change the fill value with a @code{FILL} statement in the
1602 @var{contents} of a section definition.
1605 @kindex >@var{region}
1606 @cindex section, assigning to memory region
1607 @cindex memory regions and sections
1608 Assign this section to a previously defined region of memory.
1614 @section The Entry Point
1615 @kindex ENTRY(@var{symbol})
1616 @cindex start of execution
1617 @cindex first instruction
1618 The linker command language includes a command specifically for
1619 defining the first executable instruction in an output file (its
1620 @dfn{entry point}). Its argument is a symbol name:
1625 Like symbol assignments, the @code{ENTRY} command may be placed either
1626 as an independent command in the command file, or among the section
1627 definitions within the @code{SECTIONS} command---whatever makes the most
1628 sense for your layout.
1630 @cindex entry point, defaults
1631 @code{ENTRY} is only one of several ways of choosing the entry point.
1632 You may indicate it in any of the following ways (shown in descending
1633 order of priority: methods higher in the list override methods lower down).
1636 the @samp{-e} @var{entry} command-line option;
1638 the @code{ENTRY(@var{symbol}} command in a linker control script;
1640 the value of the symbol @code{start}, if present;
1642 the value of the symbol @code{_main}, if present;
1644 the address of the first byte of the @code{.text} section, if present;
1646 The address @code{0}.
1649 For example, you can use these rules to generate an entry point with an
1650 assignment statement: if no symbol @code{start} is defined within your
1651 input files, you can simply define it, assigning it an appropriate
1658 The example shows an absolute address, but you can use any expression.
1659 For example, if your input object files use some other symbol-name
1660 convention for the entry point, you can just assign the value of
1661 whatever symbol contains the start address to @code{start}:
1663 start = other_symbol ;
1666 @node Other Commands
1667 @section Other Commands
1668 The command language includes a number of other commands that you can
1669 use for specialized purposes. They are similar in purpose to
1670 command-line options.
1677 These keywords were used in some older linkers to request a particular
1678 math subroutine library. @code{ld} doesn't use the keywords, assuming
1679 instead that any necessary subroutines are in libraries specified using
1680 the general mechanisms for linking to archives; but to permit the use of
1681 scripts that were written for the older linkers, the keywords
1682 @code{FLOAT} and @code{NOFLOAT} are accepted and ignored.
1684 @item FORCE_COMMON_ALLOCATION
1685 @kindex FORCE_COMMON_ALLOCATION
1686 @cindex common allocation
1687 This command has the same effect as the @samp{-d} command-line option:
1688 to make @code{ld} assign space to common symbols even if a relocatable
1689 output file is specified (@samp{-r}).
1691 @item INPUT ( @var{file}, @var{file}, @dots{} )
1692 @kindex INPUT ( @var{files} )
1693 @itemx INPUT ( @var{file} @var{file} @dots{} )
1694 @cindex binary input files
1695 Use this command to include binary input files in the link, without
1696 including them in a particular section definition. Files specified this
1697 way are treated identically to object files listed on the command line.
1700 @item MAP ( @var{name} )
1701 @kindex MAP ( @var{name} )
1702 @c MAP(...) appears to look for an F in the arg, ignoring all other
1703 @c chars; if it finds one, it sets "map_option_f" to true. But nothing
1704 @c checks map_option_f. Apparently a stub for the future...
1707 @item OUTPUT ( @var{filename} )
1708 @kindex OUTPUT ( @var{filename} )
1709 @cindex naming the output file
1710 Use this command to name the link output file @var{filename}. The
1711 effect of @code{OUTPUT(@var{filename})} is identical to the effect of
1712 @w{@samp{-o @var{filename}}}, and whichever is encountered last will
1713 control the name actually used to name the output file. In particular,
1714 you can use this command to supply a default output-file name other than
1717 @ifclear SingleFormat
1718 @item OUTPUT_ARCH ( @var{bfdname} )
1719 @kindex OUTPUT_ARCH ( @var{bfdname} )
1720 @cindex machine architecture, output
1721 Specify a particular output machine architecture, with one of the names
1722 used by the BFD back-end routines (@pxref{BFD}). This command is often
1723 unnecessary; the architecture is most often set implicitly by either the
1724 system BFD configuration or as a side effect of the @code{OUTPUT_FORMAT}
1727 @item OUTPUT_FORMAT ( @var{bfdname} )
1728 @kindex OUTPUT_FORMAT ( @var{bfdname} )
1729 @cindex format, output file
1730 Specify a particular output format, with one of the names used by the
1731 BFD back-end routines (@pxref{BFD}). This selection will only affect
1732 the output file; the related command @code{TARGET} affects primarily
1736 @item SEARCH_DIR ( @var{path} )
1737 @kindex SEARCH_DIR ( @var{path} )
1738 @cindex path for libraries
1739 @cindex search path, libraries
1740 Add @var{path} to the list of paths where @code{ld} looks for
1741 archive libraries. @code{SEARCH_DIR(@var{path})} has the same
1742 effect as @samp{-L@var{path}} on the command line.
1744 @item STARTUP ( @var{filename} )
1745 @kindex STARTUP ( @var{filename} )
1746 @cindex first input file
1747 Ensure that @var{filename} is the first input file used in the link
1750 @ifclear SingleFormat
1751 @item TARGET ( @var{format} )
1752 @cindex input file format
1753 @kindex TARGET ( @var{format} )
1754 Change the input-file object code format (like the command-line option
1755 @samp{-b} or its synonym @samp{-format}). The argument @var{format} is
1756 one of the strings used by BFD to name binary formats. In the current
1757 @code{ld} implementation, if @code{TARGET} is specified but
1758 @code{OUTPUT_FORMAT} is not, the last @code{TARGET} argument is also
1759 used as the default format for the @code{ld} output file.
1763 If you don't use the @code{TARGET} command, @code{ld} uses the value of
1764 the environment variable @code{GNUTARGET}, if available, to select the
1765 output file format. If that variable is also absent, @code{ld} uses
1766 the default format configured for your machine in the BFD libraries.
1771 @node Machine Dependent
1772 @chapter Machine Dependent Features
1774 @cindex machine dependencies
1775 @code{ld} has additional features on some platforms; the following
1776 sections describe them. Machines where @code{ld} has no additional
1777 functionality are not listed.
1780 * H8/300:: @code{ld} and the H8/300
1781 * i960:: @code{ld} and the Intel 960 family
1785 @c FIXME! This could use @up/@down, but there seems to be a conflict
1786 @c between those and node-defaulting.
1792 @section @code{ld} and the H8/300
1794 @cindex H8/300 support
1795 For the H8/300, @code{ld} can perform these global optimizations when
1796 you specify the @samp{-relax} command-line option.
1799 @item relaxing address modes
1800 @cindex relaxing on H8/300
1801 @code{ld} finds all @code{jsr} and @code{jmp} instructions whose
1802 targets are within eight bits, and turns them into eight-bit
1803 program-counter relative @code{bsr} and @code{bra} instructions,
1806 @item synthesizing instructions
1807 @cindex synthesizing on H8/300
1808 @c FIXME: specifically mov.b, or any mov instructions really?
1809 @code{ld} finds all @code{mov.b} instructions which use the
1810 sixteen-bit absolute address form, but refer to the top
1811 page of memory, and changes them to use the eight-bit address form.
1812 (That is: the linker turns @samp{mov.b @code{@@}@var{aa}:16} into
1813 @samp{mov.b @code{@@}@var{aa}:8} whenever the address @var{aa} is in the
1814 top page of memory).
1826 @section @code{ld} and the Intel 960 family
1828 @cindex i960 support
1830 You can use the @samp{-A@var{architecture}} command line option to
1831 specify one of the two-letter names identifying members of the 960
1832 family; the option specifies the desired output target, and warns of any
1833 incompatible instructions in the input files. It also modifies the
1834 linker's search strategy for archive libraries, to support the use of
1835 libraries specific to each particular architecture, by including in the
1836 search loop names suffixed with the string identifying the architecture.
1838 For example, if your @code{ld} command line included @w{@samp{-ACA}} as
1839 well as @w{@samp{-ltry}}, the linker would look (in its built-in search
1840 paths, and in any paths you specify with @samp{-L}) for a library with
1851 The first two possibilities would be considered in any event; the last
1852 two are due to the use of @w{@samp{-ACA}}.
1854 You can meaningfully use @samp{-A} more than once on a command line, since
1855 the 960 architecture family allows combination of target architectures; each
1856 use will add another pair of name variants to search for when @w{@samp{-l}}
1857 specifies a library.
1863 @ifclear SingleFormat
1868 @cindex object file management
1869 The linker accesses object and archive files using the BFD libraries.
1870 These libraries allow the linker to use the same routines to operate on
1871 object files whatever the object file format. A different object file
1872 format can be supported simply by creating a new BFD back end and adding
1873 it to the library. You can use @code{objdump -i}
1874 (@pxref{objdump,,objdump,binutils.info,The GNU Binary Utilities}) to
1875 list all the formats available for each architecture under BFD. This
1876 was the list of formats, and of architectures supported for each format,
1877 as of the time this manual was prepared:
1878 @cindex formats available
1879 @cindex architectures available
1881 BFD header file version 0.18
1883 (header big endian, data big endian)
1889 (header big endian, data big endian)
1895 (header big endian, data little endian)
1898 (header little endian, data little endian)
1901 (header big endian, data big endian)
1904 (header big endian, data big endian)
1907 (header little endian, data little endian)
1910 (header big endian, data little endian)
1913 (header little endian, data little endian)
1916 (header big endian, data big endian)
1919 (header big endian, data big endian)
1922 (header big endian, data big endian)
1925 (header little endian, data little endian)
1928 (header big endian, data big endian)
1940 (header little endian, data little endian)
1952 (header big endian, data big endian)
1964 (header big endian, data big endian)
1977 @cindex BFD requirements
1978 @cindex requirements for BFD
1979 As with most implementations, BFD is a compromise between
1980 several conflicting requirements. The major factor influencing
1981 BFD design was efficiency: any time used converting between
1982 formats is time which would not have been spent had BFD not
1983 been involved. This is partly offset by abstraction payback; since
1984 BFD simplifies applications and back ends, more time and care
1985 may be spent optimizing algorithms for a greater speed.
1987 One minor artifact of the BFD solution which you should bear in
1988 mind is the potential for information loss. There are two places where
1989 useful information can be lost using the BFD mechanism: during
1990 conversion and during output. @xref{BFD information loss}.
1993 * BFD outline:: How it works: an outline of BFD
1994 * BFD information loss:: Information Loss
1995 * Mechanism:: Mechanism
1999 @section How it works: an outline of BFD
2000 @cindex opening object files
2001 When an object file is opened, BFD subroutines automatically
2002 determine the format of the input object file, and build a descriptor in
2003 memory with pointers to routines that will be used to access elements of
2004 the object file's data structures.
2006 As different information from the the object files is required,
2007 BFD reads from different sections of the file and processes them.
2008 For example, a very common operation for the linker is processing symbol
2009 tables. Each BFD back end provides a routine for converting
2010 between the object file's representation of symbols and an internal
2011 canonical format. When the linker asks for the symbol table of an object
2012 file, it calls through the memory pointer to the BFD
2013 back end routine which reads and converts the table into a canonical
2014 form. The linker then operates upon the common form. When the link is
2015 finished and the linker writes the symbol table of the output file,
2016 another BFD back end routine is called which takes the newly
2017 created symbol table and converts it into the chosen output format.
2019 @node BFD information loss
2020 @section Information Loss
2021 @emph{Information can be lost during output.} The output formats
2022 supported by BFD do not provide identical facilities, and
2023 information which may be described in one form has nowhere to go in
2024 another format. One example of this is alignment information in
2025 @code{b.out}. There is nowhere in an @code{a.out} format file to store
2026 alignment information on the contained data, so when a file is linked
2027 from @code{b.out} and an @code{a.out} image is produced, alignment
2028 information will not propagate to the output file. (The linker will
2029 still use the alignment information internally, so the link is performed
2032 Another example is COFF section names. COFF files may contain an
2033 unlimited number of sections, each one with a textual section name. If
2034 the target of the link is a format which does not have many sections (e.g.,
2035 @code{a.out}) or has sections without names (e.g., the Oasys format) the
2036 link cannot be done simply. You can circumvent this problem by
2037 describing the desired input-to-output section mapping with the command
2040 @emph{Information can be lost during canonicalization.} The BFD
2041 internal canonical form of the external formats is not exhaustive; there
2042 are structures in input formats for which there is no direct
2043 representation internally. This means that the BFD back ends
2044 cannot maintain all possible data richness through the transformation
2045 between external to internal and back to external formats.
2047 This limitation is only a problem when using the linker to read one
2048 format and write another. Each BFD back end is responsible for
2049 maintaining as much data as possible, and the internal BFD
2050 canonical form has structures which are opaque to the BFD core,
2051 and exported only to the back ends. When a file is read in one format,
2052 the canonical form is generated for BFD and the linker. At the
2053 same time, the back end saves away any information which would otherwise
2054 be lost. If the data is then written back in the same format, the back
2055 end routine will be able to use the canonical form provided by the
2056 BFD core as well as the information it prepared earlier. Since
2057 there is a great deal of commonality between back ends,
2058 there is no information lost when
2059 linking big endian COFF to little endian COFF, or from @code{a.out} to
2060 @code{b.out}. When a mixture of formats is linked, the information is
2061 only lost from the files whose format differs from the destination.
2065 The greatest potential for loss of information occurs when there is the least
2066 overlap between the information provided by the source format, that
2067 stored by the canonical format, and that needed by the
2068 destination format. A brief description of the canonical form may help
2069 you understand which kinds of data you can count on preserving across
2071 @cindex BFD canonical format
2072 @cindex internal object-file format
2076 Information on target machine architecture, particular implementation,
2077 and format type are stored on a per-file basis. Other information
2078 includes a demand pagable bit and a write protected bit.
2079 Information like Unix magic numbers is not stored here---only the magic
2080 numbers' meaning, so a @code{ZMAGIC} file would have both the demand pagable
2081 bit and the write protected text bit set.
2083 The byte order of the target is stored on a per-file basis, so that big-
2084 and little-endian object files may be linked with one another.
2087 Each section in the input file contains the name of the section, the
2088 original address in the object file, various options, size and alignment
2089 information and pointers into other BFD data structures.
2092 Each symbol contains a pointer to the object file which originally
2093 defined it, its name, its value, and various option bits. When a
2094 BFD back end reads in a symbol table, the back end relocates all
2095 symbols to make them relative to the base of the section where they were
2096 defined. Doing this ensures that each symbol points to its containing
2097 section. Each symbol also has a varying amount of hidden
2098 private data for the BFD back end. Since the symbol points to the
2099 original file, the private data format for that symbol is accessible.
2100 @code{ld} can operate on a collection of symbols of wildly different
2101 formats without problems.
2103 Normal global and simple local symbols are maintained on output, so an
2104 output file (no matter its format) will retain symbols pointing to
2105 functions and to global, static, and common variables. Some symbol
2106 information is not worth retaining; in @code{a.out}, type information is
2107 stored in the symbol table as long symbol names. This information would
2108 be useless to most COFF debuggers and may be thrown away with
2109 appropriate command line switches. (The GNU debugger @code{gdb} does
2110 support @code{a.out} style debugging information in COFF).
2112 There is one word of type information within the symbol, so if the
2113 format supports symbol type information within symbols (for example, COFF,
2114 IEEE, Oasys) and the type is simple enough to fit within one word
2115 (nearly everything but aggregates), the information will be preserved.
2117 @item relocation level
2118 Each canonical BFD relocation record contains a pointer to the symbol to
2119 relocate to, the offset of the data to relocate, the section the data
2120 is in, and a pointer to a relocation type descriptor. Relocation is
2121 performed by passing messages through the relocation type
2122 descriptor and the symbol pointer. Therefore, relocations can be performed
2123 on output data using a relocation method that is only available in one of the
2124 input formats. For instance, Oasys provides a byte relocation format.
2125 A relocation record requesting this relocation type would point
2126 indirectly to a routine to perform this, so the relocation may be
2127 performed on a byte being written to a COFF file, even though 68k COFF
2128 has no such relocation type.
2129 @c FIXME why specific reference to 68K above?
2132 Object formats can contain, for debugging purposes, some form of mapping
2133 between symbols, source line numbers, and addresses in the output file.
2134 These addresses have to be relocated along with the symbol information.
2135 Each symbol with an associated list of line number records points to the
2136 first record of the list. The head of a line number list consists of a
2137 pointer to the symbol, which allows finding out the address of the
2138 function whose line number is being described. The rest of the list is
2139 made up of pairs: offsets into the section and line numbers. Any format
2140 which can simply derive this information can pass it successfully
2141 between formats (COFF, IEEE and Oasys).
2146 @appendix MRI Compatible Script Files
2147 @cindex MRI compatibility
2148 To aid users making the transition to @sc{gnu} @code{ld} from the MRI
2149 linker, @code{ld} can use MRI compatible linker scripts as an
2150 alternative to the more general-purpose linker scripting language
2151 described in @ref{Commands,,Command Language}. MRI compatible linker
2152 scripts have a much simpler command set than the scripting language
2153 otherwise used with @code{ld}. @sc{gnu} @code{ld} supports the most
2154 commonly used MRI linker commands; these commands are described here.
2156 You can specify a file containing an MRI-compatible script using the
2157 @samp{-c} command-line option.
2159 Each command in an MRI-compatible script occupies its own line; each
2160 command line starts with the keyword that identifies the command (though
2161 blank lines are also allowed for punctuation). If a line of an
2162 MRI-compatible script begins with an unrecognized keyword, @code{ld}
2163 issues a warning message, but continues processing the script.
2165 Lines beginning with @samp{*} are comments.
2167 You can write these commands using all upper-case letters, or all
2168 lower case; for example, @samp{chip} is the same as @samp{CHIP}.
2169 The following list shows only the upper-case form of each command.
2172 @item ABSOLUTE @var{secname}
2173 @item ABSOLUTE @var{secname}, @var{secname}, @dots{} @var{secname}
2174 @cindex @code{ABSOLUTE} (MRI)
2175 Normally, @code{ld} includes in the output file all sections from all
2176 the input files. However, in an MRI-compatible script, you can use the
2177 @code{ABSOLUTE} command to restrict the sections that will be present in
2178 your output program. If the @code{ABSOLUTE} command is used at all in a
2179 script, then only the sections named explicitly in @code{ABSOLUTE}
2180 commands will appear in the linker output. You can still use other
2181 input sections (whatever you select on the command line, or using
2182 @code{LOAD}) to resolve addresses in the output file.
2184 @item ALIAS @var{out-secname}, @var{in-secname}
2185 @cindex @code{ALIAS} (MRI)
2186 Use this command to place the data from input section @var{in-secname}
2187 in a section called @var{out-secname} in the linker output file.
2189 @var{in-secname} may be an integer.
2191 @item BASE @var{expression}
2192 @cindex @code{BASE} (MRI)
2193 Use the value of @var{expression} as the lowest address (other than
2194 absolute addresses) in the output file.
2196 @item CHIP @var{expression}
2197 @itemx CHIP @var{expression}, @var{expression}
2198 @cindex @code{CHIP} (MRI)
2199 This command does nothing; it is accepted only for compatibility.
2202 @cindex @code{END} (MRI)
2203 This command does nothing whatever; it's only accepted for compatibility.
2205 @item FORMAT @var{output-format}
2206 @cindex @code{FORMAT} (MRI)
2207 Similar to the @code{OUTPUT_FORMAT} command in the more general linker
2208 language, but restricted to one of these output formats:
2211 S-records, if @var{output-format} is @samp{S}
2214 IEEE, if @var{output-format} is @samp{IEEE}
2217 COFF (the @samp{coff-m68k} variant in BFD), if @var{output-format} is
2221 @item LIST @var{anything}@dots{}
2222 @cindex @code{LIST} (MRI)
2223 Print (to the standard output file) a link map, as produced by the
2224 @code{ld} command-line option @samp{-M}.
2226 The keyword @code{LIST} may be followed by anything on the
2227 same line, with no change in its effect.
2229 @item LOAD @var{filename}
2230 @item LOAD @var{filename}, @var{filename}, @dots{} @var{filename}
2231 @cindex @code{LOAD} (MRI)
2232 Include one or more object file @var{filename} in the link; this has the
2233 same effect as specifying @var{filename} directly on the @code{ld}
2236 @item NAME @var{output-name}
2237 @cindex @code{NAME} (MRI)
2238 @var{output-name} is the name for the program produced by @code{ld}; the
2239 MRI-compatible command @code{NAME} is equivalent to the command-line
2240 option @samp{-o} or the general script language command @code{OUTPUT}.
2242 @item ORDER @var{secname}, @var{secname}, @dots{} @var{secname}
2243 @itemx ORDER @var{secname} @var{secname} @var{secname}
2244 @cindex @code{ORDER} (MRI)
2245 Normally, @code{ld} orders the sections in its output file in the
2246 order in which they first appear in the input files. In an MRI-compatible
2247 script, you can override this ordering with the @code{ORDER} command. The
2248 sections you list with @code{ORDER} will appear first in your output
2249 file, in the order specified.
2251 @item PUBLIC @var{name}=@var{expression}
2252 @itemx PUBLIC @var{name},@var{expression}
2253 @itemx PUBLIC @var{name} @var{expression}
2254 @cindex @code{PUBLIC} (MRI)
2255 Supply a value (@var{expression}) for external symbol
2256 @var{name} used in the linker input files.
2258 @item SECT @var{secname}, @var{expression}
2259 @itemx SECT @var{secname}=@var{expression}
2260 @itemx SECT @var{secname} @var{expression}
2261 @cindex @code{SECT} (MRI)
2262 You can use any of these three forms of the @code{SECT} command to
2263 specify the start address (@var{expression}) for section @var{secname}.
2264 If you have more than one @code{SECT} statement for the same
2265 @var{secname}, only the @emph{first} sets the start address.
2275 % I think something like @colophon should be in texinfo. In the
2277 \long\def\colophon{\hbox to0pt{}\vfill
2278 \centerline{The body of this manual is set in}
2279 \centerline{\fontname\tenrm,}
2280 \centerline{with headings in {\bf\fontname\tenbf}}
2281 \centerline{and examples in {\tt\fontname\tentt}.}
2282 \centerline{{\it\fontname\tenit\/} and}
2283 \centerline{{\sl\fontname\tensl\/}}
2284 \centerline{are used for emphasis.}\vfill}