ld/ld.texinfo

   1 \input texinfo
   2 @setfilename ld.info
   3 @syncodeindex ky cp
   4 @c @include configdoc.texi
   5
   6 @c @smallbook
   7 @c @cropmarks
   8
   9 @ifinfo
  10 @format
  11 START-INFO-DIR-ENTRY
  12 * Ld::                          The GNU linker.
  13 END-INFO-DIR-ENTRY
  14 @end format
  15 @end ifinfo
  16
  17 @ifinfo
  18 This file documents the GNU linker LD.
  19
  20 Copyright (C) 1991, 1992, 1993 Free Software Foundation, Inc.
  21
  22 Permission is granted to make and distribute verbatim copies of
  23 this manual provided the copyright notice and this permission notice
  24 are preserved on all copies.
  25
  26 Permission is granted to copy and distribute modified versions of this
  27 manual under the conditions for verbatim copying, provided also that
  28 the entire resulting derived work is distributed under the terms of a
  29 permission notice identical to this one.
  30
  31 Permission is granted to copy and distribute translations of this manual
  32 into another language, under the above conditions for modified versions.
  33
  34 @ignore
  35 Permission is granted to process this file through Tex and print the
  36 results, provided the printed document carries copying permission
  37 notice identical to this one except for the removal of this paragraph
  38 (this paragraph not being relevant to the printed manual).
  39
  40 @end ignore
  41 @end ifinfo
  42 @iftex
  43 @finalout
  44 @setchapternewpage odd
  45 @settitle Using LD, the GNU linker
  46 @titlepage
  47 @title Using ld
  48 @subtitle The GNU linker
  49 @sp 1
  50 @subtitle @code{ld} version 2
  51 @subtitle March 1993
  52 @author Steve Chamberlain and Roland Pesch
  53 @author Cygnus Support
  54 @page
  55
  56 @tex
  57 {\parskip=0pt
  58 \hfill Cygnus Support\par
  59 \hfill steve\@cygnus.com, pesch\@cygnus.com\par
  60 \hfill {\it Using LD, the GNU linker}\par
  61 \hfill Edited by Jeffrey Osier (jeffrey\@cygnus.com), March 1993.\par
  62 }
  63 \global\parindent=0pt % Steve likes it this way.
  64 @end tex
  65
  66 @vskip 0pt plus 1filll
  67 Copyright @copyright{} 1991, 1992, 1993 Free Software Foundation, Inc.
  68
  69 Permission is granted to make and distribute verbatim copies of
  70 this manual provided the copyright notice and this permission notice
  71 are preserved on all copies.
  72
  73 Permission is granted to copy and distribute modified versions of this
  74 manual under the conditions for verbatim copying, provided also that
  75 the entire resulting derived work is distributed under the terms of a
  76 permission notice identical to this one.
  77
  78 Permission is granted to copy and distribute translations of this manual
  79 into another language, under the above conditions for modified versions.
  80 @end titlepage
  81 @end iftex
  82 @c FIXME: Talk about importance of *order* of args, cmds to linker!
  83
  84 @ifinfo
  85 @node Top
  86 @top Using ld
  87 This file documents the GNU linker ld.
  88
  89 @menu
  90 * Overview::                    Overview
  91 * Invocation::                  Invocation
  92 * Commands::                    Command Language
  93 @ifset GENERIC
  94 * Machine Dependent::           Machine Dependent Features
  95 @end ifset
  96 @ifclear GENERIC
  97 @ifset H8300
  98 * H8/300::                      ld and the H8/300
  99 @end ifset
 100 @ifset I960
 101 * i960::                        ld and the Intel 960 family
 102 @end ifset
 103 @end ifclear
 104 @ifclear SingleFormat
 105 * BFD::                         BFD
 106 @end ifclear
 107 @c Following blank line required for remaining bug in makeinfo conds/menus
 108
 109 * MRI::                         MRI Compatible Script Files
 110 * Index::                       Index
 111 @end menu
 112 @end ifinfo
 113
 114 @node Overview
 115 @chapter Overview
 116
 117 @cindex GNU linker
 118 @cindex what is this?
 119 @code{ld} combines a number of object and archive files, relocates
 120 their data and ties up symbol references. Usually the last step in
 121 compiling a program is to run @code{ld}.
 122
 123 @code{ld} accepts Linker Command Language files written in
 124 a superset of AT&T's Link Editor Command Language syntax,
 125 to provide explicit and total control over the linking process.
 126
 127 @ifclear SingleFormat
 128 This version of @code{ld} uses the general purpose BFD libraries
 129 to operate on object files. This allows @code{ld} to read, combine, and
 130 write object files in many different formats---for example, COFF or
 131 @code{a.out}.  Different formats may be linked together to produce any
 132 available kind of object file.  @xref{BFD} for a list of formats
 133 supported on various architectures.
 134 @end ifclear
 135
 136 Aside from its flexibility, the GNU linker is more helpful than other
 137 linkers in providing diagnostic information.  Many linkers abandon
 138 execution immediately upon encountering an error; whenever possible,
 139 @code{ld} continues executing, allowing you to identify other errors
 140 (or, in some cases, to get an output file in spite of the error).
 141
 142 @node Invocation
 143 @chapter Invocation
 144
 145 The GNU linker @code{ld} is meant to cover a broad range of situations,
 146 and to be as compatible as possible with other linkers.  As a result,
 147 you have many choices to control its behavior.
 148
 149 @ifset UsesEnvVars
 150 @menu
 151 * Options::                     Command Line Options
 152 * Environment::                 Environment Variables
 153 @end menu
 154
 155 @node Options
 156 @section Command Line Options
 157 @end ifset
 158
 159 @cindex command line
 160 @cindex options
 161 Here is a summary of the options you can use on the @code{ld} command
 162 line:
 163
 164 @c FIXME!  -relax only avail h8/300, i960.  Conditionals screwed in examples.
 165 @smallexample
 166 ld [-o @var{output} ] @var{objfile}@dots{}
 167   [ -A@var{architecture} ]  [ -b @var{input-format} ]  [ -Bstatic ]
 168   [ -c @var{MRI-commandfile} ]  [ -d | -dc | -dp ]
 169   [ -defsym @var{symbol}=@var{expression} ]
 170   [ -e @var{entry} ]  [ -F ]  [ -F @var{format} ]
 171   [ -format @var{input-format} ]  [ -g ]  [ -G @var{size} ]  [ -i ]
 172   [ -l@var{ar} ]  [ -L@var{searchdir} ]  [ -M ]  [ -m @var{emulation} ]
 173   [ -N | -n ]  [ -noinhibit-exec ]  [ -R @var{filename} ]
 174   [ -relax ]  [ -r | -Ur ]  [ -S ]  [ -s ]  [ -T @var{commandfile} ]
 175   [ -Ttext @var{textorg} ]  [ -Tdata @var{dataorg} ]  [ -Tbss @var{bssorg} ]
 176   [ -t ]  [ -u @var{sym}]  [-V]  [-v]  [ -X ]  [ -x ]  [ -y@var{symbol} ]
 177   [ @{ @var{script} @} ]
 178 @end smallexample
 179
 180 This plethora of command-line options may seem intimidating, but in
 181 actual practice few of them are used in any particular context.
 182 @cindex standard Unix system
 183 For instance, a frequent use of @code{ld} is to link standard Unix
 184 object files on a standard, supported Unix system.  On such a system, to
 185 link a file @code{hello.o}:
 186
 187 @example
 188 ld -o @var{output} /lib/crt0.o hello.o -lc
 189 @end example
 190
 191 This tells @code{ld} to produce a file called @var{output} as the
 192 result of linking the file @code{/lib/crt0.o} with @code{hello.o} and
 193 the library @code{libc.a}, which will come from the standard search
 194 directories.  (See the discussion of the @samp{-l} option below.)
 195
 196 The command-line options to @code{ld} may be specified in any order, and
 197 may be repeated at will.  Repeating most options with a
 198 different argument will either have no further effect, or override prior
 199 occurrences (those further to the left on the command line) of that
 200 option.
 201
 202 @ifclear SingleFormat
 203 The exceptions---which may meaningfully be used more than once---are
 204 @samp{-A}, @samp{-b} (or its synonym @samp{-format}), @samp{-defsym},
 205 @samp{-L}, @samp{-l}, @samp{-R}, and @samp{-u}.
 206 @end ifclear
 207 @ifset SingleFormat
 208 The exceptions---which may meaningfully be used more than once---are
 209 @samp{-A}, @samp{-defsym}, @samp{-L}, @samp{-l}, @samp{-R}, and @samp{-u}.
 210 @end ifset
 211
 212 @cindex object files
 213 The list of object files to be linked together, shown as @var{objfile}@dots{},
 214 may follow, precede, or be mixed in with command-line options, except that
 215 an @var{objfile} argument may not be placed between an option and
 216 its argument.
 217
 218 Usually the linker is invoked with at least one object file, but other
 219 forms of binary input files can also be specified with @samp{-l},
 220 @samp{-R}, and the script command language.  If @emph{no} binary input
 221 files at all are specified, the linker does not produce any output, and
 222 issues the message @samp{No input files}.
 223
 224 Option arguments must either follow the option letter without intervening
 225 whitespace, or be given as separate arguments immediately following the
 226 option that requires them.
 227
 228 @table @code
 229 @item @var{objfile}@dots{}
 230 The object files to be linked.
 231
 232 @ifset I960
 233 @cindex architectures
 234 @kindex -A@var{arch}
 235 @item -A@var{architecture}
 236 In the current release of @code{ld}, this option is useful only for the
 237 Intel 960 family of architectures.  In that @code{ld} configuration, the
 238 @var{architecture} argument identifies the particular architecture in
 239 the 960 family, enabling some safeguards and modifying the
 240 archive-library search path.  @xref{i960,,@code{ld} and the Intel 960
 241 family}, for details.
 242
 243 Future releases of @code{ld} may support similar functionality for
 244 other architecture families.
 245 @end ifset
 246
 247 @ifclear SingleFormat
 248 @cindex binary input format
 249 @kindex -b @var{format}
 250 @cindex input format
 251 @item -b @var{input-format}
 252 @cindex input format
 253 Specify the binary format for input object files that follow this option
 254 on the command line.  You don't usually need to specify this, as
 255 @code{ld} is configured to expect as a default input format the most
 256 usual format on each machine.  @var{input-format} is a text string, the
 257 name of a particular format supported by the BFD libraries.
 258 @w{@samp{-format @var{input-format}}} has the same effect.  @xref{BFD}.
 259
 260 You may want to use this option if you are linking files with an unusual
 261 binary format.  You can also use @samp{-b} to switch formats explicitly (when
 262 linking object files of different formats), by including
 263 @samp{-b @var{input-format}} before each group of object files in a
 264 particular format.
 265
 266 The default format is taken from the environment variable
 267 @code{GNUTARGET}.
 268 @ifset UsesEnvVars
 269 @xref{Environment}.
 270 @end ifset
 271 You can also define the input
 272 format from a script, using the command @code{TARGET}; see @ref{Other
 273 Commands}.
 274 @end ifclear
 275
 276 @kindex -Bstatic
 277 @item -Bstatic
 278 Ignored.  This option is accepted for command-line compatibility with
 279 the SunOS linker.
 280
 281 @kindex -c @var{MRI-cmdfile}
 282 @cindex compatibility, MRI
 283 @item -c @var{MRI-commandfile}
 284 For compatibility with linkers produced by MRI, @code{ld} accepts script
 285 files written in an alternate, restricted command language, described in
 286 @ref{MRI,,MRI Compatible Script Files}.  Introduce MRI script files with
 287 the option @samp{-c}; use the @samp{-T} option to run linker
 288 scripts written in the general-purpose @code{ld} scripting language.
 289 If @var{MRI-cmdfile} does not exist, @code{ld} looks for it in the directories
 290 specified by any @samp{-L} options.
 291
 292 @cindex common allocation
 293 @kindex -d
 294 @item -d
 295 @kindex -dc
 296 @itemx -dc
 297 @kindex -dp
 298 @itemx -dp
 299 These three options are equivalent; multiple forms are supported for
 300 compatibility with other linkers.  They
 301 assign space to common symbols even if a relocatable output file is
 302 specified (with @samp{-r}).  The script command
 303 @code{FORCE_COMMON_ALLOCATION} has the same effect.  @xref{Other
 304 Commands}.
 305
 306 @cindex symbols, from command line
 307 @kindex -defsym @var{symbol}=@var{exp}
 308 @item -defsym @var{symbol}=@var{expression}
 309 Create a global symbol in the output file, containing the absolute
 310 address given by @var{expression}.  You may use this option as many
 311 times as necessary to define multiple symbols in the command line.  A
 312 limited form of arithmetic is supported for the @var{expression} in this
 313 context: you may give a hexadecimal constant or the name of an existing
 314 symbol, or use @code{+} and @code{-} to add or subtract hexadecimal
 315 constants or symbols.  If you need more elaborate expressions, consider
 316 using the linker command language from a script (@pxref{Assignment, ,
 317 Assignment: Symbol Definitions}).  @emph{Note:}  there should be no
 318 white space between @var{symbol}, the equals sign (``@key{=}''), and
 319 @var{expression}.
 320
 321 @cindex entry point, from command line
 322 @kindex -e @var{entry}
 323 @item -e @var{entry}
 324 Use @var{entry} as the explicit symbol for beginning execution of your
 325 program, rather than the default entry point. @xref{Entry Point}, for a
 326 discussion of defaults and other ways of specifying the
 327 entry point.
 328
 329 @ifclear SingleFormat
 330 @kindex -F
 331 @item -F
 332 @itemx -F@var{format}
 333 Ignored.  Some older linkers used this option throughout a compilation
 334 toolchain for specifying object-file format for both input and output
 335 object files.  The mechanisms @code{ld} uses for this purpose (the
 336 @samp{-b} or @samp{-format} options for input files, the @code{TARGET}
 337 command in linker scripts for output files, the @code{GNUTARGET}
 338 environment variable) are more flexible, but @code{ld} accepts the
 339 @samp{-F} option for compatibility with scripts written to call the old
 340 linker.
 341
 342 @kindex -format
 343 @item -format @var{input-format}
 344 Synonym for @samp{-b @var{input-format}}.
 345 @end ifclear
 346
 347 @kindex -g
 348 @item -g
 349 Ignored.  Provided for compatibility with other tools.
 350
 351 @kindex -G
 352 @cindex object size
 353 @item -G@var{value}
 354 @itemx -G @var{value}
 355 Set the maximum size of objects to be optimized using the GP register to
 356 @var{size} under MIPS ECOFF.  Ignored for other object file formats.
 357
 358 @kindex -i
 359 @cindex incremental link
 360 @item -i
 361 Perform an incremental link (same as option @samp{-r}).
 362
 363 @cindex archive files, from cmd line
 364 @kindex -l@var{ar}
 365 @item -l@var{ar}
 366 Add archive file @var{ar} to the list of files to link.  This
 367 option may be used any number of times.  @code{ld} will search its
 368 path-list for occurrences of @code{lib@var{ar}.a} for every @var{ar}
 369 specified.
 370
 371 @cindex search directory, from cmd line
 372 @kindex -L@var{dir}
 373 @item -L@var{searchdir}
 374 Add path @var{searchdir} to the list of paths that @code{ld} will search
 375 for archive libraries and @code{ld} control scripts.  You may use this
 376 option any number of times.
 377
 378 @ifset UsesEnvVars
 379 The default set of paths searched (without being specified with
 380 @samp{-L}) depends on which emulation mode @code{ld} is using, and in
 381 some cases also on how it was configured.  @xref{Environment}.
 382 @end ifset
 383
 384 The paths can also be specified in a link script with the
 385 @code{SEARCH_DIR} command.
 386
 387 @cindex link map
 388 @kindex -M
 389 @item -M
 390 Print (to the standard output) a link map---diagnostic information
 391 about where symbols are mapped by @code{ld}, and information on global
 392 common storage allocation.
 393
 394 @cindex emulation
 395 @kindex -m @var{emulation}
 396 @item -m@var{emulation}
 397 @itemx -m @var{emulation}
 398 Emulate the @var{emulation} linker.  You can list the available
 399 emulations with the @samp{-V} option.  The
 400 default is the system for which you configured @code{ld}.
 401
 402 @kindex -N
 403 @cindex read/write from cmd line
 404 @kindex OMAGIC
 405 @item -N
 406 Set the text and data sections to be readable and writable.  Also, do
 407 not page-align the data segment.  If the output format supports Unix
 408 style magic numbers, mark the output as @code{OMAGIC}.
 409
 410 @item -n
 411 @kindex -n
 412 @cindex read-only text
 413 @kindex NMAGIC
 414 Set the text segment to be read only, and mark the output as
 415 @code{NMAGIC} if possible.
 416
 417 @item -noinhibit-exec
 418 @cindex output file after errors
 419 @kindex -noinhibit-exec
 420 Retain the executable output file whenever it is still usable.
 421 Normally, the linker will not produce an output file if it encounters
 422 errors during the link process; it exits without writing an output file
 423 when it issues any error whatsoever.
 424
 425 @item -o @var{output}
 426 @kindex -o @var{output}
 427 @cindex naming the output file
 428 Use @var{output} as the name for the program produced by @code{ld}; if this
 429 option is not specified, the name @file{a.out} is used by default.  The
 430 script command @code{OUTPUT} can also specify the output file name.
 431
 432 @item -R @var{filename}
 433 @kindex -R @var{file}
 434 @cindex symbol-only input
 435 On some platforms, this option performs global optimizations
 436 that become possible when the linker resolves addressing in the
 437 program, such as relaxing address modes and synthesizing new
 438 instructions in the output object file.
 439
 440 @item -relax
 441 @kindex -relax
 442 @cindex synthesizing linker
 443 @cindex relaxing addressing modes
 444 An option with machine dependent effects.  Currently this option is only
 445 supported on the H8/300.
 446 @ifset H8300
 447 @xref{H8/300,,@code{ld} and the H8/300}.
 448 @end ifset
 449
 450 On some platforms, use option performs global optimizations that
 451 become possible when the linker resolves addressing in the program, such
 452 as relaxing address modes and synthesizing new instructions in the
 453 output object file.
 454
 455 On platforms where this is not supported, @samp{-relax} is accepted, but
 456 ignored.
 457
 458 @item -r
 459 @cindex partial link
 460 @cindex relocatable output
 461 @kindex -r
 462 Generate relocatable output---i.e., generate an output file that can in
 463 turn serve as input to @code{ld}.  This is often called @dfn{partial
 464 linking}.  As a side effect, in environments that support standard Unix
 465 magic numbers, this option also sets the output file's magic number to
 466 @code{OMAGIC}.
 467 @c ; see @code{-N}.
 468 If this option is not specified, an absolute file is produced.  When
 469 linking C++ programs, this option @emph{will not} resolve references to
 470 constructors; to do that, use @samp{-Ur}.
 471
 472 This option does the same as @code{-i}.
 473
 474 @item -S
 475 @kindex -S
 476 @cindex strip debugger symbols
 477 Omit debugger symbol information (but not all symbols) from the output file.
 478
 479 @item -s
 480 @kindex -s
 481 @cindex strip all symbols
 482 Omit all symbol information from the output file.
 483
 484 @item @{ @var{script} @}
 485 @kindex @{ @var{script} @}
 486 @cindex scripts on command line
 487 You can, if you wish, include a script of linker commands directly in
 488 the command line instead of referring to it via an input file.  When the
 489 character @samp{@{} occurs on the command line, the linker switches to
 490 interpreting the command language until the end of the list of commands
 491 is reached; the end is indicated with a closing brace @samp{@}}.
 492 @code{ld} does not recognize other command-line options while parsing
 493 the script.  @xref{Commands}, for a description of the command language.
 494
 495 @item -Tbss @var{bssorg}
 496 @kindex -Tbss @var{bssorg}
 497 @itemx -Tdata @var{dataorg}
 498 @kindex -Tdata @var{dataorg}
 499 @itemx -Ttext @var{textorg}
 500 @kindex -Ttext @var{textorg}
 501 @cindex segment origins, cmd line
 502 Use @var{org} as the starting address for---respectively---the
 503 @code{bss}, @code{data}, or the @code{text} segment of the output file.
 504 @var{org} must be a single hexadecimal integer;
 505 for compatibility with other linkers, you may omit the leading
 506 @samp{0x} usually associated with hexadecimal values.
 507
 508 @item -T @var{commandfile}
 509 @itemx -T@var{commandfile}
 510 @kindex -T @var{script}
 511 @cindex script files
 512 Read link commands from the file
 513 @var{commandfile}.  These commands completely override @code{ld}'s
 514 default link format (rather than adding to it); @var{commandfile} must
 515 specify everything necessary to describe the target format.
 516 @xref{Commands}.
 517 If @var{commandfile} does not exist, @code{ld} looks for it in the directories
 518 specified by any @samp{-L} options.
 519
 520 You may also include a script of link commands directly in the command
 521 line by bracketing it between @samp{@{} and @samp{@}}.
 522
 523 @item -t
 524 @kindex -t
 525 @cindex verbose
 526 @cindex input files, displaying
 527 Print the names of the input files as @code{ld} processes them.
 528
 529 @item -u @var{sym}
 530 @kindex -u @var{sym}
 531 @cindex undefined symbol
 532 Force @var{sym} to be entered in the output file as an undefined symbol.
 533 Doing this may, for example, trigger linking of additional modules from
 534 standard libraries.  @samp{-u} may be repeated with different option
 535 arguments to enter additional undefined symbols.
 536 @c Nice idea, but no such command: This option is equivalent
 537 @c to the @code{EXTERN} linker command.
 538
 539 @item -Ur
 540 @kindex -Ur
 541 @cindex constructors
 542 For anything other than C++ programs, this option is equivalent to
 543 @samp{-r}: it generates relocatable output---i.e., an output file that can in
 544 turn serve as input to @code{ld}.  When linking C++ programs, @samp{-Ur}
 545 @emph{will} resolve references to constructors, unlike @samp{-r}.
 546
 547 @item -V
 548 @kindex -V
 549 @cindex version
 550 Display the version number for @code{ld} and list the supported emulations.
 551 Print which input files can and can not be opened.
 552
 553 @item -v
 554 @kindex -v
 555 @cindex version
 556 Display the version number for @code{ld}.
 557 Print which input files can and can not be opened.
 558
 559 @item -X
 560 @kindex -X
 561 @cindex local symbols, deleting
 562 @cindex L, deleting symbols beginning
 563 If @samp{-s} or @samp{-S} is also specified, delete only local symbols
 564 beginning with @samp{L}.
 565
 566 @item -x
 567 @kindex -x
 568 @cindex deleting local symbols
 569 If @samp{-s} or @samp{-S} is also specified, delete all local symbols,
 570 not just those beginning with @samp{L}.
 571
 572 @item -y@var{symbol}
 573 @kindex -y@var{symbol}
 574 @cindex symbol tracing
 575 Print the name of each linked file in which @var{symbol} appears.  This
 576 option may be given any number of times.  On many systems it is necessary
 577 to prepend an underscore.
 578
 579 This option is useful when you have an undefined symbol in your link but
 580 don't know where the reference is coming from.
 581 @end table
 582
 583 @ifset UsesEnvVars
 584 @node Environment
 585 @section Environment Variables
 586
 587 You can change the behavior of @code{ld} with the environment
 588 variable @code{GNUTARGET}.
 589
 590 @kindex GNUTARGET
 591 @cindex default input format
 592 @code{GNUTARGET} determines the input-file object format if you don't
 593 use @samp{-b} (or its synonym @samp{-format}).  Its value should be one
 594 of the BFD names for an input format (@pxref{BFD}).  If there is no
 595 @code{GNUTARGET} in the environment, @code{ld} uses the natural format
 596 of the host. If @code{GNUTARGET} is set to @code{default} then BFD attempts to discover the
 597 input format by examining binary input files; this method often
 598 succeeds, but there are potential ambiguities, since there is no method
 599 of ensuring that the magic number used to specify object-file formats is
 600 unique.  However, the configuration procedure for BFD on each system
 601 places the conventional format for that system first in the search-list,
 602 so ambiguities are resolved in favor of convention.
 603 @end ifset
 604
 605 @node Commands
 606 @chapter Command Language
 607
 608 @cindex command files
 609 The command language provides explicit control over the link process,
 610 allowing complete specification of the mapping between the linker's
 611 input files and its output.  It controls:
 612 @itemize @bullet
 613 @item
 614 input files
 615 @item
 616 file formats
 617 @item
 618 output file format
 619 @item
 620 addresses of sections
 621 @item
 622 placement of common blocks
 623 @end itemize
 624
 625 You may supply a command file (also known as a link script) to the
 626 linker either explicitly through the @samp{-T} option, or implicitly as
 627 an ordinary file. If the linker opens a file which it cannot recognize
 628 as a supported object or archive format, it tries to interpret the file
 629 as a command file.
 630
 631 You can also include a script directly on the @code{ld} command line,
 632 delimited by the characters @samp{@{} and @samp{@}}.
 633
 634 @menu
 635 * Scripts::                     Linker Scripts
 636 * Expressions::                 Expressions
 637 * MEMORY::                      MEMORY Command
 638 * SECTIONS::                    SECTIONS Command
 639 * Entry Point::                 The Entry Point
 640 * Other Commands::              Other Commands
 641 @end menu
 642
 643 @node Scripts
 644 @section Linker Scripts
 645 The @code{ld} command language is a collection of statements; some are
 646 simple keywords setting a particular option, some are used to select and
 647 group input files or name output files; and two statement
 648 types have a fundamental and pervasive impact on the linking process.
 649
 650 @cindex fundamental script commands
 651 @cindex commands, fundamental
 652 @cindex output file layout
 653 @cindex layout of output file
 654 The most fundamental command of the @code{ld} command language is the
 655 @code{SECTIONS} command (@pxref{SECTIONS}).  Every meaningful command
 656 script must have a @code{SECTIONS} command: it specifies a
 657 ``picture'' of the output file's layout, in varying degrees of detail.
 658 No other command is required in all cases.
 659
 660 The @code{MEMORY} command complements @code{SECTIONS} by describing the
 661 available memory in the target architecture.  This command is optional;
 662 if you don't use a @code{MEMORY} command, @code{ld} assumes sufficient
 663 memory is available in a contiguous block for all output.
 664 @xref{MEMORY}.
 665
 666 @cindex comments
 667 You may include comments in linker scripts just as in C: delimited
 668 by @samp{/*} and @samp{*/}.  As in C, comments are syntactically
 669 equivalent to whitespace.
 670
 671 @node Expressions
 672 @section Expressions
 673 @cindex expression syntax
 674 @cindex arithmetic
 675 Many useful commands involve arithmetic expressions.  The syntax for
 676 expressions in the command language is identical to that of C
 677 expressions, with the following features:
 678 @itemize @bullet
 679 @item
 680 All expressions evaluated as integers and
 681 are of ``long'' or ``unsigned long'' type.
 682 @item
 683 All constants are integers.
 684 @item
 685 All of the C arithmetic operators are provided.
 686 @item
 687 You may reference, define, and create global variables.
 688 @item
 689 You may call special purpose built-in functions.
 690 @end itemize
 691
 692 @menu
 693 * Integers::                    Integers
 694 * Symbols::                     Symbol Names
 695 * Location Counter::            The Location Counter
 696 * Operators::                   Operators
 697 * Evaluation::                  Evaluation
 698 * Assignment::                  Assignment: Defining Symbols
 699 * Built-ins::                   Built-In Functions
 700 @end menu
 701
 702 @node Integers
 703 @subsection Integers
 704 @cindex integer notation
 705 @cindex octal integers
 706 An octal integer is @samp{0} followed by zero or more of the octal
 707 digits (@samp{01234567}).
 708 @example
 709 _as_octal = 0157255;
 710 @end example
 711
 712 @cindex decimal integers
 713 A decimal integer starts with a non-zero digit followed by zero or
 714 more digits (@samp{0123456789}).
 715 @example
 716 _as_decimal = 57005;
 717 @end example
 718
 719 @cindex hexadecimal integers
 720 @kindex 0x
 721 A hexadecimal integer is @samp{0x} or @samp{0X} followed by one or
 722 more hexadecimal digits chosen from @samp{0123456789abcdefABCDEF}.
 723 @example
 724 _as_hex = 0xdead;
 725 @end example
 726
 727 @cindex negative integers
 728 To write a negative integer, use
 729 the prefix operator @samp{-}; @pxref{Operators}.
 730 @example
 731 _as_neg = -57005;
 732 @end example
 733
 734 @cindex scaled integers
 735 @cindex K and M integer suffixes
 736 @cindex M and K integer suffixes
 737 @cindex suffixes for integers
 738 @cindex integer suffixes
 739 Additionally the suffixes @code{K} and @code{M} may be used to scale a
 740 constant by
 741 @c TEXI2ROFF-KILL
 742 @ifinfo
 743 @c END TEXI2ROFF-KILL
 744 @code{1024} or @code{1024*1024}
 745 @c TEXI2ROFF-KILL
 746 @end ifinfo
 747 @tex
 748 ${\rm 1024}$ or ${\rm 1024}^2$
 749 @end tex
 750 @c END TEXI2ROFF-KILL
 751 respectively. For example, the following all refer to the same quantity:
 752
 753 @example
 754         _fourk_1 = 4K;
 755         _fourk_2 = 4096;
 756         _fourk_3 = 0x1000;
 757 @end example
 758
 759 @node Symbols
 760 @subsection Symbol Names
 761 @cindex symbol names
 762 @cindex names
 763 @cindex quoted symbol names
 764 @kindex "
 765 Unless quoted, symbol names start with a letter, underscore, point or
 766 hyphen and may include any letters, underscores, digits, points,
 767 and minus signs.  Unquoted symbol names must not conflict with any
 768 keywords.  You can specify a symbol which contains odd characters or has
 769 the same name as a keyword, by surrounding the symbol name in double quotes:
 770 @example
 771         "SECTION" = 9;
 772         "with a space" = "also with a space" + 10;
 773 @end example
 774
 775 @node Location Counter
 776 @subsection The Location Counter
 777 @kindex .
 778 @cindex dot
 779 @cindex location counter
 780 @cindex current output location
 781 The special linker variable @dfn{dot} @samp{.} always contains the
 782 current output location counter. Since the @code{.} always refers to
 783 a location in an output section, it must always appear in an
 784 expression within a @code{SECTIONS} command. The @code{.} symbol
 785 may appear anywhere that an ordinary symbol is allowed in an
 786 expression, but its assignments have a side effect. Assigning a value
 787 to the @code{.} symbol will cause the location counter to be moved.
 788 @cindex holes
 789 This may be used to create holes in the output section.  The location
 790 counter may never be moved backwards.
 791 @example
 792 SECTIONS
 793 @{
 794         output :
 795         @{
 796         file1(.text)
 797         . = . + 1000;
 798         file2(.text)
 799         . += 1000;
 800         file3(.text)
 801         @} = 0x1234;
 802 @}
 803 @end example
 804 @noindent
 805 In the previous example, @code{file1} is located at the beginning of the
 806 output section, then there is a 1000 byte gap.  Then @code{file2}
 807 appears, also with a 1000 byte gap following before @code{file3} is
 808 loaded.  The notation @samp{= 0x1234} specifies what data to write in
 809 the gaps (@pxref{Section Options}).
 810
 811 @node Operators
 812 @subsection Operators
 813 @cindex Operators for arithmetic
 814 @cindex arithmetic operators
 815 @cindex precedence in expressions
 816 The linker recognizes  the standard C set of arithmetic operators, with
 817 the standard bindings and precedence levels:
 818 @c TEXI2ROFF-KILL
 819 @ifinfo
 820 @c END TEXI2ROFF-KILL
 821 @example
 822 precedence      associativity   Operators                Notes
 823 (highest)
 824 1               left            !  -  ~                  (1)
 825 2               left            *  /  %
 826 3               left            +  -
 827 4               left            >>  <<
 828 5               left            ==  !=  >  <  <=  >=
 829 6               left            &
 830 7               left            |
 831 8               left            &&
 832 9               left            ||
 833 10              right           ? :
 834 11              right           &=  +=  -=  *=  /=       (2)
 835 (lowest)
 836 @end example
 837 Notes:
 838 (1) Prefix operators
 839 (2) @xref{Assignment}
 840 @c TEXI2ROFF-KILL
 841 @end ifinfo
 842 @tex
 843 \vskip \baselineskip
 844 %"lispnarrowing" is the extra indent used generally for @example
 845 \hskip\lispnarrowing\vbox{\offinterlineskip
 846 \hrule
 847 \halign
 848 {\vrule#&\strut\hfil\ #\ \hfil&\vrule#&\strut\hfil\ #\ \hfil&\vrule#&\strut\hfil\ {\tt #}\ \hfil&\vrule#\cr
 849 height2pt&\omit&&\omit&&\omit&\cr
 850 &Precedence&&  Associativity  &&{\rm Operators}&\cr
 851 height2pt&\omit&&\omit&&\omit&\cr
 852 \noalign{\hrule}
 853 height2pt&\omit&&\omit&&\omit&\cr
 854 &highest&&&&&\cr
 855 % '176 is tilde, '~' in tt font
 856 &1&&left&&\qquad-          \char'176\      !\qquad\dag&\cr
 857 &2&&left&&*          /        \%&\cr
 858 &3&&left&&+          -&\cr
 859 &4&&left&&>>         <<&\cr
 860 &5&&left&&==         !=       >      <      <=      >=&\cr
 861 &6&&left&&\&&\cr
 862 &7&&left&&|&\cr
 863 &8&&left&&{\&\&}&\cr
 864 &9&&left&&||&\cr
 865 &10&&right&&?        :&\cr
 866 &11&&right&&\qquad\&=      +=       -=     *=     /=\qquad\ddag&\cr
 867 &lowest&&&&&\cr
 868 height2pt&\omit&&\omit&&\omit&\cr}
 869 \hrule}
 870 @end tex
 871 @iftex
 872 {
 873 @obeylines@parskip=0pt@parindent=0pt
 874 @dag@quad Prefix operators.
 875 @ddag@quad @xref{Assignment}.
 876 }
 877 @end iftex
 878 @c END TEXI2ROFF-KILL
 879
 880 @node Evaluation
 881 @subsection Evaluation
 882
 883 @cindex lazy evaluation
 884 @cindex expression evaluation order
 885 The linker uses ``lazy evaluation'' for expressions; it only calculates
 886 an expression when absolutely necessary.  The linker needs the value of
 887 the start address, and the lengths of memory regions, in order to do any
 888 linking at all; these values are computed as soon as possible when the
 889 linker reads in the command file.  However, other values (such as symbol
 890 values) are not known or needed until after storage allocation.  Such
 891 values are evaluated later, when other information (such as the sizes of
 892 output sections) is available for use in the symbol assignment
 893 expression.
 894
 895 @node Assignment
 896 @subsection Assignment: Defining Symbols
 897 @cindex assignment in scripts
 898 @cindex symbol definition, scripts
 899 @cindex variables, defining
 900 You may create global symbols, and assign values (addresses) to global
 901 symbols, using any of the C assignment operators:
 902
 903 @table @code
 904 @item @var{symbol}  = @var{expression} ;
 905 @itemx @var{symbol} &= @var{expression} ;
 906 @itemx @var{symbol} += @var{expression} ;
 907 @itemx @var{symbol} -= @var{expression} ;
 908 @itemx @var{symbol} *= @var{expression} ;
 909 @itemx @var{symbol} /= @var{expression} ;
 910 @end table
 911
 912 Two things distinguish assignment from other operators in @code{ld}
 913 expressions.
 914 @itemize @bullet
 915 @item
 916 Assignment may only be used at the root of an expression;
 917 @samp{a=b+3;} is allowed, but @samp{a+b=3;} is an error.
 918
 919 @kindex ;
 920 @cindex semicolon
 921 @item
 922 You must place a trailing semicolon (``@key{;}'') at the end of an
 923 assignment statement.
 924 @end itemize
 925
 926 Assignment statements may appear:
 927 @itemize @bullet
 928 @item
 929 as commands in their own right in an @code{ld} script; or
 930 @item
 931 as independent statements within a @code{SECTIONS} command; or
 932 @item
 933 as part of the contents of a section definition in a
 934 @code{SECTIONS} command.
 935 @end itemize
 936
 937 The first two cases are equivalent in effect---both define a symbol with
 938 an absolute address.  The last case defines a symbol whose address is
 939 relative to a particular section (@pxref{SECTIONS}).
 940
 941 @cindex absolute and relocatable symbols
 942 @cindex relocatable and absolute symbols
 943 @cindex symbols, relocatable and absolute
 944 When a linker expression is evaluated and assigned to a variable, it is
 945 given either an absolute or a relocatable type. An absolute expression
 946 type is one in which the symbol contains the value that it will have in
 947 the output file, a relocatable expression type is one in which the
 948 value is expressed as a fixed offset from the base of a section.
 949
 950 The type of the expression is controlled by its position in the script
 951 file.  A symbol assigned within a section definition is created relative
 952 to the base of the section; a symbol assigned in any other place is
 953 created as an absolute symbol. Since a symbol created within a
 954 section definition is relative to the base of the section, it
 955 will remain relocatable if relocatable output is requested.  A symbol
 956 may be created with an absolute value even when assigned to within a
 957 section definition by using the absolute assignment function
 958 @code{ABSOLUTE}.  For example, to create an absolute symbol whose address
 959 is the last byte of an output section named @code{.data}:
 960 @example
 961 SECTIONS@{ @dots{}
 962 .data :
 963         @{
 964                 *(.data)
 965                 _edata = ABSOLUTE(.) ;
 966         @}
 967 @dots{} @}
 968 @end example
 969
 970 The linker tries to put off the evaluation of an assignment until all
 971 the terms in the source expression are known (@pxref{Evaluation}).  For
 972 instance, the sizes of sections cannot be known until after allocation,
 973 so assignments dependent upon these are not performed until after
 974 allocation. Some expressions, such as those depending upon the location
 975 counter @dfn{dot}, @samp{.} must be evaluated during allocation. If the
 976 result of an expression is required, but the value is not available,
 977 then an error results.  For example, a script like the following
 978 @example
 979 SECTIONS @{ @dots{}
 980       text 9+this_isnt_constant :
 981                 @{ @dots{}
 982                 @}
 983 @dots{} @}
 984 @end example
 985 @kindex Non constant expression
 986 @noindent
 987 will cause the error message ``@code{Non constant expression for initial
 988 address}''.
 989
 990 @node Built-ins
 991 @subsection Built-In Functions
 992 @cindex functions in expression language
 993 The command language includes a number of built-in
 994 functions for use in link script expressions.
 995 @table @code
 996 @item ABSOLUTE(@var{exp})
 997 @kindex ABSOLUTE(@var{exp})
 998 @cindex expression, absolute
 999 Return the absolute (non-relocatable, as opposed to non-negative) value
1000 of the expression @var{exp}.  Primarily useful to assign an absolute
1001 value to a symbol within a section definition, where symbol values are
1002 normally section-relative.
1003
1004 @item ADDR(@var{section})
1005 @kindex ADDR(@var{section})
1006 @cindex section address
1007 Return the absolute address of the named @var{section}.  Your script must
1008 previously have defined the location of that section. In the following
1009 example, @code{symbol_1} and @code{symbol_2} are assigned identical
1010 values:
1011 @example
1012 SECTIONS@{ @dots{}
1013         .output1 :
1014                 @{
1015                 start_of_output_1 = ABSOLUTE(.);
1016                 @dots{}
1017                 @}
1018         .output :
1019                 @{
1020                 symbol_1 = ADDR(.output1);
1021                 symbol_2 = start_of_output_1;
1022                 @}
1023 @dots{} @}
1024 @end example
1025
1026 @item ALIGN(@var{exp})
1027 @kindex ALIGN(@var{exp})
1028 @cindex rounding up location counter
1029 Return the result of the current location counter (@code{.}) aligned to
1030 the next @var{exp} boundary.  @var{exp} must be an expression whose
1031 value is a power of two.  This is equivalent to
1032 @example
1033 (. + @var{exp} - 1) & ~(@var{exp} - 1)
1034 @end example
1035
1036 @code{ALIGN} doesn't change the value of the location counter---it just
1037 does arithmetic on it.  As an example, to align the output @code{.data}
1038 section to the next @code{0x2000} byte boundary after the preceding
1039 section and to set a variable within the section to the next
1040 @code{0x8000} boundary after the input sections:
1041 @example
1042 SECTIONS@{ @dots{}
1043         .data ALIGN(0x2000): @{
1044                 *(.data)
1045                 variable = ALIGN(0x8000);
1046         @}
1047 @dots{} @}
1048 @end example
1049 @noindent
1050 The first use of @code{ALIGN} in this example specifies the location of
1051 a section because it is used as the optional @var{start} attribute of a
1052 section definition (@pxref{Section Options}).  The second use simply
1053 defines the value of a variable.
1054
1055 The built-in @code{NEXT} is closely related to @code{ALIGN}.
1056
1057 @item DEFINED(@var{symbol})
1058 @kindex DEFINED(@var{symbol})
1059 @cindex symbol defaults
1060 Return 1 if @var{symbol} is in the linker global symbol table and is
1061 defined, otherwise return 0.  You can use this function to provide default
1062 values for symbols.  For example, the following command-file fragment shows how
1063 to set a global symbol @code{begin} to the first location in the
1064 @code{.text} section---but if a symbol called @code{begin} already
1065 existed, its value is preserved:
1066 @smallexample
1067 SECTIONS@{ @dots{}
1068         .text : @{
1069                 begin = DEFINED(begin) ? begin : . ;
1070                 @dots{}
1071         @}
1072 @dots{} @}
1073 @end smallexample
1074
1075 @item NEXT(@var{exp})
1076 @kindex NEXT(@var{exp})
1077 @cindex unallocated address, next
1078 Return the next unallocated address that is a multiple of @var{exp}.
1079 This function is closely related to @code{ALIGN(@var{exp})}; unless you
1080 use the @code{MEMORY} command to define discontinuous memory for the
1081 output file, the two functions are equivalent.
1082
1083 @item SIZEOF(@var{section})
1084 @kindex SIZEOF(@var{section})
1085 @cindex section size
1086 Return the size in bytes of the named @var{section}, if that section has
1087 been allocated.  In the following example, @code{symbol_1} and
1088 @code{symbol_2} are assigned identical values:
1089 @c What does it return if the section hasn't been allocated?  0?
1090 @example
1091 SECTIONS@{ @dots{}
1092         .output @{
1093                 .start = . ;
1094                 @dots{}
1095                 .end = . ;
1096                 @}
1097         symbol_1 = .end - .start ;
1098         symbol_2 = SIZEOF(.output);
1099 @dots{} @}
1100
1101 @end example
1102
1103 @item SIZEOF_HEADERS
1104 @kindex SIZEOF_HEADERS
1105 @cindex header size
1106 @itemx sizeof_headers
1107 @kindex sizeof_headers
1108 Return the size in bytes of the output file's headers.  You can use this number
1109 as the start address of the first section, if you choose, to facilitate
1110 paging.
1111
1112 @end table
1113
1114 @node MEMORY
1115 @section MEMORY Command
1116 @kindex MEMORY
1117 @cindex regions of memory
1118 @cindex discontinuous memory
1119 @cindex allocating memory
1120 The linker's default configuration permits allocation of all available memory.
1121 You can override this configuration by using the @code{MEMORY} command.  The
1122 @code{MEMORY} command describes the location and size of blocks of
1123 memory in the target.  By using it carefully, you can describe which
1124 memory regions may be used by the linker, and which memory regions it
1125 must avoid.  The linker does not shuffle sections to fit into the
1126 available regions, but does move the requested sections into the correct
1127 regions and issue errors when the regions become too full.
1128
1129 The command files may contain at most one use of the @code{MEMORY}
1130 command; however, you can define as many blocks of memory within it as
1131 you wish.  The syntax is:
1132
1133 @example
1134 MEMORY
1135       @{
1136        @var{name} (@var{attr}) : ORIGIN = @var{origin}, LENGTH = @var{len}
1137        @dots{}
1138       @}
1139 @end example
1140 @table @code
1141 @item @var{name}
1142 @cindex naming memory regions
1143 is a name used internally by the linker to refer to the region. Any
1144 symbol name may be used.  The region names are stored in a separate
1145 name space, and will not conflict with symbols, file names or section
1146 names.  Use distinct names to specify multiple regions.
1147 @item (@var{attr})
1148 @cindex memory region attributes
1149 is an optional list of attributes, permitted for compatibility with the
1150 AT&T linker but not used by @code{ld} beyond checking that the
1151 attribute list is valid.  Valid attribute lists must be made up of the
1152 characters ``@code{LIRWX}''.  If you omit the attribute list, you may
1153 omit the parentheses around it as well.
1154 @item @var{origin}
1155 @kindex ORIGIN =
1156 @kindex o =
1157 @kindex org =
1158 is the start address of the region in physical memory.  It is
1159 an expression that must evaluate to a constant before
1160 memory allocation is performed. The keyword @code{ORIGIN} may be
1161 abbreviated to @code{org} or @code{o}.
1162 @item @var{len}
1163 @kindex LENGTH =
1164 @kindex len =
1165 @kindex l =
1166 is the size in bytes of the region (an expression).
1167 The keyword @code{LENGTH} may be abbreviated to @code{len} or @code{l}.
1168 @end table
1169
1170 For example, to specify that memory has two regions available for
1171 allocation---one starting at 0 for 256 kilobytes, and the other
1172 starting at @code{0x40000000} for four megabytes:
1173
1174 @example
1175 MEMORY
1176         @{
1177         rom : ORIGIN = 0, LENGTH = 256K
1178         ram : org = 0x40000000, l = 4M
1179         @}
1180 @end example
1181
1182 Once you have defined a region of memory named @var{mem}, you can direct
1183 specific output sections there by using a command ending in
1184 @samp{>@var{mem}} within the @code{SECTIONS} command (@pxref{Section
1185 Options}).  If the combined output sections directed to a region are too
1186 big for the region, the linker will issue an error message.
1187
1188 @node SECTIONS
1189 @section SECTIONS Command
1190 @kindex SECTIONS
1191 The @code{SECTIONS} command controls exactly where input sections are
1192 placed into output sections, their order and to which output sections
1193 they are allocated.
1194
1195 You may use at most one @code{SECTIONS} command in a commands file,
1196 but you can have as many statements within it as you wish.  Statements
1197 within the @code{SECTIONS} command can do one of three things:
1198 @itemize @bullet
1199 @item
1200 define the entry point;
1201 @item
1202 assign a value to a symbol;
1203 @item
1204 describe the placement of a named output section, and what input
1205 sections make it up.
1206 @end itemize
1207
1208 The first two possibilities---defining the entry point, and defining
1209 symbols---can also be done outside the @code{SECTIONS} command:
1210 @pxref{Entry Point}, @pxref{Assignment}.  They are permitted here as
1211 well for your convenience in reading the script, so that symbols or the
1212 entry point can be defined at meaningful points in your output-file
1213 layout.
1214
1215 When no @code{SECTIONS} command is specified, the default action
1216 of the linker is to place each input section into an identically named
1217 output section in the order that the sections are first encountered in
1218 the input files; if all input sections are present in the first file,
1219 for example, the order of sections in the output file will match the
1220 order in the first input file.
1221
1222 @menu
1223 * Section Definition::          Section Definitions
1224 * Section Contents::            Section Contents
1225 * Section Options::             Optional Section Attributes
1226 @end menu
1227
1228 @node Section Definition
1229 @subsection Section Definitions
1230 @cindex section definition
1231 The most frequently used statement in the @code{SECTIONS} command is
1232 the @dfn{section definition}, which you can use to specify the
1233 properties of an output section: its location, alignment, contents,
1234 fill pattern, and target memory region.  Most of
1235 these specifications are optional; the simplest form of a section
1236 definition is
1237 @example
1238 SECTIONS @{ @dots{}
1239 @var{secname} : @{
1240                  @var{contents}
1241                 @}
1242 @dots{} @}
1243 @end example
1244 @cindex naming output sections
1245 @noindent
1246 @var{secname} is the name of the output section, and @var{contents} a
1247 specification of what goes there---for example, a list of input files or
1248 sections of input files.  As you might assume, the whitespace shown is
1249 optional.  You do need the colon @samp{:} and the braces @samp{@{@}},
1250 however.
1251
1252 @var{secname} must meet the constraints of your output format.  In
1253 formats which only support a limited number of sections, such as
1254 @code{a.out}, the name must be one of the names supported by the format
1255 (@code{a.out}, for example, allows only @code{.text}, @code{.data} or
1256 @code{.bss}). If the output format supports any number of sections, but
1257 with numbers and not names (as is the case for Oasys), the name should be
1258 supplied as a quoted numeric string.  A section name may consist of any
1259 sequence characters, but any name which does not conform to the standard
1260 @code{ld} symbol name syntax must be quoted.
1261 @xref{Symbols, , Symbol Names}.
1262
1263 @node Section Contents
1264 @subsection Section Contents
1265 @cindex contents of a section
1266 In a section definition, you can specify the contents of an output section by
1267 listing particular object files, by listing particular input-file
1268 sections, or by a combination of the two.  You can also place arbitrary
1269 data in the section, and define symbols relative to the beginning of the
1270 section.
1271
1272 The @var{contents} of a section definition may include any of the
1273 following kinds of statement.  You can include as many of these as you
1274 like in a single section definition, separated from one another by
1275 whitespace.
1276
1277 @table @code
1278 @item @var{filename}
1279 @kindex @var{filename}
1280 @cindex input files, section defn
1281 @cindex files, including in output sections
1282 You may simply name a particular input file to be placed in the current
1283 output section; @emph{all} sections from that file are placed in the
1284 current section definition. To specify a list of particular files by
1285 name:
1286 @example
1287 .data : @{ afile.o bfile.o cfile.o @}
1288 @end example
1289 @noindent
1290 The example also illustrates that multiple statements can be included in
1291 the contents of a section definition, since each file name is a separate
1292 statement.
1293
1294 If the file name has already been mentioned in another section
1295 definition, with an explicit section name list, then only those sections
1296 which have not yet been allocated are used.
1297
1298 @item @var{filename}( @var{section} )
1299 @itemx @var{filename}( @var{section}, @var{section}, @dots{} )
1300 @itemx @var{filename}( @var{section} @var{section} @dots{} )
1301 @kindex @var{filename}(@var{section})
1302 @cindex files and sections, section defn
1303 You can name one or more sections from your input files, for
1304 insertion in the current output section.  If you wish to specify a list
1305 of input-file sections inside the parentheses, you may separate the
1306 section names by either commas or whitespace.
1307
1308 @item * (@var{section})
1309 @itemx * (@var{section}, @var{section}, @dots{})
1310 @itemx * (@var{section} @var{section} @dots{}
1311 @cindex input sections to output section
1312 @kindex *(@var{section})
1313 Instead of explicitly naming particular input files in a link control
1314 script, you can refer to @emph{all} files from the @code{ld} command
1315 line: use @samp{*} instead of a particular file name before the
1316 parenthesized input-file section list.
1317
1318 For example, to copy sections @code{1} through @code{4} from an Oasys file
1319 into the @code{.text} section of an @code{a.out} file, and sections @code{13}
1320 and @code{14} into the @code{.data} section:
1321 @example
1322 SECTIONS @{
1323         .text :@{
1324                 *("1" "2" "3" "4")
1325         @}
1326
1327         .data :@{
1328                 *("13" "14")
1329         @}
1330 @}
1331 @end example
1332
1333 If you have already explicitly included some files by name, @samp{*}
1334 refers to all @emph{remaining} files---those whose places in the output
1335 file have not yet been defined.
1336
1337 @item [ @var{section} ]
1338 @itemx [ @var{section}, @var{section}, @dots{} ]
1339 @itemx [ @var{section} @var{section} @dots{} ]
1340 @kindex [ @var{sections} ]
1341 This is an alternate notation to specify named sections from all
1342 unallocated input files; its effect is exactly the same as that of
1343 @samp{* (@var{section}@dots{})}
1344
1345 @item @var{filename}@code{( COMMON )}
1346 @itemx ( COMMON )
1347 @kindex ( COMMON )
1348 @cindex uninitialized data
1349 @cindex commons in output
1350 Specify where in your output file to place uninitialized data
1351 with this notation.  @code{*(COMMON)} by itself refers to all
1352 uninitialized data from all input files (so far as it is not yet
1353 allocated); @var{filename}@code{(COMMON)} refers to uninitialized data
1354 from a particular file.  Both are special cases of the general
1355 mechanisms for specifying where to place input-file sections:
1356 @code{ld} permits you to refer to uninitialized data as if it
1357 were in an input-file section named @code{COMMON}, regardless of the
1358 input file's format.
1359 @end table
1360
1361 For example, the following command script arranges the output file into
1362 three consecutive sections, named @code{.text}, @code{.data}, and
1363 @code{.bss}, taking the input for each from the correspondingly named
1364 sections of all the input files:
1365 @example
1366 SECTIONS @{
1367   .text : @{ *(.text) @}
1368   .data : @{ *(.data) @}
1369   .bss :  @{ *(.bss)  *(COMMON) @}
1370 @}
1371 @end example
1372
1373 The following example reads all of the sections from file @code{all.o}
1374 and places them at the start of output section @code{outputa} which
1375 starts at location @code{0x10000}. All of section @code{.input1} from
1376 file @code{foo.o} follows immediately, in the same output section.  All
1377 of section @code{.input2} from @code{foo.o} goes into output section
1378 @code{outputb}, followed by section @code{.input1} from @code{foo1.o}.
1379 All of the remaining @code{.input1} and @code{.input2} sections from any
1380 files are written to output section @code{outputc}.
1381
1382 @example
1383 SECTIONS @{
1384         outputa 0x10000 :
1385                 @{
1386                 all.o
1387                 foo.o (.input1)
1388                 @}
1389         outputb :
1390                 @{
1391                 foo.o (.input2)
1392                 foo1.o (.input1)
1393                 @}
1394         outputc :
1395                 @{
1396                 *(.input1)
1397                 *(.input2)
1398                 @}
1399 @}
1400 @end example
1401
1402 There are still more kinds of statements permitted in the contents of
1403 output section definitions.  The foregoing statements permitted you to
1404 arrange, in your output file, data originating from your input files.
1405 You can also place data directly in an output section from the link
1406 command script.  Most of these additional statements involve
1407 expressions; @pxref{Expressions}.  Although these statements are shown
1408 separately here for ease of presentation, no such segregation is needed
1409 within a section definition in the @code{SECTIONS} command; you can
1410 intermix them freely with any of the statements we've just described.
1411
1412 @table @code
1413 @item CREATE_OBJECT_SYMBOLS
1414 @kindex CREATE_OBJECT_SYMBOLS
1415 @cindex input filename symbols
1416 @cindex filename symbols
1417 Create a symbol for each input file
1418 in the current section, set to the address of the first byte of
1419 data written from the input file.  For instance, with @code{a.out}
1420 files it is conventional to have a symbol for each input file.  You can
1421 accomplish this by defining the output @code{.text} section as follows:
1422 @example
1423 SECTIONS @{
1424         .text 0x2020 :
1425                  @{
1426                 CREATE_OBJECT_SYMBOLS
1427                 *(.text)
1428                 _etext = ALIGN(0x2000);
1429                 @}
1430         @dots{}
1431 @}
1432 @end example
1433
1434 If @code{objsym} is a file containing this script, and @code{a.o},
1435 @code{b.o}, @code{c.o}, and @code{d.o} are four input files with
1436 contents like the following---
1437 @example
1438 /* a.c */
1439
1440 afunction() @{ @}
1441 int adata=1;
1442 int abss;
1443 @end example
1444
1445 @noindent
1446 @samp{ld -M sample a.o b.o c.o d.o} would create a map like this,
1447 containing symbols matching the object file names:
1448 @example
1449 00000000 A __DYNAMIC
1450 00004020 B _abss
1451 00004000 D _adata
1452 00002020 T _afunction
1453 00004024 B _bbss
1454 00004008 D _bdata
1455 00002038 T _bfunction
1456 00004028 B _cbss
1457 00004010 D _cdata
1458 00002050 T _cfunction
1459 0000402c B _dbss
1460 00004018 D _ddata
1461 00002068 T _dfunction
1462 00004020 D _edata
1463 00004030 B _end
1464 00004000 T _etext
1465 00002020 t a.o
1466 00002038 t b.o
1467 00002050 t c.o
1468 00002068 t d.o
1469 @end example
1470
1471 @item @var{symbol} = @var{expression} ;
1472 @kindex @var{symbol} = @var{expression} ;
1473 @itemx @var{symbol} @var{f}= @var{expression} ;
1474 @kindex @var{symbol} @var{f}= @var{expression} ;
1475 @var{symbol} is any symbol name (@pxref{Symbols}).  ``@var{f}=''
1476 refers to any of the operators @code{&= += -= *= /=} which combine
1477 arithmetic and assignment.
1478
1479 @cindex assignment, in section defn
1480 When you assign a value to a symbol within a particular section
1481 definition, the value is relative to the beginning of the section
1482 (@pxref{Assignment}).  If you write
1483 @example
1484 SECTIONS @{
1485   abs = 14 ;
1486   @dots{}
1487   .data : @{ @dots{} rel = 14 ; @dots{} @}
1488   abs2 = 14 + ADDR(.data);
1489   @dots{}
1490 @}
1491 @end example
1492 @c FIXME:  Try above example!
1493 @noindent
1494 @code{abs} and @code{rel} do not have the same value; @code{rel} has the
1495 same value as @code{abs2}.
1496
1497 @item BYTE(@var{expression})
1498 @kindex BYTE(@var{expression})
1499 @itemx SHORT(@var{expression})
1500 @kindex SHORT(@var{expression})
1501 @itemx LONG(@var{expression})
1502 @kindex LONG(@var{expression})
1503 @cindex direct output
1504 By including one of these three statements in a section definition, you
1505 can explicitly place one, two, or four bytes (respectively) at the
1506 current address of that section.
1507
1508 @ifclear SingleFormat
1509 Multiple-byte quantities are represented in whatever byte order is
1510 appropriate for the output file format (@pxref{BFD}).
1511 @end ifclear
1512
1513 @item FILL(@var{expression})
1514 @kindex FILL(@var{expression})
1515 @cindex holes, filling
1516 @cindex unspecified memory
1517 Specifies the ``fill pattern'' for the current section.  Any otherwise
1518 unspecified regions of memory within the section (for example, regions
1519 you skip over by assigning a new value to the location counter @samp{.})
1520 are filled with the two least significant bytes from the
1521 @var{expression} argument.  A @code{FILL} statement covers memory
1522 locations @emph{after} the point it occurs in the section definition; by
1523 including more than one @code{FILL} statement, you can have different
1524 fill patterns in different parts of an output section.
1525 @end table
1526
1527 @node Section Options
1528 @subsection Optional Section Attributes
1529 @cindex section defn, full syntax
1530 Here is the full syntax of a section definition, including all the
1531 optional portions:
1532
1533 @smallexample
1534 SECTIONS @{
1535 @dots{}
1536 @var{secname} @var{start} BLOCK(@var{align}) (NOLOAD) : @{ @var{contents} @} =@var{fill} >@var{region}
1537 @dots{}
1538 @}
1539 @end smallexample
1540
1541 @var{secname} and @var{contents} are required.  @xref{Section
1542 Definition}, and @pxref{Section Contents} for details on @var{contents}.
1543 The remaining elements---@var{start}, @code{BLOCK(@var{align)}},
1544 @code{(NOLOAD)} @code{=@var{fill}}, and @code{>@var{region}}---are all
1545 optional.
1546
1547 @table @code
1548 @item @var{start}
1549 @cindex start address, section
1550 @cindex section start
1551 @cindex section address
1552 You can force the output section to be loaded at a specified address by
1553 specifying @var{start} immediately following the section name.
1554 @var{start} can be represented as any expression. The following
1555 example generates section @var{output} at location
1556 @code{0x40000000}:
1557 @example
1558 SECTIONS @{
1559         @dots{}
1560         output 0x40000000: @{
1561                @dots{}
1562           @}
1563         @dots{}
1564 @}
1565 @end example
1566
1567 @item BLOCK(@var{align})
1568 @kindex BLOCK(@var{align})
1569 @cindex section alignment
1570 @cindex aligning sections
1571 You can include @code{BLOCK()} specification to advance
1572 the location counter @code{.} prior to the beginning of the section, so
1573 that the section will begin at the specified alignment.  @var{align} is
1574 an expression.
1575
1576 @item (NOLOAD)
1577 @kindex NOLOAD
1578 @cindex prevent unnecessary loading
1579 Use @samp{(NOLOAD)} to prevent a section from being loaded into memory
1580 each time it is accessed.  For example, in the script sample below, the
1581 @code{ROM} segment is addressed at memory location @samp{0} and does not
1582 need to be loaded into each object file:
1583 @example
1584 SECTIONS @{
1585         ROM  0  (NOLOAD)  : @{ @dots{} @}
1586         @dots{}
1587 @}
1588 @end example
1589
1590 @item =@var{fill}
1591 @kindex =@var{fill}
1592 @cindex section fill pattern
1593 @cindex fill pattern, entire section
1594 Including
1595 @code{=@var{fill}} in a section definition specifies the initial fill
1596 value for that section.
1597 You may use any expression to specify @var{fill}.
1598 Any unallocated holes in the current output
1599 section when written to the output file will be filled with the two
1600 least significant bytes of the value, repeated as necessary.  You can
1601 also change the fill value with a @code{FILL} statement in the
1602 @var{contents} of a section definition.
1603
1604 @item >@var{region}
1605 @kindex >@var{region}
1606 @cindex section, assigning to memory region
1607 @cindex memory regions and sections
1608 Assign this section to a previously defined region of memory.
1609 @xref{MEMORY}.
1610
1611 @end table
1612
1613 @node Entry Point
1614 @section The Entry Point
1615 @kindex ENTRY(@var{symbol})
1616 @cindex start of execution
1617 @cindex first instruction
1618 The linker command language includes a command specifically for
1619 defining the first executable instruction in an output file (its
1620 @dfn{entry point}).  Its argument is a symbol name:
1621 @example
1622 ENTRY(@var{symbol})
1623 @end example
1624
1625 Like symbol assignments, the @code{ENTRY} command may be placed either
1626 as an independent command in the command file, or among the section
1627 definitions within the @code{SECTIONS} command---whatever makes the most
1628 sense for your layout.
1629
1630 @cindex entry point, defaults
1631 @code{ENTRY} is only one of several ways of choosing the entry point.
1632 You may indicate it in any of the following ways (shown in descending
1633 order of priority: methods higher in the list override methods lower down).
1634 @itemize @bullet
1635 @item
1636 the @samp{-e} @var{entry} command-line option;
1637 @item
1638 the @code{ENTRY(@var{symbol}} command in a linker control script;
1639 @item
1640 the value of the symbol @code{start}, if present;
1641 @item
1642 the value of the symbol @code{_main}, if present;
1643 @item
1644 the address of the first byte of the @code{.text} section, if present;
1645 @item
1646 The address @code{0}.
1647 @end itemize
1648
1649 For example, you can use these rules to generate an entry point with an
1650 assignment statement: if no symbol @code{start} is defined within your
1651 input files, you can simply define it, assigning it an appropriate
1652 value---
1653 @example
1654 start = 0x2020;
1655 @end example
1656
1657 @noindent
1658 The example shows an absolute address, but you can use any expression.
1659 For example, if your input object files use some other symbol-name
1660 convention for the entry point, you can just assign the value of
1661 whatever symbol contains the start address to @code{start}:
1662 @example
1663 start = other_symbol ;
1664 @end example
1665
1666 @node Other Commands
1667 @section Other Commands
1668 The command language includes a number of other commands that you can
1669 use for specialized purposes.  They are similar in purpose to
1670 command-line options.
1671
1672 @table @code
1673 @item FLOAT
1674 @kindex FLOAT
1675 @itemx NOFLOAT
1676 @kindex NOFLOAT
1677 These keywords were used in some older linkers to request a particular
1678 math subroutine library.  @code{ld} doesn't use the keywords, assuming
1679 instead that any necessary subroutines are in libraries specified using
1680 the general mechanisms for linking to archives; but to permit the use of
1681 scripts that were written for the older linkers, the keywords
1682 @code{FLOAT} and @code{NOFLOAT} are accepted and ignored.
1683
1684 @item FORCE_COMMON_ALLOCATION
1685 @kindex FORCE_COMMON_ALLOCATION
1686 @cindex common allocation
1687 This command has the same effect as the @samp{-d} command-line option:
1688 to make @code{ld} assign space to common symbols even if a relocatable
1689 output file is specified (@samp{-r}).
1690
1691 @item INPUT ( @var{file}, @var{file}, @dots{} )
1692 @kindex INPUT ( @var{files} )
1693 @itemx INPUT ( @var{file} @var{file} @dots{} )
1694 @cindex binary input files
1695 Use this command to include binary input files in the link, without
1696 including them in a particular section definition.  Files specified this
1697 way are treated identically to object files listed on the command line.
1698
1699 @ignore
1700 @item MAP ( @var{name} )
1701 @kindex MAP ( @var{name} )
1702 @c MAP(...) appears to look for an F in the arg, ignoring all other
1703 @c chars; if it finds one, it sets "map_option_f" to true.  But nothing
1704 @c checks map_option_f.  Apparently a stub for the future...
1705 @end ignore
1706
1707 @item OUTPUT ( @var{filename} )
1708 @kindex OUTPUT ( @var{filename} )
1709 @cindex naming the output file
1710 Use this command to name the link output file @var{filename}.  The
1711 effect of @code{OUTPUT(@var{filename})} is identical to the effect of
1712 @w{@samp{-o @var{filename}}}, and whichever is encountered last will
1713 control the name actually used to name the output file.  In particular,
1714 you can use this command to supply a default output-file name other than
1715 @code{a.out}.
1716
1717 @ifclear SingleFormat
1718 @item OUTPUT_ARCH ( @var{bfdname} )
1719 @kindex OUTPUT_ARCH ( @var{bfdname} )
1720 @cindex machine architecture, output
1721 Specify a particular output machine architecture, with one of the names
1722 used by the BFD back-end routines (@pxref{BFD}).  This command is often
1723 unnecessary; the architecture is most often set implicitly by either the
1724 system BFD configuration or as a side effect of the @code{OUTPUT_FORMAT}
1725 command.
1726
1727 @item OUTPUT_FORMAT ( @var{bfdname} )
1728 @kindex OUTPUT_FORMAT ( @var{bfdname} )
1729 @cindex format, output file
1730 Specify a particular output format, with one of the names used by the
1731 BFD back-end routines (@pxref{BFD}).  This selection will only affect
1732 the output file; the related command @code{TARGET} affects primarily
1733 input files.
1734 @end ifclear
1735
1736 @item SEARCH_DIR ( @var{path} )
1737 @kindex SEARCH_DIR ( @var{path} )
1738 @cindex path for libraries
1739 @cindex search path, libraries
1740 Add @var{path} to the list of paths where @code{ld} looks for
1741 archive libraries.  @code{SEARCH_DIR(@var{path})} has the same
1742 effect as @samp{-L@var{path}} on the command line.
1743
1744 @item STARTUP ( @var{filename} )
1745 @kindex STARTUP ( @var{filename} )
1746 @cindex first input file
1747 Ensure that @var{filename} is the first input file used in the link
1748 process.
1749
1750 @ifclear SingleFormat
1751 @item TARGET ( @var{format} )
1752 @cindex input file format
1753 @kindex TARGET ( @var{format} )
1754 Change the input-file object code format (like the command-line option
1755 @samp{-b} or its synonym @samp{-format}).  The argument @var{format} is
1756 one of the strings used by BFD to name binary formats.  In the current
1757 @code{ld} implementation, if @code{TARGET} is specified but
1758 @code{OUTPUT_FORMAT} is not, the last @code{TARGET} argument is also
1759 used as the default format for the @code{ld} output file.
1760 @xref{BFD}.
1761
1762 @kindex GNUTARGET
1763 If you don't use the @code{TARGET} command, @code{ld} uses the value of
1764 the environment variable @code{GNUTARGET}, if available, to select the
1765 output file format.  If that variable is also absent, @code{ld} uses
1766 the default format configured for your machine in the BFD libraries.
1767 @end ifclear
1768 @end table
1769
1770 @ifset GENERIC
1771 @node Machine Dependent
1772 @chapter Machine Dependent Features
1773
1774 @cindex machine dependencies
1775 @code{ld} has additional features on some platforms; the following
1776 sections describe them.  Machines where @code{ld} has no additional
1777 functionality are not listed.
1778
1779 @menu
1780 * H8/300::                      @code{ld} and the H8/300
1781 * i960::                        @code{ld} and the Intel 960 family
1782 @end menu
1783 @end ifset
1784
1785 @c FIXME!  This could use @up/@down, but there seems to be a conflict
1786 @c         between those and node-defaulting.
1787 @ifset H8300
1788 @ifclear GENERIC
1789 @up
1790 @end ifclear
1791 @node H8/300
1792 @section @code{ld} and the H8/300
1793
1794 @cindex H8/300 support
1795 For the H8/300, @code{ld} can perform these global optimizations when
1796 you specify the @samp{-relax} command-line option.
1797
1798 @table @emph
1799 @item relaxing address modes
1800 @cindex relaxing on H8/300
1801 @code{ld} finds all @code{jsr} and @code{jmp} instructions whose
1802 targets are within eight bits, and turns them into eight-bit
1803 program-counter relative @code{bsr} and @code{bra} instructions,
1804 respectively.
1805
1806 @item synthesizing instructions
1807 @cindex synthesizing on H8/300
1808 @c FIXME: specifically mov.b, or any mov instructions really?
1809 @code{ld} finds all @code{mov.b} instructions which use the
1810 sixteen-bit absolute address form, but refer to the top
1811 page of memory, and changes them to use the eight-bit address form.
1812 (That is: the linker turns @samp{mov.b @code{@@}@var{aa}:16} into
1813 @samp{mov.b @code{@@}@var{aa}:8} whenever the address @var{aa} is in the
1814 top page of memory).
1815 @end table
1816 @ifclear GENERIC
1817 @down
1818 @end ifclear
1819 @end ifset
1820
1821 @ifset I960
1822 @ifclear GENERIC
1823 @up
1824 @end ifclear
1825 @node i960
1826 @section @code{ld} and the Intel 960 family
1827
1828 @cindex i960 support
1829
1830 You can use the @samp{-A@var{architecture}} command line option to
1831 specify one of the two-letter names identifying members of the 960
1832 family; the option specifies the desired output target, and warns of any
1833 incompatible instructions in the input files.  It also modifies the
1834 linker's search strategy for archive libraries, to support the use of
1835 libraries specific to each particular architecture, by including in the
1836 search loop names suffixed with the string identifying the architecture.
1837
1838 For example, if your @code{ld} command line included @w{@samp{-ACA}} as
1839 well as @w{@samp{-ltry}}, the linker would look (in its built-in search
1840 paths, and in any paths you specify with @samp{-L}) for a library with
1841 the names
1842
1843 @example
1844 try
1845 libtry.a
1846 tryca
1847 libtryca.a
1848 @end example
1849
1850 @noindent
1851 The first two possibilities would be considered in any event; the last
1852 two are due to the use of @w{@samp{-ACA}}.
1853
1854 You can meaningfully use @samp{-A} more than once on a command line, since
1855 the 960 architecture family allows combination of target architectures; each
1856 use will add another pair of name variants to search for when @w{@samp{-l}}
1857 specifies a library.
1858 @ifclear GENERIC
1859 @down
1860 @end ifclear
1861 @end ifset
1862
1863 @ifclear SingleFormat
1864 @node BFD
1865 @chapter BFD
1866
1867 @cindex back end
1868 @cindex object file management
1869 The linker accesses object and archive files using the BFD libraries.
1870 These libraries allow the linker to use the same routines to operate on
1871 object files whatever the object file format.  A different object file
1872 format can be supported simply by creating a new BFD back end and adding
1873 it to the library.  You can use @code{objdump -i}
1874 (@pxref{objdump,,objdump,binutils.info,The GNU Binary Utilities}) to
1875 list all the formats available for each architecture under BFD.  This
1876 was the list of formats, and of architectures supported for each format,
1877 as of the time this manual was prepared:
1878 @cindex formats available
1879 @cindex architectures available
1880 @example
1881 BFD header file version 0.18
1882 a.out-i386
1883  (header big endian, data big endian)
1884   m68k:68020
1885   a29k
1886   sparc
1887   i386
1888 a.out-sunos-big
1889  (header big endian, data big endian)
1890   m68k:68020
1891   a29k
1892   sparc
1893   i386
1894 b.out.big
1895  (header big endian, data little endian)
1896   i960:core
1897 b.out.little
1898  (header little endian, data little endian)
1899   i960:core
1900 coff-a29k-big
1901  (header big endian, data big endian)
1902   a29k
1903 coff-h8300
1904  (header big endian, data big endian)
1905   H8/300
1906 coff-i386
1907  (header little endian, data little endian)
1908   i386
1909 coff-Intel-big
1910  (header big endian, data little endian)
1911   i960:core
1912 coff-Intel-little
1913  (header little endian, data little endian)
1914   i960:core
1915 coff-m68k
1916  (header big endian, data big endian)
1917   m68k:68020
1918 coff-m88kbcs
1919  (header big endian, data big endian)
1920   m88k:88100
1921 ecoff-bigmips
1922  (header big endian, data big endian)
1923   mips
1924 ecoff-littlemips
1925  (header little endian, data little endian)
1926   mips
1927 elf-big
1928  (header big endian, data big endian)
1929   m68k:68020
1930   vax
1931   i960:core
1932   a29k
1933   sparc
1934   mips
1935   i386
1936   m88k:88100
1937   H8/300
1938   rs6000:6000
1939 elf-little
1940  (header little endian, data little endian)
1941   m68k:68020
1942   vax
1943   i960:core
1944   a29k
1945   sparc
1946   mips
1947   i386
1948   m88k:88100
1949   H8/300
1950   rs6000:6000
1951 ieee
1952  (header big endian, data big endian)
1953   m68k:68020
1954   vax
1955   i960:core
1956   a29k
1957   sparc
1958   mips
1959   i386
1960   m88k:88100
1961   H8/300
1962   rs6000:6000
1963 srec
1964  (header big endian, data big endian)
1965   m68k:68020
1966   vax
1967   i960:core
1968   a29k
1969   sparc
1970   mips
1971   i386
1972   m88k:88100
1973   H8/300
1974   rs6000:6000
1975 @end example
1976
1977 @cindex BFD requirements
1978 @cindex requirements for BFD
1979 As with most implementations, BFD is a compromise between
1980 several conflicting requirements. The major factor influencing
1981 BFD design was efficiency: any time used converting between
1982 formats is time which would not have been spent had BFD not
1983 been involved. This is partly offset by abstraction payback; since
1984 BFD simplifies applications and back ends, more time and care
1985 may be spent optimizing algorithms for a greater speed.
1986
1987 One minor artifact of the BFD solution which you should bear in
1988 mind is the potential for information loss.  There are two places where
1989 useful information can be lost using the BFD mechanism: during
1990 conversion and during output. @xref{BFD information loss}.
1991
1992 @menu
1993 * BFD outline::                 How it works: an outline of BFD
1994 * BFD information loss::        Information Loss
1995 * Mechanism::                   Mechanism
1996 @end menu
1997
1998 @node BFD outline
1999 @section How it works: an outline of BFD
2000 @cindex opening object files
2001 When an object file is opened, BFD subroutines automatically
2002 determine the format of the input object file, and build a descriptor in
2003 memory with pointers to routines that will be used to access elements of
2004 the object file's data structures.
2005
2006 As different information from the the object files is required,
2007 BFD reads from different sections of the file and processes them.
2008 For example, a very common operation for the linker is processing symbol
2009 tables.  Each BFD back end provides a routine for converting
2010 between the object file's representation of symbols and an internal
2011 canonical format. When the linker asks for the symbol table of an object
2012 file, it calls through the memory pointer to the BFD
2013 back end routine which reads and converts the table into a canonical
2014 form.  The linker then operates upon the common form. When the link is
2015 finished and the linker writes the symbol table of the output file,
2016 another BFD back end routine is called which takes the newly
2017 created symbol table and converts it into the chosen output format.
2018
2019 @node BFD information loss
2020 @section Information Loss
2021 @emph{Information can be lost during output.} The output formats
2022 supported by BFD do not provide identical facilities, and
2023 information which may be described in one form has nowhere to go in
2024 another format. One example of this is alignment information in
2025 @code{b.out}. There is nowhere in an @code{a.out} format file to store
2026 alignment information on the contained data, so when a file is linked
2027 from @code{b.out} and an @code{a.out} image is produced, alignment
2028 information will not propagate to the output file. (The linker will
2029 still use the alignment information internally, so the link is performed
2030 correctly).
2031
2032 Another example is COFF section names. COFF files may contain an
2033 unlimited number of sections, each one with a textual section name. If
2034 the target of the link is a format which does not have many sections (e.g.,
2035 @code{a.out}) or has sections without names (e.g., the Oasys format) the
2036 link cannot be done simply. You can circumvent this problem by
2037 describing the desired input-to-output section mapping with the command
2038 language.
2039
2040 @emph{Information can be lost during canonicalization.} The BFD
2041 internal canonical form of the external formats is not exhaustive; there
2042 are structures in input formats for which there is no direct
2043 representation internally.  This means that the BFD back ends
2044 cannot maintain all possible data richness through the transformation
2045 between external to internal and back to external formats.
2046
2047 This limitation is only a problem when using the linker to read one
2048 format and write another. Each BFD back end is responsible for
2049 maintaining as much data as possible, and the internal BFD
2050 canonical form has structures which are opaque to the BFD core,
2051 and exported only to the back ends. When a file is read in one format,
2052 the canonical form is generated for BFD and the linker. At the
2053 same time, the back end saves away any information which would otherwise
2054 be lost. If the data is then written back in the same format, the back
2055 end routine will be able to use the canonical form provided by the
2056 BFD core as well as the information it prepared earlier.  Since
2057 there is a great deal of commonality between back ends,
2058 there is no information lost when
2059 linking big endian COFF to little endian COFF, or from @code{a.out} to
2060 @code{b.out}.  When a mixture of formats is linked, the information is
2061 only lost from the files whose format differs from the destination.
2062
2063 @node Mechanism
2064 @section Mechanism
2065 The greatest potential for loss of information occurs when there is the least
2066 overlap between the information provided by the source format, that
2067 stored by the canonical format, and that needed by the
2068 destination format. A brief description of the canonical form may help
2069 you understand which kinds of data you can count on preserving across
2070 conversions.
2071 @cindex BFD canonical format
2072 @cindex internal object-file format
2073
2074 @table @emph
2075 @item files
2076 Information on target machine architecture, particular implementation,
2077 and format type are stored on a per-file basis. Other information
2078 includes a demand pagable bit and a write protected bit.
2079 Information like Unix magic numbers is not stored here---only the magic
2080 numbers' meaning, so a @code{ZMAGIC} file would have both the demand pagable
2081 bit and the write protected text bit set.
2082
2083 The byte order of the target is stored on a per-file basis, so that big-
2084 and little-endian object files may be linked with one another.
2085
2086 @item sections
2087 Each section in the input file contains the name of the section, the
2088 original address in the object file, various options, size and alignment
2089 information and pointers into other BFD data structures.
2090
2091 @item symbols
2092 Each symbol contains a pointer to the object file which originally
2093 defined it, its name, its value, and various option bits.  When a
2094 BFD back end reads in a symbol table, the back end relocates all
2095 symbols to make them relative to the base of the section where they were
2096 defined.  Doing this ensures that each symbol points to its containing
2097 section.  Each symbol also has a varying amount of hidden
2098 private data for the BFD back end.  Since the symbol points to the
2099 original file, the private data format for that symbol is accessible.
2100 @code{ld} can operate on a collection of symbols of wildly different
2101 formats without problems.
2102
2103 Normal global and simple local symbols are maintained on output, so an
2104 output file (no matter its format) will retain symbols pointing to
2105 functions and to global, static, and common variables.  Some symbol
2106 information is not worth retaining; in @code{a.out}, type information is
2107 stored in the symbol table as long symbol names. This information would
2108 be useless to most COFF debuggers and may be thrown away with
2109 appropriate command line switches. (The GNU debugger @code{gdb} does
2110 support @code{a.out} style debugging information in COFF).
2111
2112 There is one word of type information within the symbol, so if the
2113 format supports symbol type information within symbols (for example, COFF,
2114 IEEE, Oasys) and the type is simple enough to fit within one word
2115 (nearly everything but aggregates), the information will be preserved.
2116
2117 @item relocation level
2118 Each canonical BFD relocation record contains a pointer to the symbol to
2119 relocate to, the offset of the data to relocate, the section the data
2120 is in, and a pointer to a relocation type descriptor. Relocation is
2121 performed by passing messages through the relocation type
2122 descriptor and the symbol pointer. Therefore, relocations can be performed
2123 on output data using a relocation method that is only available in one of the
2124 input formats. For instance, Oasys provides a byte relocation format.
2125 A relocation record requesting this relocation type would point
2126 indirectly to a routine to perform this, so the relocation may be
2127 performed on a byte being written to a COFF file, even though 68k COFF
2128 has no such relocation type.
2129 @c FIXME why specific reference to 68K above?
2130
2131 @item line numbers
2132 Object formats can contain, for debugging purposes, some form of mapping
2133 between symbols, source line numbers, and addresses in the output file.
2134 These addresses have to be relocated along with the symbol information.
2135 Each symbol with an associated list of line number records points to the
2136 first record of the list.  The head of a line number list consists of a
2137 pointer to the symbol, which allows finding out the address of the
2138 function whose line number is being described. The rest of the list is
2139 made up of pairs: offsets into the section and line numbers. Any format
2140 which can simply derive this information can pass it successfully
2141 between formats (COFF, IEEE and Oasys).
2142 @end table
2143 @end ifclear
2144
2145 @node MRI
2146 @appendix MRI Compatible Script Files
2147 @cindex MRI compatibility
2148 To aid users making the transition to @sc{gnu} @code{ld} from the MRI
2149 linker, @code{ld} can use MRI compatible linker scripts as an
2150 alternative to the more general-purpose linker scripting language
2151 described in @ref{Commands,,Command Language}.  MRI compatible linker
2152 scripts have a much simpler command set than the scripting language
2153 otherwise used with @code{ld}.  @sc{gnu} @code{ld} supports the most
2154 commonly used MRI linker commands; these commands are described here.
2155
2156 You can specify a file containing an MRI-compatible script using the
2157 @samp{-c} command-line option.
2158
2159 Each command in an MRI-compatible script occupies its own line; each
2160 command line starts with the keyword that identifies the command (though
2161 blank lines are also allowed for punctuation).  If a line of an
2162 MRI-compatible script begins with an unrecognized keyword, @code{ld}
2163 issues a warning message, but continues processing the script.
2164
2165 Lines beginning with @samp{*} are comments.
2166
2167 You can write these commands using all upper-case letters, or all
2168 lower case; for example, @samp{chip} is the same as @samp{CHIP}.
2169 The following list shows only the upper-case form of each command.
2170
2171 @table @code
2172 @item ABSOLUTE @var{secname}
2173 @item ABSOLUTE @var{secname}, @var{secname}, @dots{} @var{secname}
2174 @cindex @code{ABSOLUTE} (MRI)
2175 Normally, @code{ld} includes in the output file all sections from all
2176 the input files.  However, in an MRI-compatible script, you can use the
2177 @code{ABSOLUTE} command to restrict the sections that will be present in
2178 your output program.  If the @code{ABSOLUTE} command is used at all in a
2179 script, then only the sections named explicitly in @code{ABSOLUTE}
2180 commands will appear in the linker output.  You can still use other
2181 input sections (whatever you select on the command line, or using
2182 @code{LOAD}) to resolve addresses in the output file.
2183
2184 @item ALIAS @var{out-secname}, @var{in-secname}
2185 @cindex @code{ALIAS} (MRI)
2186 Use this command to place the data from input section @var{in-secname}
2187 in a section called @var{out-secname} in the linker output file.
2188
2189 @var{in-secname} may be an integer.
2190
2191 @item BASE @var{expression}
2192 @cindex @code{BASE} (MRI)
2193 Use the value of @var{expression} as the lowest address (other than
2194 absolute addresses) in the output file.
2195
2196 @item CHIP @var{expression}
2197 @itemx CHIP @var{expression}, @var{expression}
2198 @cindex @code{CHIP} (MRI)
2199 This command does nothing; it is accepted only for compatibility.
2200
2201 @item END
2202 @cindex @code{END} (MRI)
2203 This command does nothing whatever; it's only accepted for compatibility.
2204
2205 @item FORMAT @var{output-format}
2206 @cindex @code{FORMAT} (MRI)
2207 Similar to the @code{OUTPUT_FORMAT} command in the more general linker
2208 language, but restricted to one of these output formats:
2209 @enumerate
2210 @item
2211 S-records, if @var{output-format} is @samp{S}
2212
2213 @item
2214 IEEE, if @var{output-format} is @samp{IEEE}
2215
2216 @item
2217 COFF (the @samp{coff-m68k} variant in BFD), if @var{output-format} is
2218 @samp{COFF}
2219 @end enumerate
2220
2221 @item LIST @var{anything}@dots{}
2222 @cindex @code{LIST} (MRI)
2223 Print (to the standard output file) a link map, as produced by the
2224 @code{ld} command-line option @samp{-M}.
2225
2226 The keyword @code{LIST} may be followed by anything on the
2227 same line, with no change in its effect.
2228
2229 @item LOAD @var{filename}
2230 @item LOAD @var{filename}, @var{filename}, @dots{} @var{filename}
2231 @cindex @code{LOAD} (MRI)
2232 Include one or more object file @var{filename} in the link; this has the
2233 same effect as specifying @var{filename} directly on the @code{ld}
2234 command line.
2235
2236 @item NAME @var{output-name}
2237 @cindex @code{NAME} (MRI)
2238 @var{output-name} is the name for the program produced by @code{ld}; the
2239 MRI-compatible command @code{NAME} is equivalent to the command-line
2240 option @samp{-o} or the general script language command @code{OUTPUT}.
2241
2242 @item ORDER @var{secname}, @var{secname}, @dots{} @var{secname}
2243 @itemx ORDER @var{secname} @var{secname} @var{secname}
2244 @cindex @code{ORDER} (MRI)
2245 Normally, @code{ld} orders the sections in its output file in the
2246 order in which they first appear in the input files.  In an MRI-compatible
2247 script, you can override this ordering with the @code{ORDER} command.  The
2248 sections you list with @code{ORDER} will appear first in your output
2249 file, in the order specified.
2250
2251 @item PUBLIC @var{name}=@var{expression}
2252 @itemx PUBLIC @var{name},@var{expression}
2253 @itemx PUBLIC @var{name} @var{expression}
2254 @cindex @code{PUBLIC} (MRI)
2255 Supply a value (@var{expression}) for external symbol
2256 @var{name} used in the linker input files.
2257
2258 @item SECT @var{secname}, @var{expression}
2259 @itemx SECT @var{secname}=@var{expression}
2260 @itemx SECT @var{secname} @var{expression}
2261 @cindex @code{SECT} (MRI)
2262 You can use any of these three forms of the @code{SECT} command to
2263 specify the start address (@var{expression}) for section @var{secname}.
2264 If you have more than one @code{SECT} statement for the same
2265 @var{secname}, only the @emph{first} sets the start address.
2266 @end table
2267
2268
2269 @node Index
2270 @unnumbered Index
2271
2272 @printindex cp
2273
2274 @tex
2275 % I think something like @colophon should be in texinfo.  In the
2276 % meantime:
2277 \long\def\colophon{\hbox to0pt{}\vfill
2278 \centerline{The body of this manual is set in}
2279 \centerline{\fontname\tenrm,}
2280 \centerline{with headings in {\bf\fontname\tenbf}}
2281 \centerline{and examples in {\tt\fontname\tentt}.}
2282 \centerline{{\it\fontname\tenit\/} and}
2283 \centerline{{\sl\fontname\tensl\/}}
2284 \centerline{are used for emphasis.}\vfill}
2285 \page\colophon
2286 % Blame: [email protected], 28mar91.
2287 @end tex
2288
2289
2290 @contents
2291 @bye
2292
2293