git.chise.org Git - chise/xemacs-chise.git.1/blob - man/xemacs/search.texi

   1
   2 @node Search, Fixit, Display, Top
   3 @chapter Searching and Replacement
   4 @cindex searching
   5
   6   Like other editors, Emacs has commands for searching for occurrences of
   7 a string.  The principal search command is unusual in that it is
   8 @dfn{incremental}: it begins to search before you have finished typing the
   9 search string.  There are also non-incremental search commands more like
  10 those of other editors.
  11
  12   Besides the usual @code{replace-string} command that finds all
  13 occurrences of one string and replaces them with another, Emacs has a fancy
  14 replacement command called @code{query-replace} which asks interactively
  15 which occurrences to replace.
  16
  17 @menu
  18 * Incremental Search::     Search happens as you type the string.
  19 * Non-Incremental Search:: Specify entire string and then search.
  20 * Word Search::            Search for sequence of words.
  21 * Regexp Search::          Search for match for a regexp.
  22 * Regexps::                Syntax of regular expressions.
  23 * Search Case::            To ignore case while searching, or not.
  24 * Replace::                Search, and replace some or all matches.
  25 * Other Repeating Search:: Operating on all matches for some regexp.
  26 @end menu
  27
  28 @node Incremental Search, Non-Incremental Search, Search, Search
  29 @section Incremental Search
  30
  31   An incremental search begins searching as soon as you type the first
  32 character of the search string.  As you type in the search string, Emacs
  33 shows you where the string (as you have typed it so far) is found.
  34 When you have typed enough characters to identify the place you want, you
  35 can stop.  Depending on what you do next, you may or may not need to
  36 terminate the search explicitly with a @key{RET}.
  37
  38 @c WideCommands
  39 @table @kbd
  40 @item C-s
  41 Incremental search forward (@code{isearch-forward}).
  42 @item C-r
  43 Incremental search backward (@code{isearch-backward}).
  44 @end table
  45
  46 @kindex C-s
  47 @kindex C-r
  48 @findex isearch-forward
  49 @findex isearch-backward
  50   @kbd{C-s} starts an incremental search.  @kbd{C-s} reads characters from
  51 the keyboard and positions the cursor at the first occurrence of the
  52 characters that you have typed.  If you type @kbd{C-s} and then @kbd{F},
  53 the cursor moves right after the first @samp{F}.  Type an @kbd{O}, and see
  54 the cursor move to after the first @samp{FO}.  After another @kbd{O}, the
  55 cursor is after the first @samp{FOO} after the place where you started the
  56 search.  Meanwhile, the search string @samp{FOO} has been echoed in the
  57 echo area.@refill
  58
  59   The echo area display ends with three dots when actual searching is going
  60 on.  When search is waiting for more input, the three dots are removed.
  61 (On slow terminals, the three dots are not displayed.)
  62
  63   If you make a mistake in typing the search string, you can erase
  64 characters with @key{DEL}.  Each @key{DEL} cancels the last character of the
  65 search string.  This does not happen until Emacs is ready to read another
  66 input character; first it must either find, or fail to find, the character
  67 you want to erase.  If you do not want to wait for this to happen, use
  68 @kbd{C-g} as described below.@refill
  69
  70   When you are satisfied with the place you have reached, you can type
  71 @key{RET} (or @key{C-m}), which stops searching, leaving the cursor where
  72 the search brought it.  Any command not specially meaningful in searches also
  73 stops the search and is then executed.  Thus, typing @kbd{C-a} exits the
  74 search and then moves to the beginning of the line.  @key{RET} is necessary
  75 only if the next command you want to type is a printing character,
  76 @key{DEL}, @key{ESC}, or another control character that is special
  77 within searches (@kbd{C-q}, @kbd{C-w}, @kbd{C-r}, @kbd{C-s}, or @kbd{C-y}).
  78
  79   Sometimes you search for @samp{FOO} and find it, but were actually
  80 looking for a different occurence of it.  To move to the next occurrence
  81 of the search string, type another @kbd{C-s}.  Do this as often as
  82 necessary.  If you overshoot, you can cancel some @kbd{C-s}
  83 characters with @key{DEL}.
  84
  85   After you exit a search, you can search for the same string again by
  86 typing just @kbd{C-s C-s}: the first @kbd{C-s} is the key that invokes
  87 incremental search, and the second @kbd{C-s} means ``search again''.
  88
  89   If the specified string is not found at all, the echo area displays
  90 the text @samp{Failing I-Search}.  The cursor is after the place where
  91 Emacs found as much of your string as it could.  Thus, if you search for
  92 @samp{FOOT}, and there is no @samp{FOOT}, the cursor may be after the
  93 @samp{FOO} in @samp{FOOL}.  At this point there are several things you
  94 can do.  If you mistyped the search string, correct it.  If you like the
  95 place you have found, you can type @key{RET} or some other Emacs command
  96 to ``accept what the search offered''.  Or you can type @kbd{C-g}, which
  97 removes from the search string the characters that could not be found
  98 (the @samp{T} in @samp{FOOT}), leaving those that were found (the
  99 @samp{FOO} in @samp{FOOT}).  A second @kbd{C-g} at that point cancels
 100 the search entirely, returning point to where it was when the search
 101 started.
 102
 103   If a search is failing and you ask to repeat it by typing another
 104 @kbd{C-s}, it starts again from the beginning of the buffer.  Repeating
 105 a failing backward search with @kbd{C-r} starts again from the end.  This
 106 is called @dfn{wrapping around}.  @samp{Wrapped} appears in the search
 107 prompt once this has happened.
 108
 109 @cindex quitting (in search)
 110   The @kbd{C-g} ``quit'' character does special things during searches;
 111 just what it does depends on the status of the search.  If the search has
 112 found what you specified and is waiting for input, @kbd{C-g} cancels the
 113 entire search.  The cursor moves back to where you started the search.  If
 114 @kbd{C-g} is typed when there are characters in the search string that have
 115 not been found---because Emacs is still searching for them, or because it
 116 has failed to find them---then the search string characters which have not
 117 been found are discarded from the search string.  The
 118 search is now successful and waiting for more input, so a second @kbd{C-g}
 119 cancels the entire search.
 120
 121   To search for a control character such as @kbd{C-s} or @key{DEL} or
 122 @key{ESC}, you must quote it by typing @kbd{C-q} first.  This function
 123 of @kbd{C-q} is analogous to its meaning as an Emacs command: it causes
 124 the following character to be treated the way a graphic character would
 125 normally be treated in the same context.
 126
 127  To search backwards, you can use @kbd{C-r} instead of @kbd{C-s} to
 128 start the search; @kbd{C-r} is the key that runs the command
 129 (@code{isearch-backward}) to search backward.  You can also use
 130 @kbd{C-r} to change from searching forward to searching backwards.  Do
 131 this if a search fails because the place you started was too far down in the
 132 file.  Repeated @kbd{C-r} keeps looking for more occurrences backwards.
 133 @kbd{C-s} starts going forward again.  You can cancel @kbd{C-r} in a
 134 search with @key{DEL}.
 135
 136   The characters @kbd{C-y} and @kbd{C-w} can be used in incremental search
 137 to grab text from the buffer into the search string.  This makes it
 138 convenient to search for another occurrence of text at point.  @kbd{C-w}
 139 copies the word after point as part of the search string, advancing
 140 point over that word.  Another @kbd{C-s} to repeat the search will then
 141 search for a string including that word.  @kbd{C-y} is similar to @kbd{C-w}
 142 but copies the rest of the current line into the search string.
 143
 144   The characters @kbd{M-p} and @kbd{M-n} can be used in an incremental
 145 search to recall things which you have searched for in the past.  A
 146 list of the last 16 things you have searched for is retained, and
 147 @kbd{M-p} and @kbd{M-n} let you cycle through that ring.
 148
 149 The character @kbd{M-@key{TAB}} does completion on the elements in
 150 the search history ring.  For example, if you know that you have
 151 recently searched for the string @code{POTATOE}, you could type
 152 @kbd{C-s P O M-@key{TAB}}.  If you had searched for other strings
 153 beginning with @code{PO} then you would be shown a list of them, and
 154 would need to type more to select one.
 155
 156   You can change any of the special characters in incremental search via
 157 the normal keybinding mechanism: simply add a binding to the
 158 @code{isearch-mode-map}.  For example, to make the character
 159 @kbd{C-b} mean ``search backwards'' while in isearch-mode, do this:
 160
 161 @example
 162 (define-key isearch-mode-map "\C-b" 'isearch-repeat-backward)
 163 @end example
 164
 165 These are the default bindings of isearch-mode:
 166
 167 @findex isearch-delete-char
 168 @findex isearch-exit
 169 @findex isearch-quote-char
 170 @findex isearch-repeat-forward
 171 @findex isearch-repeat-backward
 172 @findex isearch-yank-line
 173 @findex isearch-yank-word
 174 @findex isearch-abort
 175 @findex isearch-ring-retreat
 176 @findex isearch-ring-advance
 177 @findex isearch-complete
 178
 179 @kindex DEL (isearch-mode)
 180 @kindex RET (isearch-mode)
 181 @kindex C-q (isearch-mode)
 182 @kindex C-s (isearch-mode)
 183 @kindex C-r (isearch-mode)
 184 @kindex C-y (isearch-mode)
 185 @kindex C-w (isearch-mode)
 186 @kindex C-g (isearch-mode)
 187 @kindex M-p (isearch-mode)
 188 @kindex M-n (isearch-mode)
 189 @kindex M-TAB (isearch-mode)
 190
 191 @table @kbd
 192 @item DEL
 193 Delete a character from the incremental search string (@code{isearch-delete-char}).
 194 @item RET
 195 Exit incremental search (@code{isearch-exit}).
 196 @item C-q
 197 Quote special characters for incremental search (@code{isearch-quote-char}).
 198 @item C-s
 199 Repeat incremental search forward (@code{isearch-repeat-forward}).
 200 @item C-r
 201 Repeat incremental search backward (@code{isearch-repeat-backward}).
 202 @item C-y
 203 Pull rest of line from buffer into search string (@code{isearch-yank-line}).
 204 @item C-w
 205 Pull next word from buffer into search string (@code{isearch-yank-word}).
 206 @item C-g
 207 Cancels input back to what has been found successfully, or aborts the
 208 isearch (@code{isearch-abort}).
 209 @item M-p
 210 Recall the previous element in the isearch history ring
 211 (@code{isearch-ring-retreat}).
 212 @item M-n
 213 Recall the next element in the isearch history ring
 214 (@code{isearch-ring-advance}).
 215 @item M-@key{TAB}
 216 Do completion on the elements in the isearch history ring
 217 (@code{isearch-complete}).
 218
 219 @end table
 220
 221 Any other character which is normally inserted into a buffer when typed
 222 is automatically added to the search string in isearch-mode.
 223
 224 @subsection Slow Terminal Incremental Search
 225
 226   Incremental search on a slow terminal uses a modified style of display
 227 that is designed to take less time.  Instead of redisplaying the buffer at
 228 each place the search gets to, it creates a new single-line window and uses
 229 that to display the line the search has found.  The single-line window
 230 appears as soon as point gets outside of the text that is already
 231 on the screen.
 232
 233   When the search is terminated, the single-line window is removed.  Only
 234 at this time the window in which the search was done is redisplayed to show
 235 its new value of point.
 236
 237   The three dots at the end of the search string, normally used to indicate
 238 that searching is going on, are not displayed in slow style display.
 239
 240 @vindex search-slow-speed
 241   The slow terminal style of display is used when the terminal baud rate is
 242 less than or equal to the value of the variable @code{search-slow-speed},
 243 initially 1200.
 244
 245 @vindex search-slow-window-lines
 246   The number of lines to use in slow terminal search display is controlled
 247 by the variable @code{search-slow-window-lines}.  Its normal value is 1.
 248
 249 @node Non-Incremental Search, Word Search, Incremental Search, Search
 250 @section Non-Incremental Search
 251 @cindex non-incremental search
 252
 253   Emacs also has conventional non-incremental search commands, which require
 254 you type the entire search string before searching begins.
 255
 256 @table @kbd
 257 @item C-s @key{RET} @var{string} @key{RET}
 258 Search for @var{string}.
 259 @item C-r @key{RET} @var{string} @key{RET}
 260 Search backward for @var{string}.
 261 @end table
 262
 263   To do a non-incremental search, first type @kbd{C-s @key{RET}}
 264 (or @kbd{C-s C-m}).  This enters the minibuffer to read the search string.
 265 Terminate the string with @key{RET} to start the search.  If the string
 266 is not found, the search command gets an error.
 267
 268  By default, @kbd{C-s} invokes incremental search, but if you give it an
 269 empty argument, which would otherwise be useless, it invokes non-incremental
 270 search.  Therefore, @kbd{C-s @key{RET}} invokes non-incremental search.
 271 @kbd{C-r @key{RET}} also works this way.
 272
 273 @findex search-forward
 274 @findex search-backward
 275   Forward and backward non-incremental searches are implemented by the
 276 commands @code{search-forward} and @code{search-backward}.  You can bind
 277 these commands to keys.  The reason that incremental
 278 search is programmed to invoke them as well is that @kbd{C-s @key{RET}}
 279 is the traditional sequence of characters used in Emacs to invoke
 280 non-incremental search.
 281
 282  Non-incremental searches performed using @kbd{C-s @key{RET}} do
 283 not call @code{search-forward} right away.  They first check
 284 if the next character is @kbd{C-w}, which requests a word search.
 285 @ifinfo
 286 @xref{Word Search}.
 287 @end ifinfo
 288
 289 @node Word Search, Regexp Search, Non-Incremental Search, Search
 290 @section Word Search
 291 @cindex word search
 292
 293   Word search looks for a sequence of words without regard to how the
 294 words are separated.  More precisely, you type a string of many words,
 295 using single spaces to separate them, and the string is found even if
 296 there are multiple spaces, newlines or other punctuation between the words.
 297
 298   Word search is useful in editing documents formatted by text formatters.
 299 If you edit while looking at the printed, formatted version, you can't tell
 300 where the line breaks are in the source file.  Word search, allows you
 301 to search  without having to know the line breaks.
 302
 303 @table @kbd
 304 @item C-s @key{RET} C-w @var{words} @key{RET}
 305 Search for @var{words}, ignoring differences in punctuation.
 306 @item C-r @key{RET} C-w @var{words} @key{RET}
 307 Search backward for @var{words}, ignoring differences in punctuation.
 308 @end table
 309
 310   Word search is a special case of non-incremental search.  It is invoked
 311 with @kbd{C-s @key{RET} C-w} followed by the search string, which
 312 must always be terminated with another @key{RET}.  Being non-incremental, this
 313 search does not start until the argument is terminated.  It works by
 314 constructing a regular expression and searching for that.  @xref{Regexp
 315 Search}.
 316
 317  You can do a backward word search with @kbd{C-r @key{RET} C-w}.
 318
 319 @findex word-search-forward
 320 @findex word-search-backward
 321   Forward and backward word searches are implemented by the commands
 322 @code{word-search-forward} and @code{word-search-backward}.  You can
 323 bind these commands to keys.  The reason that incremental
 324 search is programmed to invoke them as well is that @kbd{C-s @key{RET} C-w}
 325 is the traditional Emacs sequence of keys for word search.
 326
 327 @node Regexp Search, Regexps, Word Search, Search
 328 @section Regular Expression Search
 329 @cindex regular expression
 330 @cindex regexp
 331
 332   A @dfn{regular expression} (@dfn{regexp}, for short) is a pattern that
 333 denotes a set of strings, possibly an infinite set.  Searching for matches
 334 for a regexp is a powerful operation that editors on Unix systems have
 335 traditionally offered.  In XEmacs, you can search for the next match for
 336 a regexp either incrementally or not.
 337
 338 @kindex M-C-s
 339 @findex isearch-forward-regexp
 340 @findex isearch-backward-regexp
 341   Incremental search for a regexp is done by typing @kbd{M-C-s}
 342 (@code{isearch-forward-regexp}).  This command reads a search string
 343 incrementally just like @kbd{C-s}, but it treats the search string as a
 344 regexp rather than looking for an exact match against the text in the
 345 buffer.  Each time you add text to the search string, you make the regexp
 346 longer, and the new regexp is searched for.  A reverse regexp search command
 347 @code{isearch-backward-regexp} also exists, but no key runs it.
 348
 349   All of the control characters that do special things within an ordinary
 350 incremental search have the same functionality in incremental regexp search.
 351 Typing @kbd{C-s} or @kbd{C-r} immediately after starting a search
 352 retrieves the last incremental search regexp used:
 353 incremental regexp and non-regexp searches have independent defaults.
 354
 355 @findex re-search-forward
 356 @findex re-search-backward
 357   Non-incremental search for a regexp is done by the functions
 358 @code{re-search-forward} and @code{re-search-backward}.  You can invoke
 359 them with @kbd{M-x} or bind them to keys.  You can also call
 360 @code{re-search-forward} by way of incremental regexp search with
 361 @kbd{M-C-s @key{RET}}.
 362
 363 @node Regexps, Search Case, Regexp Search, Search
 364 @section Syntax of Regular Expressions
 365
 366 Regular expressions have a syntax in which a few characters are special
 367 constructs and the rest are @dfn{ordinary}.  An ordinary character is a
 368 simple regular expression which matches that character and nothing else.
 369 The special characters are @samp{$}, @samp{^}, @samp{.}, @samp{*},
 370 @samp{+}, @samp{?}, @samp{[}, @samp{]} and @samp{\}; no new special
 371 characters will be defined.  Any other character appearing in a regular
 372 expression is ordinary, unless a @samp{\} precedes it.@refill
 373
 374 For example, @samp{f} is not a special character, so it is ordinary, and
 375 therefore @samp{f} is a regular expression that matches the string @samp{f}
 376 and no other string.  (It does @i{not} match the string @samp{ff}.)  Likewise,
 377 @samp{o} is a regular expression that matches only @samp{o}.@refill
 378
 379 Any two regular expressions @var{a} and @var{b} can be concatenated.  The
 380 result is a regular expression which matches a string if @var{a} matches
 381 some amount of the beginning of that string and @var{b} matches the rest of
 382 the string.@refill
 383
 384 As a simple example, you can concatenate the regular expressions @samp{f}
 385 and @samp{o} to get the regular expression @samp{fo}, which matches only
 386 the string @samp{fo}.  To do something nontrivial, you
 387 need to use one of the following special characters:
 388
 389 @table @kbd
 390 @item .@: @r{(Period)}
 391 is a special character that matches any single character except a newline.
 392 Using concatenation, you can make regular expressions like @samp{a.b}, which
 393 matches any three-character string which begins with @samp{a} and ends with
 394 @samp{b}.@refill
 395
 396 @item *
 397 is not a construct by itself; it is a suffix, which means the
 398 preceding regular expression is to be repeated as many times as
 399 possible.  In @samp{fo*}, the @samp{*} applies to the @samp{o}, so
 400 @samp{fo*} matches one @samp{f} followed by any number of @samp{o}s.
 401 The case of zero @samp{o}s is allowed: @samp{fo*} does match
 402 @samp{f}.@refill
 403
 404 @samp{*} always applies to the @i{smallest} possible preceding
 405 expression.  Thus, @samp{fo*} has a repeating @samp{o}, not a
 406 repeating @samp{fo}.@refill
 407
 408 The matcher processes a @samp{*} construct by immediately matching
 409 as many repetitions as it can find.  Then it continues with the rest
 410 of the pattern.  If that fails, backtracking occurs, discarding some
 411 of the matches of the @samp{*}-modified construct in case that makes
 412 it possible to match the rest of the pattern.  For example, matching
 413 @samp{ca*ar} against the string @samp{caaar}, the @samp{a*} first
 414 tries to match all three @samp{a}s; but the rest of the pattern is
 415 @samp{ar} and there is only @samp{r} left to match, so this try fails.
 416 The next alternative is for @samp{a*} to match only two @samp{a}s.
 417 With this choice, the rest of the regexp matches successfully.@refill
 418
 419 @item +
 420 is a suffix character similar to @samp{*} except that it requires that
 421 the preceding expression be matched at least once.  For example,
 422 @samp{ca+r} will match the strings @samp{car} and @samp{caaaar}
 423 but not the string @samp{cr}, whereas @samp{ca*r} would match all
 424 three strings.@refill
 425
 426 @item ?
 427 is a suffix character similar to @samp{*} except that it can match the
 428 preceding expression either once or not at all.  For example,
 429 @samp{ca?r} will match @samp{car} or @samp{cr}; nothing else.
 430
 431 @item [ @dots{} ]
 432 @samp{[} begins a @dfn{character set}, which is terminated by a
 433 @samp{]}.  In the simplest case, the characters between the two form
 434 the set.  Thus, @samp{[ad]} matches either one @samp{a} or one
 435 @samp{d}, and @samp{[ad]*} matches any string composed of just
 436 @samp{a}s and @samp{d}s (including the empty string), from which it
 437 follows that @samp{c[ad]*r} matches @samp{cr}, @samp{car}, @samp{cdr},
 438 @samp{caddaar}, etc.@refill
 439
 440 You can include character ranges in a character set by writing two
 441 characters with a @samp{-} between them.  Thus, @samp{[a-z]} matches any
 442 lower-case letter.  Ranges may be intermixed freely with individual
 443 characters, as in @samp{[a-z$%.]}, which matches any lower-case letter
 444 or @samp{$}, @samp{%}, or period.
 445 @refill
 446
 447 Note that inside a character set the usual special characters are not
 448 special any more.  A completely different set of special characters
 449 exists inside character sets: @samp{]}, @samp{-}, and @samp{^}.@refill
 450
 451 To include a @samp{]} in a character set, you must make it the first
 452 character.  For example, @samp{[]a]} matches @samp{]} or @samp{a}.  To
 453 include a @samp{-}, write @samp{---}, which is a range containing only
 454 @samp{-}.  To include @samp{^}, make it other than the first character
 455 in the set.@refill
 456
 457 @item [^ @dots{} ]
 458 @samp{[^} begins a @dfn{complement character set}, which matches any
 459 character except the ones specified.  Thus, @samp{[^a-z0-9A-Z]}
 460 matches all characters @i{except} letters and digits.@refill
 461
 462 @samp{^} is not special in a character set unless it is the first
 463 character.  The character following the @samp{^} is treated as if it
 464 were first (@samp{-} and @samp{]} are not special there).
 465
 466 Note that a complement character set can match a newline, unless
 467 newline is mentioned as one of the characters not to match.
 468
 469 @item ^
 470 is a special character that matches the empty string, but only if at
 471 the beginning of a line in the text being matched.  Otherwise, it fails
 472 to match anything.  Thus, @samp{^foo} matches a @samp{foo} that occurs
 473 at the beginning of a line.
 474
 475 @item $
 476 is similar to @samp{^} but matches only at the end of a line.  Thus,
 477 @samp{xx*$} matches a string of one @samp{x} or more at the end of a line.
 478
 479 @item \
 480 does two things: it quotes the special characters (including
 481 @samp{\}), and it introduces additional special constructs.
 482
 483 Because @samp{\} quotes special characters, @samp{\$} is a regular
 484 expression that matches only @samp{$}, and @samp{\[} is a regular
 485 expression that matches only @samp{[}, and so on.@refill
 486 @end table
 487
 488 Note: for historical compatibility, special characters are treated as
 489 ordinary ones if they are in contexts where their special meanings make no
 490 sense.  For example, @samp{*foo} treats @samp{*} as ordinary since there is
 491 no preceding expression on which the @samp{*} can act.  It is poor practice
 492 to depend on this behavior; better to quote the special character anyway,
 493 regardless of where is appears.@refill
 494
 495 Usually, @samp{\} followed by any character matches only
 496 that character.  However, there are several exceptions: characters
 497 which, when preceded by @samp{\}, are special constructs.  Such
 498 characters are always ordinary when encountered on their own.  Here
 499 is a table of @samp{\} constructs.
 500
 501 @table @kbd
 502 @item \|
 503 specifies an alternative.
 504 Two regular expressions @var{a} and @var{b} with @samp{\|} in
 505 between form an expression that matches anything @var{a} or
 506 @var{b} matches.@refill
 507
 508 Thus, @samp{foo\|bar} matches either @samp{foo} or @samp{bar}
 509 but no other string.@refill
 510
 511 @samp{\|} applies to the largest possible surrounding expressions.  Only a
 512 surrounding @samp{\( @dots{} \)} grouping can limit the grouping power of
 513 @samp{\|}.@refill
 514
 515 Full backtracking capability exists to handle multiple uses of @samp{\|}.
 516
 517 @item \( @dots{} \)
 518 is a grouping construct that serves three purposes:
 519
 520 @enumerate
 521 @item
 522 To enclose a set of @samp{\|} alternatives for other operations.
 523 Thus, @samp{\(foo\|bar\)x} matches either @samp{foox} or @samp{barx}.
 524
 525 @item
 526 To enclose a complicated expression for the postfix @samp{*} to operate on.
 527 Thus, @samp{ba\(na\)*} matches @samp{bananana}, etc., with any (zero or
 528 more) number of @samp{na} strings.@refill
 529
 530 @item
 531 To mark a matched substring for future reference.
 532
 533 @end enumerate
 534
 535 This last application is not a consequence of the idea of a
 536 parenthetical grouping; it is a separate feature which happens to be
 537 assigned as a second meaning to the same @samp{\( @dots{} \)} construct
 538 because in practice there is no conflict between the two meanings.
 539 Here is an explanation:
 540
 541 @item \@var{digit}
 542 after the end of a @samp{\( @dots{} \)} construct, the matcher remembers the
 543 beginning and end of the text matched by that construct.  Then, later on
 544 in the regular expression, you can use @samp{\} followed by @var{digit}
 545 to mean ``match the same text matched the @var{digit}'th time by the
 546 @samp{\( @dots{} \)} construct.''@refill
 547
 548 The strings matching the first nine @samp{\( @dots{} \)} constructs appearing
 549 in a regular expression are assigned numbers 1 through 9 in order that the
 550 open-parentheses appear in the regular expression.  @samp{\1} through
 551 @samp{\9} may be used to refer to the text matched by the corresponding
 552 @samp{\( @dots{} \)} construct.
 553
 554 For example, @samp{\(.*\)\1} matches any newline-free string that is
 555 composed of two identical halves.  The @samp{\(.*\)} matches the first
 556 half, which may be anything, but the @samp{\1} that follows must match
 557 the same exact text.
 558
 559 @item \`
 560 matches the empty string, provided it is at the beginning
 561 of the buffer.
 562
 563 @item \'
 564 matches the empty string, provided it is at the end of
 565 the buffer.
 566
 567 @item \b
 568 matches the empty string, provided it is at the beginning or
 569 end of a word.  Thus, @samp{\bfoo\b} matches any occurrence of
 570 @samp{foo} as a separate word.  @samp{\bballs?\b} matches
 571 @samp{ball} or @samp{balls} as a separate word.@refill
 572
 573 @item \B
 574 matches the empty string, provided it is @i{not} at the beginning or
 575 end of a word.
 576
 577 @item \<
 578 matches the empty string, provided it is at the beginning of a word.
 579
 580 @item \>
 581 matches the empty string, provided it is at the end of a word.
 582
 583 @item \w
 584 matches any word-constituent character.  The editor syntax table
 585 determines which characters these are.
 586
 587 @item \W
 588 matches any character that is not a word-constituent.
 589
 590 @item \s@var{code}
 591 matches any character whose syntax is @var{code}.  @var{code} is a
 592 character which represents a syntax code: thus, @samp{w} for word
 593 constituent, @samp{-} for whitespace, @samp{(} for open-parenthesis,
 594 etc.  @xref{Syntax}.@refill
 595
 596 @item \S@var{code}
 597 matches any character whose syntax is not @var{code}.
 598 @end table
 599
 600   Here is a complicated regexp used by Emacs to recognize the end of a
 601 sentence together with any whitespace that follows.  It is given in Lisp
 602 syntax to enable you to distinguish the spaces from the tab characters.  In
 603 Lisp syntax, the string constant begins and ends with a double-quote.
 604 @samp{\"} stands for a double-quote as part of the regexp, @samp{\\} for a
 605 backslash as part of the regexp, @samp{\t} for a tab and @samp{\n} for a
 606 newline.
 607
 608 @example
 609 "[.?!][]\"')]*\\($\\|\t\\|  \\)[ \t\n]*"
 610 @end example
 611
 612 @noindent
 613 This regexp contains four parts: a character set matching
 614 period, @samp{?} or @samp{!}; a character set matching close-brackets,
 615 quotes or parentheses, repeated any number of times; an alternative in
 616 backslash-parentheses that matches end-of-line, a tab or two spaces; and
 617 a character set matching whitespace characters, repeated any number of
 618 times.
 619
 620 @node Search Case, Replace, Regexps, Search
 621 @section Searching and Case
 622
 623 @vindex case-fold-search
 624   All searches in Emacs normally ignore the case of the text they
 625 are searching through; if you specify searching for @samp{FOO},
 626 @samp{Foo} and @samp{foo} are also considered a match.  Regexps, and in
 627 particular character sets, are included: @samp{[aB]} matches @samp{a}
 628 or @samp{A} or @samp{b} or @samp{B}.@refill
 629
 630   If you want a case-sensitive search, set the variable
 631 @code{case-fold-search} to @code{nil}.  Then all letters must match
 632 exactly, including case. @code{case-fold-search} is a per-buffer
 633 variable; altering it affects only the current buffer, but
 634 there is a default value which you can change as well.  @xref{Locals}.
 635 You can also use @b{Case Sensitive Search} from the @b{Options} menu
 636 on your screen.
 637
 638 @node Replace, Other Repeating Search, Search Case, Search
 639 @section Replacement Commands
 640 @cindex replacement
 641 @cindex string substitution
 642 @cindex global substitution
 643
 644   Global search-and-replace operations are not needed as often in Emacs as
 645 they are in other editors, but they are available.  In addition to the
 646 simple @code{replace-string} command which is like that found in most
 647 editors, there is a @code{query-replace} command which asks you, for each
 648 occurrence of a pattern, whether to replace it.
 649
 650   The replace commands all replace one string (or regexp) with one
 651 replacement string.  It is possible to perform several replacements in
 652 parallel using the command @code{expand-region-abbrevs}.  @xref{Expanding
 653 Abbrevs}.
 654
 655 @menu
 656 * Unconditional Replace::  Replacing all matches for a string.
 657 * Regexp Replace::         Replacing all matches for a regexp.
 658 * Replacement and Case::   How replacements preserve case of letters.
 659 * Query Replace::          How to use querying.
 660 @end menu
 661
 662 @node Unconditional Replace, Regexp Replace, Replace, Replace
 663 @subsection Unconditional Replacement
 664 @findex replace-string
 665 @findex replace-regexp
 666
 667 @table @kbd
 668 @item M-x replace-string @key{RET} @var{string} @key{RET} @var{newstring} @key{RET}
 669 Replace every occurrence of @var{string} with @var{newstring}.
 670 @item M-x replace-regexp @key{RET} @var{regexp} @key{RET} @var{newstring} @key{RET}
 671 Replace every match for @var{regexp} with @var{newstring}.
 672 @end table
 673
 674   To replace every instance of @samp{foo} after point with @samp{bar},
 675 use the command @kbd{M-x replace-string} with the two arguments
 676 @samp{foo} and @samp{bar}.  Replacement occurs only after point: if you
 677 want to cover the whole buffer you must go to the beginning first.  By
 678 default, all occurrences up to the end of the buffer are replaced.  To
 679 limit replacement to part of the buffer, narrow to that part of the
 680 buffer before doing the replacement (@pxref{Narrowing}).
 681
 682   When @code{replace-string} exits, point is left at the last occurrence
 683 replaced.  The value of point when the @code{replace-string} command was
 684 issued is remembered on the mark ring; @kbd{C-u C-@key{SPC}} moves back
 685 there.
 686
 687   A numeric argument restricts replacement to matches that are surrounded
 688 by word boundaries.
 689
 690 @node Regexp Replace, Replacement and Case, Unconditional Replace, Replace
 691 @subsection Regexp Replacement
 692
 693   @code{replace-string} replaces exact matches for a single string.  The
 694 similar command @code{replace-regexp} replaces any match for a specified
 695 pattern.
 696
 697   In @code{replace-regexp}, the @var{newstring} need not be constant.  It
 698 can refer to all or part of what is matched by the @var{regexp}.  @samp{\&}
 699 in @var{newstring} stands for the entire text being replaced.
 700 @samp{\@var{d}} in @var{newstring}, where @var{d} is a digit, stands for
 701 whatever matched the @var{d}'th parenthesized grouping in @var{regexp}.
 702 For example,@refill
 703
 704 @example
 705 M-x replace-regexp @key{RET} c[ad]+r @key{RET} \&-safe @key{RET}
 706 @end example
 707
 708 @noindent
 709 would replace (for example) @samp{cadr} with @samp{cadr-safe} and @samp{cddr}
 710 with @samp{cddr-safe}.
 711
 712 @example
 713 M-x replace-regexp @key{RET} \(c[ad]+r\)-safe @key{RET} \1 @key{RET}
 714 @end example
 715
 716 @noindent
 717 would perform exactly the opposite replacements.  To include a @samp{\}
 718 in the text to replace with, you must give @samp{\\}.
 719
 720 @node Replacement and Case, Query Replace, Regexp Replace, Replace
 721 @subsection Replace Commands and Case
 722
 723 @vindex case-replace
 724 @vindex case-fold-search
 725   If the arguments to a replace command are in lower case, the command
 726 preserves case when it makes a replacement.  Thus, the following command:
 727
 728 @example
 729 M-x replace-string @key{RET} foo @key{RET} bar @key{RET}
 730 @end example
 731
 732 @noindent
 733 replaces a lower-case @samp{foo} with a lower case @samp{bar}, @samp{FOO}
 734 with @samp{BAR}, and @samp{Foo} with @samp{Bar}.  If upper-case letters are
 735 used in the second argument, they remain upper-case every time that
 736 argument is inserted.  If upper-case letters are used in the first
 737 argument, the second argument is always substituted exactly as given, with
 738 no case conversion.  Likewise, if the variable @code{case-replace} is set
 739 to @code{nil}, replacement is done without case conversion.  If
 740 @code{case-fold-search} is set to @code{nil}, case is significant in
 741 matching occurrences of @samp{foo} to replace; also, case conversion of the
 742 replacement string is not done.
 743
 744 @node Query Replace,, Replacement and Case, Replace
 745 @subsection Query Replace
 746 @cindex query replace
 747
 748 @table @kbd
 749 @item M-% @var{string} @key{RET} @var{newstring} @key{RET}
 750 @itemx M-x query-replace @key{RET} @var{string} @key{RET} @var{newstring} @key{RET}
 751 Replace some occurrences of @var{string} with @var{newstring}.
 752 @item M-x query-replace-regexp @key{RET} @var{regexp} @key{RET} @var{newstring} @key{RET}
 753 Replace some matches for @var{regexp} with @var{newstring}.
 754 @end table
 755
 756 @kindex M-%
 757 @findex query-replace
 758   If you want to change only some of the occurrences of @samp{foo} to
 759 @samp{bar}, not all of them, you can use @code{query-replace} instead of
 760 @kbd{M-%}.  This command finds occurrences of @samp{foo} one by one,
 761 displays each occurrence, and asks you whether to replace it.  A numeric
 762 argument to @code{query-replace} tells it to consider only occurrences
 763 that are bounded by word-delimiter characters.@refill
 764
 765 @findex query-replace-regexp
 766   Aside from querying, @code{query-replace} works just like
 767 @code{replace-string}, and @code{query-replace-regexp} works
 768 just like @code{replace-regexp}.@refill
 769
 770   The things you can type when you are shown an occurrence of @var{string}
 771 or a match for @var{regexp} are:
 772
 773 @kindex SPC (query-replace)
 774 @kindex DEL (query-replace)
 775 @kindex , (query-replace)
 776 @kindex ESC (query-replace)
 777 @kindex . (query-replace)
 778 @kindex ! (query-replace)
 779 @kindex ^ (query-replace)
 780 @kindex C-r (query-replace)
 781 @kindex C-w (query-replace)
 782 @kindex C-l (query-replace)
 783
 784 @c WideCommands
 785 @table @kbd
 786 @item @key{SPC}
 787 to replace the occurrence with @var{newstring}.  This preserves case, just
 788 like @code{replace-string}, provided @code{case-replace} is non-@code{nil},
 789 as it normally is.@refill
 790
 791 @item @key{DEL}
 792 to skip to the next occurrence without replacing this one.
 793
 794 @item , @r{(Comma)}
 795 to replace this occurrence and display the result.  You are then
 796 prompted for another input character.  However, since the replacement has
 797 already been made, @key{DEL} and @key{SPC} are equivalent.  At this
 798 point, you can type @kbd{C-r} (see below) to alter the replaced text.  To
 799 undo the replacement, you can type @kbd{C-x u}.
 800 This exits the @code{query-replace}.  If you want to do further
 801 replacement you must use @kbd{C-x ESC} to restart (@pxref{Repetition}).
 802
 803 @item @key{ESC}
 804 to exit without doing any more replacements.
 805
 806 @item .@: @r{(Period)}
 807 to replace this occurrence and then exit.
 808
 809 @item !
 810 to replace all remaining occurrences without asking again.
 811
 812 @item ^
 813 to go back to the location of the previous occurrence (or what used to
 814 be an occurrence), in case you changed it by mistake.  This works by
 815 popping the mark ring.  Only one @kbd{^} in a row is allowed, because
 816 only one previous replacement location is kept during @code{query-replace}.
 817
 818 @item C-r
 819 to enter a recursive editing level, in case the occurrence needs to be
 820 edited rather than just replaced with @var{newstring}.  When you are
 821 done, exit the recursive editing level with @kbd{C-M-c} and the next
 822 occurrence will be displayed.  @xref{Recursive Edit}.
 823
 824 @item C-w
 825 to delete the occurrence, and then enter a recursive editing level as
 826 in @kbd{C-r}.  Use the recursive edit to insert text to replace the
 827 deleted occurrence of @var{string}.  When done, exit the recursive
 828 editing level with @kbd{C-M-c} and the next occurrence will be
 829 displayed.
 830
 831 @item C-l
 832 to redisplay the screen and then give another answer.
 833
 834 @item C-h
 835 to display a message summarizing these options, then give another
 836 answer.
 837 @end table
 838
 839   If you type any other character, Emacs exits the @code{query-replace}, and
 840 executes the character as a command.  To restart the @code{query-replace},
 841 use @kbd{C-x @key{ESC}}, which repeats the @code{query-replace} because it
 842 used the minibuffer to read its arguments.  @xref{Repetition, C-x ESC}.
 843
 844 @node Other Repeating Search,, Replace, Search
 845 @section Other Search-and-Loop Commands
 846
 847   Here are some other commands that find matches for a regular expression.
 848 They all operate from point to the end of the buffer.
 849
 850 @findex list-matching-lines
 851 @findex occur
 852 @findex count-matches
 853 @findex delete-non-matching-lines
 854 @findex delete-matching-lines
 855 @c grosscommands
 856 @table @kbd
 857 @item M-x occur
 858 Print each line that follows point and contains a match for the
 859 specified regexp.  A numeric argument specifies the number of context
 860 lines to print before and after each matching line; the default is
 861 none.
 862
 863 @kindex C-c C-c (Occur mode)
 864 The buffer @samp{*Occur*} containing the output serves as a menu for
 865 finding occurrences in their original context.  Find an occurrence
 866 as listed in @samp{*Occur*}, position point there, and type @kbd{C-c
 867 C-c}; this switches to the buffer that was searched and moves point to
 868 the original of the same occurrence.
 869
 870 @item M-x list-matching-lines
 871 Synonym for @kbd{M-x occur}.
 872
 873 @item M-x count-matches
 874 Print the number of matches following point for the specified regexp.
 875
 876 @item M-x delete-non-matching-lines
 877 Delete each line that follows point and does not contain a match for
 878 the specified regexp.
 879
 880 @item M-x delete-matching-lines
 881 Delete each line that follows point and contains a match for the
 882 specified regexp.
 883 @end table