|
82 | 82 | then it shall be a well-formed UTF-8 code unit sequence and
|
83 | 83 | it is decoded to produce a sequence of UCS scalar values
|
84 | 84 | that constitutes the sequence of elements of the translation character set.
|
| 85 | +In the resulting sequence, |
| 86 | +each pair of characters in the input sequence consisting of |
| 87 | +\unicode{000d}{carriage return} followed by \unicode{000a}{line feed}, |
| 88 | +as well as each |
| 89 | +\unicode{000d}{carriage return} not immediately followed by a \unicode{000a}{line feed}, |
| 90 | +is replaced by a single new-line character. |
85 | 91 |
|
86 | 92 | For any other kind of input file supported by the implementation,
|
87 | 93 | characters are mapped, in an
|
88 | 94 | \impldef{mapping physical source file characters to translation character set} manner,
|
89 |
| -to a sequence of translation character set elements\iref{lex.charset} |
90 |
| -(introducing new-line characters for end-of-line indicators). |
| 95 | +to a sequence of translation character set elements\iref{lex.charset}, |
| 96 | +representing end-of-line indicators as new-line characters. |
91 | 97 |
|
92 | 98 | \item
|
93 | 99 | \indextext{line splicing}%
|
|
308 | 314 |
|
309 | 315 | \begin{bnf}
|
310 | 316 | \nontermdef{n-char} \textnormal{one of}\br
|
311 |
| - \terminal{A B C D E F G H I J K L M N O P Q R S T U V W X Y Z}\br |
312 |
| - \terminal{0 1 2 3 4 5 6 7 8 9}\br |
313 |
| - \textnormal{\unicode{002d}{hyphen-minus}}\br |
314 |
| - \textnormal{\unicode{0020}{space}} |
| 317 | + \textnormal{any member of the translation character set except the \unicode{007d}{right curly bracket} or new-line character} |
315 | 318 | \end{bnf}
|
316 | 319 |
|
317 | 320 | \begin{bnf}
|
|
0 commit comments