Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1 | gitattributes(5) |
| 2 | ================ |
| 3 | |
| 4 | NAME |
| 5 | ---- |
Junio C Hamano | c9cb517 | 2018-06-01 07:13:44 | [diff] [blame] | 6 | gitattributes - Defining attributes per path |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 7 | |
| 8 | SYNOPSIS |
| 9 | -------- |
Junio C Hamano | 17bd227 | 2008-09-01 06:11:17 | [diff] [blame] | 10 | $GIT_DIR/info/attributes, .gitattributes |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 11 | |
| 12 | |
| 13 | DESCRIPTION |
| 14 | ----------- |
| 15 | |
| 16 | A `gitattributes` file is a simple text file that gives |
| 17 | `attributes` to pathnames. |
| 18 | |
| 19 | Each line in `gitattributes` file is of form: |
| 20 | |
Junio C Hamano | 6b7d215 | 2019-04-16 12:51:15 | [diff] [blame] | 21 | pattern attr1 attr2 ... |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 22 | |
Junio C Hamano | d533bdb | 2009-02-25 09:56:58 | [diff] [blame] | 23 | That is, a pattern followed by an attributes list, |
Junio C Hamano | cbe0eb2 | 2017-02-27 23:37:49 | [diff] [blame] | 24 | separated by whitespaces. Leading and trailing whitespaces are |
| 25 | ignored. Lines that begin with '#' are ignored. Patterns |
| 26 | that begin with a double quote are quoted in C style. |
| 27 | When the pattern matches the path in question, the attributes |
| 28 | listed on the line are given to the path. |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 29 | |
| 30 | Each attribute can be in one of these states for a given path: |
| 31 | |
| 32 | Set:: |
| 33 | |
| 34 | The path has the attribute with special value "true"; |
| 35 | this is specified by listing only the name of the |
| 36 | attribute in the attribute list. |
| 37 | |
| 38 | Unset:: |
| 39 | |
| 40 | The path has the attribute with special value "false"; |
| 41 | this is specified by listing the name of the attribute |
| 42 | prefixed with a dash `-` in the attribute list. |
| 43 | |
| 44 | Set to a value:: |
| 45 | |
| 46 | The path has the attribute with specified string value; |
| 47 | this is specified by listing the name of the attribute |
| 48 | followed by an equal sign `=` and its value in the |
| 49 | attribute list. |
| 50 | |
| 51 | Unspecified:: |
| 52 | |
Junio C Hamano | d533bdb | 2009-02-25 09:56:58 | [diff] [blame] | 53 | No pattern matches the path, and nothing says if |
Junio C Hamano | 22572e7 | 2007-04-25 05:59:25 | [diff] [blame] | 54 | the path has or does not have the attribute, the |
| 55 | attribute for the path is said to be Unspecified. |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 56 | |
Junio C Hamano | d533bdb | 2009-02-25 09:56:58 | [diff] [blame] | 57 | When more than one pattern matches the path, a later line |
Junio C Hamano | 22572e7 | 2007-04-25 05:59:25 | [diff] [blame] | 58 | overrides an earlier line. This overriding is done per |
Junio C Hamano | 8100016 | 2018-03-21 19:54:50 | [diff] [blame] | 59 | attribute. |
| 60 | |
| 61 | The rules by which the pattern matches paths are the same as in |
| 62 | `.gitignore` files (see linkgit:gitignore[5]), with a few exceptions: |
| 63 | |
| 64 | - negative patterns are forbidden |
| 65 | |
| 66 | - patterns that match a directory do not recursively match paths |
| 67 | inside that directory (so using the trailing-slash `path/` syntax is |
| 68 | pointless in an attributes file; use `path/**` instead) |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 69 | |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 70 | When deciding what attributes are assigned to a path, Git |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 71 | consults `$GIT_DIR/info/attributes` file (which has the highest |
| 72 | precedence), `.gitattributes` file in the same directory as the |
Junio C Hamano | b4a9ede | 2009-04-08 07:36:32 | [diff] [blame] | 73 | path in question, and its parent directories up to the toplevel of the |
| 74 | work tree (the further the directory that contains `.gitattributes` |
Junio C Hamano | 3b4609d | 2010-09-30 00:04:34 | [diff] [blame] | 75 | is from the path in question, the lower its precedence). Finally |
| 76 | global and system-wide files are considered (they have the lowest |
| 77 | precedence). |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 78 | |
Junio C Hamano | 6631a09 | 2012-10-10 22:50:57 | [diff] [blame] | 79 | When the `.gitattributes` file is missing from the work tree, the |
| 80 | path in the index is used as a fall-back. During checkout process, |
| 81 | `.gitattributes` in the index is used and then the file in the |
| 82 | working tree is used as a fall-back. |
| 83 | |
Junio C Hamano | 99dc24a | 2008-03-27 23:49:50 | [diff] [blame] | 84 | If you wish to affect only a single repository (i.e., to assign |
Junio C Hamano | 3b4609d | 2010-09-30 00:04:34 | [diff] [blame] | 85 | attributes to files that are particular to |
| 86 | one user's workflow for that repository), then |
Junio C Hamano | 99dc24a | 2008-03-27 23:49:50 | [diff] [blame] | 87 | attributes should be placed in the `$GIT_DIR/info/attributes` file. |
| 88 | Attributes which should be version-controlled and distributed to other |
| 89 | repositories (i.e., attributes of interest to all users) should go into |
Junio C Hamano | 3b4609d | 2010-09-30 00:04:34 | [diff] [blame] | 90 | `.gitattributes` files. Attributes that should affect all repositories |
| 91 | for a single user should be placed in a file specified by the |
Junio C Hamano | 322c624 | 2015-03-23 21:32:46 | [diff] [blame] | 92 | `core.attributesFile` configuration option (see linkgit:git-config[1]). |
Junio C Hamano | 8edb4c7 | 2012-07-09 20:33:55 | [diff] [blame] | 93 | Its default value is $XDG_CONFIG_HOME/git/attributes. If $XDG_CONFIG_HOME |
| 94 | is either not set or empty, $HOME/.config/git/attributes is used instead. |
Junio C Hamano | 3b4609d | 2010-09-30 00:04:34 | [diff] [blame] | 95 | Attributes for all users on a system should be placed in the |
| 96 | `$(prefix)/etc/gitattributes` file. |
Junio C Hamano | 99dc24a | 2008-03-27 23:49:50 | [diff] [blame] | 97 | |
Junio C Hamano | cbe0eb2 | 2017-02-27 23:37:49 | [diff] [blame] | 98 | Sometimes you would need to override a setting of an attribute |
Junio C Hamano | 86d8436 | 2011-08-04 00:21:51 | [diff] [blame] | 99 | for a path to `Unspecified` state. This can be done by listing |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 100 | the name of the attribute prefixed with an exclamation point `!`. |
| 101 | |
| 102 | |
| 103 | EFFECTS |
| 104 | ------- |
| 105 | |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 106 | Certain operations by Git can be influenced by assigning |
Junio C Hamano | 2a8f6dc | 2007-07-09 08:48:38 | [diff] [blame] | 107 | particular attributes to a path. Currently, the following |
| 108 | operations are attributes-aware. |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 109 | |
| 110 | Checking-out and checking-in |
| 111 | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ |
| 112 | |
Junio C Hamano | 469d60e | 2007-04-29 18:30:34 | [diff] [blame] | 113 | These attributes affect how the contents stored in the |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 114 | repository are copied to the working tree files when commands |
Junio C Hamano | c9f11c2 | 2019-07-10 02:54:04 | [diff] [blame] | 115 | such as 'git switch', 'git checkout' and 'git merge' run. |
| 116 | They also affect how |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 117 | Git stores the contents you prepare in the working tree in the |
Junio C Hamano | 1aa40d2 | 2010-01-21 17:46:43 | [diff] [blame] | 118 | repository upon 'git add' and 'git commit'. |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 119 | |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 120 | `text` |
Junio C Hamano | 469d60e | 2007-04-29 18:30:34 | [diff] [blame] | 121 | ^^^^^^ |
| 122 | |
Junio C Hamano | 7ab2d20 | 2023-05-10 18:59:24 | [diff] [blame] | 123 | This attribute marks the path as a text file, which enables end-of-line |
| 124 | conversion: When a matching file is added to the index, the file's line |
| 125 | endings are normalized to LF in the index. Conversely, when the file is |
| 126 | copied from the index to the working directory, its line endings may be |
| 127 | converted from LF to CRLF depending on the `eol` attribute, the Git |
| 128 | config, and the platform (see explanation of `eol` below). |
Junio C Hamano | 469d60e | 2007-04-29 18:30:34 | [diff] [blame] | 129 | |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 130 | Set:: |
| 131 | |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 132 | Setting the `text` attribute on a path enables end-of-line |
Junio C Hamano | 7ab2d20 | 2023-05-10 18:59:24 | [diff] [blame] | 133 | conversion on checkin and checkout as described above. Line endings |
| 134 | are normalized to LF in the index every time the file is checked in, |
| 135 | even if the file was previously added to Git with CRLF line endings. |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 136 | |
| 137 | Unset:: |
| 138 | |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 139 | Unsetting the `text` attribute on a path tells Git not to |
Junio C Hamano | fcb740a | 2008-08-31 06:32:05 | [diff] [blame] | 140 | attempt any end-of-line conversion upon checkin or checkout. |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 141 | |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 142 | Set to string value "auto":: |
| 143 | |
Junio C Hamano | 7ab2d20 | 2023-05-10 18:59:24 | [diff] [blame] | 144 | When `text` is set to "auto", Git decides by itself whether the file |
| 145 | is text or binary. If it is text and the file was not already in |
| 146 | Git with CRLF endings, line endings are converted on checkin and |
| 147 | checkout as described above. Otherwise, no conversion is done on |
| 148 | checkin or checkout. |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 149 | |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 150 | Unspecified:: |
| 151 | |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 152 | If the `text` attribute is unspecified, Git uses the |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 153 | `core.autocrlf` configuration variable to determine if the |
| 154 | file should be converted. |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 155 | |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 156 | Any other value causes Git to act as if `text` has been left |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 157 | unspecified. |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 158 | |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 159 | `eol` |
| 160 | ^^^^^ |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 161 | |
Junio C Hamano | 7ab2d20 | 2023-05-10 18:59:24 | [diff] [blame] | 162 | This attribute marks a path to use a specific line-ending style in the |
| 163 | working tree when it is checked out. It has effect only if `text` or |
| 164 | `text=auto` is set (see above), but specifying `eol` automatically sets |
| 165 | `text` if `text` was left unspecified. |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 166 | |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 167 | Set to string value "crlf":: |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 168 | |
Junio C Hamano | 7ab2d20 | 2023-05-10 18:59:24 | [diff] [blame] | 169 | This setting converts the file's line endings in the working |
| 170 | directory to CRLF when the file is checked out. |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 171 | |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 172 | Set to string value "lf":: |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 173 | |
Junio C Hamano | 7ab2d20 | 2023-05-10 18:59:24 | [diff] [blame] | 174 | This setting uses the same line endings in the working directory as |
| 175 | in the index when the file is checked out. |
| 176 | |
| 177 | Unspecified:: |
| 178 | |
| 179 | If the `eol` attribute is unspecified for a file, its line endings |
| 180 | in the working directory are determined by the `core.autocrlf` or |
| 181 | `core.eol` configuration variable (see the definitions of those |
| 182 | options in linkgit:git-config[1]). If `text` is set but neither of |
| 183 | those variables is, the default is `eol=crlf` on Windows and |
| 184 | `eol=lf` on all other platforms. |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 185 | |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 186 | Backwards compatibility with `crlf` attribute |
| 187 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ |
| 188 | |
| 189 | For backwards compatibility, the `crlf` attribute is interpreted as |
| 190 | follows: |
| 191 | |
| 192 | ------------------------ |
| 193 | crlf text |
| 194 | -crlf -text |
| 195 | crlf=input eol=lf |
| 196 | ------------------------ |
| 197 | |
| 198 | End-of-line conversion |
| 199 | ^^^^^^^^^^^^^^^^^^^^^^ |
| 200 | |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 201 | While Git normally leaves file contents alone, it can be configured to |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 202 | normalize line endings to LF in the repository and, optionally, to |
| 203 | convert them to CRLF when files are checked out. |
| 204 | |
Junio C Hamano | e7808d7 | 2016-08-26 21:48:01 | [diff] [blame] | 205 | If you simply want to have CRLF line endings in your working directory |
| 206 | regardless of the repository you are working with, you can set the |
| 207 | config variable "core.autocrlf" without using any attributes. |
| 208 | |
| 209 | ------------------------ |
| 210 | [core] |
| 211 | autocrlf = true |
| 212 | ------------------------ |
| 213 | |
| 214 | This does not force normalization of text files, but does ensure |
| 215 | that text files that you introduce to the repository have their line |
| 216 | endings normalized to LF when they are added, and that files that are |
| 217 | already normalized in the repository stay normalized. |
| 218 | |
| 219 | If you want to ensure that text files that any contributor introduces to |
| 220 | the repository have their line endings normalized, you can set the |
| 221 | `text` attribute to "auto" for _all_ files. |
| 222 | |
| 223 | ------------------------ |
| 224 | * text=auto |
| 225 | ------------------------ |
| 226 | |
| 227 | The attributes allow a fine-grained control, how the line endings |
| 228 | are converted. |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 229 | Here is an example that will make Git normalize .txt, .vcproj and .sh |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 230 | files, ensure that .vcproj files have CRLF and .sh files have LF in |
| 231 | the working directory, and prevent .jpg files from being normalized |
| 232 | regardless of their content. |
| 233 | |
| 234 | ------------------------ |
Junio C Hamano | eed15af | 2016-07-25 22:49:42 | [diff] [blame] | 235 | * text=auto |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 236 | *.txt text |
Junio C Hamano | eed15af | 2016-07-25 22:49:42 | [diff] [blame] | 237 | *.vcproj text eol=crlf |
| 238 | *.sh text eol=lf |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 239 | *.jpg -text |
| 240 | ------------------------ |
| 241 | |
Junio C Hamano | e7808d7 | 2016-08-26 21:48:01 | [diff] [blame] | 242 | NOTE: When `text=auto` conversion is enabled in a cross-platform |
| 243 | project using push and pull to a central repository the text files |
| 244 | containing CRLFs should be normalized. |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 245 | |
Junio C Hamano | e7808d7 | 2016-08-26 21:48:01 | [diff] [blame] | 246 | From a clean working directory: |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 247 | |
| 248 | ------------------------------------------------- |
Junio C Hamano | e7808d7 | 2016-08-26 21:48:01 | [diff] [blame] | 249 | $ echo "* text=auto" >.gitattributes |
Junio C Hamano | 4886c77 | 2017-11-27 03:33:28 | [diff] [blame] | 250 | $ git add --renormalize . |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 251 | $ git status # Show files that will be normalized |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 252 | $ git commit -m "Introduce end-of-line normalization" |
| 253 | ------------------------------------------------- |
| 254 | |
| 255 | If any files that should not be normalized show up in 'git status', |
| 256 | unset their `text` attribute before running 'git add -u'. |
| 257 | |
| 258 | ------------------------ |
| 259 | manual.pdf -text |
| 260 | ------------------------ |
| 261 | |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 262 | Conversely, text files that Git does not detect can have normalization |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 263 | enabled manually. |
| 264 | |
| 265 | ------------------------ |
| 266 | weirdchars.txt text |
| 267 | ------------------------ |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 268 | |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 269 | If `core.safecrlf` is set to "true" or "warn", Git verifies if |
Junio C Hamano | 26e590a | 2008-02-17 03:53:51 | [diff] [blame] | 270 | the conversion is reversible for the current setting of |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 271 | `core.autocrlf`. For "true", Git rejects irreversible |
| 272 | conversions; for "warn", Git only prints a warning but accepts |
Junio C Hamano | 26e590a | 2008-02-17 03:53:51 | [diff] [blame] | 273 | an irreversible conversion. The safety triggers to prevent such |
| 274 | a conversion done to the files in the work tree, but there are a |
| 275 | few exceptions. Even though... |
| 276 | |
Junio C Hamano | 1aa40d2 | 2010-01-21 17:46:43 | [diff] [blame] | 277 | - 'git add' itself does not touch the files in the work tree, the |
Junio C Hamano | 26e590a | 2008-02-17 03:53:51 | [diff] [blame] | 278 | next checkout would, so the safety triggers; |
| 279 | |
Junio C Hamano | 1aa40d2 | 2010-01-21 17:46:43 | [diff] [blame] | 280 | - 'git apply' to update a text file with a patch does touch the files |
Junio C Hamano | 26e590a | 2008-02-17 03:53:51 | [diff] [blame] | 281 | in the work tree, but the operation is about text files and CRLF |
| 282 | conversion is about fixing the line ending inconsistencies, so the |
| 283 | safety does not trigger; |
| 284 | |
Junio C Hamano | 1aa40d2 | 2010-01-21 17:46:43 | [diff] [blame] | 285 | - 'git diff' itself does not touch the files in the work tree, it is |
| 286 | often run to inspect the changes you intend to next 'git add'. To |
Junio C Hamano | 26e590a | 2008-02-17 03:53:51 | [diff] [blame] | 287 | catch potential problems early, safety triggers. |
| 288 | |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 289 | |
Junio C Hamano | ccb8252 | 2018-05-08 07:52:09 | [diff] [blame] | 290 | `working-tree-encoding` |
| 291 | ^^^^^^^^^^^^^^^^^^^^^^^ |
| 292 | |
| 293 | Git recognizes files encoded in ASCII or one of its supersets (e.g. |
| 294 | UTF-8, ISO-8859-1, ...) as text files. Files encoded in certain other |
| 295 | encodings (e.g. UTF-16) are interpreted as binary and consequently |
| 296 | built-in Git text processing tools (e.g. 'git diff') as well as most Git |
| 297 | web front ends do not visualize the contents of these files by default. |
| 298 | |
| 299 | In these cases you can tell Git the encoding of a file in the working |
| 300 | directory with the `working-tree-encoding` attribute. If a file with this |
Junio C Hamano | 8ef91f3 | 2019-12-01 22:58:27 | [diff] [blame] | 301 | attribute is added to Git, then Git re-encodes the content from the |
Junio C Hamano | ccb8252 | 2018-05-08 07:52:09 | [diff] [blame] | 302 | specified encoding to UTF-8. Finally, Git stores the UTF-8 encoded |
| 303 | content in its internal data structure (called "the index"). On checkout |
Junio C Hamano | 8ef91f3 | 2019-12-01 22:58:27 | [diff] [blame] | 304 | the content is re-encoded back to the specified encoding. |
Junio C Hamano | ccb8252 | 2018-05-08 07:52:09 | [diff] [blame] | 305 | |
| 306 | Please note that using the `working-tree-encoding` attribute may have a |
| 307 | number of pitfalls: |
| 308 | |
| 309 | - Alternative Git implementations (e.g. JGit or libgit2) and older Git |
| 310 | versions (as of March 2018) do not support the `working-tree-encoding` |
| 311 | attribute. If you decide to use the `working-tree-encoding` attribute |
| 312 | in your repository, then it is strongly recommended to ensure that all |
| 313 | clients working with the repository support it. |
Junio C Hamano | 94d00a5 | 2018-10-30 07:34:40 | [diff] [blame] | 314 | + |
| 315 | For example, Microsoft Visual Studio resources files (`*.rc`) or |
| 316 | PowerShell script files (`*.ps1`) are sometimes encoded in UTF-16. |
| 317 | If you declare `*.ps1` as files as UTF-16 and you add `foo.ps1` with |
| 318 | a `working-tree-encoding` enabled Git client, then `foo.ps1` will be |
| 319 | stored as UTF-8 internally. A client without `working-tree-encoding` |
| 320 | support will checkout `foo.ps1` as UTF-8 encoded file. This will |
| 321 | typically cause trouble for the users of this file. |
| 322 | + |
Junio C Hamano | 6b7d215 | 2019-04-16 12:51:15 | [diff] [blame] | 323 | If a Git client that does not support the `working-tree-encoding` |
| 324 | attribute adds a new file `bar.ps1`, then `bar.ps1` will be |
Junio C Hamano | 94d00a5 | 2018-10-30 07:34:40 | [diff] [blame] | 325 | stored "as-is" internally (in this example probably as UTF-16). |
| 326 | A client with `working-tree-encoding` support will interpret the |
| 327 | internal contents as UTF-8 and try to convert it to UTF-16 on checkout. |
| 328 | That operation will fail and cause an error. |
Junio C Hamano | ccb8252 | 2018-05-08 07:52:09 | [diff] [blame] | 329 | |
| 330 | - Reencoding content to non-UTF encodings can cause errors as the |
| 331 | conversion might not be UTF-8 round trip safe. If you suspect your |
| 332 | encoding to not be round trip safe, then add it to |
| 333 | `core.checkRoundtripEncoding` to make Git check the round trip |
| 334 | encoding (see linkgit:git-config[1]). SHIFT-JIS (Japanese character |
| 335 | set) is known to have round trip issues with UTF-8 and is checked by |
| 336 | default. |
| 337 | |
| 338 | - Reencoding content requires resources that might slow down certain |
| 339 | Git operations (e.g 'git checkout' or 'git add'). |
| 340 | |
| 341 | Use the `working-tree-encoding` attribute only if you cannot store a file |
| 342 | in UTF-8 encoding and if you want Git to be able to process the content |
| 343 | as text. |
| 344 | |
| 345 | As an example, use the following attributes if your '*.ps1' files are |
| 346 | UTF-16 encoded with byte order mark (BOM) and you want Git to perform |
| 347 | automatic line ending conversion based on your platform. |
| 348 | |
| 349 | ------------------------ |
| 350 | *.ps1 text working-tree-encoding=UTF-16 |
| 351 | ------------------------ |
| 352 | |
| 353 | Use the following attributes if your '*.ps1' files are UTF-16 little |
| 354 | endian encoded without BOM and you want Git to use Windows line endings |
Junio C Hamano | decc731 | 2019-03-11 09:02:54 | [diff] [blame] | 355 | in the working directory (use `UTF-16LE-BOM` instead of `UTF-16LE` if |
Junio C Hamano | 23168ef | 2019-02-07 07:20:47 | [diff] [blame] | 356 | you want UTF-16 little endian with BOM). |
| 357 | Please note, it is highly recommended to |
Junio C Hamano | ccb8252 | 2018-05-08 07:52:09 | [diff] [blame] | 358 | explicitly define the line endings with `eol` if the `working-tree-encoding` |
| 359 | attribute is used to avoid ambiguity. |
| 360 | |
| 361 | ------------------------ |
| 362 | *.ps1 text working-tree-encoding=UTF-16LE eol=CRLF |
| 363 | ------------------------ |
| 364 | |
| 365 | You can get a list of all available encodings on your platform with the |
| 366 | following command: |
| 367 | |
| 368 | ------------------------ |
| 369 | iconv --list |
| 370 | ------------------------ |
| 371 | |
| 372 | If you do not know the encoding of a file, then you can use the `file` |
| 373 | command to guess the encoding: |
| 374 | |
| 375 | ------------------------ |
| 376 | file foo.ps1 |
| 377 | ------------------------ |
| 378 | |
| 379 | |
Junio C Hamano | 469d60e | 2007-04-29 18:30:34 | [diff] [blame] | 380 | `ident` |
| 381 | ^^^^^^^ |
| 382 | |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 383 | When the attribute `ident` is set for a path, Git replaces |
Junio C Hamano | 6f1bbb7 | 2008-10-31 11:32:55 | [diff] [blame] | 384 | `$Id$` in the blob object with `$Id:`, followed by the |
Junio C Hamano | 469d60e | 2007-04-29 18:30:34 | [diff] [blame] | 385 | 40-character hexadecimal blob object name, followed by a dollar |
| 386 | sign `$` upon checkout. Any byte sequence that begins with |
Junio C Hamano | 0430e3a | 2007-05-15 03:13:17 | [diff] [blame] | 387 | `$Id:` and ends with `$` in the worktree file is replaced |
| 388 | with `$Id$` upon check-in. |
Junio C Hamano | 469d60e | 2007-04-29 18:30:34 | [diff] [blame] | 389 | |
| 390 | |
Junio C Hamano | 469d60e | 2007-04-29 18:30:34 | [diff] [blame] | 391 | `filter` |
| 392 | ^^^^^^^^ |
| 393 | |
Junio C Hamano | 361c133 | 2007-11-14 12:17:22 | [diff] [blame] | 394 | A `filter` attribute can be set to a string value that names a |
Junio C Hamano | 469d60e | 2007-04-29 18:30:34 | [diff] [blame] | 395 | filter driver specified in the configuration. |
| 396 | |
Junio C Hamano | 361c133 | 2007-11-14 12:17:22 | [diff] [blame] | 397 | A filter driver consists of a `clean` command and a `smudge` |
Junio C Hamano | 469d60e | 2007-04-29 18:30:34 | [diff] [blame] | 398 | command, either of which can be left unspecified. Upon |
Junio C Hamano | 361c133 | 2007-11-14 12:17:22 | [diff] [blame] | 399 | checkout, when the `smudge` command is specified, the command is |
| 400 | fed the blob object from its standard input, and its standard |
| 401 | output is used to update the worktree file. Similarly, the |
| 402 | `clean` command is used to convert the contents of worktree file |
Junio C Hamano | f2f28b1 | 2016-10-31 21:41:58 | [diff] [blame] | 403 | upon checkin. By default these commands process only a single |
| 404 | blob and terminate. If a long running `process` filter is used |
| 405 | in place of `clean` and/or `smudge` filters, then Git can process |
| 406 | all blobs with a single filter command invocation for the entire |
| 407 | life of a single Git command, for example `git add --all`. If a |
| 408 | long running `process` filter is configured then it always takes |
| 409 | precedence over a configured single blob filter. See section |
| 410 | below for the description of the protocol used to communicate with |
| 411 | a `process` filter. |
Junio C Hamano | 469d60e | 2007-04-29 18:30:34 | [diff] [blame] | 412 | |
Junio C Hamano | 90a5b7b | 2012-02-28 23:40:31 | [diff] [blame] | 413 | One use of the content filtering is to massage the content into a shape |
| 414 | that is more convenient for the platform, filesystem, and the user to use. |
| 415 | For this mode of operation, the key phrase here is "more convenient" and |
| 416 | not "turning something unusable into usable". In other words, the intent |
| 417 | is that if someone unsets the filter driver definition, or does not have |
| 418 | the appropriate filter program, the project should still be usable. |
Junio C Hamano | 469d60e | 2007-04-29 18:30:34 | [diff] [blame] | 419 | |
Junio C Hamano | 90a5b7b | 2012-02-28 23:40:31 | [diff] [blame] | 420 | Another use of the content filtering is to store the content that cannot |
| 421 | be directly used in the repository (e.g. a UUID that refers to the true |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 422 | content stored outside Git, or an encrypted content) and turn it into a |
Junio C Hamano | 90a5b7b | 2012-02-28 23:40:31 | [diff] [blame] | 423 | usable form upon checkout (e.g. download the external content, or decrypt |
| 424 | the encrypted content). |
| 425 | |
| 426 | These two filters behave differently, and by default, a filter is taken as |
| 427 | the former, massaging the contents into more convenient shape. A missing |
| 428 | filter driver definition in the config, or a filter driver that exits with |
| 429 | a non-zero status, is not an error but makes the filter a no-op passthru. |
| 430 | |
| 431 | You can declare that a filter turns a content that by itself is unusable |
| 432 | into a usable content by setting the filter.<driver>.required configuration |
| 433 | variable to `true`. |
Junio C Hamano | 469d60e | 2007-04-29 18:30:34 | [diff] [blame] | 434 | |
Junio C Hamano | 4886c77 | 2017-11-27 03:33:28 | [diff] [blame] | 435 | Note: Whenever the clean filter is changed, the repo should be renormalized: |
| 436 | $ git add --renormalize . |
| 437 | |
Junio C Hamano | 6a548ef | 2009-12-17 00:16:20 | [diff] [blame] | 438 | For example, in .gitattributes, you would assign the `filter` |
| 439 | attribute for paths. |
| 440 | |
| 441 | ------------------------ |
| 442 | *.c filter=indent |
| 443 | ------------------------ |
| 444 | |
| 445 | Then you would define a "filter.indent.clean" and "filter.indent.smudge" |
| 446 | configuration in your .git/config to specify a pair of commands to |
| 447 | modify the contents of C programs when the source files are checked |
| 448 | in ("clean" is run) and checked out (no change is made because the |
| 449 | command is "cat"). |
| 450 | |
| 451 | ------------------------ |
| 452 | [filter "indent"] |
| 453 | clean = indent |
| 454 | smudge = cat |
| 455 | ------------------------ |
| 456 | |
Junio C Hamano | 2db3e75 | 2010-09-03 21:33:06 | [diff] [blame] | 457 | For best results, `clean` should not alter its output further if it is |
| 458 | run twice ("clean->clean" should be equivalent to "clean"), and |
| 459 | multiple `smudge` commands should not alter `clean`'s output |
| 460 | ("smudge->smudge->clean" should be equivalent to "clean"). See the |
| 461 | section on merging below. |
| 462 | |
| 463 | The "indent" filter is well-behaved in this regard: it will not modify |
| 464 | input that is already correctly indented. In this case, the lack of a |
| 465 | smudge filter means that the clean filter _must_ accept its own output |
| 466 | without modifying it. |
| 467 | |
Junio C Hamano | 90a5b7b | 2012-02-28 23:40:31 | [diff] [blame] | 468 | If a filter _must_ succeed in order to make the stored contents usable, |
| 469 | you can declare that the filter is `required`, in the configuration: |
| 470 | |
| 471 | ------------------------ |
| 472 | [filter "crypt"] |
| 473 | clean = openssl enc ... |
| 474 | smudge = openssl enc -d ... |
| 475 | required |
| 476 | ------------------------ |
| 477 | |
Junio C Hamano | e0552e4 | 2011-01-06 00:34:47 | [diff] [blame] | 478 | Sequence "%f" on the filter command line is replaced with the name of |
| 479 | the file the filter is working on. A filter might use this in keyword |
| 480 | substitution. For example: |
| 481 | |
| 482 | ------------------------ |
| 483 | [filter "p4"] |
| 484 | clean = git-p4-filter --clean %f |
| 485 | smudge = git-p4-filter --smudge %f |
| 486 | ------------------------ |
| 487 | |
Junio C Hamano | e548d1b | 2016-08-08 22:26:49 | [diff] [blame] | 488 | Note that "%f" is the name of the path that is being worked on. Depending |
| 489 | on the version that is being filtered, the corresponding file on disk may |
| 490 | not exist, or may have different contents. So, smudge and clean commands |
| 491 | should not try to access the file on disk, but only act as filters on the |
| 492 | content provided to them on standard input. |
Junio C Hamano | 469d60e | 2007-04-29 18:30:34 | [diff] [blame] | 493 | |
Junio C Hamano | f2f28b1 | 2016-10-31 21:41:58 | [diff] [blame] | 494 | Long Running Filter Process |
| 495 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^ |
| 496 | |
| 497 | If the filter command (a string value) is defined via |
| 498 | `filter.<driver>.process` then Git can process all blobs with a |
| 499 | single filter invocation for the entire life of a single Git |
Junio C Hamano | 640779d | 2018-02-14 01:29:14 | [diff] [blame] | 500 | command. This is achieved by using the long-running process protocol |
| 501 | (described in technical/long-running-process-protocol.txt). |
Junio C Hamano | f2f28b1 | 2016-10-31 21:41:58 | [diff] [blame] | 502 | |
Junio C Hamano | 640779d | 2018-02-14 01:29:14 | [diff] [blame] | 503 | When Git encounters the first file that needs to be cleaned or smudged, |
| 504 | it starts the filter and performs the handshake. In the handshake, the |
| 505 | welcome message sent by Git is "git-filter-client", only version 2 is |
Junio C Hamano | 8ef91f3 | 2019-12-01 22:58:27 | [diff] [blame] | 506 | supported, and the supported capabilities are "clean", "smudge", and |
Junio C Hamano | 640779d | 2018-02-14 01:29:14 | [diff] [blame] | 507 | "delay". |
Junio C Hamano | f2f28b1 | 2016-10-31 21:41:58 | [diff] [blame] | 508 | |
| 509 | Afterwards Git sends a list of "key=value" pairs terminated with |
| 510 | a flush packet. The list will contain at least the filter command |
| 511 | (based on the supported capabilities) and the pathname of the file |
| 512 | to filter relative to the repository root. Right after the flush packet |
| 513 | Git sends the content split in zero or more pkt-line packets and a |
| 514 | flush packet to terminate content. Please note, that the filter |
| 515 | must not send any response before it received the content and the |
Junio C Hamano | 9e35abf | 2016-12-27 22:37:27 | [diff] [blame] | 516 | final flush packet. Also note that the "value" of a "key=value" pair |
| 517 | can contain the "=" character whereas the key would never contain |
| 518 | that character. |
Junio C Hamano | f2f28b1 | 2016-10-31 21:41:58 | [diff] [blame] | 519 | ------------------------ |
| 520 | packet: git> command=smudge |
| 521 | packet: git> pathname=path/testfile.dat |
| 522 | packet: git> 0000 |
| 523 | packet: git> CONTENT |
| 524 | packet: git> 0000 |
| 525 | ------------------------ |
| 526 | |
| 527 | The filter is expected to respond with a list of "key=value" pairs |
| 528 | terminated with a flush packet. If the filter does not experience |
| 529 | problems then the list must contain a "success" status. Right after |
| 530 | these packets the filter is expected to send the content in zero |
| 531 | or more pkt-line packets and a flush packet at the end. Finally, a |
| 532 | second list of "key=value" pairs terminated with a flush packet |
| 533 | is expected. The filter can change the status in the second list |
| 534 | or keep the status as is with an empty list. Please note that the |
| 535 | empty list must be terminated with a flush packet regardless. |
| 536 | |
| 537 | ------------------------ |
| 538 | packet: git< status=success |
| 539 | packet: git< 0000 |
| 540 | packet: git< SMUDGED_CONTENT |
| 541 | packet: git< 0000 |
| 542 | packet: git< 0000 # empty list, keep "status=success" unchanged! |
| 543 | ------------------------ |
| 544 | |
| 545 | If the result content is empty then the filter is expected to respond |
| 546 | with a "success" status and a flush packet to signal the empty content. |
| 547 | ------------------------ |
| 548 | packet: git< status=success |
| 549 | packet: git< 0000 |
| 550 | packet: git< 0000 # empty content! |
| 551 | packet: git< 0000 # empty list, keep "status=success" unchanged! |
| 552 | ------------------------ |
| 553 | |
| 554 | In case the filter cannot or does not want to process the content, |
| 555 | it is expected to respond with an "error" status. |
| 556 | ------------------------ |
| 557 | packet: git< status=error |
| 558 | packet: git< 0000 |
| 559 | ------------------------ |
| 560 | |
| 561 | If the filter experiences an error during processing, then it can |
| 562 | send the status "error" after the content was (partially or |
| 563 | completely) sent. |
| 564 | ------------------------ |
| 565 | packet: git< status=success |
| 566 | packet: git< 0000 |
| 567 | packet: git< HALF_WRITTEN_ERRONEOUS_CONTENT |
| 568 | packet: git< 0000 |
| 569 | packet: git< status=error |
| 570 | packet: git< 0000 |
| 571 | ------------------------ |
| 572 | |
| 573 | In case the filter cannot or does not want to process the content |
| 574 | as well as any future content for the lifetime of the Git process, |
| 575 | then it is expected to respond with an "abort" status at any point |
| 576 | in the protocol. |
| 577 | ------------------------ |
| 578 | packet: git< status=abort |
| 579 | packet: git< 0000 |
| 580 | ------------------------ |
| 581 | |
| 582 | Git neither stops nor restarts the filter process in case the |
| 583 | "error"/"abort" status is set. However, Git sets its exit code |
| 584 | according to the `filter.<driver>.required` flag, mimicking the |
| 585 | behavior of the `filter.<driver>.clean` / `filter.<driver>.smudge` |
| 586 | mechanism. |
| 587 | |
| 588 | If the filter dies during the communication or does not adhere to |
| 589 | the protocol then Git will stop the filter process and restart it |
| 590 | with the next file that needs to be processed. Depending on the |
| 591 | `filter.<driver>.required` flag Git will interpret that as error. |
| 592 | |
Junio C Hamano | 4b2a6c1 | 2017-08-22 18:32:37 | [diff] [blame] | 593 | Delay |
| 594 | ^^^^^ |
| 595 | |
| 596 | If the filter supports the "delay" capability, then Git can send the |
| 597 | flag "can-delay" after the filter command and pathname. This flag |
| 598 | denotes that the filter can delay filtering the current blob (e.g. to |
| 599 | compensate network latencies) by responding with no content but with |
| 600 | the status "delayed" and a flush packet. |
| 601 | ------------------------ |
| 602 | packet: git> command=smudge |
| 603 | packet: git> pathname=path/testfile.dat |
| 604 | packet: git> can-delay=1 |
| 605 | packet: git> 0000 |
| 606 | packet: git> CONTENT |
| 607 | packet: git> 0000 |
| 608 | packet: git< status=delayed |
| 609 | packet: git< 0000 |
| 610 | ------------------------ |
| 611 | |
| 612 | If the filter supports the "delay" capability then it must support the |
| 613 | "list_available_blobs" command. If Git sends this command, then the |
| 614 | filter is expected to return a list of pathnames representing blobs |
| 615 | that have been delayed earlier and are now available. |
| 616 | The list must be terminated with a flush packet followed |
| 617 | by a "success" status that is also terminated with a flush packet. If |
| 618 | no blobs for the delayed paths are available, yet, then the filter is |
| 619 | expected to block the response until at least one blob becomes |
| 620 | available. The filter can tell Git that it has no more delayed blobs |
| 621 | by sending an empty list. As soon as the filter responds with an empty |
| 622 | list, Git stops asking. All blobs that Git has not received at this |
| 623 | point are considered missing and will result in an error. |
| 624 | |
| 625 | ------------------------ |
| 626 | packet: git> command=list_available_blobs |
| 627 | packet: git> 0000 |
| 628 | packet: git< pathname=path/testfile.dat |
| 629 | packet: git< pathname=path/otherfile.dat |
| 630 | packet: git< 0000 |
| 631 | packet: git< status=success |
| 632 | packet: git< 0000 |
| 633 | ------------------------ |
| 634 | |
| 635 | After Git received the pathnames, it will request the corresponding |
| 636 | blobs again. These requests contain a pathname and an empty content |
| 637 | section. The filter is expected to respond with the smudged content |
| 638 | in the usual way as explained above. |
| 639 | ------------------------ |
| 640 | packet: git> command=smudge |
| 641 | packet: git> pathname=path/testfile.dat |
| 642 | packet: git> 0000 |
| 643 | packet: git> 0000 # empty content! |
| 644 | packet: git< status=success |
| 645 | packet: git< 0000 |
| 646 | packet: git< SMUDGED_CONTENT |
| 647 | packet: git< 0000 |
| 648 | packet: git< 0000 # empty list, keep "status=success" unchanged! |
| 649 | ------------------------ |
| 650 | |
| 651 | Example |
| 652 | ^^^^^^^ |
| 653 | |
Junio C Hamano | f2f28b1 | 2016-10-31 21:41:58 | [diff] [blame] | 654 | A long running filter demo implementation can be found in |
| 655 | `contrib/long-running-filter/example.pl` located in the Git |
| 656 | core repository. If you develop your own long running filter |
| 657 | process then the `GIT_TRACE_PACKET` environment variables can be |
| 658 | very helpful for debugging (see linkgit:git[1]). |
| 659 | |
| 660 | Please note that you cannot use an existing `filter.<driver>.clean` |
| 661 | or `filter.<driver>.smudge` command with `filter.<driver>.process` |
| 662 | because the former two use a different inter process communication |
| 663 | protocol than the latter one. |
| 664 | |
| 665 | |
Junio C Hamano | 469d60e | 2007-04-29 18:30:34 | [diff] [blame] | 666 | Interaction between checkin/checkout attributes |
| 667 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ |
| 668 | |
| 669 | In the check-in codepath, the worktree file is first converted |
| 670 | with `filter` driver (if specified and corresponding driver |
| 671 | defined), then the result is processed with `ident` (if |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 672 | specified), and then finally with `text` (again, if specified |
Junio C Hamano | 469d60e | 2007-04-29 18:30:34 | [diff] [blame] | 673 | and applicable). |
| 674 | |
| 675 | In the check-out codepath, the blob content is first converted |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 676 | with `text`, and then `ident` and fed to `filter`. |
Junio C Hamano | 469d60e | 2007-04-29 18:30:34 | [diff] [blame] | 677 | |
| 678 | |
Junio C Hamano | 2db3e75 | 2010-09-03 21:33:06 | [diff] [blame] | 679 | Merging branches with differing checkin/checkout attributes |
| 680 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ |
| 681 | |
| 682 | If you have added attributes to a file that cause the canonical |
| 683 | repository format for that file to change, such as adding a |
| 684 | clean/smudge filter or text/eol/ident attributes, merging anything |
| 685 | where the attribute is not in place would normally cause merge |
| 686 | conflicts. |
| 687 | |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 688 | To prevent these unnecessary merge conflicts, Git can be told to run a |
Junio C Hamano | 2db3e75 | 2010-09-03 21:33:06 | [diff] [blame] | 689 | virtual check-out and check-in of all three stages of a file when |
| 690 | resolving a three-way merge by setting the `merge.renormalize` |
| 691 | configuration variable. This prevents changes caused by check-in |
| 692 | conversion from causing spurious merge conflicts when a converted file |
| 693 | is merged with an unconverted file. |
| 694 | |
| 695 | As long as a "smudge->clean" results in the same output as a "clean" |
| 696 | even on files that are already smudged, this strategy will |
| 697 | automatically resolve all filter-related conflicts. Filters that do |
| 698 | not act in this way may cause additional merge conflicts that must be |
| 699 | resolved manually. |
| 700 | |
| 701 | |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 702 | Generating diff text |
| 703 | ~~~~~~~~~~~~~~~~~~~~ |
| 704 | |
Junio C Hamano | f7e47ee | 2008-11-02 18:16:45 | [diff] [blame] | 705 | `diff` |
| 706 | ^^^^^^ |
| 707 | |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 708 | The attribute `diff` affects how Git generates diffs for particular |
| 709 | files. It can tell Git whether to generate a textual patch for the path |
Junio C Hamano | 2836349 | 2008-11-14 08:26:31 | [diff] [blame] | 710 | or to treat the path as a binary file. It can also affect what line is |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 711 | shown on the hunk header `@@ -k,l +n,m @@` line, tell Git to use an |
| 712 | external command to generate the diff, or ask Git to convert binary |
Junio C Hamano | 2836349 | 2008-11-14 08:26:31 | [diff] [blame] | 713 | files to a text format before generating the diff. |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 714 | |
| 715 | Set:: |
| 716 | |
| 717 | A path to which the `diff` attribute is set is treated |
| 718 | as text, even when they contain byte values that |
| 719 | normally never appear in text files, such as NUL. |
| 720 | |
| 721 | Unset:: |
| 722 | |
| 723 | A path to which the `diff` attribute is unset will |
Junio C Hamano | 2836349 | 2008-11-14 08:26:31 | [diff] [blame] | 724 | generate `Binary files differ` (or a binary patch, if |
| 725 | binary patches are enabled). |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 726 | |
| 727 | Unspecified:: |
| 728 | |
| 729 | A path to which the `diff` attribute is unspecified |
| 730 | first gets its contents inspected, and if it looks like |
Junio C Hamano | 8e61f1f | 2014-09-11 21:53:45 | [diff] [blame] | 731 | text and is smaller than core.bigFileThreshold, it is treated |
| 732 | as text. Otherwise it would generate `Binary files differ`. |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 733 | |
Junio C Hamano | 9db95ca | 2007-04-23 07:27:05 | [diff] [blame] | 734 | String:: |
| 735 | |
Junio C Hamano | 2836349 | 2008-11-14 08:26:31 | [diff] [blame] | 736 | Diff is shown using the specified diff driver. Each driver may |
| 737 | specify one or more options, as described in the following |
| 738 | section. The options for the diff driver "foo" are defined |
| 739 | by the configuration variables in the "diff.foo" section of the |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 740 | Git config file. |
Junio C Hamano | 9db95ca | 2007-04-23 07:27:05 | [diff] [blame] | 741 | |
| 742 | |
Junio C Hamano | 2836349 | 2008-11-14 08:26:31 | [diff] [blame] | 743 | Defining an external diff driver |
| 744 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ |
Junio C Hamano | 9db95ca | 2007-04-23 07:27:05 | [diff] [blame] | 745 | |
| 746 | The definition of a diff driver is done in `gitconfig`, not |
| 747 | `gitattributes` file, so strictly speaking this manual page is a |
| 748 | wrong place to talk about it. However... |
| 749 | |
Junio C Hamano | 2836349 | 2008-11-14 08:26:31 | [diff] [blame] | 750 | To define an external diff driver `jcdiff`, add a section to your |
Junio C Hamano | 9db95ca | 2007-04-23 07:27:05 | [diff] [blame] | 751 | `$GIT_DIR/config` file (or `$HOME/.gitconfig` file) like this: |
| 752 | |
| 753 | ---------------------------------------------------------------- |
| 754 | [diff "jcdiff"] |
| 755 | command = j-c-diff |
| 756 | ---------------------------------------------------------------- |
| 757 | |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 758 | When Git needs to show you a diff for the path with `diff` |
Junio C Hamano | 9db95ca | 2007-04-23 07:27:05 | [diff] [blame] | 759 | attribute set to `jcdiff`, it calls the command you specified |
| 760 | with the above configuration, i.e. `j-c-diff`, with 7 |
| 761 | parameters, just like `GIT_EXTERNAL_DIFF` program is called. |
Junio C Hamano | f7c042d | 2008-06-06 22:50:53 | [diff] [blame] | 762 | See linkgit:git[1] for details. |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 763 | |
Junio C Hamano | 9813725 | 2023-02-27 18:49:30 | [diff] [blame] | 764 | Setting the internal diff algorithm |
| 765 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ |
| 766 | |
| 767 | The diff algorithm can be set through the `diff.algorithm` config key, but |
| 768 | sometimes it may be helpful to set the diff algorithm per path. For example, |
| 769 | one may want to use the `minimal` diff algorithm for .json files, and the |
| 770 | `histogram` for .c files, and so on without having to pass in the algorithm |
| 771 | through the command line each time. |
| 772 | |
| 773 | First, in `.gitattributes`, assign the `diff` attribute for paths. |
| 774 | |
| 775 | ------------------------ |
| 776 | *.json diff=<name> |
| 777 | ------------------------ |
| 778 | |
| 779 | Then, define a "diff.<name>.algorithm" configuration to specify the diff |
| 780 | algorithm, choosing from `myers`, `patience`, `minimal`, or `histogram`. |
| 781 | |
| 782 | ---------------------------------------------------------------- |
| 783 | [diff "<name>"] |
| 784 | algorithm = histogram |
| 785 | ---------------------------------------------------------------- |
| 786 | |
| 787 | This diff algorithm applies to user facing diff output like git-diff(1), |
| 788 | git-show(1) and is used for the `--stat` output as well. The merge machinery |
| 789 | will not use the diff algorithm set through this method. |
| 790 | |
| 791 | NOTE: If `diff.<name>.command` is defined for path with the |
| 792 | `diff=<name>` attribute, it is executed as an external diff driver |
| 793 | (see above), and adding `diff.<name>.algorithm` has no effect, as the |
| 794 | algorithm is not passed to the external diff driver. |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 795 | |
Junio C Hamano | 2a8f6dc | 2007-07-09 08:48:38 | [diff] [blame] | 796 | Defining a custom hunk-header |
| 797 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ |
| 798 | |
Junio C Hamano | 5e4d8de | 2008-09-19 06:27:02 | [diff] [blame] | 799 | Each group of changes (called a "hunk") in the textual diff output |
Junio C Hamano | 2a8f6dc | 2007-07-09 08:48:38 | [diff] [blame] | 800 | is prefixed with a line of the form: |
| 801 | |
| 802 | @@ -k,l +n,m @@ TEXT |
| 803 | |
Junio C Hamano | 5e4d8de | 2008-09-19 06:27:02 | [diff] [blame] | 804 | This is called a 'hunk header'. The "TEXT" portion is by default a line |
| 805 | that begins with an alphabet, an underscore or a dollar sign; this |
| 806 | matches what GNU 'diff -p' output uses. This default selection however |
| 807 | is not suited for some contents, and you can use a customized pattern |
| 808 | to make a selection. |
Junio C Hamano | 2a8f6dc | 2007-07-09 08:48:38 | [diff] [blame] | 809 | |
Junio C Hamano | 5e4d8de | 2008-09-19 06:27:02 | [diff] [blame] | 810 | First, in .gitattributes, you would assign the `diff` attribute |
Junio C Hamano | 2a8f6dc | 2007-07-09 08:48:38 | [diff] [blame] | 811 | for paths. |
| 812 | |
| 813 | ------------------------ |
| 814 | *.tex diff=tex |
| 815 | ------------------------ |
| 816 | |
Junio C Hamano | a476efa | 2008-10-10 15:31:42 | [diff] [blame] | 817 | Then, you would define a "diff.tex.xfuncname" configuration to |
Junio C Hamano | 2a8f6dc | 2007-07-09 08:48:38 | [diff] [blame] | 818 | specify a regular expression that matches a line that you would |
Junio C Hamano | 3bd66e9 | 2009-04-18 06:17:56 | [diff] [blame] | 819 | want to appear as the hunk header "TEXT". Add a section to your |
| 820 | `$GIT_DIR/config` file (or `$HOME/.gitconfig` file) like this: |
Junio C Hamano | 2a8f6dc | 2007-07-09 08:48:38 | [diff] [blame] | 821 | |
| 822 | ------------------------ |
| 823 | [diff "tex"] |
Junio C Hamano | a476efa | 2008-10-10 15:31:42 | [diff] [blame] | 824 | xfuncname = "^(\\\\(sub)*section\\{.*)$" |
Junio C Hamano | 2a8f6dc | 2007-07-09 08:48:38 | [diff] [blame] | 825 | ------------------------ |
| 826 | |
| 827 | Note. A single level of backslashes are eaten by the |
| 828 | configuration file parser, so you would need to double the |
| 829 | backslashes; the pattern above picks a line that begins with a |
Junio C Hamano | a638742 | 2007-08-25 03:54:27 | [diff] [blame] | 830 | backslash, and zero or more occurrences of `sub` followed by |
Junio C Hamano | 2a8f6dc | 2007-07-09 08:48:38 | [diff] [blame] | 831 | `section` followed by open brace, to the end of line. |
| 832 | |
| 833 | There are a few built-in patterns to make this easier, and `tex` |
| 834 | is one of them, so you do not have to write the above in your |
| 835 | configuration file (you still need to enable this with the |
Junio C Hamano | a76cb0d | 2008-08-13 20:04:10 | [diff] [blame] | 836 | attribute mechanism, via `.gitattributes`). The following built in |
| 837 | patterns are available: |
| 838 | |
Junio C Hamano | 4f07aec | 2012-09-25 19:08:44 | [diff] [blame] | 839 | - `ada` suitable for source code in the Ada language. |
| 840 | |
Junio C Hamano | df3d3cd | 2020-11-02 22:05:05 | [diff] [blame] | 841 | - `bash` suitable for source code in the Bourne-Again SHell language. |
| 842 | Covers a superset of POSIX shell function definitions. |
| 843 | |
Junio C Hamano | a76cb0d | 2008-08-13 20:04:10 | [diff] [blame] | 844 | - `bibtex` suitable for files with BibTeX coded references. |
| 845 | |
Junio C Hamano | 20d47e3 | 2009-01-26 06:36:02 | [diff] [blame] | 846 | - `cpp` suitable for source code in the C and C++ languages. |
| 847 | |
Junio C Hamano | 53ba6d0 | 2010-08-22 07:25:12 | [diff] [blame] | 848 | - `csharp` suitable for source code in the C# language. |
| 849 | |
Junio C Hamano | fedb1b4 | 2016-06-20 21:23:40 | [diff] [blame] | 850 | - `css` suitable for cascading style sheets. |
| 851 | |
Junio C Hamano | cb70539 | 2019-09-18 19:30:01 | [diff] [blame] | 852 | - `dts` suitable for devicetree (DTS) files. |
| 853 | |
Junio C Hamano | 8ef91f3 | 2019-12-01 22:58:27 | [diff] [blame] | 854 | - `elixir` suitable for source code in the Elixir language. |
| 855 | |
Junio C Hamano | 3b4609d | 2010-09-30 00:04:34 | [diff] [blame] | 856 | - `fortran` suitable for source code in the Fortran language. |
| 857 | |
Junio C Hamano | d7ed404 | 2015-08-03 19:43:00 | [diff] [blame] | 858 | - `fountain` suitable for Fountain documents. |
| 859 | |
Junio C Hamano | a25ccc4 | 2018-03-08 21:36:14 | [diff] [blame] | 860 | - `golang` suitable for source code in the Go language. |
| 861 | |
Junio C Hamano | f24b806 | 2008-09-09 01:34:39 | [diff] [blame] | 862 | - `html` suitable for HTML/XHTML documents. |
| 863 | |
Junio C Hamano | 675570c | 2008-09-13 06:07:28 | [diff] [blame] | 864 | - `java` suitable for source code in the Java language. |
Junio C Hamano | a76cb0d | 2008-08-13 20:04:10 | [diff] [blame] | 865 | |
Junio C Hamano | 67fef49 | 2022-03-27 17:31:23 | [diff] [blame] | 866 | - `kotlin` suitable for source code in the Kotlin language. |
| 867 | |
Junio C Hamano | 2b43cff | 2020-05-08 22:27:04 | [diff] [blame] | 868 | - `markdown` suitable for Markdown documents. |
| 869 | |
Junio C Hamano | 73c6486 | 2019-06-18 03:24:20 | [diff] [blame] | 870 | - `matlab` suitable for source code in the MATLAB and Octave languages. |
Junio C Hamano | 92887a3 | 2011-12-07 00:26:29 | [diff] [blame] | 871 | |
Junio C Hamano | a476efa | 2008-10-10 15:31:42 | [diff] [blame] | 872 | - `objc` suitable for source code in the Objective-C language. |
| 873 | |
Junio C Hamano | a76cb0d | 2008-08-13 20:04:10 | [diff] [blame] | 874 | - `pascal` suitable for source code in the Pascal/Delphi language. |
| 875 | |
Junio C Hamano | 9b6fafe | 2011-01-14 00:43:22 | [diff] [blame] | 876 | - `perl` suitable for source code in the Perl language. |
| 877 | |
Junio C Hamano | f24b806 | 2008-09-09 01:34:39 | [diff] [blame] | 878 | - `php` suitable for source code in the PHP language. |
| 879 | |
Junio C Hamano | 3f2f6e5 | 2008-08-21 10:34:22 | [diff] [blame] | 880 | - `python` suitable for source code in the Python language. |
| 881 | |
Junio C Hamano | a76cb0d | 2008-08-13 20:04:10 | [diff] [blame] | 882 | - `ruby` suitable for source code in the Ruby language. |
| 883 | |
Junio C Hamano | d0a9940 | 2019-06-21 18:58:41 | [diff] [blame] | 884 | - `rust` suitable for source code in the Rust language. |
| 885 | |
Junio C Hamano | 4078a55 | 2021-04-30 06:08:10 | [diff] [blame] | 886 | - `scheme` suitable for source code in the Scheme language. |
| 887 | |
Junio C Hamano | a76cb0d | 2008-08-13 20:04:10 | [diff] [blame] | 888 | - `tex` suitable for source code for LaTeX documents. |
Junio C Hamano | 2a8f6dc | 2007-07-09 08:48:38 | [diff] [blame] | 889 | |
| 890 | |
Junio C Hamano | 20d47e3 | 2009-01-26 06:36:02 | [diff] [blame] | 891 | Customizing word diff |
| 892 | ^^^^^^^^^^^^^^^^^^^^^ |
| 893 | |
Junio C Hamano | cc7636a | 2010-05-21 14:57:53 | [diff] [blame] | 894 | You can customize the rules that `git diff --word-diff` uses to |
Junio C Hamano | 20d47e3 | 2009-01-26 06:36:02 | [diff] [blame] | 895 | split words in a line, by specifying an appropriate regular expression |
| 896 | in the "diff.*.wordRegex" configuration variable. For example, in TeX |
| 897 | a backslash followed by a sequence of letters forms a command, but |
| 898 | several such commands can be run together without intervening |
Junio C Hamano | 3bd66e9 | 2009-04-18 06:17:56 | [diff] [blame] | 899 | whitespace. To separate them, use a regular expression in your |
| 900 | `$GIT_DIR/config` file (or `$HOME/.gitconfig` file) like this: |
Junio C Hamano | 20d47e3 | 2009-01-26 06:36:02 | [diff] [blame] | 901 | |
| 902 | ------------------------ |
| 903 | [diff "tex"] |
| 904 | wordRegex = "\\\\[a-zA-Z]+|[{}]|\\\\.|[^\\{}[:space:]]+" |
| 905 | ------------------------ |
| 906 | |
| 907 | A built-in pattern is provided for all languages listed in the |
| 908 | previous section. |
| 909 | |
| 910 | |
Junio C Hamano | 2836349 | 2008-11-14 08:26:31 | [diff] [blame] | 911 | Performing text diffs of binary files |
| 912 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ |
| 913 | |
| 914 | Sometimes it is desirable to see the diff of a text-converted |
| 915 | version of some binary files. For example, a word processor |
| 916 | document can be converted to an ASCII text representation, and |
| 917 | the diff of the text shown. Even though this conversion loses |
| 918 | some information, the resulting diff is useful for human |
| 919 | viewing (but cannot be applied directly). |
| 920 | |
| 921 | The `textconv` config option is used to define a program for |
| 922 | performing such a conversion. The program should take a single |
| 923 | argument, the name of a file to convert, and produce the |
| 924 | resulting text on stdout. |
| 925 | |
| 926 | For example, to show the diff of the exif information of a |
| 927 | file instead of the binary information (assuming you have the |
Junio C Hamano | 3bd66e9 | 2009-04-18 06:17:56 | [diff] [blame] | 928 | exif tool installed), add the following section to your |
| 929 | `$GIT_DIR/config` file (or `$HOME/.gitconfig` file): |
Junio C Hamano | 2836349 | 2008-11-14 08:26:31 | [diff] [blame] | 930 | |
| 931 | ------------------------ |
| 932 | [diff "jpg"] |
| 933 | textconv = exif |
| 934 | ------------------------ |
| 935 | |
| 936 | NOTE: The text conversion is generally a one-way conversion; |
| 937 | in this example, we lose the actual image contents and focus |
| 938 | just on the text data. This means that diffs generated by |
| 939 | textconv are _not_ suitable for applying. For this reason, |
| 940 | only `git diff` and the `git log` family of commands (i.e., |
| 941 | log, whatchanged, show) will perform text conversion. `git |
| 942 | format-patch` will never generate this output. If you want to |
| 943 | send somebody a text-converted diff of a binary file (e.g., |
| 944 | because it quickly conveys the changes you have made), you |
| 945 | should generate it separately and send it as a comment _in |
| 946 | addition to_ the usual binary diff that you might send. |
| 947 | |
Junio C Hamano | cb39aea | 2010-05-09 07:24:48 | [diff] [blame] | 948 | Because text conversion can be slow, especially when doing a |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 949 | large number of them with `git log -p`, Git provides a mechanism |
Junio C Hamano | cb39aea | 2010-05-09 07:24:48 | [diff] [blame] | 950 | to cache the output and use it in future diffs. To enable |
| 951 | caching, set the "cachetextconv" variable in your diff driver's |
| 952 | config. For example: |
| 953 | |
| 954 | ------------------------ |
| 955 | [diff "jpg"] |
| 956 | textconv = exif |
| 957 | cachetextconv = true |
| 958 | ------------------------ |
| 959 | |
| 960 | This will cache the result of running "exif" on each blob |
| 961 | indefinitely. If you change the textconv config variable for a |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 962 | diff driver, Git will automatically invalidate the cache entries |
Junio C Hamano | cb39aea | 2010-05-09 07:24:48 | [diff] [blame] | 963 | and re-run the textconv filter. If you want to invalidate the |
| 964 | cache manually (e.g., because your version of "exif" was updated |
| 965 | and now produces better output), you can remove the cache |
| 966 | manually with `git update-ref -d refs/notes/textconv/jpg` (where |
| 967 | "jpg" is the name of the diff driver, as in the example above). |
Junio C Hamano | 2836349 | 2008-11-14 08:26:31 | [diff] [blame] | 968 | |
Junio C Hamano | f93530e | 2011-05-25 23:59:59 | [diff] [blame] | 969 | Choosing textconv versus external diff |
| 970 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ |
| 971 | |
| 972 | If you want to show differences between binary or specially-formatted |
| 973 | blobs in your repository, you can choose to use either an external diff |
| 974 | command, or to use textconv to convert them to a diff-able text format. |
| 975 | Which method you choose depends on your exact situation. |
| 976 | |
| 977 | The advantage of using an external diff command is flexibility. You are |
| 978 | not bound to find line-oriented changes, nor is it necessary for the |
| 979 | output to resemble unified diff. You are free to locate and report |
| 980 | changes in the most appropriate way for your data format. |
| 981 | |
| 982 | A textconv, by comparison, is much more limiting. You provide a |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 983 | transformation of the data into a line-oriented text format, and Git |
Junio C Hamano | f93530e | 2011-05-25 23:59:59 | [diff] [blame] | 984 | uses its regular diff tools to generate the output. There are several |
| 985 | advantages to choosing this method: |
| 986 | |
| 987 | 1. Ease of use. It is often much simpler to write a binary to text |
| 988 | transformation than it is to perform your own diff. In many cases, |
| 989 | existing programs can be used as textconv filters (e.g., exif, |
| 990 | odt2txt). |
| 991 | |
| 992 | 2. Git diff features. By performing only the transformation step |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 993 | yourself, you can still utilize many of Git's diff features, |
Junio C Hamano | f93530e | 2011-05-25 23:59:59 | [diff] [blame] | 994 | including colorization, word-diff, and combined diffs for merges. |
| 995 | |
| 996 | 3. Caching. Textconv caching can speed up repeated diffs, such as those |
| 997 | you might trigger by running `git log -p`. |
| 998 | |
| 999 | |
Junio C Hamano | 9b6fafe | 2011-01-14 00:43:22 | [diff] [blame] | 1000 | Marking files as binary |
| 1001 | ^^^^^^^^^^^^^^^^^^^^^^^ |
| 1002 | |
| 1003 | Git usually guesses correctly whether a blob contains text or binary |
| 1004 | data by examining the beginning of the contents. However, sometimes you |
| 1005 | may want to override its decision, either because a blob contains binary |
| 1006 | data later in the file, or because the content, while technically |
| 1007 | composed of text characters, is opaque to a human reader. For example, |
Junio C Hamano | fb5ffde | 2014-11-04 22:38:56 | [diff] [blame] | 1008 | many postscript files contain only ASCII characters, but produce noisy |
Junio C Hamano | 9b6fafe | 2011-01-14 00:43:22 | [diff] [blame] | 1009 | and meaningless diffs. |
| 1010 | |
| 1011 | The simplest way to mark a file as binary is to unset the diff |
| 1012 | attribute in the `.gitattributes` file: |
| 1013 | |
| 1014 | ------------------------ |
| 1015 | *.ps -diff |
| 1016 | ------------------------ |
| 1017 | |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 1018 | This will cause Git to generate `Binary files differ` (or a binary |
Junio C Hamano | 9b6fafe | 2011-01-14 00:43:22 | [diff] [blame] | 1019 | patch, if binary patches are enabled) instead of a regular diff. |
| 1020 | |
| 1021 | However, one may also want to specify other diff driver attributes. For |
| 1022 | example, you might want to use `textconv` to convert postscript files to |
Junio C Hamano | fb5ffde | 2014-11-04 22:38:56 | [diff] [blame] | 1023 | an ASCII representation for human viewing, but otherwise treat them as |
Junio C Hamano | 9b6fafe | 2011-01-14 00:43:22 | [diff] [blame] | 1024 | binary files. You cannot specify both `-diff` and `diff=ps` attributes. |
| 1025 | The solution is to use the `diff.*.binary` config option: |
| 1026 | |
| 1027 | ------------------------ |
| 1028 | [diff "ps"] |
| 1029 | textconv = ps2ascii |
| 1030 | binary = true |
| 1031 | ------------------------ |
| 1032 | |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1033 | Performing a three-way merge |
| 1034 | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ |
| 1035 | |
Junio C Hamano | f7e47ee | 2008-11-02 18:16:45 | [diff] [blame] | 1036 | `merge` |
| 1037 | ^^^^^^^ |
| 1038 | |
Junio C Hamano | fa28cb1 | 2011-03-19 01:35:35 | [diff] [blame] | 1039 | The attribute `merge` affects how three versions of a file are |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1040 | merged when a file-level merge is necessary during `git merge`, |
Junio C Hamano | 27a128b | 2009-08-13 01:23:00 | [diff] [blame] | 1041 | and other commands such as `git revert` and `git cherry-pick`. |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1042 | |
| 1043 | Set:: |
| 1044 | |
| 1045 | Built-in 3-way merge driver is used to merge the |
Junio C Hamano | ba4b928 | 2008-07-06 05:20:31 | [diff] [blame] | 1046 | contents in a way similar to 'merge' command of `RCS` |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1047 | suite. This is suitable for ordinary text files. |
| 1048 | |
| 1049 | Unset:: |
| 1050 | |
| 1051 | Take the version from the current branch as the |
| 1052 | tentative merge result, and declare that the merge has |
Junio C Hamano | fa28cb1 | 2011-03-19 01:35:35 | [diff] [blame] | 1053 | conflicts. This is suitable for binary files that do |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1054 | not have a well-defined merge semantics. |
| 1055 | |
| 1056 | Unspecified:: |
| 1057 | |
| 1058 | By default, this uses the same built-in 3-way merge |
Junio C Hamano | fa28cb1 | 2011-03-19 01:35:35 | [diff] [blame] | 1059 | driver as is the case when the `merge` attribute is set. |
| 1060 | However, the `merge.default` configuration variable can name |
| 1061 | different merge driver to be used with paths for which the |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1062 | `merge` attribute is unspecified. |
| 1063 | |
Junio C Hamano | 9db95ca | 2007-04-23 07:27:05 | [diff] [blame] | 1064 | String:: |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1065 | |
| 1066 | 3-way merge is performed using the specified custom |
| 1067 | merge driver. The built-in 3-way merge driver can be |
| 1068 | explicitly specified by asking for "text" driver; the |
| 1069 | built-in "take the current branch" driver can be |
Junio C Hamano | 22572e7 | 2007-04-25 05:59:25 | [diff] [blame] | 1070 | requested with "binary". |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1071 | |
| 1072 | |
Junio C Hamano | f556fc2 | 2007-12-23 08:16:38 | [diff] [blame] | 1073 | Built-in merge drivers |
| 1074 | ^^^^^^^^^^^^^^^^^^^^^^ |
| 1075 | |
| 1076 | There are a few built-in low-level merge drivers defined that |
| 1077 | can be asked for via the `merge` attribute. |
| 1078 | |
| 1079 | text:: |
| 1080 | |
| 1081 | Usual 3-way file level merge for text files. Conflicted |
| 1082 | regions are marked with conflict markers `<<<<<<<`, |
| 1083 | `=======` and `>>>>>>>`. The version from your branch |
| 1084 | appears before the `=======` marker, and the version |
| 1085 | from the merged branch appears after the `=======` |
| 1086 | marker. |
| 1087 | |
| 1088 | binary:: |
| 1089 | |
| 1090 | Keep the version from your branch in the work tree, but |
| 1091 | leave the path in the conflicted state for the user to |
| 1092 | sort out. |
| 1093 | |
| 1094 | union:: |
| 1095 | |
| 1096 | Run 3-way file level merge for text files, but take |
| 1097 | lines from both versions, instead of leaving conflict |
| 1098 | markers. This tends to leave the added lines in the |
| 1099 | resulting file in random order and the user should |
| 1100 | verify the result. Do not use this if you do not |
| 1101 | understand the implications. |
| 1102 | |
| 1103 | |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1104 | Defining a custom merge driver |
| 1105 | ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ |
| 1106 | |
Junio C Hamano | f556fc2 | 2007-12-23 08:16:38 | [diff] [blame] | 1107 | The definition of a merge driver is done in the `.git/config` |
| 1108 | file, not in the `gitattributes` file, so strictly speaking this |
| 1109 | manual page is a wrong place to talk about it. However... |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1110 | |
| 1111 | To define a custom merge driver `filfre`, add a section to your |
| 1112 | `$GIT_DIR/config` file (or `$HOME/.gitconfig` file) like this: |
| 1113 | |
| 1114 | ---------------------------------------------------------------- |
| 1115 | [merge "filfre"] |
| 1116 | name = feel-free merge driver |
Junio C Hamano | df5384b | 2015-06-24 21:12:12 | [diff] [blame] | 1117 | driver = filfre %O %A %B %L %P |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1118 | recursive = binary |
| 1119 | ---------------------------------------------------------------- |
| 1120 | |
| 1121 | The `merge.*.name` variable gives the driver a human-readable |
| 1122 | name. |
| 1123 | |
| 1124 | The `merge.*.driver` variable's value is used to construct a |
| 1125 | command to run to merge ancestor's version (`%O`), current |
| 1126 | version (`%A`) and the other branches' version (`%B`). These |
| 1127 | three tokens are replaced with the names of temporary files that |
| 1128 | hold the contents of these versions when the command line is |
Junio C Hamano | 86660db | 2010-02-24 01:12:54 | [diff] [blame] | 1129 | built. Additionally, %L will be replaced with the conflict marker |
| 1130 | size (see below). |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1131 | |
| 1132 | The merge driver is expected to leave the result of the merge in |
| 1133 | the file named with `%A` by overwriting it, and exit with zero |
| 1134 | status if it managed to merge them cleanly, or non-zero if there |
Junio C Hamano | 2e885d8 | 2023-06-30 19:50:46 | [diff] [blame] | 1135 | were conflicts. When the driver crashes (e.g. killed by SEGV), |
| 1136 | it is expected to exit with non-zero status that are higher than |
| 1137 | 128, and in such a case, the merge results in a failure (which is |
| 1138 | different from producing a conflict). |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1139 | |
| 1140 | The `merge.*.recursive` variable specifies what other merge |
| 1141 | driver to use when the merge driver is called for an internal |
| 1142 | merge between common ancestors, when there are more than one. |
| 1143 | When left unspecified, the driver itself is used for both |
| 1144 | internal merge and the final merge. |
| 1145 | |
Junio C Hamano | df5384b | 2015-06-24 21:12:12 | [diff] [blame] | 1146 | The merge driver can learn the pathname in which the merged result |
| 1147 | will be stored via placeholder `%P`. |
| 1148 | |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1149 | |
Junio C Hamano | 1aa40d2 | 2010-01-21 17:46:43 | [diff] [blame] | 1150 | `conflict-marker-size` |
| 1151 | ^^^^^^^^^^^^^^^^^^^^^^ |
| 1152 | |
| 1153 | This attribute controls the length of conflict markers left in |
| 1154 | the work tree file during a conflicted merge. Only setting to |
| 1155 | the value to a positive integer has any meaningful effect. |
| 1156 | |
| 1157 | For example, this line in `.gitattributes` can be used to tell the merge |
| 1158 | machinery to leave much longer (instead of the usual 7-character-long) |
| 1159 | conflict markers when merging the file `Documentation/git-merge.txt` |
| 1160 | results in a conflict. |
| 1161 | |
| 1162 | ------------------------ |
| 1163 | Documentation/git-merge.txt conflict-marker-size=32 |
| 1164 | ------------------------ |
| 1165 | |
| 1166 | |
Junio C Hamano | 942b35e | 2007-12-09 10:19:33 | [diff] [blame] | 1167 | Checking whitespace errors |
| 1168 | ~~~~~~~~~~~~~~~~~~~~~~~~~~ |
| 1169 | |
| 1170 | `whitespace` |
| 1171 | ^^^^^^^^^^^^ |
| 1172 | |
| 1173 | The `core.whitespace` configuration variable allows you to define what |
Junio C Hamano | ba4b928 | 2008-07-06 05:20:31 | [diff] [blame] | 1174 | 'diff' and 'apply' should consider whitespace errors for all paths in |
Junio C Hamano | 35738e8 | 2008-01-07 07:55:46 | [diff] [blame] | 1175 | the project (See linkgit:git-config[1]). This attribute gives you finer |
Junio C Hamano | 942b35e | 2007-12-09 10:19:33 | [diff] [blame] | 1176 | control per path. |
| 1177 | |
| 1178 | Set:: |
| 1179 | |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 1180 | Notice all types of potential whitespace errors known to Git. |
Junio C Hamano | eef01fe | 2010-12-13 08:31:58 | [diff] [blame] | 1181 | The tab width is taken from the value of the `core.whitespace` |
| 1182 | configuration variable. |
Junio C Hamano | 942b35e | 2007-12-09 10:19:33 | [diff] [blame] | 1183 | |
| 1184 | Unset:: |
| 1185 | |
| 1186 | Do not notice anything as error. |
| 1187 | |
| 1188 | Unspecified:: |
| 1189 | |
Junio C Hamano | eef01fe | 2010-12-13 08:31:58 | [diff] [blame] | 1190 | Use the value of the `core.whitespace` configuration variable to |
Junio C Hamano | 942b35e | 2007-12-09 10:19:33 | [diff] [blame] | 1191 | decide what to notice as error. |
| 1192 | |
| 1193 | String:: |
| 1194 | |
Junio C Hamano | ac204f4 | 2023-01-05 06:34:54 | [diff] [blame] | 1195 | Specify a comma separated list of common whitespace problems to |
Junio C Hamano | eef01fe | 2010-12-13 08:31:58 | [diff] [blame] | 1196 | notice in the same format as the `core.whitespace` configuration |
Junio C Hamano | 942b35e | 2007-12-09 10:19:33 | [diff] [blame] | 1197 | variable. |
| 1198 | |
| 1199 | |
Junio C Hamano | ba4b928 | 2008-07-06 05:20:31 | [diff] [blame] | 1200 | Creating an archive |
| 1201 | ~~~~~~~~~~~~~~~~~~~ |
| 1202 | |
| 1203 | `export-ignore` |
| 1204 | ^^^^^^^^^^^^^^^ |
| 1205 | |
| 1206 | Files and directories with the attribute `export-ignore` won't be added to |
| 1207 | archive files. |
| 1208 | |
| 1209 | `export-subst` |
| 1210 | ^^^^^^^^^^^^^^ |
| 1211 | |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 1212 | If the attribute `export-subst` is set for a file then Git will expand |
Junio C Hamano | ba4b928 | 2008-07-06 05:20:31 | [diff] [blame] | 1213 | several placeholders when adding this file to an archive. The |
| 1214 | expansion depends on the availability of a commit ID, i.e., if |
| 1215 | linkgit:git-archive[1] has been given a tree instead of a commit or a |
| 1216 | tag then no replacement will be done. The placeholders are the same |
| 1217 | as those for the option `--pretty=format:` of linkgit:git-log[1], |
| 1218 | except that they need to be wrapped like this: `$Format:PLACEHOLDERS$` |
| 1219 | in the file. E.g. the string `$Format:%H$` will be replaced by the |
Junio C Hamano | 458281a | 2021-03-22 21:20:26 | [diff] [blame] | 1220 | commit hash. However, only one `%(describe)` placeholder is expanded |
| 1221 | per archive to avoid denial-of-service attacks. |
Junio C Hamano | ba4b928 | 2008-07-06 05:20:31 | [diff] [blame] | 1222 | |
| 1223 | |
Junio C Hamano | a10d8d2 | 2009-10-22 03:09:05 | [diff] [blame] | 1224 | Packing objects |
| 1225 | ~~~~~~~~~~~~~~~ |
| 1226 | |
| 1227 | `delta` |
| 1228 | ^^^^^^^ |
| 1229 | |
| 1230 | Delta compression will not be attempted for blobs for paths with the |
| 1231 | attribute `delta` set to false. |
| 1232 | |
| 1233 | |
Junio C Hamano | d3452a0 | 2008-11-15 08:07:55 | [diff] [blame] | 1234 | Viewing files in GUI tools |
| 1235 | ~~~~~~~~~~~~~~~~~~~~~~~~~~ |
| 1236 | |
| 1237 | `encoding` |
| 1238 | ^^^^^^^^^^ |
| 1239 | |
| 1240 | The value of this attribute specifies the character encoding that should |
| 1241 | be used by GUI tools (e.g. linkgit:gitk[1] and linkgit:git-gui[1]) to |
| 1242 | display the contents of the relevant file. Note that due to performance |
| 1243 | considerations linkgit:gitk[1] does not use this attribute unless you |
| 1244 | manually enable per-file encodings in its options. |
| 1245 | |
| 1246 | If this attribute is not set or has an invalid value, the value of the |
| 1247 | `gui.encoding` configuration variable is used instead |
| 1248 | (See linkgit:git-config[1]). |
| 1249 | |
| 1250 | |
Junio C Hamano | 86d8436 | 2011-08-04 00:21:51 | [diff] [blame] | 1251 | USING MACRO ATTRIBUTES |
Junio C Hamano | fcb740a | 2008-08-31 06:32:05 | [diff] [blame] | 1252 | ---------------------- |
| 1253 | |
| 1254 | You do not want any end-of-line conversions applied to, nor textual diffs |
| 1255 | produced for, any binary file you track. You would need to specify e.g. |
| 1256 | |
| 1257 | ------------ |
Junio C Hamano | bb88cf4 | 2010-06-21 15:23:55 | [diff] [blame] | 1258 | *.jpg -text -diff |
Junio C Hamano | fcb740a | 2008-08-31 06:32:05 | [diff] [blame] | 1259 | ------------ |
| 1260 | |
| 1261 | but that may become cumbersome, when you have many attributes. Using |
Junio C Hamano | 86d8436 | 2011-08-04 00:21:51 | [diff] [blame] | 1262 | macro attributes, you can define an attribute that, when set, also |
| 1263 | sets or unsets a number of other attributes at the same time. The |
| 1264 | system knows a built-in macro attribute, `binary`: |
Junio C Hamano | fcb740a | 2008-08-31 06:32:05 | [diff] [blame] | 1265 | |
| 1266 | ------------ |
| 1267 | *.jpg binary |
| 1268 | ------------ |
| 1269 | |
Junio C Hamano | 86d8436 | 2011-08-04 00:21:51 | [diff] [blame] | 1270 | Setting the "binary" attribute also unsets the "text" and "diff" |
| 1271 | attributes as above. Note that macro attributes can only be "Set", |
| 1272 | though setting one might have the effect of setting or unsetting other |
| 1273 | attributes or even returning other attributes to the "Unspecified" |
| 1274 | state. |
Junio C Hamano | fcb740a | 2008-08-31 06:32:05 | [diff] [blame] | 1275 | |
| 1276 | |
Junio C Hamano | 86d8436 | 2011-08-04 00:21:51 | [diff] [blame] | 1277 | DEFINING MACRO ATTRIBUTES |
Junio C Hamano | fcb740a | 2008-08-31 06:32:05 | [diff] [blame] | 1278 | ------------------------- |
| 1279 | |
Junio C Hamano | c5bd79e | 2014-01-27 21:31:26 | [diff] [blame] | 1280 | Custom macro attributes can be defined only in top-level gitattributes |
| 1281 | files (`$GIT_DIR/info/attributes`, the `.gitattributes` file at the |
| 1282 | top level of the working tree, or the global or system-wide |
| 1283 | gitattributes files), not in `.gitattributes` files in working tree |
| 1284 | subdirectories. The built-in macro attribute "binary" is equivalent |
| 1285 | to: |
Junio C Hamano | fcb740a | 2008-08-31 06:32:05 | [diff] [blame] | 1286 | |
| 1287 | ------------ |
Junio C Hamano | a3cfb76 | 2012-09-15 05:32:14 | [diff] [blame] | 1288 | [attr]binary -diff -merge -text |
Junio C Hamano | fcb740a | 2008-08-31 06:32:05 | [diff] [blame] | 1289 | ------------ |
| 1290 | |
Junio C Hamano | 51a55bc | 2021-05-12 05:22:19 | [diff] [blame] | 1291 | NOTES |
| 1292 | ----- |
| 1293 | |
| 1294 | Git does not follow symbolic links when accessing a `.gitattributes` |
| 1295 | file in the working tree. This keeps behavior consistent when the file |
| 1296 | is accessed from the index or a tree versus from the filesystem. |
Junio C Hamano | fcb740a | 2008-08-31 06:32:05 | [diff] [blame] | 1297 | |
Junio C Hamano | b9d9d90 | 2018-05-23 07:07:42 | [diff] [blame] | 1298 | EXAMPLES |
| 1299 | -------- |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1300 | |
| 1301 | If you have these three `gitattributes` file: |
| 1302 | |
| 1303 | ---------------------------------------------------------------- |
| 1304 | (in $GIT_DIR/info/attributes) |
| 1305 | |
| 1306 | a* foo !bar -baz |
| 1307 | |
| 1308 | (in .gitattributes) |
| 1309 | abc foo bar baz |
| 1310 | |
| 1311 | (in t/.gitattributes) |
| 1312 | ab* merge=filfre |
| 1313 | abc -foo -bar |
| 1314 | *.c frotz |
| 1315 | ---------------------------------------------------------------- |
| 1316 | |
| 1317 | the attributes given to path `t/abc` are computed as follows: |
| 1318 | |
| 1319 | 1. By examining `t/.gitattributes` (which is in the same |
Junio C Hamano | 076ffcc | 2013-02-06 05:13:21 | [diff] [blame] | 1320 | directory as the path in question), Git finds that the first |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1321 | line matches. `merge` attribute is set. It also finds that |
| 1322 | the second line matches, and attributes `foo` and `bar` |
| 1323 | are unset. |
| 1324 | |
| 1325 | 2. Then it examines `.gitattributes` (which is in the parent |
| 1326 | directory), and finds that the first line matches, but |
| 1327 | `t/.gitattributes` file already decided how `merge`, `foo` |
| 1328 | and `bar` attributes should be given to this path, so it |
| 1329 | leaves `foo` and `bar` unset. Attribute `baz` is set. |
| 1330 | |
Junio C Hamano | bdeff82 | 2007-07-30 09:07:17 | [diff] [blame] | 1331 | 3. Finally it examines `$GIT_DIR/info/attributes`. This file |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1332 | is used to override the in-tree settings. The first line is |
| 1333 | a match, and `foo` is set, `bar` is reverted to unspecified |
| 1334 | state, and `baz` is unset. |
| 1335 | |
Junio C Hamano | a638742 | 2007-08-25 03:54:27 | [diff] [blame] | 1336 | As the result, the attributes assignment to `t/abc` becomes: |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1337 | |
| 1338 | ---------------------------------------------------------------- |
| 1339 | foo set to true |
| 1340 | bar unspecified |
| 1341 | baz set to false |
| 1342 | merge set to string value "filfre" |
| 1343 | frotz unspecified |
| 1344 | ---------------------------------------------------------------- |
| 1345 | |
| 1346 | |
Junio C Hamano | f727901 | 2011-08-18 06:13:13 | [diff] [blame] | 1347 | SEE ALSO |
| 1348 | -------- |
| 1349 | linkgit:git-check-attr[1]. |
Junio C Hamano | ef4c2d1 | 2007-09-10 08:11:27 | [diff] [blame] | 1350 | |
Junio C Hamano | e90436a | 2007-04-22 07:22:13 | [diff] [blame] | 1351 | GIT |
| 1352 | --- |
Junio C Hamano | f7c042d | 2008-06-06 22:50:53 | [diff] [blame] | 1353 | Part of the linkgit:git[1] suite |