This causes cgit to serve error pages, which is undesirable. This reverts commit5229c9b232, reversing changes made tof2b211131f.
		
			
				
	
	
		
			4691 lines
		
	
	
	
		
			171 KiB
		
	
	
	
		
			Text
		
	
	
	
	
	
			
		
		
	
	
			4691 lines
		
	
	
	
		
			171 KiB
		
	
	
	
		
			Text
		
	
	
	
	
	
| Git User Manual
 | |
| ===============
 | |
| 
 | |
| Git is a fast distributed revision control system.
 | |
| 
 | |
| This manual is designed to be readable by someone with basic UNIX
 | |
| command-line skills, but no previous knowledge of Git.
 | |
| 
 | |
| <<repositories-and-branches>> and <<exploring-git-history>> explain how
 | |
| to fetch and study a project using git--read these chapters to learn how
 | |
| to build and test a particular version of a software project, search for
 | |
| regressions, and so on.
 | |
| 
 | |
| People needing to do actual development will also want to read
 | |
| <<Developing-With-git>> and <<sharing-development>>.
 | |
| 
 | |
| Further chapters cover more specialized topics.
 | |
| 
 | |
| Comprehensive reference documentation is available through the man
 | |
| pages, or linkgit:git-help[1] command.  For example, for the command
 | |
| `git clone <repo>`, you can either use:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ man git-clone
 | |
| ------------------------------------------------
 | |
| 
 | |
| or:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git help clone
 | |
| ------------------------------------------------
 | |
| 
 | |
| With the latter, you can use the manual viewer of your choice; see
 | |
| linkgit:git-help[1] for more information.
 | |
| 
 | |
| See also <<git-quick-start>> for a brief overview of Git commands,
 | |
| without any explanation.
 | |
| 
 | |
| Finally, see <<todo>> for ways that you can help make this manual more
 | |
| complete.
 | |
| 
 | |
| 
 | |
| [[repositories-and-branches]]
 | |
| Repositories and Branches
 | |
| =========================
 | |
| 
 | |
| [[how-to-get-a-git-repository]]
 | |
| How to get a Git repository
 | |
| ---------------------------
 | |
| 
 | |
| It will be useful to have a Git repository to experiment with as you
 | |
| read this manual.
 | |
| 
 | |
| The best way to get one is by using the linkgit:git-clone[1] command to
 | |
| download a copy of an existing repository.  If you don't already have a
 | |
| project in mind, here are some interesting examples:
 | |
| 
 | |
| ------------------------------------------------
 | |
| 	# Git itself (approx. 40MB download):
 | |
| $ git clone git://git.kernel.org/pub/scm/git/git.git
 | |
| 	# the Linux kernel (approx. 640MB download):
 | |
| $ git clone git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
 | |
| ------------------------------------------------
 | |
| 
 | |
| The initial clone may be time-consuming for a large project, but you
 | |
| will only need to clone once.
 | |
| 
 | |
| The clone command creates a new directory named after the project
 | |
| (`git` or `linux` in the examples above).  After you cd into this
 | |
| directory, you will see that it contains a copy of the project files,
 | |
| called the <<def_working_tree,working tree>>, together with a special
 | |
| top-level directory named `.git`, which contains all the information
 | |
| about the history of the project.
 | |
| 
 | |
| [[how-to-check-out]]
 | |
| How to check out a different version of a project
 | |
| -------------------------------------------------
 | |
| 
 | |
| Git is best thought of as a tool for storing the history of a collection
 | |
| of files.  It stores the history as a compressed collection of
 | |
| interrelated snapshots of the project's contents.  In Git each such
 | |
| version is called a <<def_commit,commit>>.
 | |
| 
 | |
| Those snapshots aren't necessarily all arranged in a single line from
 | |
| oldest to newest; instead, work may simultaneously proceed along
 | |
| parallel lines of development, called <<def_branch,branches>>, which may
 | |
| merge and diverge.
 | |
| 
 | |
| A single Git repository can track development on multiple branches.  It
 | |
| does this by keeping a list of <<def_head,heads>> which reference the
 | |
| latest commit on each branch; the linkgit:git-branch[1] command shows
 | |
| you the list of branch heads:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git branch
 | |
| * master
 | |
| ------------------------------------------------
 | |
| 
 | |
| A freshly cloned repository contains a single branch head, by default
 | |
| named "master", with the working directory initialized to the state of
 | |
| the project referred to by that branch head.
 | |
| 
 | |
| Most projects also use <<def_tag,tags>>.  Tags, like heads, are
 | |
| references into the project's history, and can be listed using the
 | |
| linkgit:git-tag[1] command:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git tag -l
 | |
| v2.6.11
 | |
| v2.6.11-tree
 | |
| v2.6.12
 | |
| v2.6.12-rc2
 | |
| v2.6.12-rc3
 | |
| v2.6.12-rc4
 | |
| v2.6.12-rc5
 | |
| v2.6.12-rc6
 | |
| v2.6.13
 | |
| ...
 | |
| ------------------------------------------------
 | |
| 
 | |
| Tags are expected to always point at the same version of a project,
 | |
| while heads are expected to advance as development progresses.
 | |
| 
 | |
| Create a new branch head pointing to one of these versions and check it
 | |
| out using linkgit:git-switch[1]:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git switch -c new v2.6.13
 | |
| ------------------------------------------------
 | |
| 
 | |
| The working directory then reflects the contents that the project had
 | |
| when it was tagged v2.6.13, and linkgit:git-branch[1] shows two
 | |
| branches, with an asterisk marking the currently checked-out branch:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git branch
 | |
|   master
 | |
| * new
 | |
| ------------------------------------------------
 | |
| 
 | |
| If you decide that you'd rather see version 2.6.17, you can modify
 | |
| the current branch to point at v2.6.17 instead, with
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git reset --hard v2.6.17
 | |
| ------------------------------------------------
 | |
| 
 | |
| Note that if the current branch head was your only reference to a
 | |
| particular point in history, then resetting that branch may leave you
 | |
| with no way to find the history it used to point to; so use this command
 | |
| carefully.
 | |
| 
 | |
| [[understanding-commits]]
 | |
| Understanding History: Commits
 | |
| ------------------------------
 | |
| 
 | |
| Every change in the history of a project is represented by a commit.
 | |
| The linkgit:git-show[1] command shows the most recent commit on the
 | |
| current branch:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git show
 | |
| commit 17cf781661e6d38f737f15f53ab552f1e95960d7
 | |
| Author: Linus Torvalds <torvalds@ppc970.osdl.org.(none)>
 | |
| Date:   Tue Apr 19 14:11:06 2005 -0700
 | |
| 
 | |
|     Remove duplicate getenv(DB_ENVIRONMENT) call
 | |
| 
 | |
|     Noted by Tony Luck.
 | |
| 
 | |
| diff --git a/init-db.c b/init-db.c
 | |
| index 65898fa..b002dc6 100644
 | |
| --- a/init-db.c
 | |
| +++ b/init-db.c
 | |
| @@ -7,7 +7,7 @@
 | |
|  
 | |
|  int main(int argc, char **argv)
 | |
|  {
 | |
| -	char *sha1_dir = getenv(DB_ENVIRONMENT), *path;
 | |
| +	char *sha1_dir, *path;
 | |
|  	int len, i;
 | |
|  
 | |
|  	if (mkdir(".git", 0755) < 0) {
 | |
| ------------------------------------------------
 | |
| 
 | |
| As you can see, a commit shows who made the latest change, what they
 | |
| did, and why.
 | |
| 
 | |
| Every commit has a 40-hexdigit id, sometimes called the "object name" or the
 | |
| "SHA-1 id", shown on the first line of the `git show` output.  You can usually
 | |
| refer to a commit by a shorter name, such as a tag or a branch name, but this
 | |
| longer name can also be useful.  Most importantly, it is a globally unique
 | |
| name for this commit: so if you tell somebody else the object name (for
 | |
| example in email), then you are guaranteed that name will refer to the same
 | |
| commit in their repository that it does in yours (assuming their repository
 | |
| has that commit at all).  Since the object name is computed as a hash over the
 | |
| contents of the commit, you are guaranteed that the commit can never change
 | |
| without its name also changing.
 | |
| 
 | |
| In fact, in <<git-concepts>> we shall see that everything stored in Git
 | |
| history, including file data and directory contents, is stored in an object
 | |
| with a name that is a hash of its contents.
 | |
| 
 | |
| [[understanding-reachability]]
 | |
| Understanding history: commits, parents, and reachability
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| Every commit (except the very first commit in a project) also has a
 | |
| parent commit which shows what happened before this commit.
 | |
| Following the chain of parents will eventually take you back to the
 | |
| beginning of the project.
 | |
| 
 | |
| However, the commits do not form a simple list; Git allows lines of
 | |
| development to diverge and then reconverge, and the point where two
 | |
| lines of development reconverge is called a "merge".  The commit
 | |
| representing a merge can therefore have more than one parent, with
 | |
| each parent representing the most recent commit on one of the lines
 | |
| of development leading to that point.
 | |
| 
 | |
| The best way to see how this works is using the linkgit:gitk[1]
 | |
| command; running gitk now on a Git repository and looking for merge
 | |
| commits will help understand how Git organizes history.
 | |
| 
 | |
| In the following, we say that commit X is "reachable" from commit Y
 | |
| if commit X is an ancestor of commit Y.  Equivalently, you could say
 | |
| that Y is a descendant of X, or that there is a chain of parents
 | |
| leading from commit Y to commit X.
 | |
| 
 | |
| [[history-diagrams]]
 | |
| Understanding history: History diagrams
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| We will sometimes represent Git history using diagrams like the one
 | |
| below.  Commits are shown as "o", and the links between them with
 | |
| lines drawn with - / and \.  Time goes left to right:
 | |
| 
 | |
| 
 | |
| ................................................
 | |
|          o--o--o <-- Branch A
 | |
|         /
 | |
|  o--o--o <-- master
 | |
|         \
 | |
|          o--o--o <-- Branch B
 | |
| ................................................
 | |
| 
 | |
| If we need to talk about a particular commit, the character "o" may
 | |
| be replaced with another letter or number.
 | |
| 
 | |
| [[what-is-a-branch]]
 | |
| Understanding history: What is a branch?
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| When we need to be precise, we will use the word "branch" to mean a line
 | |
| of development, and "branch head" (or just "head") to mean a reference
 | |
| to the most recent commit on a branch.  In the example above, the branch
 | |
| head named "A" is a pointer to one particular commit, but we refer to
 | |
| the line of three commits leading up to that point as all being part of
 | |
| "branch A".
 | |
| 
 | |
| However, when no confusion will result, we often just use the term
 | |
| "branch" both for branches and for branch heads.
 | |
| 
 | |
| [[manipulating-branches]]
 | |
| Manipulating branches
 | |
| ---------------------
 | |
| 
 | |
| Creating, deleting, and modifying branches is quick and easy; here's
 | |
| a summary of the commands:
 | |
| 
 | |
| `git branch`::
 | |
| 	list all branches.
 | |
| `git branch <branch>`::
 | |
| 	create a new branch named `<branch>`, referencing the same
 | |
| 	point in history as the current branch.
 | |
| `git branch <branch> <start-point>`::
 | |
| 	create a new branch named `<branch>`, referencing
 | |
| 	`<start-point>`, which may be specified any way you like,
 | |
| 	including using a branch name or a tag name.
 | |
| `git branch -d <branch>`::
 | |
| 	delete the branch `<branch>`; if the branch is not fully
 | |
| 	merged in its upstream branch or contained in the current branch,
 | |
| 	this command will fail with a warning.
 | |
| `git branch -D <branch>`::
 | |
| 	delete the branch `<branch>` irrespective of its merged status.
 | |
| `git switch <branch>`::
 | |
| 	make the current branch `<branch>`, updating the working
 | |
| 	directory to reflect the version referenced by `<branch>`.
 | |
| `git switch -c <new> <start-point>`::
 | |
| 	create a new branch `<new>` referencing `<start-point>`, and
 | |
| 	check it out.
 | |
| 
 | |
| The special symbol "HEAD" can always be used to refer to the current
 | |
| branch.  In fact, Git uses a file named `HEAD` in the `.git` directory
 | |
| to remember which branch is current:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ cat .git/HEAD
 | |
| ref: refs/heads/master
 | |
| ------------------------------------------------
 | |
| 
 | |
| [[detached-head]]
 | |
| Examining an old version without creating a new branch
 | |
| ------------------------------------------------------
 | |
| 
 | |
| The `git switch` command normally expects a branch head, but will also
 | |
| accept an arbitrary commit when invoked with --detach; for example,
 | |
| you can check out the commit referenced by a tag:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git switch --detach v2.6.17
 | |
| Note: checking out 'v2.6.17'.
 | |
| 
 | |
| You are in 'detached HEAD' state. You can look around, make experimental
 | |
| changes and commit them, and you can discard any commits you make in this
 | |
| state without impacting any branches by performing another switch.
 | |
| 
 | |
| If you want to create a new branch to retain commits you create, you may
 | |
| do so (now or later) by using -c with the switch command again. Example:
 | |
| 
 | |
|   git switch -c new_branch_name
 | |
| 
 | |
| HEAD is now at 427abfa Linux v2.6.17
 | |
| ------------------------------------------------
 | |
| 
 | |
| The HEAD then refers to the SHA-1 of the commit instead of to a branch,
 | |
| and git branch shows that you are no longer on a branch:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ cat .git/HEAD
 | |
| 427abfa28afedffadfca9dd8b067eb6d36bac53f
 | |
| $ git branch
 | |
| * (detached from v2.6.17)
 | |
|   master
 | |
| ------------------------------------------------
 | |
| 
 | |
| In this case we say that the HEAD is "detached".
 | |
| 
 | |
| This is an easy way to check out a particular version without having to
 | |
| make up a name for the new branch.   You can still create a new branch
 | |
| (or tag) for this version later if you decide to.
 | |
| 
 | |
| [[examining-remote-branches]]
 | |
| Examining branches from a remote repository
 | |
| -------------------------------------------
 | |
| 
 | |
| The "master" branch that was created at the time you cloned is a copy
 | |
| of the HEAD in the repository that you cloned from.  That repository
 | |
| may also have had other branches, though, and your local repository
 | |
| keeps branches which track each of those remote branches, called
 | |
| remote-tracking branches, which you
 | |
| can view using the `-r` option to linkgit:git-branch[1]:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git branch -r
 | |
|   origin/HEAD
 | |
|   origin/html
 | |
|   origin/maint
 | |
|   origin/man
 | |
|   origin/master
 | |
|   origin/next
 | |
|   origin/pu
 | |
|   origin/todo
 | |
| ------------------------------------------------
 | |
| 
 | |
| In this example, "origin" is called a remote repository, or "remote"
 | |
| for short. The branches of this repository are called "remote
 | |
| branches" from our point of view. The remote-tracking branches listed
 | |
| above were created based on the remote branches at clone time and will
 | |
| be updated by `git fetch` (hence `git pull`) and `git push`. See
 | |
| <<Updating-a-repository-With-git-fetch>> for details.
 | |
| 
 | |
| You might want to build on one of these remote-tracking branches
 | |
| on a branch of your own, just as you would for a tag:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git switch -c my-todo-copy origin/todo
 | |
| ------------------------------------------------
 | |
| 
 | |
| You can also check out `origin/todo` directly to examine it or
 | |
| write a one-off patch.  See <<detached-head,detached head>>.
 | |
| 
 | |
| Note that the name "origin" is just the name that Git uses by default
 | |
| to refer to the repository that you cloned from.
 | |
| 
 | |
| [[how-git-stores-references]]
 | |
| Naming branches, tags, and other references
 | |
| -------------------------------------------
 | |
| 
 | |
| Branches, remote-tracking branches, and tags are all references to
 | |
| commits.  All references are named with a slash-separated path name
 | |
| starting with `refs`; the names we've been using so far are actually
 | |
| shorthand:
 | |
| 
 | |
| 	- The branch `test` is short for `refs/heads/test`.
 | |
| 	- The tag `v2.6.18` is short for `refs/tags/v2.6.18`.
 | |
| 	- `origin/master` is short for `refs/remotes/origin/master`.
 | |
| 
 | |
| The full name is occasionally useful if, for example, there ever
 | |
| exists a tag and a branch with the same name.
 | |
| 
 | |
| (Newly created refs are actually stored in the `.git/refs` directory,
 | |
| under the path given by their name.  However, for efficiency reasons
 | |
| they may also be packed together in a single file; see
 | |
| linkgit:git-pack-refs[1]).
 | |
| 
 | |
| As another useful shortcut, the "HEAD" of a repository can be referred
 | |
| to just using the name of that repository.  So, for example, "origin"
 | |
| is usually a shortcut for the HEAD branch in the repository "origin".
 | |
| 
 | |
| For the complete list of paths which Git checks for references, and
 | |
| the order it uses to decide which to choose when there are multiple
 | |
| references with the same shorthand name, see the "SPECIFYING
 | |
| REVISIONS" section of linkgit:gitrevisions[7].
 | |
| 
 | |
| [[Updating-a-repository-With-git-fetch]]
 | |
| Updating a repository with git fetch
 | |
| ------------------------------------
 | |
| 
 | |
| After you clone a repository and commit a few changes of your own, you
 | |
| may wish to check the original repository for updates.
 | |
| 
 | |
| The `git-fetch` command, with no arguments, will update all of the
 | |
| remote-tracking branches to the latest version found in the original
 | |
| repository.  It will not touch any of your own branches--not even the
 | |
| "master" branch that was created for you on clone.
 | |
| 
 | |
| [[fetching-branches]]
 | |
| Fetching branches from other repositories
 | |
| -----------------------------------------
 | |
| 
 | |
| You can also track branches from repositories other than the one you
 | |
| cloned from, using linkgit:git-remote[1]:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git remote add staging git://git.kernel.org/.../gregkh/staging.git
 | |
| $ git fetch staging
 | |
| ...
 | |
| From git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/staging
 | |
|  * [new branch]      master     -> staging/master
 | |
|  * [new branch]      staging-linus -> staging/staging-linus
 | |
|  * [new branch]      staging-next -> staging/staging-next
 | |
| -------------------------------------------------
 | |
| 
 | |
| New remote-tracking branches will be stored under the shorthand name
 | |
| that you gave `git remote add`, in this case `staging`:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git branch -r
 | |
|   origin/HEAD -> origin/master
 | |
|   origin/master
 | |
|   staging/master
 | |
|   staging/staging-linus
 | |
|   staging/staging-next
 | |
| -------------------------------------------------
 | |
| 
 | |
| If you run `git fetch <remote>` later, the remote-tracking branches
 | |
| for the named `<remote>` will be updated.
 | |
| 
 | |
| If you examine the file `.git/config`, you will see that Git has added
 | |
| a new stanza:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ cat .git/config
 | |
| ...
 | |
| [remote "staging"]
 | |
| 	url = git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/staging.git
 | |
| 	fetch = +refs/heads/*:refs/remotes/staging/*
 | |
| ...
 | |
| -------------------------------------------------
 | |
| 
 | |
| This is what causes Git to track the remote's branches; you may modify
 | |
| or delete these configuration options by editing `.git/config` with a
 | |
| text editor.  (See the "CONFIGURATION FILE" section of
 | |
| linkgit:git-config[1] for details.)
 | |
| 
 | |
| [[exploring-git-history]]
 | |
| Exploring Git history
 | |
| =====================
 | |
| 
 | |
| Git is best thought of as a tool for storing the history of a
 | |
| collection of files.  It does this by storing compressed snapshots of
 | |
| the contents of a file hierarchy, together with "commits" which show
 | |
| the relationships between these snapshots.
 | |
| 
 | |
| Git provides extremely flexible and fast tools for exploring the
 | |
| history of a project.
 | |
| 
 | |
| We start with one specialized tool that is useful for finding the
 | |
| commit that introduced a bug into a project.
 | |
| 
 | |
| [[using-bisect]]
 | |
| How to use bisect to find a regression
 | |
| --------------------------------------
 | |
| 
 | |
| Suppose version 2.6.18 of your project worked, but the version at
 | |
| "master" crashes.  Sometimes the best way to find the cause of such a
 | |
| regression is to perform a brute-force search through the project's
 | |
| history to find the particular commit that caused the problem.  The
 | |
| linkgit:git-bisect[1] command can help you do this:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git bisect start
 | |
| $ git bisect good v2.6.18
 | |
| $ git bisect bad master
 | |
| Bisecting: 3537 revisions left to test after this
 | |
| [65934a9a028b88e83e2b0f8b36618fe503349f8e] BLOCK: Make USB storage depend on SCSI rather than selecting it [try #6]
 | |
| -------------------------------------------------
 | |
| 
 | |
| If you run `git branch` at this point, you'll see that Git has
 | |
| temporarily moved you in "(no branch)". HEAD is now detached from any
 | |
| branch and points directly to a commit (with commit id 65934) that
 | |
| is reachable from "master" but not from v2.6.18. Compile and test it,
 | |
| and see whether it crashes. Assume it does crash. Then:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git bisect bad
 | |
| Bisecting: 1769 revisions left to test after this
 | |
| [7eff82c8b1511017ae605f0c99ac275a7e21b867] i2c-core: Drop useless bitmaskings
 | |
| -------------------------------------------------
 | |
| 
 | |
| checks out an older version.  Continue like this, telling Git at each
 | |
| stage whether the version it gives you is good or bad, and notice
 | |
| that the number of revisions left to test is cut approximately in
 | |
| half each time.
 | |
| 
 | |
| After about 13 tests (in this case), it will output the commit id of
 | |
| the guilty commit.  You can then examine the commit with
 | |
| linkgit:git-show[1], find out who wrote it, and mail them your bug
 | |
| report with the commit id.  Finally, run
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git bisect reset
 | |
| -------------------------------------------------
 | |
| 
 | |
| to return you to the branch you were on before.
 | |
| 
 | |
| Note that the version which `git bisect` checks out for you at each
 | |
| point is just a suggestion, and you're free to try a different
 | |
| version if you think it would be a good idea.  For example,
 | |
| occasionally you may land on a commit that broke something unrelated;
 | |
| run
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git bisect visualize
 | |
| -------------------------------------------------
 | |
| 
 | |
| which will run gitk and label the commit it chose with a marker that
 | |
| says "bisect".  Choose a safe-looking commit nearby, note its commit
 | |
| id, and check it out with:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git reset --hard fb47ddb2db
 | |
| -------------------------------------------------
 | |
| 
 | |
| then test, run `bisect good` or `bisect bad` as appropriate, and
 | |
| continue.
 | |
| 
 | |
| Instead of `git bisect visualize` and then `git reset --hard
 | |
| fb47ddb2db`, you might just want to tell Git that you want to skip
 | |
| the current commit:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git bisect skip
 | |
| -------------------------------------------------
 | |
| 
 | |
| In this case, though, Git may not eventually be able to tell the first
 | |
| bad one between some first skipped commits and a later bad commit.
 | |
| 
 | |
| There are also ways to automate the bisecting process if you have a
 | |
| test script that can tell a good from a bad commit. See
 | |
| linkgit:git-bisect[1] for more information about this and other `git
 | |
| bisect` features.
 | |
| 
 | |
| [[naming-commits]]
 | |
| Naming commits
 | |
| --------------
 | |
| 
 | |
| We have seen several ways of naming commits already:
 | |
| 
 | |
| 	- 40-hexdigit object name
 | |
| 	- branch name: refers to the commit at the head of the given
 | |
| 	  branch
 | |
| 	- tag name: refers to the commit pointed to by the given tag
 | |
| 	  (we've seen branches and tags are special cases of
 | |
| 	  <<how-git-stores-references,references>>).
 | |
| 	- HEAD: refers to the head of the current branch
 | |
| 
 | |
| There are many more; see the "SPECIFYING REVISIONS" section of the
 | |
| linkgit:gitrevisions[7] man page for the complete list of ways to
 | |
| name revisions.  Some examples:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git show fb47ddb2 # the first few characters of the object name
 | |
| 		    # are usually enough to specify it uniquely
 | |
| $ git show HEAD^    # the parent of the HEAD commit
 | |
| $ git show HEAD^^   # the grandparent
 | |
| $ git show HEAD~4   # the great-great-grandparent
 | |
| -------------------------------------------------
 | |
| 
 | |
| Recall that merge commits may have more than one parent; by default,
 | |
| `^` and `~` follow the first parent listed in the commit, but you can
 | |
| also choose:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git show HEAD^1   # show the first parent of HEAD
 | |
| $ git show HEAD^2   # show the second parent of HEAD
 | |
| -------------------------------------------------
 | |
| 
 | |
| In addition to HEAD, there are several other special names for
 | |
| commits:
 | |
| 
 | |
| Merges (to be discussed later), as well as operations such as
 | |
| `git reset`, which change the currently checked-out commit, generally
 | |
| set ORIG_HEAD to the value HEAD had before the current operation.
 | |
| 
 | |
| The `git fetch` operation always stores the head of the last fetched
 | |
| branch in FETCH_HEAD.  For example, if you run `git fetch` without
 | |
| specifying a local branch as the target of the operation
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git fetch git://example.com/proj.git theirbranch
 | |
| -------------------------------------------------
 | |
| 
 | |
| the fetched commits will still be available from FETCH_HEAD.
 | |
| 
 | |
| When we discuss merges we'll also see the special name MERGE_HEAD,
 | |
| which refers to the other branch that we're merging in to the current
 | |
| branch.
 | |
| 
 | |
| The linkgit:git-rev-parse[1] command is a low-level command that is
 | |
| occasionally useful for translating some name for a commit to the object
 | |
| name for that commit:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git rev-parse origin
 | |
| e05db0fd4f31dde7005f075a84f96b360d05984b
 | |
| -------------------------------------------------
 | |
| 
 | |
| [[creating-tags]]
 | |
| Creating tags
 | |
| -------------
 | |
| 
 | |
| We can also create a tag to refer to a particular commit; after
 | |
| running
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git tag stable-1 1b2e1d63ff
 | |
| -------------------------------------------------
 | |
| 
 | |
| You can use `stable-1` to refer to the commit 1b2e1d63ff.
 | |
| 
 | |
| This creates a "lightweight" tag.  If you would also like to include a
 | |
| comment with the tag, and possibly sign it cryptographically, then you
 | |
| should create a tag object instead; see the linkgit:git-tag[1] man page
 | |
| for details.
 | |
| 
 | |
| [[browsing-revisions]]
 | |
| Browsing revisions
 | |
| ------------------
 | |
| 
 | |
| The linkgit:git-log[1] command can show lists of commits.  On its
 | |
| own, it shows all commits reachable from the parent commit; but you
 | |
| can also make more specific requests:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git log v2.5..	# commits since (not reachable from) v2.5
 | |
| $ git log test..master	# commits reachable from master but not test
 | |
| $ git log master..test	# ...reachable from test but not master
 | |
| $ git log master...test	# ...reachable from either test or master,
 | |
| 			#    but not both
 | |
| $ git log --since="2 weeks ago" # commits from the last 2 weeks
 | |
| $ git log Makefile      # commits which modify Makefile
 | |
| $ git log fs/		# ... which modify any file under fs/
 | |
| $ git log -S'foo()'	# commits which add or remove any file data
 | |
| 			# matching the string 'foo()'
 | |
| -------------------------------------------------
 | |
| 
 | |
| And of course you can combine all of these; the following finds
 | |
| commits since v2.5 which touch the `Makefile` or any file under `fs`:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git log v2.5.. Makefile fs/
 | |
| -------------------------------------------------
 | |
| 
 | |
| You can also ask git log to show patches:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git log -p
 | |
| -------------------------------------------------
 | |
| 
 | |
| See the `--pretty` option in the linkgit:git-log[1] man page for more
 | |
| display options.
 | |
| 
 | |
| Note that git log starts with the most recent commit and works
 | |
| backwards through the parents; however, since Git history can contain
 | |
| multiple independent lines of development, the particular order that
 | |
| commits are listed in may be somewhat arbitrary.
 | |
| 
 | |
| [[generating-diffs]]
 | |
| Generating diffs
 | |
| ----------------
 | |
| 
 | |
| You can generate diffs between any two versions using
 | |
| linkgit:git-diff[1]:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git diff master..test
 | |
| -------------------------------------------------
 | |
| 
 | |
| That will produce the diff between the tips of the two branches.  If
 | |
| you'd prefer to find the diff from their common ancestor to test, you
 | |
| can use three dots instead of two:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git diff master...test
 | |
| -------------------------------------------------
 | |
| 
 | |
| Sometimes what you want instead is a set of patches; for this you can
 | |
| use linkgit:git-format-patch[1]:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git format-patch master..test
 | |
| -------------------------------------------------
 | |
| 
 | |
| will generate a file with a patch for each commit reachable from test
 | |
| but not from master.
 | |
| 
 | |
| [[viewing-old-file-versions]]
 | |
| Viewing old file versions
 | |
| -------------------------
 | |
| 
 | |
| You can always view an old version of a file by just checking out the
 | |
| correct revision first.  But sometimes it is more convenient to be
 | |
| able to view an old version of a single file without checking
 | |
| anything out; this command does that:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git show v2.5:fs/locks.c
 | |
| -------------------------------------------------
 | |
| 
 | |
| Before the colon may be anything that names a commit, and after it
 | |
| may be any path to a file tracked by Git.
 | |
| 
 | |
| [[history-examples]]
 | |
| Examples
 | |
| --------
 | |
| 
 | |
| [[counting-commits-on-a-branch]]
 | |
| Counting the number of commits on a branch
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| Suppose you want to know how many commits you've made on `mybranch`
 | |
| since it diverged from `origin`:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git log --pretty=oneline origin..mybranch | wc -l
 | |
| -------------------------------------------------
 | |
| 
 | |
| Alternatively, you may often see this sort of thing done with the
 | |
| lower-level command linkgit:git-rev-list[1], which just lists the SHA-1's
 | |
| of all the given commits:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git rev-list origin..mybranch | wc -l
 | |
| -------------------------------------------------
 | |
| 
 | |
| [[checking-for-equal-branches]]
 | |
| Check whether two branches point at the same history
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| Suppose you want to check whether two branches point at the same point
 | |
| in history.
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git diff origin..master
 | |
| -------------------------------------------------
 | |
| 
 | |
| will tell you whether the contents of the project are the same at the
 | |
| two branches; in theory, however, it's possible that the same project
 | |
| contents could have been arrived at by two different historical
 | |
| routes.  You could compare the object names:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git rev-list origin
 | |
| e05db0fd4f31dde7005f075a84f96b360d05984b
 | |
| $ git rev-list master
 | |
| e05db0fd4f31dde7005f075a84f96b360d05984b
 | |
| -------------------------------------------------
 | |
| 
 | |
| Or you could recall that the `...` operator selects all commits
 | |
| reachable from either one reference or the other but not
 | |
| both; so
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git log origin...master
 | |
| -------------------------------------------------
 | |
| 
 | |
| will return no commits when the two branches are equal.
 | |
| 
 | |
| [[finding-tagged-descendants]]
 | |
| Find first tagged version including a given fix
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| Suppose you know that the commit e05db0fd fixed a certain problem.
 | |
| You'd like to find the earliest tagged release that contains that
 | |
| fix.
 | |
| 
 | |
| Of course, there may be more than one answer--if the history branched
 | |
| after commit e05db0fd, then there could be multiple "earliest" tagged
 | |
| releases.
 | |
| 
 | |
| You could just visually inspect the commits since e05db0fd:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ gitk e05db0fd..
 | |
| -------------------------------------------------
 | |
| 
 | |
| or you can use linkgit:git-name-rev[1], which will give the commit a
 | |
| name based on any tag it finds pointing to one of the commit's
 | |
| descendants:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git name-rev --tags e05db0fd
 | |
| e05db0fd tags/v1.5.0-rc1^0~23
 | |
| -------------------------------------------------
 | |
| 
 | |
| The linkgit:git-describe[1] command does the opposite, naming the
 | |
| revision using a tag on which the given commit is based:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git describe e05db0fd
 | |
| v1.5.0-rc0-260-ge05db0f
 | |
| -------------------------------------------------
 | |
| 
 | |
| but that may sometimes help you guess which tags might come after the
 | |
| given commit.
 | |
| 
 | |
| If you just want to verify whether a given tagged version contains a
 | |
| given commit, you could use linkgit:git-merge-base[1]:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git merge-base e05db0fd v1.5.0-rc1
 | |
| e05db0fd4f31dde7005f075a84f96b360d05984b
 | |
| -------------------------------------------------
 | |
| 
 | |
| The merge-base command finds a common ancestor of the given commits,
 | |
| and always returns one or the other in the case where one is a
 | |
| descendant of the other; so the above output shows that e05db0fd
 | |
| actually is an ancestor of v1.5.0-rc1.
 | |
| 
 | |
| Alternatively, note that
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git log v1.5.0-rc1..e05db0fd
 | |
| -------------------------------------------------
 | |
| 
 | |
| will produce empty output if and only if v1.5.0-rc1 includes e05db0fd,
 | |
| because it outputs only commits that are not reachable from v1.5.0-rc1.
 | |
| 
 | |
| As yet another alternative, the linkgit:git-show-branch[1] command lists
 | |
| the commits reachable from its arguments with a display on the left-hand
 | |
| side that indicates which arguments that commit is reachable from.
 | |
| So, if you run something like
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git show-branch e05db0fd v1.5.0-rc0 v1.5.0-rc1 v1.5.0-rc2
 | |
| ! [e05db0fd] Fix warnings in sha1_file.c - use C99 printf format if
 | |
| available
 | |
|  ! [v1.5.0-rc0] GIT v1.5.0 preview
 | |
|   ! [v1.5.0-rc1] GIT v1.5.0-rc1
 | |
|    ! [v1.5.0-rc2] GIT v1.5.0-rc2
 | |
| ...
 | |
| -------------------------------------------------
 | |
| 
 | |
| then a line like
 | |
| 
 | |
| -------------------------------------------------
 | |
| + ++ [e05db0fd] Fix warnings in sha1_file.c - use C99 printf format if
 | |
| available
 | |
| -------------------------------------------------
 | |
| 
 | |
| shows that e05db0fd is reachable from itself, from v1.5.0-rc1,
 | |
| and from v1.5.0-rc2, and not from v1.5.0-rc0.
 | |
| 
 | |
| [[showing-commits-unique-to-a-branch]]
 | |
| Showing commits unique to a given branch
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| Suppose you would like to see all the commits reachable from the branch
 | |
| head named `master` but not from any other head in your repository.
 | |
| 
 | |
| We can list all the heads in this repository with
 | |
| linkgit:git-show-ref[1]:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git show-ref --heads
 | |
| bf62196b5e363d73353a9dcf094c59595f3153b7 refs/heads/core-tutorial
 | |
| db768d5504c1bb46f63ee9d6e1772bd047e05bf9 refs/heads/maint
 | |
| a07157ac624b2524a059a3414e99f6f44bebc1e7 refs/heads/master
 | |
| 24dbc180ea14dc1aebe09f14c8ecf32010690627 refs/heads/tutorial-2
 | |
| 1e87486ae06626c2f31eaa63d26fc0fd646c8af2 refs/heads/tutorial-fixes
 | |
| -------------------------------------------------
 | |
| 
 | |
| We can get just the branch-head names, and remove `master`, with
 | |
| the help of the standard utilities cut and grep:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git show-ref --heads | cut -d' ' -f2 | grep -v '^refs/heads/master'
 | |
| refs/heads/core-tutorial
 | |
| refs/heads/maint
 | |
| refs/heads/tutorial-2
 | |
| refs/heads/tutorial-fixes
 | |
| -------------------------------------------------
 | |
| 
 | |
| And then we can ask to see all the commits reachable from master
 | |
| but not from these other heads:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ gitk master --not $( git show-ref --heads | cut -d' ' -f2 |
 | |
| 				grep -v '^refs/heads/master' )
 | |
| -------------------------------------------------
 | |
| 
 | |
| Obviously, endless variations are possible; for example, to see all
 | |
| commits reachable from some head but not from any tag in the repository:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ gitk $( git show-ref --heads ) --not  $( git show-ref --tags )
 | |
| -------------------------------------------------
 | |
| 
 | |
| (See linkgit:gitrevisions[7] for explanations of commit-selecting
 | |
| syntax such as `--not`.)
 | |
| 
 | |
| [[making-a-release]]
 | |
| Creating a changelog and tarball for a software release
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| The linkgit:git-archive[1] command can create a tar or zip archive from
 | |
| any version of a project; for example:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git archive -o latest.tar.gz --prefix=project/ HEAD
 | |
| -------------------------------------------------
 | |
| 
 | |
| will use HEAD to produce a gzipped tar archive in which each filename
 | |
| is preceded by `project/`.  The output file format is inferred from
 | |
| the output file extension if possible, see linkgit:git-archive[1] for
 | |
| details.
 | |
| 
 | |
| Versions of Git older than 1.7.7 don't know about the `tar.gz` format,
 | |
| you'll need to use gzip explicitly:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git archive --format=tar --prefix=project/ HEAD | gzip >latest.tar.gz
 | |
| -------------------------------------------------
 | |
| 
 | |
| If you're releasing a new version of a software project, you may want
 | |
| to simultaneously make a changelog to include in the release
 | |
| announcement.
 | |
| 
 | |
| Linus Torvalds, for example, makes new kernel releases by tagging them,
 | |
| then running:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ release-script 2.6.12 2.6.13-rc6 2.6.13-rc7
 | |
| -------------------------------------------------
 | |
| 
 | |
| where release-script is a shell script that looks like:
 | |
| 
 | |
| -------------------------------------------------
 | |
| #!/bin/sh
 | |
| stable="$1"
 | |
| last="$2"
 | |
| new="$3"
 | |
| echo "# git tag v$new"
 | |
| echo "git archive --prefix=linux-$new/ v$new | gzip -9 > ../linux-$new.tar.gz"
 | |
| echo "git diff v$stable v$new | gzip -9 > ../patch-$new.gz"
 | |
| echo "git log --no-merges v$new ^v$last > ../ChangeLog-$new"
 | |
| echo "git shortlog --no-merges v$new ^v$last > ../ShortLog"
 | |
| echo "git diff --stat --summary -M v$last v$new > ../diffstat-$new"
 | |
| -------------------------------------------------
 | |
| 
 | |
| and then he just cut-and-pastes the output commands after verifying that
 | |
| they look OK.
 | |
| 
 | |
| [[Finding-commits-With-given-Content]]
 | |
| Finding commits referencing a file with given content
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| Somebody hands you a copy of a file, and asks which commits modified a
 | |
| file such that it contained the given content either before or after the
 | |
| commit.  You can find out with this:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $  git log --raw --abbrev=40 --pretty=oneline |
 | |
| 	grep -B 1 `git hash-object filename`
 | |
| -------------------------------------------------
 | |
| 
 | |
| Figuring out why this works is left as an exercise to the (advanced)
 | |
| student.  The linkgit:git-log[1], linkgit:git-diff-tree[1], and
 | |
| linkgit:git-hash-object[1] man pages may prove helpful.
 | |
| 
 | |
| [[Developing-With-git]]
 | |
| Developing with Git
 | |
| ===================
 | |
| 
 | |
| [[telling-git-your-name]]
 | |
| Telling Git your name
 | |
| ---------------------
 | |
| 
 | |
| Before creating any commits, you should introduce yourself to Git.
 | |
| The easiest way to do so is to use linkgit:git-config[1]:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git config --global user.name 'Your Name Comes Here'
 | |
| $ git config --global user.email 'you@yourdomain.example.com'
 | |
| ------------------------------------------------
 | |
| 
 | |
| Which will add the following to a file named `.gitconfig` in your
 | |
| home directory:
 | |
| 
 | |
| ------------------------------------------------
 | |
| [user]
 | |
| 	name = Your Name Comes Here
 | |
| 	email = you@yourdomain.example.com
 | |
| ------------------------------------------------
 | |
| 
 | |
| See the "CONFIGURATION FILE" section of linkgit:git-config[1] for
 | |
| details on the configuration file.  The file is plain text, so you can
 | |
| also edit it with your favorite editor.
 | |
| 
 | |
| 
 | |
| [[creating-a-new-repository]]
 | |
| Creating a new repository
 | |
| -------------------------
 | |
| 
 | |
| Creating a new repository from scratch is very easy:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ mkdir project
 | |
| $ cd project
 | |
| $ git init
 | |
| -------------------------------------------------
 | |
| 
 | |
| If you have some initial content (say, a tarball):
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ tar xzvf project.tar.gz
 | |
| $ cd project
 | |
| $ git init
 | |
| $ git add . # include everything below ./ in the first commit:
 | |
| $ git commit
 | |
| -------------------------------------------------
 | |
| 
 | |
| [[how-to-make-a-commit]]
 | |
| How to make a commit
 | |
| --------------------
 | |
| 
 | |
| Creating a new commit takes three steps:
 | |
| 
 | |
| 	1. Making some changes to the working directory using your
 | |
| 	   favorite editor.
 | |
| 	2. Telling Git about your changes.
 | |
| 	3. Creating the commit using the content you told Git about
 | |
| 	   in step 2.
 | |
| 
 | |
| In practice, you can interleave and repeat steps 1 and 2 as many
 | |
| times as you want: in order to keep track of what you want committed
 | |
| at step 3, Git maintains a snapshot of the tree's contents in a
 | |
| special staging area called "the index."
 | |
| 
 | |
| At the beginning, the content of the index will be identical to
 | |
| that of the HEAD.  The command `git diff --cached`, which shows
 | |
| the difference between the HEAD and the index, should therefore
 | |
| produce no output at that point.
 | |
| 
 | |
| Modifying the index is easy:
 | |
| 
 | |
| To update the index with the contents of a new or modified file, use
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git add path/to/file
 | |
| -------------------------------------------------
 | |
| 
 | |
| To remove a file from the index and from the working tree, use
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git rm path/to/file
 | |
| -------------------------------------------------
 | |
| 
 | |
| After each step you can verify that
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git diff --cached
 | |
| -------------------------------------------------
 | |
| 
 | |
| always shows the difference between the HEAD and the index file--this
 | |
| is what you'd commit if you created the commit now--and that
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git diff
 | |
| -------------------------------------------------
 | |
| 
 | |
| shows the difference between the working tree and the index file.
 | |
| 
 | |
| Note that `git add` always adds just the current contents of a file
 | |
| to the index; further changes to the same file will be ignored unless
 | |
| you run `git add` on the file again.
 | |
| 
 | |
| When you're ready, just run
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git commit
 | |
| -------------------------------------------------
 | |
| 
 | |
| and Git will prompt you for a commit message and then create the new
 | |
| commit.  Check to make sure it looks like what you expected with
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git show
 | |
| -------------------------------------------------
 | |
| 
 | |
| As a special shortcut,
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git commit -a
 | |
| -------------------------------------------------
 | |
| 
 | |
| will update the index with any files that you've modified or removed
 | |
| and create a commit, all in one step.
 | |
| 
 | |
| A number of commands are useful for keeping track of what you're
 | |
| about to commit:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git diff --cached # difference between HEAD and the index; what
 | |
| 		    # would be committed if you ran "commit" now.
 | |
| $ git diff	    # difference between the index file and your
 | |
| 		    # working directory; changes that would not
 | |
| 		    # be included if you ran "commit" now.
 | |
| $ git diff HEAD	    # difference between HEAD and working tree; what
 | |
| 		    # would be committed if you ran "commit -a" now.
 | |
| $ git status	    # a brief per-file summary of the above.
 | |
| -------------------------------------------------
 | |
| 
 | |
| You can also use linkgit:git-gui[1] to create commits, view changes in
 | |
| the index and the working tree files, and individually select diff hunks
 | |
| for inclusion in the index (by right-clicking on the diff hunk and
 | |
| choosing "Stage Hunk For Commit").
 | |
| 
 | |
| [[creating-good-commit-messages]]
 | |
| Creating good commit messages
 | |
| -----------------------------
 | |
| 
 | |
| Though not required, it's a good idea to begin the commit message
 | |
| with a single short (less than 50 character) line summarizing the
 | |
| change, followed by a blank line and then a more thorough
 | |
| description.  The text up to the first blank line in a commit
 | |
| message is treated as the commit title, and that title is used
 | |
| throughout Git.  For example, linkgit:git-format-patch[1] turns a
 | |
| commit into email, and it uses the title on the Subject line and the
 | |
| rest of the commit in the body.
 | |
| 
 | |
| 
 | |
| [[ignoring-files]]
 | |
| Ignoring files
 | |
| --------------
 | |
| 
 | |
| A project will often generate files that you do 'not' want to track with Git.
 | |
| This typically includes files generated by a build process or temporary
 | |
| backup files made by your editor. Of course, 'not' tracking files with Git
 | |
| is just a matter of 'not' calling `git add` on them. But it quickly becomes
 | |
| annoying to have these untracked files lying around; e.g. they make
 | |
| `git add .` practically useless, and they keep showing up in the output of
 | |
| `git status`.
 | |
| 
 | |
| You can tell Git to ignore certain files by creating a file called
 | |
| `.gitignore` in the top level of your working directory, with contents
 | |
| such as:
 | |
| 
 | |
| -------------------------------------------------
 | |
| # Lines starting with '#' are considered comments.
 | |
| # Ignore any file named foo.txt.
 | |
| foo.txt
 | |
| # Ignore (generated) html files,
 | |
| *.html
 | |
| # except foo.html which is maintained by hand.
 | |
| !foo.html
 | |
| # Ignore objects and archives.
 | |
| *.[oa]
 | |
| -------------------------------------------------
 | |
| 
 | |
| See linkgit:gitignore[5] for a detailed explanation of the syntax.  You can
 | |
| also place .gitignore files in other directories in your working tree, and they
 | |
| will apply to those directories and their subdirectories.  The `.gitignore`
 | |
| files can be added to your repository like any other files (just run `git add
 | |
| .gitignore` and `git commit`, as usual), which is convenient when the exclude
 | |
| patterns (such as patterns matching build output files) would also make sense
 | |
| for other users who clone your repository.
 | |
| 
 | |
| If you wish the exclude patterns to affect only certain repositories
 | |
| (instead of every repository for a given project), you may instead put
 | |
| them in a file in your repository named `.git/info/exclude`, or in any
 | |
| file specified by the `core.excludesFile` configuration variable.
 | |
| Some Git commands can also take exclude patterns directly on the
 | |
| command line.  See linkgit:gitignore[5] for the details.
 | |
| 
 | |
| [[how-to-merge]]
 | |
| How to merge
 | |
| ------------
 | |
| 
 | |
| You can rejoin two diverging branches of development using
 | |
| linkgit:git-merge[1]:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git merge branchname
 | |
| -------------------------------------------------
 | |
| 
 | |
| merges the development in the branch `branchname` into the current
 | |
| branch.
 | |
| 
 | |
| A merge is made by combining the changes made in `branchname` and the
 | |
| changes made up to the latest commit in your current branch since
 | |
| their histories forked. The work tree is overwritten by the result of
 | |
| the merge when this combining is done cleanly, or overwritten by a
 | |
| half-merged results when this combining results in conflicts.
 | |
| Therefore, if you have uncommitted changes touching the same files as
 | |
| the ones impacted by the merge, Git will refuse to proceed. Most of
 | |
| the time, you will want to commit your changes before you can merge,
 | |
| and if you don't, then linkgit:git-stash[1] can take these changes
 | |
| away while you're doing the merge, and reapply them afterwards.
 | |
| 
 | |
| If the changes are independent enough, Git will automatically complete
 | |
| the merge and commit the result (or reuse an existing commit in case
 | |
| of <<fast-forwards,fast-forward>>, see below). On the other hand,
 | |
| if there are conflicts--for example, if the same file is
 | |
| modified in two different ways in the remote branch and the local
 | |
| branch--then you are warned; the output may look something like this:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git merge next
 | |
|  100% (4/4) done
 | |
| Auto-merged file.txt
 | |
| CONFLICT (content): Merge conflict in file.txt
 | |
| Automatic merge failed; fix conflicts and then commit the result.
 | |
| -------------------------------------------------
 | |
| 
 | |
| Conflict markers are left in the problematic files, and after
 | |
| you resolve the conflicts manually, you can update the index
 | |
| with the contents and run Git commit, as you normally would when
 | |
| creating a new file.
 | |
| 
 | |
| If you examine the resulting commit using gitk, you will see that it
 | |
| has two parents, one pointing to the top of the current branch, and
 | |
| one to the top of the other branch.
 | |
| 
 | |
| [[resolving-a-merge]]
 | |
| Resolving a merge
 | |
| -----------------
 | |
| 
 | |
| When a merge isn't resolved automatically, Git leaves the index and
 | |
| the working tree in a special state that gives you all the
 | |
| information you need to help resolve the merge.
 | |
| 
 | |
| Files with conflicts are marked specially in the index, so until you
 | |
| resolve the problem and update the index, linkgit:git-commit[1] will
 | |
| fail:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git commit
 | |
| file.txt: needs merge
 | |
| -------------------------------------------------
 | |
| 
 | |
| Also, linkgit:git-status[1] will list those files as "unmerged", and the
 | |
| files with conflicts will have conflict markers added, like this:
 | |
| 
 | |
| -------------------------------------------------
 | |
| <<<<<<< HEAD:file.txt
 | |
| Hello world
 | |
| =======
 | |
| Goodbye
 | |
| >>>>>>> 77976da35a11db4580b80ae27e8d65caf5208086:file.txt
 | |
| -------------------------------------------------
 | |
| 
 | |
| All you need to do is edit the files to resolve the conflicts, and then
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git add file.txt
 | |
| $ git commit
 | |
| -------------------------------------------------
 | |
| 
 | |
| Note that the commit message will already be filled in for you with
 | |
| some information about the merge.  Normally you can just use this
 | |
| default message unchanged, but you may add additional commentary of
 | |
| your own if desired.
 | |
| 
 | |
| The above is all you need to know to resolve a simple merge.  But Git
 | |
| also provides more information to help resolve conflicts:
 | |
| 
 | |
| [[conflict-resolution]]
 | |
| Getting conflict-resolution help during a merge
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| All of the changes that Git was able to merge automatically are
 | |
| already added to the index file, so linkgit:git-diff[1] shows only
 | |
| the conflicts.  It uses an unusual syntax:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git diff
 | |
| diff --cc file.txt
 | |
| index 802992c,2b60207..0000000
 | |
| --- a/file.txt
 | |
| +++ b/file.txt
 | |
| @@@ -1,1 -1,1 +1,5 @@@
 | |
| ++<<<<<<< HEAD:file.txt
 | |
|  +Hello world
 | |
| ++=======
 | |
| + Goodbye
 | |
| ++>>>>>>> 77976da35a11db4580b80ae27e8d65caf5208086:file.txt
 | |
| -------------------------------------------------
 | |
| 
 | |
| Recall that the commit which will be committed after we resolve this
 | |
| conflict will have two parents instead of the usual one: one parent
 | |
| will be HEAD, the tip of the current branch; the other will be the
 | |
| tip of the other branch, which is stored temporarily in MERGE_HEAD.
 | |
| 
 | |
| During the merge, the index holds three versions of each file.  Each of
 | |
| these three "file stages" represents a different version of the file:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git show :1:file.txt	# the file in a common ancestor of both branches
 | |
| $ git show :2:file.txt	# the version from HEAD.
 | |
| $ git show :3:file.txt	# the version from MERGE_HEAD.
 | |
| -------------------------------------------------
 | |
| 
 | |
| When you ask linkgit:git-diff[1] to show the conflicts, it runs a
 | |
| three-way diff between the conflicted merge results in the work tree with
 | |
| stages 2 and 3 to show only hunks whose contents come from both sides,
 | |
| mixed (in other words, when a hunk's merge results come only from stage 2,
 | |
| that part is not conflicting and is not shown.  Same for stage 3).
 | |
| 
 | |
| The diff above shows the differences between the working-tree version of
 | |
| file.txt and the stage 2 and stage 3 versions.  So instead of preceding
 | |
| each line by a single `+` or `-`, it now uses two columns: the first
 | |
| column is used for differences between the first parent and the working
 | |
| directory copy, and the second for differences between the second parent
 | |
| and the working directory copy.  (See the "COMBINED DIFF FORMAT" section
 | |
| of linkgit:git-diff-files[1] for a details of the format.)
 | |
| 
 | |
| After resolving the conflict in the obvious way (but before updating the
 | |
| index), the diff will look like:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git diff
 | |
| diff --cc file.txt
 | |
| index 802992c,2b60207..0000000
 | |
| --- a/file.txt
 | |
| +++ b/file.txt
 | |
| @@@ -1,1 -1,1 +1,1 @@@
 | |
| - Hello world
 | |
|  -Goodbye
 | |
| ++Goodbye world
 | |
| -------------------------------------------------
 | |
| 
 | |
| This shows that our resolved version deleted "Hello world" from the
 | |
| first parent, deleted "Goodbye" from the second parent, and added
 | |
| "Goodbye world", which was previously absent from both.
 | |
| 
 | |
| Some special diff options allow diffing the working directory against
 | |
| any of these stages:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git diff -1 file.txt		# diff against stage 1
 | |
| $ git diff --base file.txt	# same as the above
 | |
| $ git diff -2 file.txt		# diff against stage 2
 | |
| $ git diff --ours file.txt	# same as the above
 | |
| $ git diff -3 file.txt		# diff against stage 3
 | |
| $ git diff --theirs file.txt	# same as the above.
 | |
| -------------------------------------------------
 | |
| 
 | |
| The linkgit:git-log[1] and linkgit:gitk[1] commands also provide special help
 | |
| for merges:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git log --merge
 | |
| $ gitk --merge
 | |
| -------------------------------------------------
 | |
| 
 | |
| These will display all commits which exist only on HEAD or on
 | |
| MERGE_HEAD, and which touch an unmerged file.
 | |
| 
 | |
| You may also use linkgit:git-mergetool[1], which lets you merge the
 | |
| unmerged files using external tools such as Emacs or kdiff3.
 | |
| 
 | |
| Each time you resolve the conflicts in a file and update the index:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git add file.txt
 | |
| -------------------------------------------------
 | |
| 
 | |
| the different stages of that file will be "collapsed", after which
 | |
| `git diff` will (by default) no longer show diffs for that file.
 | |
| 
 | |
| [[undoing-a-merge]]
 | |
| Undoing a merge
 | |
| ---------------
 | |
| 
 | |
| If you get stuck and decide to just give up and throw the whole mess
 | |
| away, you can always return to the pre-merge state with
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git merge --abort
 | |
| -------------------------------------------------
 | |
| 
 | |
| Or, if you've already committed the merge that you want to throw away,
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git reset --hard ORIG_HEAD
 | |
| -------------------------------------------------
 | |
| 
 | |
| However, this last command can be dangerous in some cases--never
 | |
| throw away a commit you have already committed if that commit may
 | |
| itself have been merged into another branch, as doing so may confuse
 | |
| further merges.
 | |
| 
 | |
| [[fast-forwards]]
 | |
| Fast-forward merges
 | |
| -------------------
 | |
| 
 | |
| There is one special case not mentioned above, which is treated
 | |
| differently.  Normally, a merge results in a merge commit, with two
 | |
| parents, one pointing at each of the two lines of development that
 | |
| were merged.
 | |
| 
 | |
| However, if the current branch is an ancestor of the other--so every commit
 | |
| present in the current branch is already contained in the other branch--then Git
 | |
| just performs a "fast-forward"; the head of the current branch is moved forward
 | |
| to point at the head of the merged-in branch, without any new commits being
 | |
| created.
 | |
| 
 | |
| [[fixing-mistakes]]
 | |
| Fixing mistakes
 | |
| ---------------
 | |
| 
 | |
| If you've messed up the working tree, but haven't yet committed your
 | |
| mistake, you can return the entire working tree to the last committed
 | |
| state with
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git restore --staged --worktree :/
 | |
| -------------------------------------------------
 | |
| 
 | |
| If you make a commit that you later wish you hadn't, there are two
 | |
| fundamentally different ways to fix the problem:
 | |
| 
 | |
| 	1. You can create a new commit that undoes whatever was done
 | |
| 	by the old commit.  This is the correct thing if your
 | |
| 	mistake has already been made public.
 | |
| 
 | |
| 	2. You can go back and modify the old commit.  You should
 | |
| 	never do this if you have already made the history public;
 | |
| 	Git does not normally expect the "history" of a project to
 | |
| 	change, and cannot correctly perform repeated merges from
 | |
| 	a branch that has had its history changed.
 | |
| 
 | |
| [[reverting-a-commit]]
 | |
| Fixing a mistake with a new commit
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| Creating a new commit that reverts an earlier change is very easy;
 | |
| just pass the linkgit:git-revert[1] command a reference to the bad
 | |
| commit; for example, to revert the most recent commit:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git revert HEAD
 | |
| -------------------------------------------------
 | |
| 
 | |
| This will create a new commit which undoes the change in HEAD.  You
 | |
| will be given a chance to edit the commit message for the new commit.
 | |
| 
 | |
| You can also revert an earlier change, for example, the next-to-last:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git revert HEAD^
 | |
| -------------------------------------------------
 | |
| 
 | |
| In this case Git will attempt to undo the old change while leaving
 | |
| intact any changes made since then.  If more recent changes overlap
 | |
| with the changes to be reverted, then you will be asked to fix
 | |
| conflicts manually, just as in the case of <<resolving-a-merge,
 | |
| resolving a merge>>.
 | |
| 
 | |
| [[fixing-a-mistake-by-rewriting-history]]
 | |
| Fixing a mistake by rewriting history
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| If the problematic commit is the most recent commit, and you have not
 | |
| yet made that commit public, then you may just
 | |
| <<undoing-a-merge,destroy it using `git reset`>>.
 | |
| 
 | |
| Alternatively, you
 | |
| can edit the working directory and update the index to fix your
 | |
| mistake, just as if you were going to <<how-to-make-a-commit,create a
 | |
| new commit>>, then run
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git commit --amend
 | |
| -------------------------------------------------
 | |
| 
 | |
| which will replace the old commit by a new commit incorporating your
 | |
| changes, giving you a chance to edit the old commit message first.
 | |
| 
 | |
| Again, you should never do this to a commit that may already have
 | |
| been merged into another branch; use linkgit:git-revert[1] instead in
 | |
| that case.
 | |
| 
 | |
| It is also possible to replace commits further back in the history, but
 | |
| this is an advanced topic to be left for
 | |
| <<cleaning-up-history,another chapter>>.
 | |
| 
 | |
| [[checkout-of-path]]
 | |
| Checking out an old version of a file
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| In the process of undoing a previous bad change, you may find it
 | |
| useful to check out an older version of a particular file using
 | |
| linkgit:git-restore[1]. The command
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git restore --source=HEAD^ path/to/file
 | |
| -------------------------------------------------
 | |
| 
 | |
| replaces path/to/file by the contents it had in the commit HEAD^, and
 | |
| also updates the index to match.  It does not change branches.
 | |
| 
 | |
| If you just want to look at an old version of the file, without
 | |
| modifying the working directory, you can do that with
 | |
| linkgit:git-show[1]:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git show HEAD^:path/to/file
 | |
| -------------------------------------------------
 | |
| 
 | |
| which will display the given version of the file.
 | |
| 
 | |
| [[interrupted-work]]
 | |
| Temporarily setting aside work in progress
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| While you are in the middle of working on something complicated, you
 | |
| find an unrelated but obvious and trivial bug.  You would like to fix it
 | |
| before continuing.  You can use linkgit:git-stash[1] to save the current
 | |
| state of your work, and after fixing the bug (or, optionally after doing
 | |
| so on a different branch and then coming back), unstash the
 | |
| work-in-progress changes.
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git stash push -m "work in progress for foo feature"
 | |
| ------------------------------------------------
 | |
| 
 | |
| This command will save your changes away to the `stash`, and
 | |
| reset your working tree and the index to match the tip of your
 | |
| current branch.  Then you can make your fix as usual.
 | |
| 
 | |
| ------------------------------------------------
 | |
| ... edit and test ...
 | |
| $ git commit -a -m "blorpl: typofix"
 | |
| ------------------------------------------------
 | |
| 
 | |
| After that, you can go back to what you were working on with
 | |
| `git stash pop`:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git stash pop
 | |
| ------------------------------------------------
 | |
| 
 | |
| 
 | |
| [[ensuring-good-performance]]
 | |
| Ensuring good performance
 | |
| -------------------------
 | |
| 
 | |
| On large repositories, Git depends on compression to keep the history
 | |
| information from taking up too much space on disk or in memory.  Some
 | |
| Git commands may automatically run linkgit:git-gc[1], so you don't
 | |
| have to worry about running it manually.  However, compressing a large
 | |
| repository may take a while, so you may want to call `gc` explicitly
 | |
| to avoid automatic compression kicking in when it is not convenient.
 | |
| 
 | |
| 
 | |
| [[ensuring-reliability]]
 | |
| Ensuring reliability
 | |
| --------------------
 | |
| 
 | |
| [[checking-for-corruption]]
 | |
| Checking the repository for corruption
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| The linkgit:git-fsck[1] command runs a number of self-consistency checks
 | |
| on the repository, and reports on any problems.  This may take some
 | |
| time.
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git fsck
 | |
| dangling commit 7281251ddd2a61e38657c827739c57015671a6b3
 | |
| dangling commit 2706a059f258c6b245f298dc4ff2ccd30ec21a63
 | |
| dangling commit 13472b7c4b80851a1bc551779171dcb03655e9b5
 | |
| dangling blob 218761f9d90712d37a9c5e36f406f92202db07eb
 | |
| dangling commit bf093535a34a4d35731aa2bd90fe6b176302f14f
 | |
| dangling commit 8e4bec7f2ddaa268bef999853c25755452100f8e
 | |
| dangling tree d50bb86186bf27b681d25af89d3b5b68382e4085
 | |
| dangling tree b24c2473f1fd3d91352a624795be026d64c8841f
 | |
| ...
 | |
| -------------------------------------------------
 | |
| 
 | |
| You will see informational messages on dangling objects. They are objects
 | |
| that still exist in the repository but are no longer referenced by any of
 | |
| your branches, and can (and will) be removed after a while with `gc`.
 | |
| You can run `git fsck --no-dangling` to suppress these messages, and still
 | |
| view real errors.
 | |
| 
 | |
| [[recovering-lost-changes]]
 | |
| Recovering lost changes
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| [[reflogs]]
 | |
| Reflogs
 | |
| ^^^^^^^
 | |
| 
 | |
| Say you modify a branch with <<fixing-mistakes,`git reset --hard`>>,
 | |
| and then realize that the branch was the only reference you had to
 | |
| that point in history.
 | |
| 
 | |
| Fortunately, Git also keeps a log, called a "reflog", of all the
 | |
| previous values of each branch.  So in this case you can still find the
 | |
| old history using, for example,
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git log master@{1}
 | |
| -------------------------------------------------
 | |
| 
 | |
| This lists the commits reachable from the previous version of the
 | |
| `master` branch head.  This syntax can be used with any Git command
 | |
| that accepts a commit, not just with `git log`.  Some other examples:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git show master@{2}		# See where the branch pointed 2,
 | |
| $ git show master@{3}		# 3, ... changes ago.
 | |
| $ gitk master@{yesterday}	# See where it pointed yesterday,
 | |
| $ gitk master@{"1 week ago"}	# ... or last week
 | |
| $ git log --walk-reflogs master	# show reflog entries for master
 | |
| -------------------------------------------------
 | |
| 
 | |
| A separate reflog is kept for the HEAD, so
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git show HEAD@{"1 week ago"}
 | |
| -------------------------------------------------
 | |
| 
 | |
| will show what HEAD pointed to one week ago, not what the current branch
 | |
| pointed to one week ago.  This allows you to see the history of what
 | |
| you've checked out.
 | |
| 
 | |
| The reflogs are kept by default for 30 days, after which they may be
 | |
| pruned.  See linkgit:git-reflog[1] and linkgit:git-gc[1] to learn
 | |
| how to control this pruning, and see the "SPECIFYING REVISIONS"
 | |
| section of linkgit:gitrevisions[7] for details.
 | |
| 
 | |
| Note that the reflog history is very different from normal Git history.
 | |
| While normal history is shared by every repository that works on the
 | |
| same project, the reflog history is not shared: it tells you only about
 | |
| how the branches in your local repository have changed over time.
 | |
| 
 | |
| [[dangling-object-recovery]]
 | |
| Examining dangling objects
 | |
| ^^^^^^^^^^^^^^^^^^^^^^^^^^
 | |
| 
 | |
| In some situations the reflog may not be able to save you.  For example,
 | |
| suppose you delete a branch, then realize you need the history it
 | |
| contained.  The reflog is also deleted; however, if you have not yet
 | |
| pruned the repository, then you may still be able to find the lost
 | |
| commits in the dangling objects that `git fsck` reports.  See
 | |
| <<dangling-objects>> for the details.
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git fsck
 | |
| dangling commit 7281251ddd2a61e38657c827739c57015671a6b3
 | |
| dangling commit 2706a059f258c6b245f298dc4ff2ccd30ec21a63
 | |
| dangling commit 13472b7c4b80851a1bc551779171dcb03655e9b5
 | |
| ...
 | |
| -------------------------------------------------
 | |
| 
 | |
| You can examine
 | |
| one of those dangling commits with, for example,
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ gitk 7281251ddd --not --all
 | |
| ------------------------------------------------
 | |
| 
 | |
| which does what it sounds like: it says that you want to see the commit
 | |
| history that is described by the dangling commit(s), but not the
 | |
| history that is described by all your existing branches and tags.  Thus
 | |
| you get exactly the history reachable from that commit that is lost.
 | |
| (And notice that it might not be just one commit: we only report the
 | |
| "tip of the line" as being dangling, but there might be a whole deep
 | |
| and complex commit history that was dropped.)
 | |
| 
 | |
| If you decide you want the history back, you can always create a new
 | |
| reference pointing to it, for example, a new branch:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git branch recovered-branch 7281251ddd
 | |
| ------------------------------------------------
 | |
| 
 | |
| Other types of dangling objects (blobs and trees) are also possible, and
 | |
| dangling objects can arise in other situations.
 | |
| 
 | |
| 
 | |
| [[sharing-development]]
 | |
| Sharing development with others
 | |
| ===============================
 | |
| 
 | |
| [[getting-updates-With-git-pull]]
 | |
| Getting updates with git pull
 | |
| -----------------------------
 | |
| 
 | |
| After you clone a repository and commit a few changes of your own, you
 | |
| may wish to check the original repository for updates and merge them
 | |
| into your own work.
 | |
| 
 | |
| We have already seen <<Updating-a-repository-With-git-fetch,how to
 | |
| keep remote-tracking branches up to date>> with linkgit:git-fetch[1],
 | |
| and how to merge two branches.  So you can merge in changes from the
 | |
| original repository's master branch with:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git fetch
 | |
| $ git merge origin/master
 | |
| -------------------------------------------------
 | |
| 
 | |
| However, the linkgit:git-pull[1] command provides a way to do this in
 | |
| one step:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git pull origin master
 | |
| -------------------------------------------------
 | |
| 
 | |
| In fact, if you have `master` checked out, then this branch has been
 | |
| configured by `git clone` to get changes from the HEAD branch of the
 | |
| origin repository.  So often you can
 | |
| accomplish the above with just a simple
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git pull
 | |
| -------------------------------------------------
 | |
| 
 | |
| This command will fetch changes from the remote branches to your
 | |
| remote-tracking branches `origin/*`, and merge the default branch into
 | |
| the current branch.
 | |
| 
 | |
| More generally, a branch that is created from a remote-tracking branch
 | |
| will pull
 | |
| by default from that branch.  See the descriptions of the
 | |
| `branch.<name>.remote` and `branch.<name>.merge` options in
 | |
| linkgit:git-config[1], and the discussion of the `--track` option in
 | |
| linkgit:git-checkout[1], to learn how to control these defaults.
 | |
| 
 | |
| In addition to saving you keystrokes, `git pull` also helps you by
 | |
| producing a default commit message documenting the branch and
 | |
| repository that you pulled from.
 | |
| 
 | |
| (But note that no such commit will be created in the case of a
 | |
| <<fast-forwards,fast-forward>>; instead, your branch will just be
 | |
| updated to point to the latest commit from the upstream branch.)
 | |
| 
 | |
| The `git pull` command can also be given `.` as the "remote" repository,
 | |
| in which case it just merges in a branch from the current repository; so
 | |
| the commands
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git pull . branch
 | |
| $ git merge branch
 | |
| -------------------------------------------------
 | |
| 
 | |
| are roughly equivalent.
 | |
| 
 | |
| [[submitting-patches]]
 | |
| Submitting patches to a project
 | |
| -------------------------------
 | |
| 
 | |
| If you just have a few changes, the simplest way to submit them may
 | |
| just be to send them as patches in email:
 | |
| 
 | |
| First, use linkgit:git-format-patch[1]; for example:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git format-patch origin
 | |
| -------------------------------------------------
 | |
| 
 | |
| will produce a numbered series of files in the current directory, one
 | |
| for each patch in the current branch but not in `origin/HEAD`.
 | |
| 
 | |
| `git format-patch` can include an initial "cover letter". You can insert
 | |
| commentary on individual patches after the three dash line which
 | |
| `format-patch` places after the commit message but before the patch
 | |
| itself.  If you use `git notes` to track your cover letter material,
 | |
| `git format-patch --notes` will include the commit's notes in a similar
 | |
| manner.
 | |
| 
 | |
| You can then import these into your mail client and send them by
 | |
| hand.  However, if you have a lot to send at once, you may prefer to
 | |
| use the linkgit:git-send-email[1] script to automate the process.
 | |
| Consult the mailing list for your project first to determine
 | |
| their requirements for submitting patches.
 | |
| 
 | |
| [[importing-patches]]
 | |
| Importing patches to a project
 | |
| ------------------------------
 | |
| 
 | |
| Git also provides a tool called linkgit:git-am[1] (am stands for
 | |
| "apply mailbox"), for importing such an emailed series of patches.
 | |
| Just save all of the patch-containing messages, in order, into a
 | |
| single mailbox file, say `patches.mbox`, then run
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git am -3 patches.mbox
 | |
| -------------------------------------------------
 | |
| 
 | |
| Git will apply each patch in order; if any conflicts are found, it
 | |
| will stop, and you can fix the conflicts as described in
 | |
| "<<resolving-a-merge,Resolving a merge>>".  (The `-3` option tells
 | |
| Git to perform a merge; if you would prefer it just to abort and
 | |
| leave your tree and index untouched, you may omit that option.)
 | |
| 
 | |
| Once the index is updated with the results of the conflict
 | |
| resolution, instead of creating a new commit, just run
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git am --continue
 | |
| -------------------------------------------------
 | |
| 
 | |
| and Git will create the commit for you and continue applying the
 | |
| remaining patches from the mailbox.
 | |
| 
 | |
| The final result will be a series of commits, one for each patch in
 | |
| the original mailbox, with authorship and commit log message each
 | |
| taken from the message containing each patch.
 | |
| 
 | |
| [[public-repositories]]
 | |
| Public Git repositories
 | |
| -----------------------
 | |
| 
 | |
| Another way to submit changes to a project is to tell the maintainer
 | |
| of that project to pull the changes from your repository using
 | |
| linkgit:git-pull[1].  In the section "<<getting-updates-With-git-pull,
 | |
| Getting updates with `git pull`>>" we described this as a way to get
 | |
| updates from the "main" repository, but it works just as well in the
 | |
| other direction.
 | |
| 
 | |
| If you and the maintainer both have accounts on the same machine, then
 | |
| you can just pull changes from each other's repositories directly;
 | |
| commands that accept repository URLs as arguments will also accept a
 | |
| local directory name:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git clone /path/to/repository
 | |
| $ git pull /path/to/other/repository
 | |
| -------------------------------------------------
 | |
| 
 | |
| or an ssh URL:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git clone ssh://yourhost/~you/repository
 | |
| -------------------------------------------------
 | |
| 
 | |
| For projects with few developers, or for synchronizing a few private
 | |
| repositories, this may be all you need.
 | |
| 
 | |
| However, the more common way to do this is to maintain a separate public
 | |
| repository (usually on a different host) for others to pull changes
 | |
| from.  This is usually more convenient, and allows you to cleanly
 | |
| separate private work in progress from publicly visible work.
 | |
| 
 | |
| You will continue to do your day-to-day work in your personal
 | |
| repository, but periodically "push" changes from your personal
 | |
| repository into your public repository, allowing other developers to
 | |
| pull from that repository.  So the flow of changes, in a situation
 | |
| where there is one other developer with a public repository, looks
 | |
| like this:
 | |
| 
 | |
|                         you push
 | |
|   your personal repo ------------------> your public repo
 | |
| 	^                                     |
 | |
| 	|                                     |
 | |
| 	| you pull                            | they pull
 | |
| 	|                                     |
 | |
| 	|                                     |
 | |
|         |               they push             V
 | |
|   their public repo <------------------- their repo
 | |
| 
 | |
| We explain how to do this in the following sections.
 | |
| 
 | |
| [[setting-up-a-public-repository]]
 | |
| Setting up a public repository
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| Assume your personal repository is in the directory `~/proj`.  We
 | |
| first create a new clone of the repository and tell `git daemon` that it
 | |
| is meant to be public:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git clone --bare ~/proj proj.git
 | |
| $ touch proj.git/git-daemon-export-ok
 | |
| -------------------------------------------------
 | |
| 
 | |
| The resulting directory proj.git contains a "bare" git repository--it is
 | |
| just the contents of the `.git` directory, without any files checked out
 | |
| around it.
 | |
| 
 | |
| Next, copy `proj.git` to the server where you plan to host the
 | |
| public repository.  You can use scp, rsync, or whatever is most
 | |
| convenient.
 | |
| 
 | |
| [[exporting-via-git]]
 | |
| Exporting a Git repository via the Git protocol
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| This is the preferred method.
 | |
| 
 | |
| If someone else administers the server, they should tell you what
 | |
| directory to put the repository in, and what `git://` URL it will
 | |
| appear at.  You can then skip to the section
 | |
| "<<pushing-changes-to-a-public-repository,Pushing changes to a public
 | |
| repository>>", below.
 | |
| 
 | |
| Otherwise, all you need to do is start linkgit:git-daemon[1]; it will
 | |
| listen on port 9418.  By default, it will allow access to any directory
 | |
| that looks like a Git directory and contains the magic file
 | |
| git-daemon-export-ok.  Passing some directory paths as `git daemon`
 | |
| arguments will further restrict the exports to those paths.
 | |
| 
 | |
| You can also run `git daemon` as an inetd service; see the
 | |
| linkgit:git-daemon[1] man page for details.  (See especially the
 | |
| examples section.)
 | |
| 
 | |
| [[exporting-via-http]]
 | |
| Exporting a git repository via HTTP
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| The Git protocol gives better performance and reliability, but on a
 | |
| host with a web server set up, HTTP exports may be simpler to set up.
 | |
| 
 | |
| All you need to do is place the newly created bare Git repository in
 | |
| a directory that is exported by the web server, and make some
 | |
| adjustments to give web clients some extra information they need:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ mv proj.git /home/you/public_html/proj.git
 | |
| $ cd proj.git
 | |
| $ git --bare update-server-info
 | |
| $ mv hooks/post-update.sample hooks/post-update
 | |
| -------------------------------------------------
 | |
| 
 | |
| (For an explanation of the last two lines, see
 | |
| linkgit:git-update-server-info[1] and linkgit:githooks[5].)
 | |
| 
 | |
| Advertise the URL of `proj.git`.  Anybody else should then be able to
 | |
| clone or pull from that URL, for example with a command line like:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git clone http://yourserver.com/~you/proj.git
 | |
| -------------------------------------------------
 | |
| 
 | |
| (See also
 | |
| link:howto/setup-git-server-over-http.html[setup-git-server-over-http]
 | |
| for a slightly more sophisticated setup using WebDAV which also
 | |
| allows pushing over HTTP.)
 | |
| 
 | |
| [[pushing-changes-to-a-public-repository]]
 | |
| Pushing changes to a public repository
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| Note that the two techniques outlined above (exporting via
 | |
| <<exporting-via-http,http>> or <<exporting-via-git,git>>) allow other
 | |
| maintainers to fetch your latest changes, but they do not allow write
 | |
| access, which you will need to update the public repository with the
 | |
| latest changes created in your private repository.
 | |
| 
 | |
| The simplest way to do this is using linkgit:git-push[1] and ssh; to
 | |
| update the remote branch named `master` with the latest state of your
 | |
| branch named `master`, run
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git push ssh://yourserver.com/~you/proj.git master:master
 | |
| -------------------------------------------------
 | |
| 
 | |
| or just
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git push ssh://yourserver.com/~you/proj.git master
 | |
| -------------------------------------------------
 | |
| 
 | |
| As with `git fetch`, `git push` will complain if this does not result in a
 | |
| <<fast-forwards,fast-forward>>; see the following section for details on
 | |
| handling this case.
 | |
| 
 | |
| Note that the target of a `push` is normally a
 | |
| <<def_bare_repository,bare>> repository.  You can also push to a
 | |
| repository that has a checked-out working tree, but a push to update the
 | |
| currently checked-out branch is denied by default to prevent confusion.
 | |
| See the description of the receive.denyCurrentBranch option
 | |
| in linkgit:git-config[1] for details.
 | |
| 
 | |
| As with `git fetch`, you may also set up configuration options to
 | |
| save typing; so, for example:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git remote add public-repo ssh://yourserver.com/~you/proj.git
 | |
| -------------------------------------------------
 | |
| 
 | |
| adds the following to `.git/config`:
 | |
| 
 | |
| -------------------------------------------------
 | |
| [remote "public-repo"]
 | |
| 	url = yourserver.com:proj.git
 | |
| 	fetch = +refs/heads/*:refs/remotes/example/*
 | |
| -------------------------------------------------
 | |
| 
 | |
| which lets you do the same push with just
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git push public-repo master
 | |
| -------------------------------------------------
 | |
| 
 | |
| See the explanations of the `remote.<name>.url`,
 | |
| `branch.<name>.remote`, and `remote.<name>.push` options in
 | |
| linkgit:git-config[1] for details.
 | |
| 
 | |
| [[forcing-push]]
 | |
| What to do when a push fails
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| If a push would not result in a <<fast-forwards,fast-forward>> of the
 | |
| remote branch, then it will fail with an error like:
 | |
| 
 | |
| -------------------------------------------------
 | |
|  ! [rejected]        master -> master (non-fast-forward)
 | |
| error: failed to push some refs to '...'
 | |
| hint: Updates were rejected because the tip of your current branch is behind
 | |
| hint: its remote counterpart. Integrate the remote changes (e.g.
 | |
| hint: 'git pull ...') before pushing again.
 | |
| hint: See the 'Note about fast-forwards' in 'git push --help' for details.
 | |
| -------------------------------------------------
 | |
| 
 | |
| This can happen, for example, if you:
 | |
| 
 | |
| 	- use `git reset --hard` to remove already-published commits, or
 | |
| 	- use `git commit --amend` to replace already-published commits
 | |
| 	  (as in <<fixing-a-mistake-by-rewriting-history>>), or
 | |
| 	- use `git rebase` to rebase any already-published commits (as
 | |
| 	  in <<using-git-rebase>>).
 | |
| 
 | |
| You may force `git push` to perform the update anyway by preceding the
 | |
| branch name with a plus sign:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git push ssh://yourserver.com/~you/proj.git +master
 | |
| -------------------------------------------------
 | |
| 
 | |
| Note the addition of the `+` sign.  Alternatively, you can use the
 | |
| `-f` flag to force the remote update, as in:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git push -f ssh://yourserver.com/~you/proj.git master
 | |
| -------------------------------------------------
 | |
| 
 | |
| Normally whenever a branch head in a public repository is modified, it
 | |
| is modified to point to a descendant of the commit that it pointed to
 | |
| before.  By forcing a push in this situation, you break that convention.
 | |
| (See <<problems-With-rewriting-history>>.)
 | |
| 
 | |
| Nevertheless, this is a common practice for people that need a simple
 | |
| way to publish a work-in-progress patch series, and it is an acceptable
 | |
| compromise as long as you warn other developers that this is how you
 | |
| intend to manage the branch.
 | |
| 
 | |
| It's also possible for a push to fail in this way when other people have
 | |
| the right to push to the same repository.  In that case, the correct
 | |
| solution is to retry the push after first updating your work: either by a
 | |
| pull, or by a fetch followed by a rebase; see the
 | |
| <<setting-up-a-shared-repository,next section>> and
 | |
| linkgit:gitcvs-migration[7] for more.
 | |
| 
 | |
| [[setting-up-a-shared-repository]]
 | |
| Setting up a shared repository
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| Another way to collaborate is by using a model similar to that
 | |
| commonly used in CVS, where several developers with special rights
 | |
| all push to and pull from a single shared repository.  See
 | |
| linkgit:gitcvs-migration[7] for instructions on how to
 | |
| set this up.
 | |
| 
 | |
| However, while there is nothing wrong with Git's support for shared
 | |
| repositories, this mode of operation is not generally recommended,
 | |
| simply because the mode of collaboration that Git supports--by
 | |
| exchanging patches and pulling from public repositories--has so many
 | |
| advantages over the central shared repository:
 | |
| 
 | |
| 	- Git's ability to quickly import and merge patches allows a
 | |
| 	  single maintainer to process incoming changes even at very
 | |
| 	  high rates.  And when that becomes too much, `git pull` provides
 | |
| 	  an easy way for that maintainer to delegate this job to other
 | |
| 	  maintainers while still allowing optional review of incoming
 | |
| 	  changes.
 | |
| 	- Since every developer's repository has the same complete copy
 | |
| 	  of the project history, no repository is special, and it is
 | |
| 	  trivial for another developer to take over maintenance of a
 | |
| 	  project, either by mutual agreement, or because a maintainer
 | |
| 	  becomes unresponsive or difficult to work with.
 | |
| 	- The lack of a central group of "committers" means there is
 | |
| 	  less need for formal decisions about who is "in" and who is
 | |
| 	  "out".
 | |
| 
 | |
| [[setting-up-gitweb]]
 | |
| Allowing web browsing of a repository
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| The gitweb cgi script provides users an easy way to browse your
 | |
| project's revisions, file contents and logs without having to install
 | |
| Git. Features like RSS/Atom feeds and blame/annotation details may
 | |
| optionally be enabled.
 | |
| 
 | |
| The linkgit:git-instaweb[1] command provides a simple way to start
 | |
| browsing the repository using gitweb. The default server when using
 | |
| instaweb is lighttpd.
 | |
| 
 | |
| See the file gitweb/INSTALL in the Git source tree and
 | |
| linkgit:gitweb[1] for instructions on details setting up a permanent
 | |
| installation with a CGI or Perl capable server.
 | |
| 
 | |
| [[how-to-get-a-git-repository-with-minimal-history]]
 | |
| How to get a Git repository with minimal history
 | |
| ------------------------------------------------
 | |
| 
 | |
| A <<def_shallow_clone,shallow clone>>, with its truncated
 | |
| history, is useful when one is interested only in recent history
 | |
| of a project and getting full history from the upstream is
 | |
| expensive.
 | |
| 
 | |
| A <<def_shallow_clone,shallow clone>> is created by specifying
 | |
| the linkgit:git-clone[1] `--depth` switch. The depth can later be
 | |
| changed with the linkgit:git-fetch[1] `--depth` switch, or full
 | |
| history restored with `--unshallow`.
 | |
| 
 | |
| Merging inside a <<def_shallow_clone,shallow clone>> will work as long
 | |
| as a merge base is in the recent history.
 | |
| Otherwise, it will be like merging unrelated histories and may
 | |
| have to result in huge conflicts.  This limitation may make such
 | |
| a repository unsuitable to be used in merge based workflows.
 | |
| 
 | |
| [[sharing-development-examples]]
 | |
| Examples
 | |
| --------
 | |
| 
 | |
| [[maintaining-topic-branches]]
 | |
| Maintaining topic branches for a Linux subsystem maintainer
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| This describes how Tony Luck uses Git in his role as maintainer of the
 | |
| IA64 architecture for the Linux kernel.
 | |
| 
 | |
| He uses two public branches:
 | |
| 
 | |
|  - A "test" tree into which patches are initially placed so that they
 | |
|    can get some exposure when integrated with other ongoing development.
 | |
|    This tree is available to Andrew for pulling into -mm whenever he
 | |
|    wants.
 | |
| 
 | |
|  - A "release" tree into which tested patches are moved for final sanity
 | |
|    checking, and as a vehicle to send them upstream to Linus (by sending
 | |
|    him a "please pull" request.)
 | |
| 
 | |
| He also uses a set of temporary branches ("topic branches"), each
 | |
| containing a logical grouping of patches.
 | |
| 
 | |
| To set this up, first create your work tree by cloning Linus's public
 | |
| tree:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git clone git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git work
 | |
| $ cd work
 | |
| -------------------------------------------------
 | |
| 
 | |
| Linus's tree will be stored in the remote-tracking branch named origin/master,
 | |
| and can be updated using linkgit:git-fetch[1]; you can track other
 | |
| public trees using linkgit:git-remote[1] to set up a "remote" and
 | |
| linkgit:git-fetch[1] to keep them up to date; see
 | |
| <<repositories-and-branches>>.
 | |
| 
 | |
| Now create the branches in which you are going to work; these start out
 | |
| at the current tip of origin/master branch, and should be set up (using
 | |
| the `--track` option to linkgit:git-branch[1]) to merge changes in from
 | |
| Linus by default.
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git branch --track test origin/master
 | |
| $ git branch --track release origin/master
 | |
| -------------------------------------------------
 | |
| 
 | |
| These can be easily kept up to date using linkgit:git-pull[1].
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git switch test && git pull
 | |
| $ git switch release && git pull
 | |
| -------------------------------------------------
 | |
| 
 | |
| Important note!  If you have any local changes in these branches, then
 | |
| this merge will create a commit object in the history (with no local
 | |
| changes Git will simply do a "fast-forward" merge).  Many people dislike
 | |
| the "noise" that this creates in the Linux history, so you should avoid
 | |
| doing this capriciously in the `release` branch, as these noisy commits
 | |
| will become part of the permanent history when you ask Linus to pull
 | |
| from the release branch.
 | |
| 
 | |
| A few configuration variables (see linkgit:git-config[1]) can
 | |
| make it easy to push both branches to your public tree.  (See
 | |
| <<setting-up-a-public-repository>>.)
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ cat >> .git/config <<EOF
 | |
| [remote "mytree"]
 | |
| 	url =  master.kernel.org:/pub/scm/linux/kernel/git/aegl/linux.git
 | |
| 	push = release
 | |
| 	push = test
 | |
| EOF
 | |
| -------------------------------------------------
 | |
| 
 | |
| Then you can push both the test and release trees using
 | |
| linkgit:git-push[1]:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git push mytree
 | |
| -------------------------------------------------
 | |
| 
 | |
| or push just one of the test and release branches using:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git push mytree test
 | |
| -------------------------------------------------
 | |
| 
 | |
| or
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git push mytree release
 | |
| -------------------------------------------------
 | |
| 
 | |
| Now to apply some patches from the community.  Think of a short
 | |
| snappy name for a branch to hold this patch (or related group of
 | |
| patches), and create a new branch from a recent stable tag of
 | |
| Linus's branch. Picking a stable base for your branch will:
 | |
| 1) help you: by avoiding inclusion of unrelated and perhaps lightly
 | |
| tested changes
 | |
| 2) help future bug hunters that use `git bisect` to find problems
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git switch -c speed-up-spinlocks v2.6.35
 | |
| -------------------------------------------------
 | |
| 
 | |
| Now you apply the patch(es), run some tests, and commit the change(s).  If
 | |
| the patch is a multi-part series, then you should apply each as a separate
 | |
| commit to this branch.
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ ... patch ... test  ... commit [ ... patch ... test ... commit ]*
 | |
| -------------------------------------------------
 | |
| 
 | |
| When you are happy with the state of this change, you can merge it into the
 | |
| "test" branch in preparation to make it public:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git switch test && git merge speed-up-spinlocks
 | |
| -------------------------------------------------
 | |
| 
 | |
| It is unlikely that you would have any conflicts here ... but you might if you
 | |
| spent a while on this step and had also pulled new versions from upstream.
 | |
| 
 | |
| Sometime later when enough time has passed and testing done, you can pull the
 | |
| same branch into the `release` tree ready to go upstream.  This is where you
 | |
| see the value of keeping each patch (or patch series) in its own branch.  It
 | |
| means that the patches can be moved into the `release` tree in any order.
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git switch release && git merge speed-up-spinlocks
 | |
| -------------------------------------------------
 | |
| 
 | |
| After a while, you will have a number of branches, and despite the
 | |
| well chosen names you picked for each of them, you may forget what
 | |
| they are for, or what status they are in.  To get a reminder of what
 | |
| changes are in a specific branch, use:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git log linux..branchname | git shortlog
 | |
| -------------------------------------------------
 | |
| 
 | |
| To see whether it has already been merged into the test or release branches,
 | |
| use:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git log test..branchname
 | |
| -------------------------------------------------
 | |
| 
 | |
| or
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git log release..branchname
 | |
| -------------------------------------------------
 | |
| 
 | |
| (If this branch has not yet been merged, you will see some log entries.
 | |
| If it has been merged, then there will be no output.)
 | |
| 
 | |
| Once a patch completes the great cycle (moving from test to release,
 | |
| then pulled by Linus, and finally coming back into your local
 | |
| `origin/master` branch), the branch for this change is no longer needed.
 | |
| You detect this when the output from:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git log origin..branchname
 | |
| -------------------------------------------------
 | |
| 
 | |
| is empty.  At this point the branch can be deleted:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git branch -d branchname
 | |
| -------------------------------------------------
 | |
| 
 | |
| Some changes are so trivial that it is not necessary to create a separate
 | |
| branch and then merge into each of the test and release branches.  For
 | |
| these changes, just apply directly to the `release` branch, and then
 | |
| merge that into the `test` branch.
 | |
| 
 | |
| After pushing your work to `mytree`, you can use
 | |
| linkgit:git-request-pull[1] to prepare a "please pull" request message
 | |
| to send to Linus:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git push mytree
 | |
| $ git request-pull origin mytree release
 | |
| -------------------------------------------------
 | |
| 
 | |
| Here are some of the scripts that simplify all this even further.
 | |
| 
 | |
| -------------------------------------------------
 | |
| ==== update script ====
 | |
| # Update a branch in my Git tree.  If the branch to be updated
 | |
| # is origin, then pull from kernel.org.  Otherwise merge
 | |
| # origin/master branch into test|release branch
 | |
| 
 | |
| case "$1" in
 | |
| test|release)
 | |
| 	git checkout $1 && git pull . origin
 | |
| 	;;
 | |
| origin)
 | |
| 	before=$(git rev-parse refs/remotes/origin/master)
 | |
| 	git fetch origin
 | |
| 	after=$(git rev-parse refs/remotes/origin/master)
 | |
| 	if [ $before != $after ]
 | |
| 	then
 | |
| 		git log $before..$after | git shortlog
 | |
| 	fi
 | |
| 	;;
 | |
| *)
 | |
| 	echo "usage: $0 origin|test|release" 1>&2
 | |
| 	exit 1
 | |
| 	;;
 | |
| esac
 | |
| -------------------------------------------------
 | |
| 
 | |
| -------------------------------------------------
 | |
| ==== merge script ====
 | |
| # Merge a branch into either the test or release branch
 | |
| 
 | |
| pname=$0
 | |
| 
 | |
| usage()
 | |
| {
 | |
| 	echo "usage: $pname branch test|release" 1>&2
 | |
| 	exit 1
 | |
| }
 | |
| 
 | |
| git show-ref -q --verify -- refs/heads/"$1" || {
 | |
| 	echo "Can't see branch <$1>" 1>&2
 | |
| 	usage
 | |
| }
 | |
| 
 | |
| case "$2" in
 | |
| test|release)
 | |
| 	if [ $(git log $2..$1 | wc -c) -eq 0 ]
 | |
| 	then
 | |
| 		echo $1 already merged into $2 1>&2
 | |
| 		exit 1
 | |
| 	fi
 | |
| 	git checkout $2 && git pull . $1
 | |
| 	;;
 | |
| *)
 | |
| 	usage
 | |
| 	;;
 | |
| esac
 | |
| -------------------------------------------------
 | |
| 
 | |
| -------------------------------------------------
 | |
| ==== status script ====
 | |
| # report on status of my ia64 Git tree
 | |
| 
 | |
| gb=$(tput setab 2)
 | |
| rb=$(tput setab 1)
 | |
| restore=$(tput setab 9)
 | |
| 
 | |
| if [ `git rev-list test..release | wc -c` -gt 0 ]
 | |
| then
 | |
| 	echo $rb Warning: commits in release that are not in test $restore
 | |
| 	git log test..release
 | |
| fi
 | |
| 
 | |
| for branch in `git show-ref --heads | sed 's|^.*/||'`
 | |
| do
 | |
| 	if [ $branch = test -o $branch = release ]
 | |
| 	then
 | |
| 		continue
 | |
| 	fi
 | |
| 
 | |
| 	echo -n $gb ======= $branch ====== $restore " "
 | |
| 	status=
 | |
| 	for ref in test release origin/master
 | |
| 	do
 | |
| 		if [ `git rev-list $ref..$branch | wc -c` -gt 0 ]
 | |
| 		then
 | |
| 			status=$status${ref:0:1}
 | |
| 		fi
 | |
| 	done
 | |
| 	case $status in
 | |
| 	trl)
 | |
| 		echo $rb Need to pull into test $restore
 | |
| 		;;
 | |
| 	rl)
 | |
| 		echo "In test"
 | |
| 		;;
 | |
| 	l)
 | |
| 		echo "Waiting for linus"
 | |
| 		;;
 | |
| 	"")
 | |
| 		echo $rb All done $restore
 | |
| 		;;
 | |
| 	*)
 | |
| 		echo $rb "<$status>" $restore
 | |
| 		;;
 | |
| 	esac
 | |
| 	git log origin/master..$branch | git shortlog
 | |
| done
 | |
| -------------------------------------------------
 | |
| 
 | |
| 
 | |
| [[cleaning-up-history]]
 | |
| Rewriting history and maintaining patch series
 | |
| ==============================================
 | |
| 
 | |
| Normally commits are only added to a project, never taken away or
 | |
| replaced.  Git is designed with this assumption, and violating it will
 | |
| cause Git's merge machinery (for example) to do the wrong thing.
 | |
| 
 | |
| However, there is a situation in which it can be useful to violate this
 | |
| assumption.
 | |
| 
 | |
| [[patch-series]]
 | |
| Creating the perfect patch series
 | |
| ---------------------------------
 | |
| 
 | |
| Suppose you are a contributor to a large project, and you want to add a
 | |
| complicated feature, and to present it to the other developers in a way
 | |
| that makes it easy for them to read your changes, verify that they are
 | |
| correct, and understand why you made each change.
 | |
| 
 | |
| If you present all of your changes as a single patch (or commit), they
 | |
| may find that it is too much to digest all at once.
 | |
| 
 | |
| If you present them with the entire history of your work, complete with
 | |
| mistakes, corrections, and dead ends, they may be overwhelmed.
 | |
| 
 | |
| So the ideal is usually to produce a series of patches such that:
 | |
| 
 | |
| 	1. Each patch can be applied in order.
 | |
| 
 | |
| 	2. Each patch includes a single logical change, together with a
 | |
| 	   message explaining the change.
 | |
| 
 | |
| 	3. No patch introduces a regression: after applying any initial
 | |
| 	   part of the series, the resulting project still compiles and
 | |
| 	   works, and has no bugs that it didn't have before.
 | |
| 
 | |
| 	4. The complete series produces the same end result as your own
 | |
| 	   (probably much messier!) development process did.
 | |
| 
 | |
| We will introduce some tools that can help you do this, explain how to
 | |
| use them, and then explain some of the problems that can arise because
 | |
| you are rewriting history.
 | |
| 
 | |
| [[using-git-rebase]]
 | |
| Keeping a patch series up to date using git rebase
 | |
| --------------------------------------------------
 | |
| 
 | |
| Suppose that you create a branch `mywork` on a remote-tracking branch
 | |
| `origin`, and create some commits on top of it:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git switch -c mywork origin
 | |
| $ vi file.txt
 | |
| $ git commit
 | |
| $ vi otherfile.txt
 | |
| $ git commit
 | |
| ...
 | |
| -------------------------------------------------
 | |
| 
 | |
| You have performed no merges into mywork, so it is just a simple linear
 | |
| sequence of patches on top of `origin`:
 | |
| 
 | |
| ................................................
 | |
|  o--o--O <-- origin
 | |
|         \
 | |
| 	 a--b--c <-- mywork
 | |
| ................................................
 | |
| 
 | |
| Some more interesting work has been done in the upstream project, and
 | |
| `origin` has advanced:
 | |
| 
 | |
| ................................................
 | |
|  o--o--O--o--o--o <-- origin
 | |
|         \
 | |
|          a--b--c <-- mywork
 | |
| ................................................
 | |
| 
 | |
| At this point, you could use `pull` to merge your changes back in;
 | |
| the result would create a new merge commit, like this:
 | |
| 
 | |
| ................................................
 | |
|  o--o--O--o--o--o <-- origin
 | |
|         \        \
 | |
|          a--b--c--m <-- mywork
 | |
| ................................................
 | |
| 
 | |
| However, if you prefer to keep the history in mywork a simple series of
 | |
| commits without any merges, you may instead choose to use
 | |
| linkgit:git-rebase[1]:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git switch mywork
 | |
| $ git rebase origin
 | |
| -------------------------------------------------
 | |
| 
 | |
| This will remove each of your commits from mywork, temporarily saving
 | |
| them as patches (in a directory named `.git/rebase-apply`), update mywork to
 | |
| point at the latest version of origin, then apply each of the saved
 | |
| patches to the new mywork.  The result will look like:
 | |
| 
 | |
| 
 | |
| ................................................
 | |
|  o--o--O--o--o--o <-- origin
 | |
| 		 \
 | |
| 		  a'--b'--c' <-- mywork
 | |
| ................................................
 | |
| 
 | |
| In the process, it may discover conflicts.  In that case it will stop
 | |
| and allow you to fix the conflicts; after fixing conflicts, use `git add`
 | |
| to update the index with those contents, and then, instead of
 | |
| running `git commit`, just run
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git rebase --continue
 | |
| -------------------------------------------------
 | |
| 
 | |
| and Git will continue applying the rest of the patches.
 | |
| 
 | |
| At any point you may use the `--abort` option to abort this process and
 | |
| return mywork to the state it had before you started the rebase:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git rebase --abort
 | |
| -------------------------------------------------
 | |
| 
 | |
| If you need to reorder or edit a number of commits in a branch, it may
 | |
| be easier to use `git rebase -i`, which allows you to reorder and
 | |
| squash commits, as well as marking them for individual editing during
 | |
| the rebase.  See <<interactive-rebase>> for details, and
 | |
| <<reordering-patch-series>> for alternatives.
 | |
| 
 | |
| [[rewriting-one-commit]]
 | |
| Rewriting a single commit
 | |
| -------------------------
 | |
| 
 | |
| We saw in <<fixing-a-mistake-by-rewriting-history>> that you can replace the
 | |
| most recent commit using
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git commit --amend
 | |
| -------------------------------------------------
 | |
| 
 | |
| which will replace the old commit by a new commit incorporating your
 | |
| changes, giving you a chance to edit the old commit message first.
 | |
| This is useful for fixing typos in your last commit, or for adjusting
 | |
| the patch contents of a poorly staged commit.
 | |
| 
 | |
| If you need to amend commits from deeper in your history, you can
 | |
| use <<interactive-rebase,interactive rebase's `edit` instruction>>.
 | |
| 
 | |
| [[reordering-patch-series]]
 | |
| Reordering or selecting from a patch series
 | |
| -------------------------------------------
 | |
| 
 | |
| Sometimes you want to edit a commit deeper in your history.  One
 | |
| approach is to use `git format-patch` to create a series of patches
 | |
| and then reset the state to before the patches:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git format-patch origin
 | |
| $ git reset --hard origin
 | |
| -------------------------------------------------
 | |
| 
 | |
| Then modify, reorder, or eliminate patches as needed before applying
 | |
| them again with linkgit:git-am[1]:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git am *.patch
 | |
| -------------------------------------------------
 | |
| 
 | |
| [[interactive-rebase]]
 | |
| Using interactive rebases
 | |
| -------------------------
 | |
| 
 | |
| You can also edit a patch series with an interactive rebase.  This is
 | |
| the same as <<reordering-patch-series,reordering a patch series using
 | |
| `format-patch`>>, so use whichever interface you like best.
 | |
| 
 | |
| Rebase your current HEAD on the last commit you want to retain as-is.
 | |
| For example, if you want to reorder the last 5 commits, use:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git rebase -i HEAD~5
 | |
| -------------------------------------------------
 | |
| 
 | |
| This will open your editor with a list of steps to be taken to perform
 | |
| your rebase.
 | |
| 
 | |
| -------------------------------------------------
 | |
| pick deadbee The oneline of this commit
 | |
| pick fa1afe1 The oneline of the next commit
 | |
| ...
 | |
| 
 | |
| # Rebase c0ffeee..deadbee onto c0ffeee
 | |
| #
 | |
| # Commands:
 | |
| #  p, pick = use commit
 | |
| #  r, reword = use commit, but edit the commit message
 | |
| #  e, edit = use commit, but stop for amending
 | |
| #  s, squash = use commit, but meld into previous commit
 | |
| #  f, fixup = like "squash", but discard this commit's log message
 | |
| #  x, exec = run command (the rest of the line) using shell
 | |
| #
 | |
| # These lines can be re-ordered; they are executed from top to bottom.
 | |
| #
 | |
| # If you remove a line here THAT COMMIT WILL BE LOST.
 | |
| #
 | |
| # However, if you remove everything, the rebase will be aborted.
 | |
| #
 | |
| # Note that empty commits are commented out
 | |
| -------------------------------------------------
 | |
| 
 | |
| As explained in the comments, you can reorder commits, squash them
 | |
| together, edit commit messages, etc. by editing the list.  Once you
 | |
| are satisfied, save the list and close your editor, and the rebase
 | |
| will begin.
 | |
| 
 | |
| The rebase will stop where `pick` has been replaced with `edit` or
 | |
| when a step in the list fails to mechanically resolve conflicts and
 | |
| needs your help.  When you are done editing and/or resolving conflicts
 | |
| you can continue with `git rebase --continue`.  If you decide that
 | |
| things are getting too hairy, you can always bail out with `git rebase
 | |
| --abort`.  Even after the rebase is complete, you can still recover
 | |
| the original branch by using the <<reflogs,reflog>>.
 | |
| 
 | |
| For a more detailed discussion of the procedure and additional tips,
 | |
| see the "INTERACTIVE MODE" section of linkgit:git-rebase[1].
 | |
| 
 | |
| [[patch-series-tools]]
 | |
| Other tools
 | |
| -----------
 | |
| 
 | |
| There are numerous other tools, such as StGit, which exist for the
 | |
| purpose of maintaining a patch series.  These are outside of the scope of
 | |
| this manual.
 | |
| 
 | |
| [[problems-With-rewriting-history]]
 | |
| Problems with rewriting history
 | |
| -------------------------------
 | |
| 
 | |
| The primary problem with rewriting the history of a branch has to do
 | |
| with merging.  Suppose somebody fetches your branch and merges it into
 | |
| their branch, with a result something like this:
 | |
| 
 | |
| ................................................
 | |
|  o--o--O--o--o--o <-- origin
 | |
|         \        \
 | |
|          t--t--t--m <-- their branch:
 | |
| ................................................
 | |
| 
 | |
| Then suppose you modify the last three commits:
 | |
| 
 | |
| ................................................
 | |
| 	 o--o--o <-- new head of origin
 | |
| 	/
 | |
|  o--o--O--o--o--o <-- old head of origin
 | |
| ................................................
 | |
| 
 | |
| If we examined all this history together in one repository, it will
 | |
| look like:
 | |
| 
 | |
| ................................................
 | |
| 	 o--o--o <-- new head of origin
 | |
| 	/
 | |
|  o--o--O--o--o--o <-- old head of origin
 | |
|         \        \
 | |
|          t--t--t--m <-- their branch:
 | |
| ................................................
 | |
| 
 | |
| Git has no way of knowing that the new head is an updated version of
 | |
| the old head; it treats this situation exactly the same as it would if
 | |
| two developers had independently done the work on the old and new heads
 | |
| in parallel.  At this point, if someone attempts to merge the new head
 | |
| in to their branch, Git will attempt to merge together the two (old and
 | |
| new) lines of development, instead of trying to replace the old by the
 | |
| new.  The results are likely to be unexpected.
 | |
| 
 | |
| You may still choose to publish branches whose history is rewritten,
 | |
| and it may be useful for others to be able to fetch those branches in
 | |
| order to examine or test them, but they should not attempt to pull such
 | |
| branches into their own work.
 | |
| 
 | |
| For true distributed development that supports proper merging,
 | |
| published branches should never be rewritten.
 | |
| 
 | |
| [[bisect-merges]]
 | |
| Why bisecting merge commits can be harder than bisecting linear history
 | |
| -----------------------------------------------------------------------
 | |
| 
 | |
| The linkgit:git-bisect[1] command correctly handles history that
 | |
| includes merge commits.  However, when the commit that it finds is a
 | |
| merge commit, the user may need to work harder than usual to figure out
 | |
| why that commit introduced a problem.
 | |
| 
 | |
| Imagine this history:
 | |
| 
 | |
| ................................................
 | |
|       ---Z---o---X---...---o---A---C---D
 | |
|           \                       /
 | |
|            o---o---Y---...---o---B
 | |
| ................................................
 | |
| 
 | |
| Suppose that on the upper line of development, the meaning of one
 | |
| of the functions that exists at Z is changed at commit X.  The
 | |
| commits from Z leading to A change both the function's
 | |
| implementation and all calling sites that exist at Z, as well
 | |
| as new calling sites they add, to be consistent.  There is no
 | |
| bug at A.
 | |
| 
 | |
| Suppose that in the meantime on the lower line of development somebody
 | |
| adds a new calling site for that function at commit Y.  The
 | |
| commits from Z leading to B all assume the old semantics of that
 | |
| function and the callers and the callee are consistent with each
 | |
| other.  There is no bug at B, either.
 | |
| 
 | |
| Suppose further that the two development lines merge cleanly at C,
 | |
| so no conflict resolution is required.
 | |
| 
 | |
| Nevertheless, the code at C is broken, because the callers added
 | |
| on the lower line of development have not been converted to the new
 | |
| semantics introduced on the upper line of development.  So if all
 | |
| you know is that D is bad, that Z is good, and that
 | |
| linkgit:git-bisect[1] identifies C as the culprit, how will you
 | |
| figure out that the problem is due to this change in semantics?
 | |
| 
 | |
| When the result of a `git bisect` is a non-merge commit, you should
 | |
| normally be able to discover the problem by examining just that commit.
 | |
| Developers can make this easy by breaking their changes into small
 | |
| self-contained commits.  That won't help in the case above, however,
 | |
| because the problem isn't obvious from examination of any single
 | |
| commit; instead, a global view of the development is required.  To
 | |
| make matters worse, the change in semantics in the problematic
 | |
| function may be just one small part of the changes in the upper
 | |
| line of development.
 | |
| 
 | |
| On the other hand, if instead of merging at C you had rebased the
 | |
| history between Z to B on top of A, you would have gotten this
 | |
| linear history:
 | |
| 
 | |
| ................................................................
 | |
|     ---Z---o---X--...---o---A---o---o---Y*--...---o---B*--D*
 | |
| ................................................................
 | |
| 
 | |
| Bisecting between Z and D* would hit a single culprit commit Y*,
 | |
| and understanding why Y* was broken would probably be easier.
 | |
| 
 | |
| Partly for this reason, many experienced Git users, even when
 | |
| working on an otherwise merge-heavy project, keep the history
 | |
| linear by rebasing against the latest upstream version before
 | |
| publishing.
 | |
| 
 | |
| [[advanced-branch-management]]
 | |
| Advanced branch management
 | |
| ==========================
 | |
| 
 | |
| [[fetching-individual-branches]]
 | |
| Fetching individual branches
 | |
| ----------------------------
 | |
| 
 | |
| Instead of using linkgit:git-remote[1], you can also choose just
 | |
| to update one branch at a time, and to store it locally under an
 | |
| arbitrary name:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git fetch origin todo:my-todo-work
 | |
| -------------------------------------------------
 | |
| 
 | |
| The first argument, `origin`, just tells Git to fetch from the
 | |
| repository you originally cloned from.  The second argument tells Git
 | |
| to fetch the branch named `todo` from the remote repository, and to
 | |
| store it locally under the name `refs/heads/my-todo-work`.
 | |
| 
 | |
| You can also fetch branches from other repositories; so
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git fetch git://example.com/proj.git master:example-master
 | |
| -------------------------------------------------
 | |
| 
 | |
| will create a new branch named `example-master` and store in it the
 | |
| branch named `master` from the repository at the given URL.  If you
 | |
| already have a branch named example-master, it will attempt to
 | |
| <<fast-forwards,fast-forward>> to the commit given by example.com's
 | |
| master branch.  In more detail:
 | |
| 
 | |
| [[fetch-fast-forwards]]
 | |
| git fetch and fast-forwards
 | |
| ---------------------------
 | |
| 
 | |
| In the previous example, when updating an existing branch, `git fetch`
 | |
| checks to make sure that the most recent commit on the remote
 | |
| branch is a descendant of the most recent commit on your copy of the
 | |
| branch before updating your copy of the branch to point at the new
 | |
| commit.  Git calls this process a <<fast-forwards,fast-forward>>.
 | |
| 
 | |
| A fast-forward looks something like this:
 | |
| 
 | |
| ................................................
 | |
|  o--o--o--o <-- old head of the branch
 | |
|            \
 | |
|             o--o--o <-- new head of the branch
 | |
| ................................................
 | |
| 
 | |
| 
 | |
| In some cases it is possible that the new head will *not* actually be
 | |
| a descendant of the old head.  For example, the developer may have
 | |
| realized she made a serious mistake, and decided to backtrack,
 | |
| resulting in a situation like:
 | |
| 
 | |
| ................................................
 | |
|  o--o--o--o--a--b <-- old head of the branch
 | |
|            \
 | |
|             o--o--o <-- new head of the branch
 | |
| ................................................
 | |
| 
 | |
| In this case, `git fetch` will fail, and print out a warning.
 | |
| 
 | |
| In that case, you can still force Git to update to the new head, as
 | |
| described in the following section.  However, note that in the
 | |
| situation above this may mean losing the commits labeled `a` and `b`,
 | |
| unless you've already created a reference of your own pointing to
 | |
| them.
 | |
| 
 | |
| [[forcing-fetch]]
 | |
| Forcing git fetch to do non-fast-forward updates
 | |
| ------------------------------------------------
 | |
| 
 | |
| If git fetch fails because the new head of a branch is not a
 | |
| descendant of the old head, you may force the update with:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git fetch git://example.com/proj.git +master:refs/remotes/example/master
 | |
| -------------------------------------------------
 | |
| 
 | |
| Note the addition of the `+` sign.  Alternatively, you can use the `-f`
 | |
| flag to force updates of all the fetched branches, as in:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git fetch -f origin
 | |
| -------------------------------------------------
 | |
| 
 | |
| Be aware that commits that the old version of example/master pointed at
 | |
| may be lost, as we saw in the previous section.
 | |
| 
 | |
| [[remote-branch-configuration]]
 | |
| Configuring remote-tracking branches
 | |
| ------------------------------------
 | |
| 
 | |
| We saw above that `origin` is just a shortcut to refer to the
 | |
| repository that you originally cloned from.  This information is
 | |
| stored in Git configuration variables, which you can see using
 | |
| linkgit:git-config[1]:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git config -l
 | |
| core.repositoryformatversion=0
 | |
| core.filemode=true
 | |
| core.logallrefupdates=true
 | |
| remote.origin.url=git://git.kernel.org/pub/scm/git/git.git
 | |
| remote.origin.fetch=+refs/heads/*:refs/remotes/origin/*
 | |
| branch.master.remote=origin
 | |
| branch.master.merge=refs/heads/master
 | |
| -------------------------------------------------
 | |
| 
 | |
| If there are other repositories that you also use frequently, you can
 | |
| create similar configuration options to save typing; for example,
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git remote add example git://example.com/proj.git
 | |
| -------------------------------------------------
 | |
| 
 | |
| adds the following to `.git/config`:
 | |
| 
 | |
| -------------------------------------------------
 | |
| [remote "example"]
 | |
| 	url = git://example.com/proj.git
 | |
| 	fetch = +refs/heads/*:refs/remotes/example/*
 | |
| -------------------------------------------------
 | |
| 
 | |
| Also note that the above configuration can be performed by directly
 | |
| editing the file `.git/config` instead of using linkgit:git-remote[1].
 | |
| 
 | |
| After configuring the remote, the following three commands will do the
 | |
| same thing:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git fetch git://example.com/proj.git +refs/heads/*:refs/remotes/example/*
 | |
| $ git fetch example +refs/heads/*:refs/remotes/example/*
 | |
| $ git fetch example
 | |
| -------------------------------------------------
 | |
| 
 | |
| See linkgit:git-config[1] for more details on the configuration
 | |
| options mentioned above and linkgit:git-fetch[1] for more details on
 | |
| the refspec syntax.
 | |
| 
 | |
| 
 | |
| [[git-concepts]]
 | |
| Git concepts
 | |
| ============
 | |
| 
 | |
| Git is built on a small number of simple but powerful ideas.  While it
 | |
| is possible to get things done without understanding them, you will find
 | |
| Git much more intuitive if you do.
 | |
| 
 | |
| We start with the most important, the  <<def_object_database,object
 | |
| database>> and the <<def_index,index>>.
 | |
| 
 | |
| [[the-object-database]]
 | |
| The Object Database
 | |
| -------------------
 | |
| 
 | |
| 
 | |
| We already saw in <<understanding-commits>> that all commits are stored
 | |
| under a 40-digit "object name".  In fact, all the information needed to
 | |
| represent the history of a project is stored in objects with such names.
 | |
| In each case the name is calculated by taking the SHA-1 hash of the
 | |
| contents of the object.  The SHA-1 hash is a cryptographic hash function.
 | |
| What that means to us is that it is impossible to find two different
 | |
| objects with the same name.  This has a number of advantages; among
 | |
| others:
 | |
| 
 | |
| - Git can quickly determine whether two objects are identical or not,
 | |
|   just by comparing names.
 | |
| - Since object names are computed the same way in every repository, the
 | |
|   same content stored in two repositories will always be stored under
 | |
|   the same name.
 | |
| - Git can detect errors when it reads an object, by checking that the
 | |
|   object's name is still the SHA-1 hash of its contents.
 | |
| 
 | |
| (See <<object-details>> for the details of the object formatting and
 | |
| SHA-1 calculation.)
 | |
| 
 | |
| There are four different types of objects: "blob", "tree", "commit", and
 | |
| "tag".
 | |
| 
 | |
| - A <<def_blob_object,"blob" object>> is used to store file data.
 | |
| - A <<def_tree_object,"tree" object>> ties one or more
 | |
|   "blob" objects into a directory structure. In addition, a tree object
 | |
|   can refer to other tree objects, thus creating a directory hierarchy.
 | |
| - A <<def_commit_object,"commit" object>> ties such directory hierarchies
 | |
|   together into a <<def_DAG,directed acyclic graph>> of revisions--each
 | |
|   commit contains the object name of exactly one tree designating the
 | |
|   directory hierarchy at the time of the commit. In addition, a commit
 | |
|   refers to "parent" commit objects that describe the history of how we
 | |
|   arrived at that directory hierarchy.
 | |
| - A <<def_tag_object,"tag" object>> symbolically identifies and can be
 | |
|   used to sign other objects. It contains the object name and type of
 | |
|   another object, a symbolic name (of course!) and, optionally, a
 | |
|   signature.
 | |
| 
 | |
| The object types in some more detail:
 | |
| 
 | |
| [[commit-object]]
 | |
| Commit Object
 | |
| ~~~~~~~~~~~~~
 | |
| 
 | |
| The "commit" object links a physical state of a tree with a description
 | |
| of how we got there and why.  Use the `--pretty=raw` option to
 | |
| linkgit:git-show[1] or linkgit:git-log[1] to examine your favorite
 | |
| commit:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git show -s --pretty=raw 2be7fcb476
 | |
| commit 2be7fcb4764f2dbcee52635b91fedb1b3dcf7ab4
 | |
| tree fb3a8bdd0ceddd019615af4d57a53f43d8cee2bf
 | |
| parent 257a84d9d02e90447b149af58b271c19405edb6a
 | |
| author Dave Watson <dwatson@mimvista.com> 1187576872 -0400
 | |
| committer Junio C Hamano <gitster@pobox.com> 1187591163 -0700
 | |
| 
 | |
|     Fix misspelling of 'suppress' in docs
 | |
| 
 | |
|     Signed-off-by: Junio C Hamano <gitster@pobox.com>
 | |
| ------------------------------------------------
 | |
| 
 | |
| As you can see, a commit is defined by:
 | |
| 
 | |
| - a tree: The SHA-1 name of a tree object (as defined below), representing
 | |
|   the contents of a directory at a certain point in time.
 | |
| - parent(s): The SHA-1 name(s) of some number of commits which represent the
 | |
|   immediately previous step(s) in the history of the project.  The
 | |
|   example above has one parent; merge commits may have more than
 | |
|   one.  A commit with no parents is called a "root" commit, and
 | |
|   represents the initial revision of a project.  Each project must have
 | |
|   at least one root.  A project can also have multiple roots, though
 | |
|   that isn't common (or necessarily a good idea).
 | |
| - an author: The name of the person responsible for this change, together
 | |
|   with its date.
 | |
| - a committer: The name of the person who actually created the commit,
 | |
|   with the date it was done.  This may be different from the author, for
 | |
|   example, if the author was someone who wrote a patch and emailed it
 | |
|   to the person who used it to create the commit.
 | |
| - a comment describing this commit.
 | |
| 
 | |
| Note that a commit does not itself contain any information about what
 | |
| actually changed; all changes are calculated by comparing the contents
 | |
| of the tree referred to by this commit with the trees associated with
 | |
| its parents.  In particular, Git does not attempt to record file renames
 | |
| explicitly, though it can identify cases where the existence of the same
 | |
| file data at changing paths suggests a rename.  (See, for example, the
 | |
| `-M` option to linkgit:git-diff[1]).
 | |
| 
 | |
| A commit is usually created by linkgit:git-commit[1], which creates a
 | |
| commit whose parent is normally the current HEAD, and whose tree is
 | |
| taken from the content currently stored in the index.
 | |
| 
 | |
| [[tree-object]]
 | |
| Tree Object
 | |
| ~~~~~~~~~~~
 | |
| 
 | |
| The ever-versatile linkgit:git-show[1] command can also be used to
 | |
| examine tree objects, but linkgit:git-ls-tree[1] will give you more
 | |
| details:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git ls-tree fb3a8bdd0ce
 | |
| 100644 blob 63c918c667fa005ff12ad89437f2fdc80926e21c    .gitignore
 | |
| 100644 blob 5529b198e8d14decbe4ad99db3f7fb632de0439d    .mailmap
 | |
| 100644 blob 6ff87c4664981e4397625791c8ea3bbb5f2279a3    COPYING
 | |
| 040000 tree 2fb783e477100ce076f6bf57e4a6f026013dc745    Documentation
 | |
| 100755 blob 3c0032cec592a765692234f1cba47dfdcc3a9200    GIT-VERSION-GEN
 | |
| 100644 blob 289b046a443c0647624607d471289b2c7dcd470b    INSTALL
 | |
| 100644 blob 4eb463797adc693dc168b926b6932ff53f17d0b1    Makefile
 | |
| 100644 blob 548142c327a6790ff8821d67c2ee1eff7a656b52    README
 | |
| ...
 | |
| ------------------------------------------------
 | |
| 
 | |
| As you can see, a tree object contains a list of entries, each with a
 | |
| mode, object type, SHA-1 name, and name, sorted by name.  It represents
 | |
| the contents of a single directory tree.
 | |
| 
 | |
| The object type may be a blob, representing the contents of a file, or
 | |
| another tree, representing the contents of a subdirectory.  Since trees
 | |
| and blobs, like all other objects, are named by the SHA-1 hash of their
 | |
| contents, two trees have the same SHA-1 name if and only if their
 | |
| contents (including, recursively, the contents of all subdirectories)
 | |
| are identical.  This allows Git to quickly determine the differences
 | |
| between two related tree objects, since it can ignore any entries with
 | |
| identical object names.
 | |
| 
 | |
| (Note: in the presence of submodules, trees may also have commits as
 | |
| entries.  See <<submodules>> for documentation.)
 | |
| 
 | |
| Note that the files all have mode 644 or 755: Git actually only pays
 | |
| attention to the executable bit.
 | |
| 
 | |
| [[blob-object]]
 | |
| Blob Object
 | |
| ~~~~~~~~~~~
 | |
| 
 | |
| You can use linkgit:git-show[1] to examine the contents of a blob; take,
 | |
| for example, the blob in the entry for `COPYING` from the tree above:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git show 6ff87c4664
 | |
| 
 | |
|  Note that the only valid version of the GPL as far as this project
 | |
|  is concerned is _this_ particular version of the license (ie v2, not
 | |
|  v2.2 or v3.x or whatever), unless explicitly otherwise stated.
 | |
| ...
 | |
| ------------------------------------------------
 | |
| 
 | |
| A "blob" object is nothing but a binary blob of data.  It doesn't refer
 | |
| to anything else or have attributes of any kind.
 | |
| 
 | |
| Since the blob is entirely defined by its data, if two files in a
 | |
| directory tree (or in multiple different versions of the repository)
 | |
| have the same contents, they will share the same blob object. The object
 | |
| is totally independent of its location in the directory tree, and
 | |
| renaming a file does not change the object that file is associated with.
 | |
| 
 | |
| Note that any tree or blob object can be examined using
 | |
| linkgit:git-show[1] with the <revision>:<path> syntax.  This can
 | |
| sometimes be useful for browsing the contents of a tree that is not
 | |
| currently checked out.
 | |
| 
 | |
| [[trust]]
 | |
| Trust
 | |
| ~~~~~
 | |
| 
 | |
| If you receive the SHA-1 name of a blob from one source, and its contents
 | |
| from another (possibly untrusted) source, you can still trust that those
 | |
| contents are correct as long as the SHA-1 name agrees.  This is because
 | |
| the SHA-1 is designed so that it is infeasible to find different contents
 | |
| that produce the same hash.
 | |
| 
 | |
| Similarly, you need only trust the SHA-1 name of a top-level tree object
 | |
| to trust the contents of the entire directory that it refers to, and if
 | |
| you receive the SHA-1 name of a commit from a trusted source, then you
 | |
| can easily verify the entire history of commits reachable through
 | |
| parents of that commit, and all of those contents of the trees referred
 | |
| to by those commits.
 | |
| 
 | |
| So to introduce some real trust in the system, the only thing you need
 | |
| to do is to digitally sign just 'one' special note, which includes the
 | |
| name of a top-level commit.  Your digital signature shows others
 | |
| that you trust that commit, and the immutability of the history of
 | |
| commits tells others that they can trust the whole history.
 | |
| 
 | |
| In other words, you can easily validate a whole archive by just
 | |
| sending out a single email that tells the people the name (SHA-1 hash)
 | |
| of the top commit, and digitally sign that email using something
 | |
| like GPG/PGP.
 | |
| 
 | |
| To assist in this, Git also provides the tag object...
 | |
| 
 | |
| [[tag-object]]
 | |
| Tag Object
 | |
| ~~~~~~~~~~
 | |
| 
 | |
| A tag object contains an object, object type, tag name, the name of the
 | |
| person ("tagger") who created the tag, and a message, which may contain
 | |
| a signature, as can be seen using linkgit:git-cat-file[1]:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git cat-file tag v1.5.0
 | |
| object 437b1b20df4b356c9342dac8d38849f24ef44f27
 | |
| type commit
 | |
| tag v1.5.0
 | |
| tagger Junio C Hamano <junkio@cox.net> 1171411200 +0000
 | |
| 
 | |
| GIT 1.5.0
 | |
| -----BEGIN PGP SIGNATURE-----
 | |
| Version: GnuPG v1.4.6 (GNU/Linux)
 | |
| 
 | |
| iD8DBQBF0lGqwMbZpPMRm5oRAuRiAJ9ohBLd7s2kqjkKlq1qqC57SbnmzQCdG4ui
 | |
| nLE/L9aUXdWeTFPron96DLA=
 | |
| =2E+0
 | |
| -----END PGP SIGNATURE-----
 | |
| ------------------------------------------------
 | |
| 
 | |
| See the linkgit:git-tag[1] command to learn how to create and verify tag
 | |
| objects.  (Note that linkgit:git-tag[1] can also be used to create
 | |
| "lightweight tags", which are not tag objects at all, but just simple
 | |
| references whose names begin with `refs/tags/`).
 | |
| 
 | |
| [[pack-files]]
 | |
| How Git stores objects efficiently: pack files
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| Newly created objects are initially created in a file named after the
 | |
| object's SHA-1 hash (stored in `.git/objects`).
 | |
| 
 | |
| Unfortunately this system becomes inefficient once a project has a
 | |
| lot of objects.  Try this on an old project:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git count-objects
 | |
| 6930 objects, 47620 kilobytes
 | |
| ------------------------------------------------
 | |
| 
 | |
| The first number is the number of objects which are kept in
 | |
| individual files.  The second is the amount of space taken up by
 | |
| those "loose" objects.
 | |
| 
 | |
| You can save space and make Git faster by moving these loose objects in
 | |
| to a "pack file", which stores a group of objects in an efficient
 | |
| compressed format; the details of how pack files are formatted can be
 | |
| found in link:technical/pack-format.html[pack format].
 | |
| 
 | |
| To put the loose objects into a pack, just run git repack:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git repack
 | |
| Counting objects: 6020, done.
 | |
| Delta compression using up to 4 threads.
 | |
| Compressing objects: 100% (6020/6020), done.
 | |
| Writing objects: 100% (6020/6020), done.
 | |
| Total 6020 (delta 4070), reused 0 (delta 0)
 | |
| ------------------------------------------------
 | |
| 
 | |
| This creates a single "pack file" in .git/objects/pack/
 | |
| containing all currently unpacked objects.  You can then run
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git prune
 | |
| ------------------------------------------------
 | |
| 
 | |
| to remove any of the "loose" objects that are now contained in the
 | |
| pack.  This will also remove any unreferenced objects (which may be
 | |
| created when, for example, you use `git reset` to remove a commit).
 | |
| You can verify that the loose objects are gone by looking at the
 | |
| `.git/objects` directory or by running
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git count-objects
 | |
| 0 objects, 0 kilobytes
 | |
| ------------------------------------------------
 | |
| 
 | |
| Although the object files are gone, any commands that refer to those
 | |
| objects will work exactly as they did before.
 | |
| 
 | |
| The linkgit:git-gc[1] command performs packing, pruning, and more for
 | |
| you, so is normally the only high-level command you need.
 | |
| 
 | |
| [[dangling-objects]]
 | |
| Dangling objects
 | |
| ~~~~~~~~~~~~~~~~
 | |
| 
 | |
| The linkgit:git-fsck[1] command will sometimes complain about dangling
 | |
| objects.  They are not a problem.
 | |
| 
 | |
| The most common cause of dangling objects is that you've rebased a
 | |
| branch, or you have pulled from somebody else who rebased a branch--see
 | |
| <<cleaning-up-history>>.  In that case, the old head of the original
 | |
| branch still exists, as does everything it pointed to. The branch
 | |
| pointer itself just doesn't, since you replaced it with another one.
 | |
| 
 | |
| There are also other situations that cause dangling objects. For
 | |
| example, a "dangling blob" may arise because you did a `git add` of a
 | |
| file, but then, before you actually committed it and made it part of the
 | |
| bigger picture, you changed something else in that file and committed
 | |
| that *updated* thing--the old state that you added originally ends up
 | |
| not being pointed to by any commit or tree, so it's now a dangling blob
 | |
| object.
 | |
| 
 | |
| Similarly, when the "recursive" merge strategy runs, and finds that
 | |
| there are criss-cross merges and thus more than one merge base (which is
 | |
| fairly unusual, but it does happen), it will generate one temporary
 | |
| midway tree (or possibly even more, if you had lots of criss-crossing
 | |
| merges and more than two merge bases) as a temporary internal merge
 | |
| base, and again, those are real objects, but the end result will not end
 | |
| up pointing to them, so they end up "dangling" in your repository.
 | |
| 
 | |
| Generally, dangling objects aren't anything to worry about. They can
 | |
| even be very useful: if you screw something up, the dangling objects can
 | |
| be how you recover your old tree (say, you did a rebase, and realized
 | |
| that you really didn't want to--you can look at what dangling objects
 | |
| you have, and decide to reset your head to some old dangling state).
 | |
| 
 | |
| For commits, you can just use:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ gitk <dangling-commit-sha-goes-here> --not --all
 | |
| ------------------------------------------------
 | |
| 
 | |
| This asks for all the history reachable from the given commit but not
 | |
| from any branch, tag, or other reference.  If you decide it's something
 | |
| you want, you can always create a new reference to it, e.g.,
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git branch recovered-branch <dangling-commit-sha-goes-here>
 | |
| ------------------------------------------------
 | |
| 
 | |
| For blobs and trees, you can't do the same, but you can still examine
 | |
| them.  You can just do
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git show <dangling-blob/tree-sha-goes-here>
 | |
| ------------------------------------------------
 | |
| 
 | |
| to show what the contents of the blob were (or, for a tree, basically
 | |
| what the `ls` for that directory was), and that may give you some idea
 | |
| of what the operation was that left that dangling object.
 | |
| 
 | |
| Usually, dangling blobs and trees aren't very interesting. They're
 | |
| almost always the result of either being a half-way mergebase (the blob
 | |
| will often even have the conflict markers from a merge in it, if you
 | |
| have had conflicting merges that you fixed up by hand), or simply
 | |
| because you interrupted a `git fetch` with ^C or something like that,
 | |
| leaving _some_ of the new objects in the object database, but just
 | |
| dangling and useless.
 | |
| 
 | |
| Anyway, once you are sure that you're not interested in any dangling
 | |
| state, you can just prune all unreachable objects:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git prune
 | |
| ------------------------------------------------
 | |
| 
 | |
| and they'll be gone. (You should only run `git prune` on a quiescent
 | |
| repository--it's kind of like doing a filesystem fsck recovery: you
 | |
| don't want to do that while the filesystem is mounted.
 | |
| `git prune` is designed not to cause any harm in such cases of concurrent
 | |
| accesses to a repository but you might receive confusing or scary messages.)
 | |
| 
 | |
| [[recovering-from-repository-corruption]]
 | |
| Recovering from repository corruption
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| By design, Git treats data trusted to it with caution.  However, even in
 | |
| the absence of bugs in Git itself, it is still possible that hardware or
 | |
| operating system errors could corrupt data.
 | |
| 
 | |
| The first defense against such problems is backups.  You can back up a
 | |
| Git directory using clone, or just using cp, tar, or any other backup
 | |
| mechanism.
 | |
| 
 | |
| As a last resort, you can search for the corrupted objects and attempt
 | |
| to replace them by hand.  Back up your repository before attempting this
 | |
| in case you corrupt things even more in the process.
 | |
| 
 | |
| We'll assume that the problem is a single missing or corrupted blob,
 | |
| which is sometimes a solvable problem.  (Recovering missing trees and
 | |
| especially commits is *much* harder).
 | |
| 
 | |
| Before starting, verify that there is corruption, and figure out where
 | |
| it is with linkgit:git-fsck[1]; this may be time-consuming.
 | |
| 
 | |
| Assume the output looks like this:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git fsck --full --no-dangling
 | |
| broken link from    tree 2d9263c6d23595e7cb2a21e5ebbb53655278dff8
 | |
|               to    blob 4b9458b3786228369c63936db65827de3cc06200
 | |
| missing blob 4b9458b3786228369c63936db65827de3cc06200
 | |
| ------------------------------------------------
 | |
| 
 | |
| Now you know that blob 4b9458b3 is missing, and that the tree 2d9263c6
 | |
| points to it.  If you could find just one copy of that missing blob
 | |
| object, possibly in some other repository, you could move it into
 | |
| `.git/objects/4b/9458b3...` and be done.  Suppose you can't.  You can
 | |
| still examine the tree that pointed to it with linkgit:git-ls-tree[1],
 | |
| which might output something like:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git ls-tree 2d9263c6d23595e7cb2a21e5ebbb53655278dff8
 | |
| 100644 blob 8d14531846b95bfa3564b58ccfb7913a034323b8	.gitignore
 | |
| 100644 blob ebf9bf84da0aab5ed944264a5db2a65fe3a3e883	.mailmap
 | |
| 100644 blob ca442d313d86dc67e0a2e5d584b465bd382cbf5c	COPYING
 | |
| ...
 | |
| 100644 blob 4b9458b3786228369c63936db65827de3cc06200	myfile
 | |
| ...
 | |
| ------------------------------------------------
 | |
| 
 | |
| So now you know that the missing blob was the data for a file named
 | |
| `myfile`.  And chances are you can also identify the directory--let's
 | |
| say it's in `somedirectory`.  If you're lucky the missing copy might be
 | |
| the same as the copy you have checked out in your working tree at
 | |
| `somedirectory/myfile`; you can test whether that's right with
 | |
| linkgit:git-hash-object[1]:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git hash-object -w somedirectory/myfile
 | |
| ------------------------------------------------
 | |
| 
 | |
| which will create and store a blob object with the contents of
 | |
| somedirectory/myfile, and output the SHA-1 of that object.  if you're
 | |
| extremely lucky it might be 4b9458b3786228369c63936db65827de3cc06200, in
 | |
| which case you've guessed right, and the corruption is fixed!
 | |
| 
 | |
| Otherwise, you need more information.  How do you tell which version of
 | |
| the file has been lost?
 | |
| 
 | |
| The easiest way to do this is with:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git log --raw --all --full-history -- somedirectory/myfile
 | |
| ------------------------------------------------
 | |
| 
 | |
| Because you're asking for raw output, you'll now get something like
 | |
| 
 | |
| ------------------------------------------------
 | |
| commit abc
 | |
| Author:
 | |
| Date:
 | |
| ...
 | |
| :100644 100644 4b9458b newsha M somedirectory/myfile
 | |
| 
 | |
| 
 | |
| commit xyz
 | |
| Author:
 | |
| Date:
 | |
| 
 | |
| ...
 | |
| :100644 100644 oldsha 4b9458b M somedirectory/myfile
 | |
| ------------------------------------------------
 | |
| 
 | |
| This tells you that the immediately following version of the file was
 | |
| "newsha", and that the immediately preceding version was "oldsha".
 | |
| You also know the commit messages that went with the change from oldsha
 | |
| to 4b9458b and with the change from 4b9458b to newsha.
 | |
| 
 | |
| If you've been committing small enough changes, you may now have a good
 | |
| shot at reconstructing the contents of the in-between state 4b9458b.
 | |
| 
 | |
| If you can do that, you can now recreate the missing object with
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git hash-object -w <recreated-file>
 | |
| ------------------------------------------------
 | |
| 
 | |
| and your repository is good again!
 | |
| 
 | |
| (Btw, you could have ignored the `fsck`, and started with doing a
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git log --raw --all
 | |
| ------------------------------------------------
 | |
| 
 | |
| and just looked for the sha of the missing object (4b9458b) in that
 | |
| whole thing. It's up to you--Git does *have* a lot of information, it is
 | |
| just missing one particular blob version.
 | |
| 
 | |
| [[the-index]]
 | |
| The index
 | |
| ---------
 | |
| 
 | |
| The index is a binary file (generally kept in `.git/index`) containing a
 | |
| sorted list of path names, each with permissions and the SHA-1 of a blob
 | |
| object; linkgit:git-ls-files[1] can show you the contents of the index:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git ls-files --stage
 | |
| 100644 63c918c667fa005ff12ad89437f2fdc80926e21c 0	.gitignore
 | |
| 100644 5529b198e8d14decbe4ad99db3f7fb632de0439d 0	.mailmap
 | |
| 100644 6ff87c4664981e4397625791c8ea3bbb5f2279a3 0	COPYING
 | |
| 100644 a37b2152bd26be2c2289e1f57a292534a51a93c7 0	Documentation/.gitignore
 | |
| 100644 fbefe9a45b00a54b58d94d06eca48b03d40a50e0 0	Documentation/Makefile
 | |
| ...
 | |
| 100644 2511aef8d89ab52be5ec6a5e46236b4b6bcd07ea 0	xdiff/xtypes.h
 | |
| 100644 2ade97b2574a9f77e7ae4002a4e07a6a38e46d07 0	xdiff/xutils.c
 | |
| 100644 d5de8292e05e7c36c4b68857c1cf9855e3d2f70a 0	xdiff/xutils.h
 | |
| -------------------------------------------------
 | |
| 
 | |
| Note that in older documentation you may see the index called the
 | |
| "current directory cache" or just the "cache".  It has three important
 | |
| properties:
 | |
| 
 | |
| 1. The index contains all the information necessary to generate a single
 | |
| (uniquely determined) tree object.
 | |
| +
 | |
| For example, running linkgit:git-commit[1] generates this tree object
 | |
| from the index, stores it in the object database, and uses it as the
 | |
| tree object associated with the new commit.
 | |
| 
 | |
| 2. The index enables fast comparisons between the tree object it defines
 | |
| and the working tree.
 | |
| +
 | |
| It does this by storing some additional data for each entry (such as
 | |
| the last modified time).  This data is not displayed above, and is not
 | |
| stored in the created tree object, but it can be used to determine
 | |
| quickly which files in the working directory differ from what was
 | |
| stored in the index, and thus save Git from having to read all of the
 | |
| data from such files to look for changes.
 | |
| 
 | |
| 3. It can efficiently represent information about merge conflicts
 | |
| between different tree objects, allowing each pathname to be
 | |
| associated with sufficient information about the trees involved that
 | |
| you can create a three-way merge between them.
 | |
| +
 | |
| We saw in <<conflict-resolution>> that during a merge the index can
 | |
| store multiple versions of a single file (called "stages").  The third
 | |
| column in the linkgit:git-ls-files[1] output above is the stage
 | |
| number, and will take on values other than 0 for files with merge
 | |
| conflicts.
 | |
| 
 | |
| The index is thus a sort of temporary staging area, which is filled with
 | |
| a tree which you are in the process of working on.
 | |
| 
 | |
| If you blow the index away entirely, you generally haven't lost any
 | |
| information as long as you have the name of the tree that it described.
 | |
| 
 | |
| [[submodules]]
 | |
| Submodules
 | |
| ==========
 | |
| 
 | |
| Large projects are often composed of smaller, self-contained modules.  For
 | |
| example, an embedded Linux distribution's source tree would include every
 | |
| piece of software in the distribution with some local modifications; a movie
 | |
| player might need to build against a specific, known-working version of a
 | |
| decompression library; several independent programs might all share the same
 | |
| build scripts.
 | |
| 
 | |
| With centralized revision control systems this is often accomplished by
 | |
| including every module in one single repository.  Developers can check out
 | |
| all modules or only the modules they need to work with.  They can even modify
 | |
| files across several modules in a single commit while moving things around
 | |
| or updating APIs and translations.
 | |
| 
 | |
| Git does not allow partial checkouts, so duplicating this approach in Git
 | |
| would force developers to keep a local copy of modules they are not
 | |
| interested in touching.  Commits in an enormous checkout would be slower
 | |
| than you'd expect as Git would have to scan every directory for changes.
 | |
| If modules have a lot of local history, clones would take forever.
 | |
| 
 | |
| On the plus side, distributed revision control systems can much better
 | |
| integrate with external sources.  In a centralized model, a single arbitrary
 | |
| snapshot of the external project is exported from its own revision control
 | |
| and then imported into the local revision control on a vendor branch.  All
 | |
| the history is hidden.  With distributed revision control you can clone the
 | |
| entire external history and much more easily follow development and re-merge
 | |
| local changes.
 | |
| 
 | |
| Git's submodule support allows a repository to contain, as a subdirectory, a
 | |
| checkout of an external project.  Submodules maintain their own identity;
 | |
| the submodule support just stores the submodule repository location and
 | |
| commit ID, so other developers who clone the containing project
 | |
| ("superproject") can easily clone all the submodules at the same revision.
 | |
| Partial checkouts of the superproject are possible: you can tell Git to
 | |
| clone none, some or all of the submodules.
 | |
| 
 | |
| The linkgit:git-submodule[1] command is available since Git 1.5.3.  Users
 | |
| with Git 1.5.2 can look up the submodule commits in the repository and
 | |
| manually check them out; earlier versions won't recognize the submodules at
 | |
| all.
 | |
| 
 | |
| To see how submodule support works, create four example
 | |
| repositories that can be used later as a submodule:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ mkdir ~/git
 | |
| $ cd ~/git
 | |
| $ for i in a b c d
 | |
| do
 | |
| 	mkdir $i
 | |
| 	cd $i
 | |
| 	git init
 | |
| 	echo "module $i" > $i.txt
 | |
| 	git add $i.txt
 | |
| 	git commit -m "Initial commit, submodule $i"
 | |
| 	cd ..
 | |
| done
 | |
| -------------------------------------------------
 | |
| 
 | |
| Now create the superproject and add all the submodules:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ mkdir super
 | |
| $ cd super
 | |
| $ git init
 | |
| $ for i in a b c d
 | |
| do
 | |
| 	git submodule add ~/git/$i $i
 | |
| done
 | |
| -------------------------------------------------
 | |
| 
 | |
| NOTE: Do not use local URLs here if you plan to publish your superproject!
 | |
| 
 | |
| See what files `git submodule` created:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ ls -a
 | |
| .  ..  .git  .gitmodules  a  b  c  d
 | |
| -------------------------------------------------
 | |
| 
 | |
| The `git submodule add <repo> <path>` command does a couple of things:
 | |
| 
 | |
| - It clones the submodule from `<repo>` to the given `<path>` under the
 | |
|   current directory and by default checks out the master branch.
 | |
| - It adds the submodule's clone path to the linkgit:gitmodules[5] file and
 | |
|   adds this file to the index, ready to be committed.
 | |
| - It adds the submodule's current commit ID to the index, ready to be
 | |
|   committed.
 | |
| 
 | |
| Commit the superproject:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git commit -m "Add submodules a, b, c and d."
 | |
| -------------------------------------------------
 | |
| 
 | |
| Now clone the superproject:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ cd ..
 | |
| $ git clone super cloned
 | |
| $ cd cloned
 | |
| -------------------------------------------------
 | |
| 
 | |
| The submodule directories are there, but they're empty:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ ls -a a
 | |
| .  ..
 | |
| $ git submodule status
 | |
| -d266b9873ad50488163457f025db7cdd9683d88b a
 | |
| -e81d457da15309b4fef4249aba9b50187999670d b
 | |
| -c1536a972b9affea0f16e0680ba87332dc059146 c
 | |
| -d96249ff5d57de5de093e6baff9e0aafa5276a74 d
 | |
| -------------------------------------------------
 | |
| 
 | |
| NOTE: The commit object names shown above would be different for you, but they
 | |
| should match the HEAD commit object names of your repositories.  You can check
 | |
| it by running `git ls-remote ../a`.
 | |
| 
 | |
| Pulling down the submodules is a two-step process. First run `git submodule
 | |
| init` to add the submodule repository URLs to `.git/config`:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git submodule init
 | |
| -------------------------------------------------
 | |
| 
 | |
| Now use `git submodule update` to clone the repositories and check out the
 | |
| commits specified in the superproject:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git submodule update
 | |
| $ cd a
 | |
| $ ls -a
 | |
| .  ..  .git  a.txt
 | |
| -------------------------------------------------
 | |
| 
 | |
| One major difference between `git submodule update` and `git submodule add` is
 | |
| that `git submodule update` checks out a specific commit, rather than the tip
 | |
| of a branch. It's like checking out a tag: the head is detached, so you're not
 | |
| working on a branch.
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git branch
 | |
| * (detached from d266b98)
 | |
|   master
 | |
| -------------------------------------------------
 | |
| 
 | |
| If you want to make a change within a submodule and you have a detached head,
 | |
| then you should create or checkout a branch, make your changes, publish the
 | |
| change within the submodule, and then update the superproject to reference the
 | |
| new commit:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git switch master
 | |
| -------------------------------------------------
 | |
| 
 | |
| or
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git switch -c fix-up
 | |
| -------------------------------------------------
 | |
| 
 | |
| then
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ echo "adding a line again" >> a.txt
 | |
| $ git commit -a -m "Updated the submodule from within the superproject."
 | |
| $ git push
 | |
| $ cd ..
 | |
| $ git diff
 | |
| diff --git a/a b/a
 | |
| index d266b98..261dfac 160000
 | |
| --- a/a
 | |
| +++ b/a
 | |
| @@ -1 +1 @@
 | |
| -Subproject commit d266b9873ad50488163457f025db7cdd9683d88b
 | |
| +Subproject commit 261dfac35cb99d380eb966e102c1197139f7fa24
 | |
| $ git add a
 | |
| $ git commit -m "Updated submodule a."
 | |
| $ git push
 | |
| -------------------------------------------------
 | |
| 
 | |
| You have to run `git submodule update` after `git pull` if you want to update
 | |
| submodules, too.
 | |
| 
 | |
| Pitfalls with submodules
 | |
| ------------------------
 | |
| 
 | |
| Always publish the submodule change before publishing the change to the
 | |
| superproject that references it. If you forget to publish the submodule change,
 | |
| others won't be able to clone the repository:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ cd ~/git/super/a
 | |
| $ echo i added another line to this file >> a.txt
 | |
| $ git commit -a -m "doing it wrong this time"
 | |
| $ cd ..
 | |
| $ git add a
 | |
| $ git commit -m "Updated submodule a again."
 | |
| $ git push
 | |
| $ cd ~/git/cloned
 | |
| $ git pull
 | |
| $ git submodule update
 | |
| error: pathspec '261dfac35cb99d380eb966e102c1197139f7fa24' did not match any file(s) known to git.
 | |
| Did you forget to 'git add'?
 | |
| Unable to checkout '261dfac35cb99d380eb966e102c1197139f7fa24' in submodule path 'a'
 | |
| -------------------------------------------------
 | |
| 
 | |
| In older Git versions it could be easily forgotten to commit new or modified
 | |
| files in a submodule, which silently leads to similar problems as not pushing
 | |
| the submodule changes. Starting with Git 1.7.0 both `git status` and `git diff`
 | |
| in the superproject show submodules as modified when they contain new or
 | |
| modified files to protect against accidentally committing such a state. `git
 | |
| diff` will also add a `-dirty` to the work tree side when generating patch
 | |
| output or used with the `--submodule` option:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git diff
 | |
| diff --git a/sub b/sub
 | |
| --- a/sub
 | |
| +++ b/sub
 | |
| @@ -1 +1 @@
 | |
| -Subproject commit 3f356705649b5d566d97ff843cf193359229a453
 | |
| +Subproject commit 3f356705649b5d566d97ff843cf193359229a453-dirty
 | |
| $ git diff --submodule
 | |
| Submodule sub 3f35670..3f35670-dirty:
 | |
| -------------------------------------------------
 | |
| 
 | |
| You also should not rewind branches in a submodule beyond commits that were
 | |
| ever recorded in any superproject.
 | |
| 
 | |
| It's not safe to run `git submodule update` if you've made and committed
 | |
| changes within a submodule without checking out a branch first. They will be
 | |
| silently overwritten:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ cat a.txt
 | |
| module a
 | |
| $ echo line added from private2 >> a.txt
 | |
| $ git commit -a -m "line added inside private2"
 | |
| $ cd ..
 | |
| $ git submodule update
 | |
| Submodule path 'a': checked out 'd266b9873ad50488163457f025db7cdd9683d88b'
 | |
| $ cd a
 | |
| $ cat a.txt
 | |
| module a
 | |
| -------------------------------------------------
 | |
| 
 | |
| NOTE: The changes are still visible in the submodule's reflog.
 | |
| 
 | |
| If you have uncommitted changes in your submodule working tree, `git
 | |
| submodule update` will not overwrite them.  Instead, you get the usual
 | |
| warning about not being able switch from a dirty branch.
 | |
| 
 | |
| [[low-level-operations]]
 | |
| Low-level Git operations
 | |
| ========================
 | |
| 
 | |
| Many of the higher-level commands were originally implemented as shell
 | |
| scripts using a smaller core of low-level Git commands.  These can still
 | |
| be useful when doing unusual things with Git, or just as a way to
 | |
| understand its inner workings.
 | |
| 
 | |
| [[object-manipulation]]
 | |
| Object access and manipulation
 | |
| ------------------------------
 | |
| 
 | |
| The linkgit:git-cat-file[1] command can show the contents of any object,
 | |
| though the higher-level linkgit:git-show[1] is usually more useful.
 | |
| 
 | |
| The linkgit:git-commit-tree[1] command allows constructing commits with
 | |
| arbitrary parents and trees.
 | |
| 
 | |
| A tree can be created with linkgit:git-write-tree[1] and its data can be
 | |
| accessed by linkgit:git-ls-tree[1].  Two trees can be compared with
 | |
| linkgit:git-diff-tree[1].
 | |
| 
 | |
| A tag is created with linkgit:git-mktag[1], and the signature can be
 | |
| verified by linkgit:git-verify-tag[1], though it is normally simpler to
 | |
| use linkgit:git-tag[1] for both.
 | |
| 
 | |
| [[the-workflow]]
 | |
| The Workflow
 | |
| ------------
 | |
| 
 | |
| High-level operations such as linkgit:git-commit[1] and
 | |
| linkgit:git-restore[1] work by moving data
 | |
| between the working tree, the index, and the object database.  Git
 | |
| provides low-level operations which perform each of these steps
 | |
| individually.
 | |
| 
 | |
| Generally, all Git operations work on the index file. Some operations
 | |
| work *purely* on the index file (showing the current state of the
 | |
| index), but most operations move data between the index file and either
 | |
| the database or the working directory. Thus there are four main
 | |
| combinations:
 | |
| 
 | |
| [[working-directory-to-index]]
 | |
| working directory -> index
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| The linkgit:git-update-index[1] command updates the index with
 | |
| information from the working directory.  You generally update the
 | |
| index information by just specifying the filename you want to update,
 | |
| like so:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git update-index filename
 | |
| -------------------------------------------------
 | |
| 
 | |
| but to avoid common mistakes with filename globbing etc., the command
 | |
| will not normally add totally new entries or remove old entries,
 | |
| i.e. it will normally just update existing cache entries.
 | |
| 
 | |
| To tell Git that yes, you really do realize that certain files no
 | |
| longer exist, or that new files should be added, you
 | |
| should use the `--remove` and `--add` flags respectively.
 | |
| 
 | |
| NOTE! A `--remove` flag does 'not' mean that subsequent filenames will
 | |
| necessarily be removed: if the files still exist in your directory
 | |
| structure, the index will be updated with their new status, not
 | |
| removed. The only thing `--remove` means is that update-index will be
 | |
| considering a removed file to be a valid thing, and if the file really
 | |
| does not exist any more, it will update the index accordingly.
 | |
| 
 | |
| As a special case, you can also do `git update-index --refresh`, which
 | |
| will refresh the "stat" information of each index to match the current
 | |
| stat information. It will 'not' update the object status itself, and
 | |
| it will only update the fields that are used to quickly test whether
 | |
| an object still matches its old backing store object.
 | |
| 
 | |
| The previously introduced linkgit:git-add[1] is just a wrapper for
 | |
| linkgit:git-update-index[1].
 | |
| 
 | |
| [[index-to-object-database]]
 | |
| index -> object database
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| You write your current index file to a "tree" object with the program
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git write-tree
 | |
| -------------------------------------------------
 | |
| 
 | |
| that doesn't come with any options--it will just write out the
 | |
| current index into the set of tree objects that describe that state,
 | |
| and it will return the name of the resulting top-level tree. You can
 | |
| use that tree to re-generate the index at any time by going in the
 | |
| other direction:
 | |
| 
 | |
| [[object-database-to-index]]
 | |
| object database -> index
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| You read a "tree" file from the object database, and use that to
 | |
| populate (and overwrite--don't do this if your index contains any
 | |
| unsaved state that you might want to restore later!) your current
 | |
| index.  Normal operation is just
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git read-tree <SHA-1 of tree>
 | |
| -------------------------------------------------
 | |
| 
 | |
| and your index file will now be equivalent to the tree that you saved
 | |
| earlier. However, that is only your 'index' file: your working
 | |
| directory contents have not been modified.
 | |
| 
 | |
| [[index-to-working-directory]]
 | |
| index -> working directory
 | |
| ~~~~~~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| You update your working directory from the index by "checking out"
 | |
| files. This is not a very common operation, since normally you'd just
 | |
| keep your files updated, and rather than write to your working
 | |
| directory, you'd tell the index files about the changes in your
 | |
| working directory (i.e. `git update-index`).
 | |
| 
 | |
| However, if you decide to jump to a new version, or check out somebody
 | |
| else's version, or just restore a previous tree, you'd populate your
 | |
| index file with read-tree, and then you need to check out the result
 | |
| with
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git checkout-index filename
 | |
| -------------------------------------------------
 | |
| 
 | |
| or, if you want to check out all of the index, use `-a`.
 | |
| 
 | |
| NOTE! `git checkout-index` normally refuses to overwrite old files, so
 | |
| if you have an old version of the tree already checked out, you will
 | |
| need to use the `-f` flag ('before' the `-a` flag or the filename) to
 | |
| 'force' the checkout.
 | |
| 
 | |
| 
 | |
| Finally, there are a few odds and ends which are not purely moving
 | |
| from one representation to the other:
 | |
| 
 | |
| [[tying-it-all-together]]
 | |
| Tying it all together
 | |
| ~~~~~~~~~~~~~~~~~~~~~
 | |
| 
 | |
| To commit a tree you have instantiated with `git write-tree`, you'd
 | |
| create a "commit" object that refers to that tree and the history
 | |
| behind it--most notably the "parent" commits that preceded it in
 | |
| history.
 | |
| 
 | |
| Normally a "commit" has one parent: the previous state of the tree
 | |
| before a certain change was made. However, sometimes it can have two
 | |
| or more parent commits, in which case we call it a "merge", due to the
 | |
| fact that such a commit brings together ("merges") two or more
 | |
| previous states represented by other commits.
 | |
| 
 | |
| In other words, while a "tree" represents a particular directory state
 | |
| of a working directory, a "commit" represents that state in time,
 | |
| and explains how we got there.
 | |
| 
 | |
| You create a commit object by giving it the tree that describes the
 | |
| state at the time of the commit, and a list of parents:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git commit-tree <tree> -p <parent> [(-p <parent2>)...]
 | |
| -------------------------------------------------
 | |
| 
 | |
| and then giving the reason for the commit on stdin (either through
 | |
| redirection from a pipe or file, or by just typing it at the tty).
 | |
| 
 | |
| `git commit-tree` will return the name of the object that represents
 | |
| that commit, and you should save it away for later use. Normally,
 | |
| you'd commit a new `HEAD` state, and while Git doesn't care where you
 | |
| save the note about that state, in practice we tend to just write the
 | |
| result to the file pointed at by `.git/HEAD`, so that we can always see
 | |
| what the last committed state was.
 | |
| 
 | |
| Here is a picture that illustrates how various pieces fit together:
 | |
| 
 | |
| ------------
 | |
| 
 | |
|                      commit-tree
 | |
|                       commit obj
 | |
|                        +----+
 | |
|                        |    |
 | |
|                        |    |
 | |
|                        V    V
 | |
|                     +-----------+
 | |
|                     | Object DB |
 | |
|                     |  Backing  |
 | |
|                     |   Store   |
 | |
|                     +-----------+
 | |
|                        ^
 | |
|            write-tree  |     |
 | |
|              tree obj  |     |
 | |
|                        |     |  read-tree
 | |
|                        |     |  tree obj
 | |
|                              V
 | |
|                     +-----------+
 | |
|                     |   Index   |
 | |
|                     |  "cache"  |
 | |
|                     +-----------+
 | |
|          update-index  ^
 | |
|              blob obj  |     |
 | |
|                        |     |
 | |
|     checkout-index -u  |     |  checkout-index
 | |
|              stat      |     |  blob obj
 | |
|                              V
 | |
|                     +-----------+
 | |
|                     |  Working  |
 | |
|                     | Directory |
 | |
|                     +-----------+
 | |
| 
 | |
| ------------
 | |
| 
 | |
| 
 | |
| [[examining-the-data]]
 | |
| Examining the data
 | |
| ------------------
 | |
| 
 | |
| You can examine the data represented in the object database and the
 | |
| index with various helper tools. For every object, you can use
 | |
| linkgit:git-cat-file[1] to examine details about the
 | |
| object:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git cat-file -t <objectname>
 | |
| -------------------------------------------------
 | |
| 
 | |
| shows the type of the object, and once you have the type (which is
 | |
| usually implicit in where you find the object), you can use
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git cat-file blob|tree|commit|tag <objectname>
 | |
| -------------------------------------------------
 | |
| 
 | |
| to show its contents. NOTE! Trees have binary content, and as a result
 | |
| there is a special helper for showing that content, called
 | |
| `git ls-tree`, which turns the binary content into a more easily
 | |
| readable form.
 | |
| 
 | |
| It's especially instructive to look at "commit" objects, since those
 | |
| tend to be small and fairly self-explanatory. In particular, if you
 | |
| follow the convention of having the top commit name in `.git/HEAD`,
 | |
| you can do
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git cat-file commit HEAD
 | |
| -------------------------------------------------
 | |
| 
 | |
| to see what the top commit was.
 | |
| 
 | |
| [[merging-multiple-trees]]
 | |
| Merging multiple trees
 | |
| ----------------------
 | |
| 
 | |
| Git can help you perform a three-way merge, which can in turn be
 | |
| used for a many-way merge by repeating the merge procedure several
 | |
| times.  The usual situation is that you only do one three-way merge
 | |
| (reconciling two lines of history) and commit the result, but if
 | |
| you like to, you can merge several branches in one go.
 | |
| 
 | |
| To perform a three-way merge, you start with the two commits you
 | |
| want to merge, find their closest common parent (a third commit),
 | |
| and compare the trees corresponding to these three commits.
 | |
| 
 | |
| To get the "base" for the merge, look up the common parent of two
 | |
| commits:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git merge-base <commit1> <commit2>
 | |
| -------------------------------------------------
 | |
| 
 | |
| This prints the name of a commit they are both based on. You should
 | |
| now look up the tree objects of those commits, which you can easily
 | |
| do with
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git cat-file commit <commitname> | head -1
 | |
| -------------------------------------------------
 | |
| 
 | |
| since the tree object information is always the first line in a commit
 | |
| object.
 | |
| 
 | |
| Once you know the three trees you are going to merge (the one "original"
 | |
| tree, aka the common tree, and the two "result" trees, aka the branches
 | |
| you want to merge), you do a "merge" read into the index. This will
 | |
| complain if it has to throw away your old index contents, so you should
 | |
| make sure that you've committed those--in fact you would normally
 | |
| always do a merge against your last commit (which should thus match what
 | |
| you have in your current index anyway).
 | |
| 
 | |
| To do the merge, do
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git read-tree -m -u <origtree> <yourtree> <targettree>
 | |
| -------------------------------------------------
 | |
| 
 | |
| which will do all trivial merge operations for you directly in the
 | |
| index file, and you can just write the result out with
 | |
| `git write-tree`.
 | |
| 
 | |
| 
 | |
| [[merging-multiple-trees-2]]
 | |
| Merging multiple trees, continued
 | |
| ---------------------------------
 | |
| 
 | |
| Sadly, many merges aren't trivial. If there are files that have
 | |
| been added, moved or removed, or if both branches have modified the
 | |
| same file, you will be left with an index tree that contains "merge
 | |
| entries" in it. Such an index tree can 'NOT' be written out to a tree
 | |
| object, and you will have to resolve any such merge clashes using
 | |
| other tools before you can write out the result.
 | |
| 
 | |
| You can examine such index state with `git ls-files --unmerged`
 | |
| command.  An example:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git read-tree -m $orig HEAD $target
 | |
| $ git ls-files --unmerged
 | |
| 100644 263414f423d0e4d70dae8fe53fa34614ff3e2860 1	hello.c
 | |
| 100644 06fa6a24256dc7e560efa5687fa84b51f0263c3a 2	hello.c
 | |
| 100644 cc44c73eb783565da5831b4d820c962954019b69 3	hello.c
 | |
| ------------------------------------------------
 | |
| 
 | |
| Each line of the `git ls-files --unmerged` output begins with
 | |
| the blob mode bits, blob SHA-1, 'stage number', and the
 | |
| filename.  The 'stage number' is Git's way to say which tree it
 | |
| came from: stage 1 corresponds to the `$orig` tree, stage 2 to
 | |
| the `HEAD` tree, and stage 3 to the `$target` tree.
 | |
| 
 | |
| Earlier we said that trivial merges are done inside
 | |
| `git read-tree -m`.  For example, if the file did not change
 | |
| from `$orig` to `HEAD` or `$target`, or if the file changed
 | |
| from `$orig` to `HEAD` and `$orig` to `$target` the same way,
 | |
| obviously the final outcome is what is in `HEAD`.  What the
 | |
| above example shows is that file `hello.c` was changed from
 | |
| `$orig` to `HEAD` and `$orig` to `$target` in a different way.
 | |
| You could resolve this by running your favorite 3-way merge
 | |
| program, e.g.  `diff3`, `merge`, or Git's own merge-file, on
 | |
| the blob objects from these three stages yourself, like this:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ git cat-file blob 263414f >hello.c~1
 | |
| $ git cat-file blob 06fa6a2 >hello.c~2
 | |
| $ git cat-file blob cc44c73 >hello.c~3
 | |
| $ git merge-file hello.c~2 hello.c~1 hello.c~3
 | |
| ------------------------------------------------
 | |
| 
 | |
| This would leave the merge result in `hello.c~2` file, along
 | |
| with conflict markers if there are conflicts.  After verifying
 | |
| the merge result makes sense, you can tell Git what the final
 | |
| merge result for this file is by:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ mv -f hello.c~2 hello.c
 | |
| $ git update-index hello.c
 | |
| -------------------------------------------------
 | |
| 
 | |
| When a path is in the "unmerged" state, running `git update-index` for
 | |
| that path tells Git to mark the path resolved.
 | |
| 
 | |
| The above is the description of a Git merge at the lowest level,
 | |
| to help you understand what conceptually happens under the hood.
 | |
| In practice, nobody, not even Git itself, runs `git cat-file` three times
 | |
| for this.  There is a `git merge-index` program that extracts the
 | |
| stages to temporary files and calls a "merge" script on it:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git merge-index git-merge-one-file hello.c
 | |
| -------------------------------------------------
 | |
| 
 | |
| and that is what higher level `git merge -s resolve` is implemented with.
 | |
| 
 | |
| [[hacking-git]]
 | |
| Hacking Git
 | |
| ===========
 | |
| 
 | |
| This chapter covers internal details of the Git implementation which
 | |
| probably only Git developers need to understand.
 | |
| 
 | |
| [[object-details]]
 | |
| Object storage format
 | |
| ---------------------
 | |
| 
 | |
| All objects have a statically determined "type" which identifies the
 | |
| format of the object (i.e. how it is used, and how it can refer to other
 | |
| objects).  There are currently four different object types: "blob",
 | |
| "tree", "commit", and "tag".
 | |
| 
 | |
| Regardless of object type, all objects share the following
 | |
| characteristics: they are all deflated with zlib, and have a header
 | |
| that not only specifies their type, but also provides size information
 | |
| about the data in the object.  It's worth noting that the SHA-1 hash
 | |
| that is used to name the object is the hash of the original data
 | |
| plus this header, so `sha1sum` 'file' does not match the object name
 | |
| for 'file'.
 | |
| 
 | |
| As a result, the general consistency of an object can always be tested
 | |
| independently of the contents or the type of the object: all objects can
 | |
| be validated by verifying that (a) their hashes match the content of the
 | |
| file and (b) the object successfully inflates to a stream of bytes that
 | |
| forms a sequence of
 | |
| `<ascii type without space> + <space> + <ascii decimal size> +
 | |
| <byte\0> + <binary object data>`.
 | |
| 
 | |
| The structured objects can further have their structure and
 | |
| connectivity to other objects verified. This is generally done with
 | |
| the `git fsck` program, which generates a full dependency graph
 | |
| of all objects, and verifies their internal consistency (in addition
 | |
| to just verifying their superficial consistency through the hash).
 | |
| 
 | |
| [[birdview-on-the-source-code]]
 | |
| A birds-eye view of Git's source code
 | |
| -------------------------------------
 | |
| 
 | |
| It is not always easy for new developers to find their way through Git's
 | |
| source code.  This section gives you a little guidance to show where to
 | |
| start.
 | |
| 
 | |
| A good place to start is with the contents of the initial commit, with:
 | |
| 
 | |
| ----------------------------------------------------
 | |
| $ git switch --detach e83c5163
 | |
| ----------------------------------------------------
 | |
| 
 | |
| The initial revision lays the foundation for almost everything Git has
 | |
| today, but is small enough to read in one sitting.
 | |
| 
 | |
| Note that terminology has changed since that revision.  For example, the
 | |
| README in that revision uses the word "changeset" to describe what we
 | |
| now call a <<def_commit_object,commit>>.
 | |
| 
 | |
| Also, we do not call it "cache" any more, but rather "index"; however, the
 | |
| file is still called `cache.h`.  Remark: Not much reason to change it now,
 | |
| especially since there is no good single name for it anyway, because it is
 | |
| basically _the_ header file which is included by _all_ of Git's C sources.
 | |
| 
 | |
| If you grasp the ideas in that initial commit, you should check out a
 | |
| more recent version and skim `cache.h`, `object.h` and `commit.h`.
 | |
| 
 | |
| In the early days, Git (in the tradition of UNIX) was a bunch of programs
 | |
| which were extremely simple, and which you used in scripts, piping the
 | |
| output of one into another. This turned out to be good for initial
 | |
| development, since it was easier to test new things.  However, recently
 | |
| many of these parts have become builtins, and some of the core has been
 | |
| "libified", i.e. put into libgit.a for performance, portability reasons,
 | |
| and to avoid code duplication.
 | |
| 
 | |
| By now, you know what the index is (and find the corresponding data
 | |
| structures in `cache.h`), and that there are just a couple of object types
 | |
| (blobs, trees, commits and tags) which inherit their common structure from
 | |
| `struct object`, which is their first member (and thus, you can cast e.g.
 | |
| `(struct object *)commit` to achieve the _same_ as `&commit->object`, i.e.
 | |
| get at the object name and flags).
 | |
| 
 | |
| Now is a good point to take a break to let this information sink in.
 | |
| 
 | |
| Next step: get familiar with the object naming.  Read <<naming-commits>>.
 | |
| There are quite a few ways to name an object (and not only revisions!).
 | |
| All of these are handled in `sha1_name.c`. Just have a quick look at
 | |
| the function `get_sha1()`. A lot of the special handling is done by
 | |
| functions like `get_sha1_basic()` or the likes.
 | |
| 
 | |
| This is just to get you into the groove for the most libified part of Git:
 | |
| the revision walker.
 | |
| 
 | |
| Basically, the initial version of `git log` was a shell script:
 | |
| 
 | |
| ----------------------------------------------------------------
 | |
| $ git-rev-list --pretty $(git-rev-parse --default HEAD "$@") | \
 | |
| 	LESS=-S ${PAGER:-less}
 | |
| ----------------------------------------------------------------
 | |
| 
 | |
| What does this mean?
 | |
| 
 | |
| `git rev-list` is the original version of the revision walker, which
 | |
| _always_ printed a list of revisions to stdout.  It is still functional,
 | |
| and needs to, since most new Git commands start out as scripts using
 | |
| `git rev-list`.
 | |
| 
 | |
| `git rev-parse` is not as important any more; it was only used to filter out
 | |
| options that were relevant for the different plumbing commands that were
 | |
| called by the script.
 | |
| 
 | |
| Most of what `git rev-list` did is contained in `revision.c` and
 | |
| `revision.h`.  It wraps the options in a struct named `rev_info`, which
 | |
| controls how and what revisions are walked, and more.
 | |
| 
 | |
| The original job of `git rev-parse` is now taken by the function
 | |
| `setup_revisions()`, which parses the revisions and the common command-line
 | |
| options for the revision walker. This information is stored in the struct
 | |
| `rev_info` for later consumption. You can do your own command-line option
 | |
| parsing after calling `setup_revisions()`. After that, you have to call
 | |
| `prepare_revision_walk()` for initialization, and then you can get the
 | |
| commits one by one with the function `get_revision()`.
 | |
| 
 | |
| If you are interested in more details of the revision walking process,
 | |
| just have a look at the first implementation of `cmd_log()`; call
 | |
| `git show v1.3.0~155^2~4` and scroll down to that function (note that you
 | |
| no longer need to call `setup_pager()` directly).
 | |
| 
 | |
| Nowadays, `git log` is a builtin, which means that it is _contained_ in the
 | |
| command `git`.  The source side of a builtin is
 | |
| 
 | |
| - a function called `cmd_<bla>`, typically defined in `builtin/<bla.c>`
 | |
|   (note that older versions of Git used to have it in `builtin-<bla>.c`
 | |
|   instead), and declared in `builtin.h`.
 | |
| 
 | |
| - an entry in the `commands[]` array in `git.c`, and
 | |
| 
 | |
| - an entry in `BUILTIN_OBJECTS` in the `Makefile`.
 | |
| 
 | |
| Sometimes, more than one builtin is contained in one source file.  For
 | |
| example, `cmd_whatchanged()` and `cmd_log()` both reside in `builtin/log.c`,
 | |
| since they share quite a bit of code.  In that case, the commands which are
 | |
| _not_ named like the `.c` file in which they live have to be listed in
 | |
| `BUILT_INS` in the `Makefile`.
 | |
| 
 | |
| `git log` looks more complicated in C than it does in the original script,
 | |
| but that allows for a much greater flexibility and performance.
 | |
| 
 | |
| Here again it is a good point to take a pause.
 | |
| 
 | |
| Lesson three is: study the code.  Really, it is the best way to learn about
 | |
| the organization of Git (after you know the basic concepts).
 | |
| 
 | |
| So, think about something which you are interested in, say, "how can I
 | |
| access a blob just knowing the object name of it?".  The first step is to
 | |
| find a Git command with which you can do it.  In this example, it is either
 | |
| `git show` or `git cat-file`.
 | |
| 
 | |
| For the sake of clarity, let's stay with `git cat-file`, because it
 | |
| 
 | |
| - is plumbing, and
 | |
| 
 | |
| - was around even in the initial commit (it literally went only through
 | |
|   some 20 revisions as `cat-file.c`, was renamed to `builtin/cat-file.c`
 | |
|   when made a builtin, and then saw less than 10 versions).
 | |
| 
 | |
| So, look into `builtin/cat-file.c`, search for `cmd_cat_file()` and look what
 | |
| it does.
 | |
| 
 | |
| ------------------------------------------------------------------
 | |
|         git_config(git_default_config);
 | |
|         if (argc != 3)
 | |
| 		usage("git cat-file [-t|-s|-e|-p|<type>] <sha1>");
 | |
|         if (get_sha1(argv[2], sha1))
 | |
|                 die("Not a valid object name %s", argv[2]);
 | |
| ------------------------------------------------------------------
 | |
| 
 | |
| Let's skip over the obvious details; the only really interesting part
 | |
| here is the call to `get_sha1()`.  It tries to interpret `argv[2]` as an
 | |
| object name, and if it refers to an object which is present in the current
 | |
| repository, it writes the resulting SHA-1 into the variable `sha1`.
 | |
| 
 | |
| Two things are interesting here:
 | |
| 
 | |
| - `get_sha1()` returns 0 on _success_.  This might surprise some new
 | |
|   Git hackers, but there is a long tradition in UNIX to return different
 | |
|   negative numbers in case of different errors--and 0 on success.
 | |
| 
 | |
| - the variable `sha1` in the function signature of `get_sha1()` is `unsigned
 | |
|   char *`, but is actually expected to be a pointer to `unsigned
 | |
|   char[20]`.  This variable will contain the 160-bit SHA-1 of the given
 | |
|   commit.  Note that whenever a SHA-1 is passed as `unsigned char *`, it
 | |
|   is the binary representation, as opposed to the ASCII representation in
 | |
|   hex characters, which is passed as `char *`.
 | |
| 
 | |
| You will see both of these things throughout the code.
 | |
| 
 | |
| Now, for the meat:
 | |
| 
 | |
| -----------------------------------------------------------------------------
 | |
|         case 0:
 | |
|                 buf = read_object_with_reference(sha1, argv[1], &size, NULL);
 | |
| -----------------------------------------------------------------------------
 | |
| 
 | |
| This is how you read a blob (actually, not only a blob, but any type of
 | |
| object).  To know how the function `read_object_with_reference()` actually
 | |
| works, find the source code for it (something like `git grep
 | |
| read_object_with | grep ":[a-z]"` in the Git repository), and read
 | |
| the source.
 | |
| 
 | |
| To find out how the result can be used, just read on in `cmd_cat_file()`:
 | |
| 
 | |
| -----------------------------------
 | |
|         write_or_die(1, buf, size);
 | |
| -----------------------------------
 | |
| 
 | |
| Sometimes, you do not know where to look for a feature.  In many such cases,
 | |
| it helps to search through the output of `git log`, and then `git show` the
 | |
| corresponding commit.
 | |
| 
 | |
| Example: If you know that there was some test case for `git bundle`, but
 | |
| do not remember where it was (yes, you _could_ `git grep bundle t/`, but that
 | |
| does not illustrate the point!):
 | |
| 
 | |
| ------------------------
 | |
| $ git log --no-merges t/
 | |
| ------------------------
 | |
| 
 | |
| In the pager (`less`), just search for "bundle", go a few lines back,
 | |
| and see that it is in commit 18449ab0.  Now just copy this object name,
 | |
| and paste it into the command line
 | |
| 
 | |
| -------------------
 | |
| $ git show 18449ab0
 | |
| -------------------
 | |
| 
 | |
| Voila.
 | |
| 
 | |
| Another example: Find out what to do in order to make some script a
 | |
| builtin:
 | |
| 
 | |
| -------------------------------------------------
 | |
| $ git log --no-merges --diff-filter=A builtin/*.c
 | |
| -------------------------------------------------
 | |
| 
 | |
| You see, Git is actually the best tool to find out about the source of Git
 | |
| itself!
 | |
| 
 | |
| [[glossary]]
 | |
| Git Glossary
 | |
| ============
 | |
| 
 | |
| [[git-explained]]
 | |
| Git explained
 | |
| -------------
 | |
| 
 | |
| include::glossary-content.txt[]
 | |
| 
 | |
| [[git-quick-start]]
 | |
| Appendix A: Git Quick Reference
 | |
| ===============================
 | |
| 
 | |
| This is a quick summary of the major commands; the previous chapters
 | |
| explain how these work in more detail.
 | |
| 
 | |
| [[quick-creating-a-new-repository]]
 | |
| Creating a new repository
 | |
| -------------------------
 | |
| 
 | |
| From a tarball:
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ tar xzf project.tar.gz
 | |
| $ cd project
 | |
| $ git init
 | |
| Initialized empty Git repository in .git/
 | |
| $ git add .
 | |
| $ git commit
 | |
| -----------------------------------------------
 | |
| 
 | |
| From a remote repository:
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git clone git://example.com/pub/project.git
 | |
| $ cd project
 | |
| -----------------------------------------------
 | |
| 
 | |
| [[managing-branches]]
 | |
| Managing branches
 | |
| -----------------
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git branch			# list all local branches in this repo
 | |
| $ git switch test	        # switch working directory to branch "test"
 | |
| $ git branch new		# create branch "new" starting at current HEAD
 | |
| $ git branch -d new		# delete branch "new"
 | |
| -----------------------------------------------
 | |
| 
 | |
| Instead of basing a new branch on current HEAD (the default), use:
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git branch new test    # branch named "test"
 | |
| $ git branch new v2.6.15 # tag named v2.6.15
 | |
| $ git branch new HEAD^   # commit before the most recent
 | |
| $ git branch new HEAD^^  # commit before that
 | |
| $ git branch new test~10 # ten commits before tip of branch "test"
 | |
| -----------------------------------------------
 | |
| 
 | |
| Create and switch to a new branch at the same time:
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git switch -c new v2.6.15
 | |
| -----------------------------------------------
 | |
| 
 | |
| Update and examine branches from the repository you cloned from:
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git fetch		# update
 | |
| $ git branch -r		# list
 | |
|   origin/master
 | |
|   origin/next
 | |
|   ...
 | |
| $ git switch -c masterwork origin/master
 | |
| -----------------------------------------------
 | |
| 
 | |
| Fetch a branch from a different repository, and give it a new
 | |
| name in your repository:
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git fetch git://example.com/project.git theirbranch:mybranch
 | |
| $ git fetch git://example.com/project.git v2.6.15:mybranch
 | |
| -----------------------------------------------
 | |
| 
 | |
| Keep a list of repositories you work with regularly:
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git remote add example git://example.com/project.git
 | |
| $ git remote			# list remote repositories
 | |
| example
 | |
| origin
 | |
| $ git remote show example	# get details
 | |
| * remote example
 | |
|   URL: git://example.com/project.git
 | |
|   Tracked remote branches
 | |
|     master
 | |
|     next
 | |
|     ...
 | |
| $ git fetch example		# update branches from example
 | |
| $ git branch -r			# list all remote branches
 | |
| -----------------------------------------------
 | |
| 
 | |
| 
 | |
| [[exploring-history]]
 | |
| Exploring history
 | |
| -----------------
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ gitk			    # visualize and browse history
 | |
| $ git log		    # list all commits
 | |
| $ git log src/		    # ...modifying src/
 | |
| $ git log v2.6.15..v2.6.16  # ...in v2.6.16, not in v2.6.15
 | |
| $ git log master..test	    # ...in branch test, not in branch master
 | |
| $ git log test..master	    # ...in branch master, but not in test
 | |
| $ git log test...master	    # ...in one branch, not in both
 | |
| $ git log -S'foo()'	    # ...where difference contain "foo()"
 | |
| $ git log --since="2 weeks ago"
 | |
| $ git log -p		    # show patches as well
 | |
| $ git show		    # most recent commit
 | |
| $ git diff v2.6.15..v2.6.16 # diff between two tagged versions
 | |
| $ git diff v2.6.15..HEAD    # diff with current head
 | |
| $ git grep "foo()"	    # search working directory for "foo()"
 | |
| $ git grep v2.6.15 "foo()"  # search old tree for "foo()"
 | |
| $ git show v2.6.15:a.txt    # look at old version of a.txt
 | |
| -----------------------------------------------
 | |
| 
 | |
| Search for regressions:
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git bisect start
 | |
| $ git bisect bad		# current version is bad
 | |
| $ git bisect good v2.6.13-rc2	# last known good revision
 | |
| Bisecting: 675 revisions left to test after this
 | |
| 				# test here, then:
 | |
| $ git bisect good		# if this revision is good, or
 | |
| $ git bisect bad		# if this revision is bad.
 | |
| 				# repeat until done.
 | |
| -----------------------------------------------
 | |
| 
 | |
| [[making-changes]]
 | |
| Making changes
 | |
| --------------
 | |
| 
 | |
| Make sure Git knows who to blame:
 | |
| 
 | |
| ------------------------------------------------
 | |
| $ cat >>~/.gitconfig <<\EOF
 | |
| [user]
 | |
| 	name = Your Name Comes Here
 | |
| 	email = you@yourdomain.example.com
 | |
| EOF
 | |
| ------------------------------------------------
 | |
| 
 | |
| Select file contents to include in the next commit, then make the
 | |
| commit:
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git add a.txt    # updated file
 | |
| $ git add b.txt    # new file
 | |
| $ git rm c.txt     # old file
 | |
| $ git commit
 | |
| -----------------------------------------------
 | |
| 
 | |
| Or, prepare and create the commit in one step:
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git commit d.txt # use latest content only of d.txt
 | |
| $ git commit -a	   # use latest content of all tracked files
 | |
| -----------------------------------------------
 | |
| 
 | |
| [[merging]]
 | |
| Merging
 | |
| -------
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git merge test   # merge branch "test" into the current branch
 | |
| $ git pull git://example.com/project.git master
 | |
| 		   # fetch and merge in remote branch
 | |
| $ git pull . test  # equivalent to git merge test
 | |
| -----------------------------------------------
 | |
| 
 | |
| [[sharing-your-changes]]
 | |
| Sharing your changes
 | |
| --------------------
 | |
| 
 | |
| Importing or exporting patches:
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git format-patch origin..HEAD # format a patch for each commit
 | |
| 				# in HEAD but not in origin
 | |
| $ git am mbox # import patches from the mailbox "mbox"
 | |
| -----------------------------------------------
 | |
| 
 | |
| Fetch a branch in a different Git repository, then merge into the
 | |
| current branch:
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git pull git://example.com/project.git theirbranch
 | |
| -----------------------------------------------
 | |
| 
 | |
| Store the fetched branch into a local branch before merging into the
 | |
| current branch:
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git pull git://example.com/project.git theirbranch:mybranch
 | |
| -----------------------------------------------
 | |
| 
 | |
| After creating commits on a local branch, update the remote
 | |
| branch with your commits:
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git push ssh://example.com/project.git mybranch:theirbranch
 | |
| -----------------------------------------------
 | |
| 
 | |
| When remote and local branch are both named "test":
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git push ssh://example.com/project.git test
 | |
| -----------------------------------------------
 | |
| 
 | |
| Shortcut version for a frequently used remote repository:
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git remote add example ssh://example.com/project.git
 | |
| $ git push example test
 | |
| -----------------------------------------------
 | |
| 
 | |
| [[repository-maintenance]]
 | |
| Repository maintenance
 | |
| ----------------------
 | |
| 
 | |
| Check for corruption:
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git fsck
 | |
| -----------------------------------------------
 | |
| 
 | |
| Recompress, remove unused cruft:
 | |
| 
 | |
| -----------------------------------------------
 | |
| $ git gc
 | |
| -----------------------------------------------
 | |
| 
 | |
| 
 | |
| [[todo]]
 | |
| Appendix B: Notes and todo list for this manual
 | |
| ===============================================
 | |
| 
 | |
| [[todo-list]]
 | |
| Todo list
 | |
| ---------
 | |
| 
 | |
| This is a work in progress.
 | |
| 
 | |
| The basic requirements:
 | |
| 
 | |
| - It must be readable in order, from beginning to end, by someone
 | |
|   intelligent with a basic grasp of the UNIX command line, but without
 | |
|   any special knowledge of Git.  If necessary, any other prerequisites
 | |
|   should be specifically mentioned as they arise.
 | |
| - Whenever possible, section headings should clearly describe the task
 | |
|   they explain how to do, in language that requires no more knowledge
 | |
|   than necessary: for example, "importing patches into a project" rather
 | |
|   than "the `git am` command"
 | |
| 
 | |
| Think about how to create a clear chapter dependency graph that will
 | |
| allow people to get to important topics without necessarily reading
 | |
| everything in between.
 | |
| 
 | |
| Scan `Documentation/` for other stuff left out; in particular:
 | |
| 
 | |
| - howto's
 | |
| - some of `technical/`?
 | |
| - hooks
 | |
| - list of commands in linkgit:git[1]
 | |
| 
 | |
| Scan email archives for other stuff left out
 | |
| 
 | |
| Scan man pages to see if any assume more background than this manual
 | |
| provides.
 | |
| 
 | |
| Add more good examples.  Entire sections of just cookbook examples
 | |
| might be a good idea; maybe make an "advanced examples" section a
 | |
| standard end-of-chapter section?
 | |
| 
 | |
| Include cross-references to the glossary, where appropriate.
 | |
| 
 | |
| Add a section on working with other version control systems, including
 | |
| CVS, Subversion, and just imports of series of release tarballs.
 | |
| 
 | |
| Write a chapter on using plumbing and writing scripts.
 | |
| 
 | |
| Alternates, clone -reference, etc.
 | |
| 
 | |
| More on recovery from repository corruption.  See:
 | |
| 	http://marc.info/?l=git&m=117263864820799&w=2
 | |
| 	http://marc.info/?l=git&m=117147855503798&w=2
 |