git/builtin at ab6eea6f7b9a5289d72c05476da19ab2bb457fd3 - git

admin/git

Files

Jeff King ab6eea6f7b receive-pack: use oidset to de-duplicate .have lines

If you have an alternate object store with a very large
number of refs, the peak memory usage of the sha1_array can
grow high, even if most of them are duplicates that end up
not being printed at all.

The similar for_each_alternate_ref() code-paths in
fetch-pack solve this by using flags in "struct object" to
de-duplicate (and so are relying on obj_hash at the core).

But we don't have a "struct object" at all in this case. We
could call lookup_unknown_object() to get one, but if our
goal is reducing memory footprint, it's not great:

 - an unknown object is as large as the largest object type
   (a commit), which is bigger than an oidset entry

 - we can free the memory after our ref advertisement, but
   "struct object" entries persist forever (and the
   receive-pack may hang around for a long time, as the
   bottleneck is often client upload bandwidth).

So let's use an oidset. Note that unlike a sha1-array it
doesn't sort the output as a side effect. However, our
output is at least stable, because for_each_alternate_ref()
will give us the sha1s in ref-sorted order.

In one particularly pathological case with an alternate that
has 60,000 unique refs out of 80 million total, this reduced
the peak heap usage of "git receive-pack . </dev/null" from
13GB to 14MB.

Signed-off-by: Jeff King <peff@peff.net>
Signed-off-by: Junio C Hamano <gitster@pobox.com>

2017-02-08 15:39:55 -08:00

add.c

hold_locked_index(): align error handling with hold_lockfile_for_update()

2016-12-07 11:31:59 -08:00

am.c

Merge branch 'sb/sequencer-abort-safety'

2016-12-21 14:55:01 -08:00

annotate.c

…

apply.c

Convert read_mmblob to take struct object_id.

2016-09-07 12:59:42 -07:00

archive.c

archive: read local configuration

2016-11-22 13:55:20 -08:00

bisect--helper.c

…

blame.c

use oid_to_hex_r() for converting struct object_id hashes to hex strings

2017-01-30 14:23:40 -08:00

branch.c

Merge branch 'nd/for-each-ref-ignore-case'

2016-12-19 14:45:31 -08:00

bundle.c

…

cat-file.c

Merge branch 'jk/pack-objects-optim-mru'

2016-10-10 14:03:47 -07:00

check-attr.c

…

check-ignore.c

…

check-mailmap.c

…

check-ref-format.c

…

checkout-index.c

hold_locked_index(): align error handling with hold_lockfile_for_update()

2016-12-07 11:31:59 -08:00

checkout.c

Merge branch 'cw/log-updates-for-all-refs-really'

2017-02-03 11:25:19 -08:00

clean.c

i18n: clean.c: match string with git-add--interactive.perl

2016-12-14 11:00:05 -08:00

clone.c

Merge branch 'rs/absolute-pathdup'

2017-02-02 13:36:55 -08:00

column.c

…

commit-tree.c

builtin/commit-tree: convert to struct object_id

2016-09-07 12:59:43 -07:00

commit.c

builtin/commit.c: switch to strbuf, instead of snprintf()

2017-01-31 10:09:00 -08:00

config.c

i18n: config: mark error message for translation

2016-09-15 13:17:32 -07:00

count-objects.c

alternates: use fspathcmp to detect duplicates

2016-10-10 13:52:37 -07:00

credential.c

…

describe.c

use QSORT

2016-09-29 15:42:18 -07:00

diff-files.c

diff: run arguments through precompose_argv

2016-05-13 14:35:49 -07:00

diff-index.c

diff: run arguments through precompose_argv

2016-05-13 14:35:49 -07:00

diff-tree.c

Merge branch 'ar/diff-args-osx-precompose' into maint

2016-06-06 14:27:35 -07:00

diff.c

Merge branch 'jk/setup-sequence-update'

2016-09-21 15:15:24 -07:00

difftool.c

difftool: hack around -Wzero-length-format warning

2017-01-25 13:28:34 -08:00

fast-export.c

use QSORT

2016-09-29 15:42:18 -07:00

fetch-pack.c

Merge branch 'nd/shallow-deepen'

2016-10-10 14:03:50 -07:00

fetch.c

Merge branch 'js/remote-rename-with-half-configured-remote'

2017-01-31 13:14:59 -08:00

fmt-merge-msg.c

remove unnecessary check before QSORT

2016-09-29 15:42:18 -07:00

for-each-ref.c

tag, branch, for-each-ref: add --ignore-case for sorting and filtering

2016-12-05 14:59:29 -08:00

fsck.c

Merge branch 'jk/fsck-connectivity-check-fix'

2017-01-31 13:15:01 -08:00

gc.c

auto gc: don't write bitmaps for incremental repacks

2016-12-29 13:45:35 -08:00

get-tar-commit-id.c

…

grep.c

grep: search history of moved submodules

2016-12-22 11:47:33 -08:00

hash-object.c

hash-object: always try to set up the git repository

2016-09-13 15:45:45 -07:00

help.c

Merge branch 'js/no-html-bypass-on-windows' into maint

2016-09-08 21:35:55 -07:00

index-pack.c

index-pack: skip collision check when not in repository

2016-12-16 13:57:19 -08:00

init-db.c

refs: add option core.logAllRefUpdates = always

2017-01-31 10:01:24 -08:00

interpret-trailers.c

Merge branch 'jk/parseopt-string-list' into jk/string-list-static-init

2016-06-13 10:37:48 -07:00

log.c

Merge branch 'jt/format-patch-rfc'

2016-09-26 16:09:17 -07:00

ls-files.c

Merge branch 'bw/ls-files-recurse-submodules'

2016-10-26 13:14:44 -07:00

ls-remote.c

…

ls-tree.c

ls-tree: convert show_recursive to use the pathspec struct interface

2017-01-08 18:04:17 -08:00

mailinfo.c

mailinfo: read local configuration

2016-11-22 13:13:16 -08:00

mailsplit.c

mailsplit: support unescaping mboxrd messages

2016-06-06 11:14:43 -07:00

merge-base.c

merge-base: handle --fork-point without reflog

2016-10-12 14:30:16 -07:00

merge-file.c

builtin/merge-file.c: use error_errno()

2016-05-09 12:29:08 -07:00

merge-index.c

use oid_to_hex_r() for converting struct object_id hashes to hex strings

2017-01-30 14:23:40 -08:00

merge-ours.c

…

merge-recursive.c

i18n: merge-recursive: mark verbose message for translation

2016-09-15 13:17:32 -07:00

merge-tree.c

struct name_entry: use struct object_id instead of unsigned char sha1[20]

2016-04-25 14:23:42 -07:00

merge.c

Merge branch 'cp/merge-continue'

2016-12-27 00:11:41 -08:00

mktag.c

…

mktree.c

use QSORT

2016-09-29 15:42:18 -07:00

mv.c

Merge branch 'bw/pathspec-cleanup'

2017-01-18 15:12:15 -08:00

name-rev.c

use QSORT

2016-09-29 15:42:18 -07:00

notes.c

notes: spell first word of error messages in lowercase

2016-09-15 13:17:32 -07:00

pack-objects.c

compression: unify pack.compression configuration parsing

2016-11-15 21:16:22 -08:00

pack-redundant.c

…

pack-refs.c

…

patch-id.c

Merge branch 'rs/patch-id-use-skip-prefix'

2016-06-03 14:38:03 -07:00

prune-packed.c

…

prune.c

…

pull.c

Merge branch 'jc/pull-rebase-ff' into maint

2017-01-17 15:11:05 -08:00

push.c

Merge branch 'bw/push-submodule-only'

2017-01-31 13:14:56 -08:00

read-tree.c

read-tree: use OPT_BOOL instead of OPT_SET_INT

2017-01-11 13:17:16 -08:00

receive-pack.c

receive-pack: use oidset to de-duplicate .have lines

2017-02-08 15:39:55 -08:00

reflog.c

struct name_entry: use struct object_id instead of unsigned char sha1[20]

2016-04-25 14:23:42 -07:00

remote-ext.c

pkt-line: rename packet_write() to packet_write_fmt()

2016-10-17 11:36:50 -07:00

remote-fd.c

…

remote.c

Merge branch 'js/remote-rename-with-half-configured-remote'

2017-01-31 13:14:59 -08:00

repack.c

repack: die on incremental + write-bitmap-index

2016-12-29 13:45:37 -08:00

replace.c

Merge branch 'js/replace-edit-use-editor-configuration' into maint

2016-05-06 14:53:24 -07:00

rerere.c

…

reset.c

hold_locked_index(): align error handling with hold_lockfile_for_update()

2016-12-07 11:31:59 -08:00

rev-list.c

use oid_to_hex_r() for converting struct object_id hashes to hex strings

2017-01-30 14:23:40 -08:00

rev-parse.c

Merge branch 'jk/rev-parse-symbolic-parents-fix' into maint

2017-01-17 14:49:26 -08:00

revert.c

sequencer: get rid of the subcommand field

2016-10-21 09:32:34 -07:00

rm.c

Merge branch 'sb/submodule-rm-absorb'

2017-01-18 15:12:11 -08:00

send-pack.c

Merge branch 'sk/send-pack-all-fix' into maint

2016-04-29 14:15:57 -07:00

shortlog.c

shortlog: group by committer information

2016-12-15 16:19:13 -08:00

show-branch.c

show-branch: use QSORT

2016-10-03 12:46:47 -07:00

show-ref.c

show-ref: remove a stale comment

2017-01-23 18:51:56 -08:00

stripspace.c

stripspace: respect repository config

2016-11-21 11:00:38 -08:00

submodule--helper.c

Merge branch 'rs/absolute-pathdup'

2017-02-02 13:36:55 -08:00

symbolic-ref.c

symbolic-ref -d: do not allow removal of HEAD

2016-09-02 09:01:38 -07:00

tag.c

Merge branch 'st/verify-tag'

2017-01-31 13:14:58 -08:00

unpack-file.c

…

unpack-objects.c

unpack-objects: add --max-input-size=<size> option

2016-08-24 12:31:05 -07:00

update-index.c

hold_locked_index(): align error handling with hold_lockfile_for_update()

2016-12-07 11:31:59 -08:00

update-ref.c

…

update-server-info.c

…

upload-archive.c

archive: read local configuration

2016-11-22 13:55:20 -08:00

var.c

…

verify-commit.c

…

verify-pack.c

…

verify-tag.c

builtin/verify-tag: add --format to verify-tag

2017-01-17 16:10:22 -08:00

worktree.c

worktree list: keep the list sorted

2016-11-28 13:18:51 -08:00

write-tree.c

…