Tutorial: Safely Reducing the Size of a Git Repository

Before starting any of these procedures, make sure to backup your repository.

Tutorial: Safely Reducing the Size of a Git Repository

Prerequisites:
  • A local Git repository.
  • Backup the repository before making changes.
  • Optional: Install tools like BFG Repo-Cleaner.
Step 1: Basic Repository Cleanup

1.1. Run Git Garbage Collection

Start by running the Git built-in garbage collection command, which can help clean up unnecessary files and optimize the repository.

bash 复制代码
cd /path/to/your/repo
git gc --aggressive --prune=now
  • --aggressive: More thorough cleanup.
  • --prune=now: Removes objects that are no longer needed.

1.2. Clean Reflogs

Reflogs record when the tips of branches and other references were updated in the repo. They can consume space, especially in large projects.

bash 复制代码
git reflog expire --expire=now --all
Step 2: Identify and Remove Large Files

2.1. Find Large Files

Use a script to find large files in your repository's history.

bash 复制代码
git rev-list --objects --all |
git cat-file --batch-check='%(objecttype) %(objectname) %(objectsize) %(rest)' |
sort -n -k 3 |
tail -n 10

This command will list the top 10 largest objects in the repo.

2.2. Remove Large Files Using BFG

If you find large files that should not be in the repository, use BFG Repo-Cleaner, which is faster and simpler than git filter-branch.

First, download and run BFG:

bash 复制代码
java -jar bfg.jar --strip-blobs-bigger-than 100M /path/to/your/repo

2.3. Alternative: Use git filter-branch

If you prefer not to use BFG, you can manually remove large files with git filter-branch:

bash 复制代码
git filter-branch --force --index-filter \
  "git rm --cached --ignore-unmatch PATH_TO_LARGE_FILE" \
  --prune-empty --tag-name-filter cat -- --all

Replace PATH_TO_LARGE_FILE with the path to the file you wish to remove.

Step 3: Clone the Repository Afresh

After cleaning up the history, it might be beneficial to clone the repository afresh to start with a new, smaller .git directory.

bash 复制代码
cd ..
git clone --mirror /path/to/old/repo new-repo
cd new-repo
git reflog expire --expire=now --all
git gc --aggressive --prune=now
Step 4: Replace Old Repository

Once you are satisfied with the new repository's state, you can replace the old repository:

bash 复制代码
mv /path/to/old/repo /path/to/old/repo-old
mv new-repo /path/to/old/repo
Final Notes
  • After performing these actions, especially if you changed the history, you will need to force-push to any remotes and inform collaborators to re-clone the repository.
  • Always ensure you have backups and confirm that no critical data is lost during the cleanup.

This tutorial will guide you through reducing the size of your Git repository effectively and safely. Remember, these changes affect the repository's history, which can impact collaborative workflows.

相关推荐
桃花岛主706 分钟前
命令提交git到github上的步骤
git·github
AI逐月14 分钟前
Git 停止追踪已提交文件问题
大数据·git·elasticsearch
是毛毛吧25 分钟前
开发环境配置指南:解决 GitHub 连接超时与依赖下载失败的问题
网络·git·网络安全·docker·信息与通信
bj_zhb1 小时前
git stash 用法
git
rockmelodies1 小时前
本地 PyCharm 生成的 Git 仓库与 GitHub 远程仓库关联起来
git·pycharm·github
阿萨德528号2 小时前
Git 与 SVN 对比详解
git·svn
郑州光合科技余经理2 小时前
定制开发实战:海外版外卖系统PHP全栈解决方案
java·服务器·开发语言·javascript·git·uni-app·php
艾小码2 小时前
解锁Git隐藏力量:从熟练到大师的高级命令指南
git
BestOrNothing_20152 小时前
Git分支实操指南:本地学习分支创建+远程同步避坑全解析
git·分支管理·版本控制·开发规范·新手教程
Lenyiin4 小时前
Linux 项目托管 `git`
linux·运维·服务器·git·lenyiin