Hugging Face GGUF 模型可视化

Hugging Face GGUF 模型可视化

  • [1. Finding GGUF files (检索 GGUF 模型)](#1. Finding GGUF files (检索 GGUF 模型))
  • [2. Viewer for metadata & tensors info (可视化 GGUF 模型)](#2. Viewer for metadata & tensors info (可视化 GGUF 模型))
  • References

无知小儿,仙家雄霸天下,依附强者才是唯一的出路。否则天地虽大,也让你们无路可走!

GGUF
https://huggingface.co/docs/hub/gguf

Hugging Face Hub supports all file formats, but has built-in features for GGUF format, a binary format that is optimized for quick loading and saving of models, making it highly efficient for inference purposes.

GGUF
https://github.com/ggerganov/ggml/blob/master/docs/gguf.md

GGUF is designed for use with GGML and other executors. GGUF was developed by Georgi Gerganov who is also the developer of llama.cpp, a popular C/C++ LLM inference framework. Models initially developed in frameworks like PyTorch can be converted to GGUF format for use with those engines.

Georgi Gerganov
https://huggingface.co/ggerganov

As we can see in this graph, unlike tensor-only file formats like safetensors - which is also a recommended model format for the Hub - GGUF encodes both the tensors and a standardized set of metadata.

Safetensors
https://huggingface.co/docs/safetensors

1. Finding GGUF files (检索 GGUF 模型)

You can browse all models with GGUF files filtering by the GGUF tag: https://huggingface.co/models?library=gguf.

Moreover, you can use https://huggingface.co/spaces/ggml-org/gguf-my-repo tool to convert/quantize your model weights into GGUF weights.

For example, you can check out TheBloke/Mixtral-8x7B-Instruct-v0.1-GGUF for seeing GGUF files in action.

TheBloke/Mixtral-8x7B-Instruct-v0.1-GGUF
https://huggingface.co/TheBloke/Mixtral-8x7B-Instruct-v0.1-GGUF

deepseek-ai/DeepSeek-V3
https://huggingface.co/deepseek-ai/DeepSeek-V3

2. Viewer for metadata & tensors info (可视化 GGUF 模型)

The Hub has a viewer for GGUF files that lets a user check out metadata & tensors info (name, shape, precison). The viewer is available on model page & files page.

点击下图绿色矩形框区域图标即可进入 Hugging Face GGUF 模型可视化界面。

TheBloke/Mixtral-8x7B-Instruct-v0.1-GGUF
https://huggingface.co/TheBloke/Mixtral-8x7B-Instruct-v0.1-GGUF/tree/main

MaziyarPanahi/Llama-3.2-1B-Instruct-GGUF
https://huggingface.co/MaziyarPanahi/Llama-3.2-1B-Instruct-GGUF/tree/main

https://huggingface.co/MaziyarPanahi/Llama-3.2-1B-Instruct-GGUF/tree/main?show_file_info=Llama-3.2-1B-Instruct.Q4_K_M.gguf

References

1\] Yongqiang Cheng, \[2\] huggingface/gguf,

相关推荐
HuggingFace4 天前
LeRobot v0.4.0 正式发布:全面提升开源机器人的学习能力
hugging face·lerobot·开源机器人
叶庭云13 天前
一文了解开源大语言模型文件结构,以 Hugging Face DeepSeek-V3.1 模型仓库为例
人工智能·大语言模型·hugging face·1024程序员节·llms·开源模型文件结构·deepseek-v3.1
闲人编程19 天前
深入浅出Transformer:使用Hugging Face库快速上手NLP
python·深度学习·自然语言处理·nlp·transformer·hugging face·codecapsule
安娜的信息安全说1 个月前
Hugging Face vs Ollama:云端协作与本地隐私的AI平台对决
人工智能·hugging face·ollama
martian6652 个月前
大模型部署:AI大模型在医学影像分类中的ONNX、TensorRT和Docker应用
人工智能·docker·分类·ai大模型·hugging face
一粒马豆2 个月前
chromadb使用hugging face模型时利用镜像网站下载注意事项
python·embedding·chroma·词嵌入·hugging face·词向量·chromadb
小喵要摸鱼7 个月前
【Hugging Face 开源库】Diffusers 库 —— 扩散模型
开源·扩散模型·hugging face·diffusers 库
ai_lian_shuo8 个月前
Hugging Face的Transformers核心模块:Pipelines(参数说明,各种模型类型调用案例)
人工智能·pipeline·transformers·hugging face