Georgi Gerganov - ggml - llama.cpp - whisper.cpp

Georgi Gerganov - ggml - llama.cpp - whisper.cpp

  • [1. Georgi Gerganov](#1. Georgi Gerganov)
    • [1.1. Projects](#1.1. Projects)
  • [2. `ggml`](#2. ggml)
  • [3. `llama.cpp`](#3. llama.cpp)
  • [4. `whisper.cpp`](#4. whisper.cpp)
  • References

1. Georgi Gerganov

https://github.com/ggerganov
https://ggerganov.com/

ggml-org
https://github.com/ggml-org

GGML - AI at the edge
https://ggml.ai/

ggml.ai is a company founded by Georgi Gerganov to support the development of ggml.

ggml is a tensor library for machine learning to enable large models and high performance on commodity hardware. It is used by llama.cpp and whisper.cpp.

1.1. Projects

  • whisper.cpp

https://github.com/ggerganov/whisper.cpp

High-performance inference of OpenAI's Whisper automatic speech recognition model

The project provides a high-quality speech-to-text solution that runs on Mac, Windows, Linux, iOS, Android, Raspberry Pi, and Web

  • llama.cpp

https://github.com/ggerganov/llama.cpp

Inference of Meta's LLaMA model (and others) in pure C/C++

The project provides efficient inference across a wide range of hardware and serves as the foundation for numerous LLM-based applications

2. ggml

https://github.com/ggerganov/ggml

Tensor library for machine learning

Some of the development is currently happening in the llama.cpp and whisper.cpp repos.

复制代码
sync : llama.cpp
sync : whisper.cpp

3. llama.cpp

https://github.com/ggerganov/llama.cpp

Inference of Meta's LLaMA model (and others) in pure C/C++

The llama.cpp project is the main playground for developing new features for the ggml library.

复制代码
sync : ggml

4. whisper.cpp

https://github.com/ggerganov/whisper.cpp

High-performance inference of OpenAI's Whisper automatic speech recognition (ASR) model.

The entire high-level implementation of the model is contained in whisper.h and whisper.cpp. The rest of the code is part of the ggml machine learning library.

复制代码
sync : ggml
sync : ggml + llama.cpp
sync : ggml and llama.cpp

References

1\] Yongqiang Cheng, \[2\] Introduction to ggml,

相关推荐
cooldream20095 天前
利用网络算力使用 Unsloth 实现llama大模型的微调部署调用
大模型微调·llama.cpp·unsloth
JOJO___8 天前
【2026】记录在windows编译llama.cpp步骤,AMD CPU本地部署千问3.5本地大模型,内存占用低
windows·cpu·amd·llama.cpp·llama.cpp编译·千问3.5·本地大语言模型
曦云沐10 天前
MacOS 上使用 Metal GPU 加速编译 llama.cpp 完整指南
macos·llama.cpp
feasibility.17 天前
多模态模型Qwen-3.5在Llama-Factory使用+llama.cpp量化导出+部署流程(含报错处理)
人工智能·llm·多模态·量化·llama.cpp·vlm·llama-factory
晨欣1 个月前
单卡 L20 48GB实测 | 同是 Q8_0,为什么 Qwen3.6 在 llama.cpp 长上下文下比 Qwen3.5 更慢?
llama.cpp·qwen3.6-35b-a3b·qwen3.5-35b-a3b
d1z8882 个月前
(二十)32天GPU测试从入门到精通-llama.cpp CPU/GPU 混合推理day18
人工智能·llama·显卡·llama.cpp
gergul2 个月前
在llama-cpp-python中使用自己编译的llama.cpp,解决pip install llama-cpp-python报错
python·llama·llama.cpp·llamacpppython
晨欣2 个月前
单卡 48GB 实测:Gemma 4 26B A4B、Gemma 4 31B、gpt-oss-20b 三模型部署与并发对比
google·openai·nvidia·vllm·llama.cpp·gpt-oss-20b·gemma4
belldeep2 个月前
AI: ggml llama.cpp 与 BitNet 模型介绍
人工智能·llama.cpp·bitnet·gguf·ggml
未来之窗软件服务2 个月前
SenseVoicecpp ggml-rpc.cpp大模型[AI人工智能(七十七)]—东方仙盟
人工智能·rpc·ggml·仙盟创梦ide·东方仙盟