LLM-Intro to Large Language Models

mrbone112023-12-04 12:15

LLM

some LLM's model and weight are not opened to user

what is?

Llama 270b model

2 files
- parameters file
  - parameter or weight of neural network
  - parameter -- 2bytes, float number
- code run parameters(inference)
  - c or python, etc
  - for c, 500 lines code without dependency to run
  - self contained package(no network need)
how to get parameters?
- lossy compress large chunk of text (10TB) with 6000 GPU for 12 days (cost 200$) to 140G zip file(gestalt of the text, weights and parameters)
what neural do is trying to predict the next word in a sequence. parameters are dispersed throughout the neural network and neurons are connected to each other, fire in a certain way
prediction has strong relationship with compression
LLM create a correct form of text and fill it with its knowedge. not create a copy of text that was be trained.
how does it work？

training stage

pre-training
- expensive
- base model. get a document generator model
- it's about knowledge
- internet documents
fine tuning
- cheaper
- assistant model. get a assistant model
- it's about alighment
- Q&A document
- training with high quality conversation(question and answer).write labeling instructions to specify how assistant should behave
- focus on quality not amount
stage 3(optional)
- use comparison label
- reenforcement learning from human feedback

labeling is a human-machine collaboration

rank of LLM

LLM scaling laws：

more D and N will get better model

multimodality. now some LLM like GPT can use different tools to help it with answering questions. browser, calculator, python interpreter.
future directions of development in LLM

give LLM system 2 ablility

LLM now only have system one(instinctive)
convert time to accuracy

self-improvement

in narrow domain it is possible to self-improve

customization

experts in certain domain

future of LLM

上一篇：Linux——基本指令（一）

下一篇：TCP传输的三次握手四次挥手策略

热门推荐

01GitHub 镜像站点 02【保姆级教程】免费使用Gemini3的5种方法！免翻墙/国内直连 03BongoCat - 跨平台键盘猫动画工具 04UV安装并设置国内源 05安娜的档案(Anna’s Archive) 镜像网站/国内最新可访问入口（持续更新）06Linux下V2Ray安装配置指南 07Google Antigravity：无法登录？早期错误、登录修复和用户反馈指南 08Labelme从安装到标注：零基础完整指南 09全球最强模型Grok4，国内已可免费使用！（附教程）1046个Nano-banana 精选提示词，持续更新中