Install PostgreSQL with pgvector

PostgreSQL is a powerful database that we're using. Not only the common data, but the Vector data.

PostgreSQL supports vector data types by a extension called pgvector.

In this article, we're going to install PostgreSQL and pgvector.

Prerequisites

Assume you have a Linux system, and currently we are using CentOS 8.

And PostgreSQL 15 is the last version that supports CentOS 8. By the way, pgvector needs PostgreSQL 11 or later.

Install PostgreSQL

Go to the official site to download the rpm package.

Or you can just copy the following command to install it.

bash 复制代码
# Install the repository RPM:
yum install -y https://download.postgresql.org/pub/repos/yum/reporpms/EL-7-x86_64/pgdg-redhat-repo-latest.noarch.rpm

# Install PostgreSQL:
yum install -y postgresql15-server

# Optionally initialize the database and enable automatic start:
/usr/pgsql-15/bin/postgresql-15-setup initdb
systemctl enable postgresql-15
systemctl start postgresql-15

Check the PostgreSQL if it's installed successfully.

bash 复制代码
psql --version

Some useful commands:

bash 复制代码
systemctl status postgresql-15
systemctl stop postgresql-15
systemctl restart postgresql-15

Setup PostgreSQL

After the installation, it will create a user named postgres automatically. Switch to the postgres user.

bash 复制代码
su - postgres

Now we enter the PostgreSQL shell, and change the password of the postgres user.

bash 复制代码
psql
sql 复制代码
ALTER USER postgres WITH PASSWORD 'postgres';

After completing these tasks, enter the exit command to quit.

Beware that the default password of Linux postgres user should be changed.

bash 复制代码
# 修改 linux 的 postgres 用户密码
# 注意,这个密码很容易被猜到,被攻击
 sudo  passwd -d postgres
 sudo -u postgres passwd

Install pgvector

By default, install pgvetor by compiling from source code. However, it's such a tough task with a lot of errors.

Eventually, we found a way to install pgvector by using the RPM package. It's SUPER easy and works well.

bash 复制代码
yum install pgvector_15

Create a user

Enter the PostgreSQL shell again: psql.

sql 复制代码
# 创建用户
CREATE USER dbroot WITH PASSWORD '123root';

# 创建数据库
CREATE DATABASE vectdb OWNER dbroot;

# 赋权
GRANT ALL PRIVILEGES ON DATABASE vectdb TO dbroot;

# 添加 superuser 权限,用于安装 pgvector 插件
alter user dbroot with superuser;

Then just create a vector table to test if it works.

sql 复制代码
# 本地登录
> psql -U dbroot -h localhost  -p 5432 -d vectdb

# 创建插件
CREATE EXTENSION vector;

# 创建一个带有向量类型的表
CREATE TABLE items (id bigserial PRIMARY KEY, embedding vector(3));

# 写入一条数据
INSERT INTO items (embedding) VALUES ('[1,2,3]'), ('[4,5,6]');

# 向量检索
SELECT * FROM items ORDER BY embedding <-> '[3,1,2]' LIMIT 5;

Open PostgreSQL to the outside accessing

bash 复制代码
> cd /var/lib/pgsql/15/data/

> vim postgresql.conf
#将监听地址修改为*
#默认listen_addresses配置是注释掉的,所以可以直接在配置文件开头加入该行
listen_addresses='*'

> vim pg_hba.conf
#在文件尾部加入
#注意,开启所有外网都能访问时,服务容易被攻击
host  all  all 0.0.0.0/0 md5

#重启PostgreSQL服务
sudo systemctl restart postgresql-15

Some useful commands

bash 复制代码
#查看所有数据库
> \l

#切换当前数据库
> \c vectdb

#查看当前数据库下所有表
> \d

#退出数据库
> \q

# 查询配置文件所在位置
> show config_file;

# 查询数据储存目录
> show data_directory;

Thanks to these articles that helped me a lot!

相关推荐
筑梦之路1 小时前
harbor数据库报错权限异常如何处理——筑梦之路
数据库·harbor
czlczl200209251 小时前
理解 MySQL 行锁:两阶段锁协议与热点更新优化
数据库·mysql
AllData公司负责人2 小时前
通过Postgresql同步到Doris,全视角演示AllData数据中台核心功能效果,涵盖:数据入湖仓,数据同步,数据处理,数据服务,BI可视化驾驶舱
java·大数据·数据库·数据仓库·人工智能·python·postgresql
哆啦A梦15882 小时前
20, Springboot3+vue3实现前台轮播图和详情页的设计
javascript·数据库·spring boot·mybatis·vue3
渣渣盟3 小时前
Mysql入门到精通全集(SQL99)包含关系运算,软考数据库工程师复习首选
数据库·mysql·oracle
dishugj3 小时前
HANA 数据库的核心进程架构
数据库
2301_782040453 小时前
CSS Flex布局中如何实现导航栏与Logo的左右分布_利用justify-content- space-between
jvm·数据库·python
.柒宇.3 小时前
Redis主从复制集群搭建详解
数据库·redis·缓存·主从复制
2301_808414383 小时前
MySQL中的函数
数据库·mysql
Mahir084 小时前
MySQL 数据一致性的基石:三大日志( redo log/undo log/binlog)与两阶段提交(Prepare 阶段和Commit 阶段)深度解密
数据库·后端·mysql·面试