Superset安装

Superset的安装官网上也有比较详细的步骤,参考官方文档。不想踩坑当然推荐还是用docker方式,当然因为Superset本身是用python实现的,对于Python开发者来说,用pypi部署也不是难事,也是一个学习机会。

本文主要是整理了下我部署过程的资料,以供借鉴。

核心步骤不难,主要是各个组件部署在不同的linux环境中,会踩到很多系统组件&动态链接库缺失的问题。

创建Python虚拟环境

我们是在单个机器上部署多个Python服务,所以需要做依赖包的隔离,这里我们采用来管理Python环境。Pyenv:Pyenv安装指引

bash 复制代码
# 安装Python
pyenv install 3.10.14
# 创建虚拟环境
pyenv virtualenv 3.10.14 superset-env
# 激活superset专用python环境
pyenv activate superset-env

安装Superset相关组件

bash 复制代码
# 安装superset本尊
pip install apache-superset
# 安装superset元数据库的连接驱动,如果是mysql
# if failed mysqlclient, then:
# sudo yum install -y python3-devel mysql-devel mysql-server
# 如果都无法解决,试试降级安装pip install mysqlclient==2.1.1
pip install mysqlclient

Superset初始化

编辑初始化配置

先把配置文件写到superset_config.py文件里,后面初始化时会用到这个文件的配置

python 复制代码
from celery.schedules import crontab

# Superset specific config
ROW_LIMIT = 10000

# Flask App Builder configuration
# Your App secret key will be used for securely signing the session cookie
# and encrypting sensitive information on the database
# Make sure you are changing this key for your deployment with a strong key.
# Alternatively you can set it with `SUPERSET_SECRET_KEY` environment variable.
# You MUST set this for production environments or the server will refuse
# to start and you will see an error in the logs accordingly.
SECRET_KEY = ''

# The SQLAlchemy connection string to your database backend
# This connection defines the path to the database that stores your
# superset metadata (slices, connections, tables, dashboards, ...).
# Note that the connection information to connect to the datasources
# you want to explore are managed directly in the web UI
# The check_same_thread=false property ensures the sqlite client does not attempt
# to enforce single-threaded access, which may be problematic in some edge cases
SQLALCHEMY_DATABASE_URI = ''

# Flask-WTF flag for CSRF
WTF_CSRF_ENABLED = True
# Add endpoints that need to be exempt from CSRF protection
WTF_CSRF_EXEMPT_LIST = []
# A CSRF token that expires in 1 year
WTF_CSRF_TIME_LIMIT = 60 * 60 * 24 * 1

# Set this API key to enable Mapbox visualizations
MAPBOX_API_KEY = ''

# activate Jinja templating
FEATURE_FLAGS = {
  "ENABLE_TEMPLATE_PROCESSING": True,
  "ALERT_REPORTS": True
}

引用初始化配置文件

bash 复制代码
export SUPERSET_CONFIG_PATH=superset_config.py
export FLASK_APP=superset

初始化数据库和元数据

bash 复制代码
# Then, you need to initialize the database:
superset db upgrade
# Create an admin user in your metadata database (use `admin` as username to be able to load the examples)
superset fab create-admin
# Load some data to play with
superset load_examples
# Create default roles and permissions
superset init

启动服务

接下来就可以直接启动服务了,以下我整理的启停脚本:

bash 复制代码
#!/bin/bash
# superset_start.sh
pyenv activate superset-env
export SUPERSET_CONFIG_PATH=superset_config.py
export FLASK_APP=superset

function start() {
  python -m gunicorn --workers 2 --timeout 120 --bind 0.0.0.0:8088 "superset.app:create_app()" --daemon

}

function stop() {
  local name="superset.app"
  pid=$(pgrep -f $name)
  if [ -z "$pid" ]; then
    echo "$ENV $name is not running"
  else
    echo "Stopping $ENV $name, pid=$pid"
    kill $pid
    sleep 3
  fi
}

case $1 in
  start)
    start
  ;;
  stop)
    stop
  ;;
  restart)
    stop
    start
  ;;
  check)
    ps -ef |grep superset.app
  ;;
  *)
    echo "Usage: $0 {start|stop|restart|check}"
esac
相关推荐
feifeikon22 分钟前
Python Day5 进阶语法(列表表达式/三元/断言/with-as/异常捕获/字符串方法/lambda函数
开发语言·python
杰仔正在努力1 小时前
python成长技能之枚举类
开发语言·python
Eiceblue1 小时前
通过Python 调整Excel行高、列宽
开发语言·vscode·python·pycharm·excel
Jam-Young1 小时前
Python中的面向对象编程,类,对象,封装,继承,多态
开发语言·python
Light601 小时前
低代码牵手 AI 接口:开启智能化开发新征程
人工智能·python·深度学习·低代码·链表·线性回归
墨绿色的摆渡人1 小时前
用 Python 从零开始创建神经网络(六):优化(Optimization)介绍
人工智能·python·深度学习·神经网络
小han的日常2 小时前
pycharm分支提交操作
python·pycharm
明月清风徐徐2 小时前
Scrapy爬取豆瓣电影Top250排行榜
python·selenium·scrapy
theLuckyLong2 小时前
SpringBoot后端解决跨域问题
spring boot·后端·python
Yongqiang Cheng2 小时前
Python operator.itemgetter(item) and operator.itemgetter(*items)
python·operator·itemgetter