Superset安装

Superset的安装官网上也有比较详细的步骤,参考官方文档。不想踩坑当然推荐还是用docker方式,当然因为Superset本身是用python实现的,对于Python开发者来说,用pypi部署也不是难事,也是一个学习机会。

本文主要是整理了下我部署过程的资料,以供借鉴。

核心步骤不难,主要是各个组件部署在不同的linux环境中,会踩到很多系统组件&动态链接库缺失的问题。

创建Python虚拟环境

我们是在单个机器上部署多个Python服务,所以需要做依赖包的隔离,这里我们采用来管理Python环境。Pyenv:Pyenv安装指引

bash 复制代码
# 安装Python
pyenv install 3.10.14
# 创建虚拟环境
pyenv virtualenv 3.10.14 superset-env
# 激活superset专用python环境
pyenv activate superset-env

安装Superset相关组件

bash 复制代码
# 安装superset本尊
pip install apache-superset
# 安装superset元数据库的连接驱动,如果是mysql
# if failed mysqlclient, then:
# sudo yum install -y python3-devel mysql-devel mysql-server
# 如果都无法解决,试试降级安装pip install mysqlclient==2.1.1
pip install mysqlclient

Superset初始化

编辑初始化配置

先把配置文件写到superset_config.py文件里,后面初始化时会用到这个文件的配置

python 复制代码
from celery.schedules import crontab

# Superset specific config
ROW_LIMIT = 10000

# Flask App Builder configuration
# Your App secret key will be used for securely signing the session cookie
# and encrypting sensitive information on the database
# Make sure you are changing this key for your deployment with a strong key.
# Alternatively you can set it with `SUPERSET_SECRET_KEY` environment variable.
# You MUST set this for production environments or the server will refuse
# to start and you will see an error in the logs accordingly.
SECRET_KEY = ''

# The SQLAlchemy connection string to your database backend
# This connection defines the path to the database that stores your
# superset metadata (slices, connections, tables, dashboards, ...).
# Note that the connection information to connect to the datasources
# you want to explore are managed directly in the web UI
# The check_same_thread=false property ensures the sqlite client does not attempt
# to enforce single-threaded access, which may be problematic in some edge cases
SQLALCHEMY_DATABASE_URI = ''

# Flask-WTF flag for CSRF
WTF_CSRF_ENABLED = True
# Add endpoints that need to be exempt from CSRF protection
WTF_CSRF_EXEMPT_LIST = []
# A CSRF token that expires in 1 year
WTF_CSRF_TIME_LIMIT = 60 * 60 * 24 * 1

# Set this API key to enable Mapbox visualizations
MAPBOX_API_KEY = ''

# activate Jinja templating
FEATURE_FLAGS = {
  "ENABLE_TEMPLATE_PROCESSING": True,
  "ALERT_REPORTS": True
}

引用初始化配置文件

bash 复制代码
export SUPERSET_CONFIG_PATH=superset_config.py
export FLASK_APP=superset

初始化数据库和元数据

bash 复制代码
# Then, you need to initialize the database:
superset db upgrade
# Create an admin user in your metadata database (use `admin` as username to be able to load the examples)
superset fab create-admin
# Load some data to play with
superset load_examples
# Create default roles and permissions
superset init

启动服务

接下来就可以直接启动服务了,以下我整理的启停脚本:

bash 复制代码
#!/bin/bash
# superset_start.sh
pyenv activate superset-env
export SUPERSET_CONFIG_PATH=superset_config.py
export FLASK_APP=superset

function start() {
  python -m gunicorn --workers 2 --timeout 120 --bind 0.0.0.0:8088 "superset.app:create_app()" --daemon

}

function stop() {
  local name="superset.app"
  pid=$(pgrep -f $name)
  if [ -z "$pid" ]; then
    echo "$ENV $name is not running"
  else
    echo "Stopping $ENV $name, pid=$pid"
    kill $pid
    sleep 3
  fi
}

case $1 in
  start)
    start
  ;;
  stop)
    stop
  ;;
  restart)
    stop
    start
  ;;
  check)
    ps -ef |grep superset.app
  ;;
  *)
    echo "Usage: $0 {start|stop|restart|check}"
esac
相关推荐
云泽野2 小时前
【Java|集合类】list遍历的6种方式
java·python·list
IMPYLH3 小时前
Python 的内置函数 reversed
笔记·python
小赖同学啊5 小时前
物联网数据安全区块链服务
开发语言·python·区块链
码荼6 小时前
学习开发之hashmap
java·python·学习·哈希算法·个人开发·小白学开发·不花钱不花时间crud
小陈phd7 小时前
李宏毅机器学习笔记——梯度下降法
人工智能·python·机器学习
kk爱闹7 小时前
【挑战14天学完python和pytorch】- day01
android·pytorch·python
Blossom.1187 小时前
机器学习在智能建筑中的应用:能源管理与环境优化
人工智能·python·深度学习·神经网络·机器学习·机器人·sklearn
亚力山大抵7 小时前
实验六-使用PyMySQL数据存储的Flask登录系统-实验七-集成Flask-SocketIO的实时通信系统
后端·python·flask
showyoui7 小时前
Python 闭包(Closure)实战总结
开发语言·python
amazinging8 小时前
北京-4年功能测试2年空窗-报培训班学测开-第四十一天
python·学习·appium