Superset安装

Superset的安装官网上也有比较详细的步骤,参考官方文档。不想踩坑当然推荐还是用docker方式,当然因为Superset本身是用python实现的,对于Python开发者来说,用pypi部署也不是难事,也是一个学习机会。

本文主要是整理了下我部署过程的资料,以供借鉴。

核心步骤不难,主要是各个组件部署在不同的linux环境中,会踩到很多系统组件&动态链接库缺失的问题。

创建Python虚拟环境

我们是在单个机器上部署多个Python服务,所以需要做依赖包的隔离,这里我们采用来管理Python环境。Pyenv:Pyenv安装指引

bash 复制代码
# 安装Python
pyenv install 3.10.14
# 创建虚拟环境
pyenv virtualenv 3.10.14 superset-env
# 激活superset专用python环境
pyenv activate superset-env

安装Superset相关组件

bash 复制代码
# 安装superset本尊
pip install apache-superset
# 安装superset元数据库的连接驱动,如果是mysql
# if failed mysqlclient, then:
# sudo yum install -y python3-devel mysql-devel mysql-server
# 如果都无法解决,试试降级安装pip install mysqlclient==2.1.1
pip install mysqlclient

Superset初始化

编辑初始化配置

先把配置文件写到superset_config.py文件里,后面初始化时会用到这个文件的配置

python 复制代码
from celery.schedules import crontab

# Superset specific config
ROW_LIMIT = 10000

# Flask App Builder configuration
# Your App secret key will be used for securely signing the session cookie
# and encrypting sensitive information on the database
# Make sure you are changing this key for your deployment with a strong key.
# Alternatively you can set it with `SUPERSET_SECRET_KEY` environment variable.
# You MUST set this for production environments or the server will refuse
# to start and you will see an error in the logs accordingly.
SECRET_KEY = ''

# The SQLAlchemy connection string to your database backend
# This connection defines the path to the database that stores your
# superset metadata (slices, connections, tables, dashboards, ...).
# Note that the connection information to connect to the datasources
# you want to explore are managed directly in the web UI
# The check_same_thread=false property ensures the sqlite client does not attempt
# to enforce single-threaded access, which may be problematic in some edge cases
SQLALCHEMY_DATABASE_URI = ''

# Flask-WTF flag for CSRF
WTF_CSRF_ENABLED = True
# Add endpoints that need to be exempt from CSRF protection
WTF_CSRF_EXEMPT_LIST = []
# A CSRF token that expires in 1 year
WTF_CSRF_TIME_LIMIT = 60 * 60 * 24 * 1

# Set this API key to enable Mapbox visualizations
MAPBOX_API_KEY = ''

# activate Jinja templating
FEATURE_FLAGS = {
  "ENABLE_TEMPLATE_PROCESSING": True,
  "ALERT_REPORTS": True
}

引用初始化配置文件

bash 复制代码
export SUPERSET_CONFIG_PATH=superset_config.py
export FLASK_APP=superset

初始化数据库和元数据

bash 复制代码
# Then, you need to initialize the database:
superset db upgrade
# Create an admin user in your metadata database (use `admin` as username to be able to load the examples)
superset fab create-admin
# Load some data to play with
superset load_examples
# Create default roles and permissions
superset init

启动服务

接下来就可以直接启动服务了,以下我整理的启停脚本:

bash 复制代码
#!/bin/bash
# superset_start.sh
pyenv activate superset-env
export SUPERSET_CONFIG_PATH=superset_config.py
export FLASK_APP=superset

function start() {
  python -m gunicorn --workers 2 --timeout 120 --bind 0.0.0.0:8088 "superset.app:create_app()" --daemon

}

function stop() {
  local name="superset.app"
  pid=$(pgrep -f $name)
  if [ -z "$pid" ]; then
    echo "$ENV $name is not running"
  else
    echo "Stopping $ENV $name, pid=$pid"
    kill $pid
    sleep 3
  fi
}

case $1 in
  start)
    start
  ;;
  stop)
    stop
  ;;
  restart)
    stop
    start
  ;;
  check)
    ps -ef |grep superset.app
  ;;
  *)
    echo "Usage: $0 {start|stop|restart|check}"
esac
相关推荐
Yan-英杰34 分钟前
百度搜索和文心智能体接入DeepSeek满血版——AI搜索的新纪元
图像处理·人工智能·python·深度学习·deepseek
weixin_307779132 小时前
Azure上基于OpenAI GPT-4模型验证行政区域数据的设计方案
数据仓库·python·云计算·aws
玩电脑的辣条哥3 小时前
Python如何播放本地音乐并在web页面播放
开发语言·前端·python
多想和从前一样5 小时前
Django 创建表时 “__str__ ”方法的使用
后端·python·django
小喵要摸鱼7 小时前
【Pytorch 库】自定义数据集相关的类
pytorch·python
bdawn7 小时前
深度集成DeepSeek大模型:WebSocket流式聊天实现
python·websocket·openai·api·实时聊天·deepseek大模型·流式输出
Jackson@ML7 小时前
Python数据可视化简介
开发语言·python·数据可视化
mosquito_lover17 小时前
怎么把pyqt界面做的像web一样漂亮
前端·python·pyqt
mengyoufengyu7 小时前
算法12-贪心算法
python·算法·贪心算法
T_Y99438 小时前
pythonrsa加密与sha256加密
python