开源免费ETL工具==PYTHON实现

方便自己快速处理一些基于文件的聚合计算,比如CSV。

https://github.com/hebian1994/etl_react_flow

🛠️ ETL Flow Builder

A powerful, visual ETL (Extract-Transform-Load) tool built with React , React Flow , Material-UI , and a Python Flask backend. Design and manage complex data pipelines with a user-friendly interface and DAG-based execution.


📸 Preview

📸 Structure


🚀 Features

🔄 Flow Management

  • Create, edit, and delete ETL flows
  • Flow version history and versioning
  • Real-time configuration and validation

🧩 Node System

  • Modular nodes for various ETL operations:
    • File Input
    • Data Viewer
    • Filter
    • Left Join
  • Custom node configuration panels
  • Node connection validation and schema enforcement

⚙️ Data Processing

  • DAG-based flow execution
  • Schema propagation and management
  • Preview intermediate data at any node
  • Configuration validation before execution

🧱 System Architecture

🖥️ Frontend

  • Framework: React + TypeScript
  • Visualization: React Flow
  • UI Library: Material-UI (MUI)
  • Engine Library: Polars
  • Core Components:
    • FlowList: Dashboard for managing flows
    • Designer: Drag-and-drop interface for building flows
    • History: View and restore previous versions
    • Custom Node UI and Config Panels

🔧 Backend

  • Framework: Python + Flask
  • Database: SQLite (via SQLAlchemy ORM)
  • API: RESTful endpoints for flow and node operations
  • Core Services:
    • FlowService: Handles flow CRUD and metadata
    • NodeService: Manages node lifecycle and configurations
    • ETLService: Executes DAGs and manages schema propagation

🧬 Data Models

复制代码
Flow    ──>  stores flow structure and metadata
Node    ──>  represents each ETL operation
Config  ──>  holds node-specific configuration
Schema  ──>  manages and validates data schema

🔒 State & Validation

  • Node configuration status tracking
  • Flow validation before execution
  • Schema-aware transformations and previews
  • UI-managed configuration state

🛠️ Tech Stack

Layer Technology
Frontend React, TypeScript, React Flow, MUI
Backend Polars, Python, Flask, SQLAlchemy
Database SQLite
Architecture REST API + DAG Executor

📦 Getting Started

🔧 Prerequisites

  • Node.js (v16+)
  • Python (v3.8+)
  • pipenv or virtualenv

🖥️ Frontend Setup

复制代码
cd frontend
npm install
npm run dev

🐍 Backend Setup

复制代码
cd backend
pipenv install
pipenv run flask run
相关推荐
WSSWWWSSW42 分钟前
Seaborn数据可视化实战:Seaborn数据可视化基础-从内置数据集到外部数据集的应用
python·信息可视化·数据分析·matplotlib·seaborn
Small___ming42 分钟前
Matplotlib 可视化大师系列(七):专属篇 - 绘制误差线、等高线与更多特殊图表
python·信息可视化·matplotlib
荼蘼3 小时前
CUDA安装,pytorch库安装
人工智能·pytorch·python
杨荧4 小时前
基于Python的农作物病虫害防治网站 Python+Django+Vue.js
大数据·前端·vue.js·爬虫·python
骑驴看星星a4 小时前
数学建模--Topsis(Python)
开发语言·python·学习·数学建模
学习3人组4 小时前
JupyterLab在线调试实验室
python
颜如玉5 小时前
Redis scan高位进位加法机制浅析
redis·后端·开源
Moment5 小时前
毕业一年了,分享一下我的四个开源项目!😊😊😊
前端·后端·开源
ygy.白茶5 小时前
线性回归入门级
人工智能·python·机器学习
刘同学Python学习日记7 小时前
Python 办公自动化实战:Excel 批量处理 + 自动发邮件
python