Flink 自定义数据源开发流程

1 继承SourceFunction和ParallelSourceFunction

复制代码
import org.apache.flink.streaming.api.functions.source.SourceFunction;

重新run()和cancel()方法

2 AccessSource 代码

复制代码
package com.zyb.flink.basic.source;
import com.zyb.flink.basic.bean.Access;
import org.apache.flink.streaming.api.functions.source.SourceFunction;
import java.util.Random;

public class AccessSource implements SourceFunction<Access> {
    boolean isRunning = true;

    @Override
    public void run(SourceContext<Access> ctx) throws Exception {
        Random random = new Random();
        String[] domains = {"pk1.com","pk2.com","pk3.com","pk4.com","pk5."};

        while (isRunning){
            long time = System.currentTimeMillis();
            ctx.collect(new Access(time,domains[random.nextInt(domains.length)],random.nextInt(1000)));
        }
        Thread.sleep(2000);
    }

    @Override
    public void cancel() {
        isRunning = false;
    }
}

3 Access代码

复制代码
package com.zyb.flink.basic.bean;

public class Access {
    private long time;
    private String domain;
    private double traffic;

    @Override
    public String toString() {
        return "Access{" +
                "time=" + time +
                ", domain='" + domain + '\'' +
                ", traffic=" + traffic +
                '}';
    }

    public Access() {
    }

    public Access(long time, String domain, double traffic) {
        this.time = time;
        this.domain = domain;
        this.traffic = traffic;
    }

    public long getTime() {
        return time;
    }

    public void setTime(long time) {
        this.time = time;
    }

    public String getDomain() {
        return domain;
    }

    public void setDomain(String domain) {
        this.domain = domain;
    }

    public double getTraffic() {
        return traffic;
    }

    public void setTraffic(double traffic) {
        this.traffic = traffic;
    }
}

4 测试代码

复制代码
package com.zyb.flink.basic.source;
import com.zyb.flink.basic.bean.Access;
import org.apache.flink.streaming.api.functions.source.SourceFunction;
import java.util.Random;

public class AccessSource implements SourceFunction<Access> {
    boolean isRunning = true;

    @Override
    public void run(SourceContext<Access> ctx) throws Exception {
        Random random = new Random();
        String[] domains = {"pk1.com","pk2.com","pk3.com","pk4.com","pk5."};

        while (isRunning){
            long time = System.currentTimeMillis();
            ctx.collect(new Access(time,domains[random.nextInt(domains.length)],random.nextInt(1000)));
        }
        Thread.sleep(2000);
    }

    @Override
    public void cancel() {
        isRunning = false;
    }
}
相关推荐
喵叔哟21 分钟前
14【.NET10 实战--孢子记账--产品智能化】--智能生成预算
大数据·人工智能·.net
Deepoch1 小时前
Deepoc VLA开发板:实现采摘机器人动态生物适应与精准作业
大数据·人工智能·机器人·采摘机器人·deepoc
申通之声1 小时前
3年稳定率90%+:申通五星管理经
大数据·人工智能·交通物流
C137的本贾尼1 小时前
幻读与 Next-Key Lock:可重复读隔离级别如何解决幻读
大数据·数据库
郑洁文1 小时前
达州市人口相关数据分析与应用
大数据·数据挖掘·数据分析·毕设·达州市人口
YangYang9YangYan2 小时前
2026文科生报考大数据类专业学习数据分析的可行性分析
大数据·学习·数据分析
知识分享小能手2 小时前
Hadoop学习教程,从入门到精通, 初识Hadoop — 知识点详解(1)
大数据·hadoop·学习
weixin_408318042 小时前
2026年医疗直播行业趋势报告:技术方向、监管变化与市场格局
java·大数据·人工智能
ACP广源盛139246256732 小时前
GSV2231 三屏显示扩展芯片@ACP#RTX Spark AI 终端多屏协作专属解决方案
大数据·人工智能·分布式·信息可视化·spark·电脑·音视频
2501_933670792 小时前
高中物理成绩优异,适合报考大数据哪个细分专业
大数据