transformers in tabular tiny survey 2024.4.8

推荐阅读

TabLLM

pmlr2023,

Few-shot Classification of Tabular Data with Large Language Models

方法

使用把tabular数据序列化成文字的方法进行classification。
使用的序列化方法有几个,有人工也有AI生成。

效果

做few shot learning的效果
看上去一般。

TransTab

Learning Transferable Tabular Transformers Across Tables

方法

属于transfer learning的方法。对category、binary和numeric值进行embedding后再进行transformers最后进行classification。

使用场景

原文:

  • S(1) Transfer learning . We collect data tables from multiple cancer trials for testing the efficacy

of the same drug on different patients. These tables were designed independently with overlapping

columns. How do we learn ML models for one trial by leveraging tables from all trials?

  • S(2) Incremental learning . Additional columns might be added over time. For example, additional

features are collected across different trial phases. How do we update the ML models using tables

from all trial phases?

  • S(3) Pretraining+Finetuning . The trial outcome label (e.g., mortality) might not be always available

from all table sources. Can we benefit pretraining on those tables without labels? How do we finetune

the model on the target table with labels?

  • S(4) Zero-shot inference . We model the drug efficacy based on our trial records. The next step is to

conduct inference with the model to find patients that can benefit from the drug. However, patient

tables do not share the same columns as trial tables so direct inference is not possible.

效果

具体看原文吧,与当时的baseline比有提升。

MET

Masked Encoding for Tabular Data

tabtransformer

2020年,arxiv,TabTransformer: Tabular Data Modeling Using Contextual Embeddings

方法

transformer无监督训练,mlp监督训练。

原文

we introduce a pre-training procedure to train the Transformer layers using unlabeled data . This is followed by fine-tuning of the pre-trained Transformer layers along with the top MLP layer using the labeled data

效果

跟mlp

跟其他模型

tabnet

2020, arxiv,Google Cloud AI,Attentive Interpretable Tabular Learning, 封装的非常好,都可以当工具包使用了。

方法

跟transformer没关系的。
feature selection用的是17年的某个选择模型,最后agg一下做predict。

相关推荐
武汉唯众智创几秒前
基于五级工的人工智能训练师教学解决方案
人工智能·ai·产教融合·人工智能训练师·五级工·ai训练师
执笔论英雄5 分钟前
【RL】python协程
java·网络·人工智能·python·设计模式
你好~每一天40 分钟前
未来3年,最值得拿下的5个AI证书!
数据结构·人工智能·算法·sqlite·hbase·散列表·模拟退火算法
老前端的功夫1 小时前
前端技术选型的理性之道:构建可量化的ROI评估模型
前端·javascript·人工智能·ubuntu·前端框架
koo3641 小时前
pytorch深度学习笔记
pytorch·笔记·深度学习
Mxsoft6191 小时前
我发现区块链数据同步延迟,某次故障溯源卡顿,动态调整共识机制救场!
人工智能
m0_488913011 小时前
小白也能懂!RAG技术让AI告别知识滞后,收藏学习
人工智能·学习·langchain·大模型·ai大模型·rag·大模型学习
帮帮志1 小时前
【AI大模型对话】流式输出和非流式输出的定义和区别
开发语言·人工智能·python·大模型·anaconda
陈奕昆1 小时前
n8n实战营Day1课时2:核心概念拆解+天气提醒工作流实操
开发语言·人工智能·n8n
邹小邹-AI1 小时前
未来是AI客服的天下
人工智能