transformers in tabular tiny survey 2024.4.8

推荐阅读

TabLLM

pmlr2023,

Few-shot Classification of Tabular Data with Large Language Models

方法

使用把tabular数据序列化成文字的方法进行classification。
使用的序列化方法有几个,有人工也有AI生成。

效果

做few shot learning的效果
看上去一般。

TransTab

Learning Transferable Tabular Transformers Across Tables

方法

属于transfer learning的方法。对category、binary和numeric值进行embedding后再进行transformers最后进行classification。

使用场景

原文:

  • S(1) Transfer learning . We collect data tables from multiple cancer trials for testing the efficacy

of the same drug on different patients. These tables were designed independently with overlapping

columns. How do we learn ML models for one trial by leveraging tables from all trials?

  • S(2) Incremental learning . Additional columns might be added over time. For example, additional

features are collected across different trial phases. How do we update the ML models using tables

from all trial phases?

  • S(3) Pretraining+Finetuning . The trial outcome label (e.g., mortality) might not be always available

from all table sources. Can we benefit pretraining on those tables without labels? How do we finetune

the model on the target table with labels?

  • S(4) Zero-shot inference . We model the drug efficacy based on our trial records. The next step is to

conduct inference with the model to find patients that can benefit from the drug. However, patient

tables do not share the same columns as trial tables so direct inference is not possible.

效果

具体看原文吧,与当时的baseline比有提升。

MET

Masked Encoding for Tabular Data

tabtransformer

2020年,arxiv,TabTransformer: Tabular Data Modeling Using Contextual Embeddings

方法

transformer无监督训练,mlp监督训练。

原文

we introduce a pre-training procedure to train the Transformer layers using unlabeled data . This is followed by fine-tuning of the pre-trained Transformer layers along with the top MLP layer using the labeled data

效果

跟mlp

跟其他模型

tabnet

2020, arxiv,Google Cloud AI,Attentive Interpretable Tabular Learning, 封装的非常好,都可以当工具包使用了。

方法

跟transformer没关系的。
feature selection用的是17年的某个选择模型,最后agg一下做predict。

相关推荐
deephub1 分钟前
构建一个可自我改进的多 Agent RAG 系统:架构、评估,以及带人工审核的 Prompt 反馈闭环
人工智能·python·大语言模型·rag
zhangxingchao3 分钟前
AI应用开发五:RAG高级技术与调优
前端·人工智能·后端
海兰4 分钟前
【第54篇】Graph + Langfuse 可观测性实战
java·人工智能·spring boot·spring ai
KG_LLM图谱增强大模型18 分钟前
scHilda:大模型与知识图谱分层融合,突破单细胞分型瓶颈
数据库·人工智能·知识图谱
元智启20 分钟前
企业AI如何开发:智能体时代的安全治理架构与合规管控实践
人工智能·安全·架构
Appoint_x23 分钟前
别让 LLM 当复读机:我给文件管理系统做 AI 助手时的三个关键设计
人工智能
摄影图25 分钟前
AI设计实用图片素材 适配多元创作推广需求
人工智能·科技·智能手机·aigc·贴图
HS_Tiger28 分钟前
【个人对AI技术的观点验证】
人工智能
小陶来咯33 分钟前
AI Agent 设计模式:ReAct 深度解析
人工智能·react.js·设计模式
Muyuan199834 分钟前
31.Cursor 初体验:用 AI Agent 给 PaperPilot 做一次最小工程重构
人工智能·python·重构·django·fastapi·faiss