【金融】- findpapers:论文搜索与下载工具

金融 - findpapers:论文搜索与下载工具

findpapers:论文搜索与下载工具

复制代码
findpapers search search.json --query "[Deep Learning] AND [Knowledge Graph] AND ([Quantitative Investment] OR [Algorithmic Trading] OR [Financial Analysis] OR [Risk Assessment] OR [Economic Cycle] OR [Business Cycle])" --databases "arxiv,ssrn,repec,econbiz,semanticscholar" --limit-db 40 --verbose

这段代码是一个使用 findpapers工具,在五个专业库中(arxiv,ssrn,repec,econbiz,semanticscholar),进行一定逻辑条件的,学术论文搜索的命令。

其中

复制代码
findpapers search search_broad.json --query "[...]" --databases "arxiv,pubmed" --limit-db 40 --verbose

该命令通过 findpapers工具从"arxiv,ssrn,repec,econbiz,semanticscholar"​数据库中检索符合如下指定关键词组合

复制代码
"[Deep Learning] AND [Knowledge Graph] AND ([Quantitative Investment] OR [Algorithmic Trading] OR [Financial Analysis] OR [Risk Assessment] OR [Economic Cycle] OR [Business Cycle])"

的学术论文,并将结果保存到 search_broad.json文件中。

参数说明如下:

完成后,有类似如下整理好的搜索结果(以下是单篇备选文献的结果),

复制代码
{
  "databases": [
    "arxiv",
    "ssrn",
    "repec",
    "econbiz",
    "semanticscholar"
  ],
  "limit": null,
  "limit_per_database": 40,
  "number_of_papers": 1,
  "number_of_papers_by_database": {
    "arXiv": 1
  },
  "papers": [
    {
      "abstract": "Knowledge Graphs have emerged as a compelling abstraction for capturing key\nrelationship among the entities of interest to enterprises and for integrating\ndata from heterogeneous sources. JPMorgan Chase (JPMC) is leading this trend by\nleveraging knowledge graphs across the organization for multiple mission\ncritical applications such as risk assessment, fraud detection, investment\nadvice, etc. A core problem in leveraging a knowledge graph is to link mentions\n(e.g., company names) that are encountered in textual sources to entities in\nthe knowledge graph. Although several techniques exist for entity linking, they\nare tuned for entities that exist in Wikipedia, and fail to generalize for the\nentities that are of interest to an enterprise. In this paper, we propose a\nnovel end-to-end neural entity linking model (JEL) that uses minimal context\ninformation and a margin loss to generate entity embeddings, and a Wide & Deep\nLearning model to match character and semantic information respectively. We\nshow that JEL achieves the state-of-the-art performance to link mentions of\ncompany names in financial news with entities in our knowledge graph. We report\non our efforts to deploy this model in the company-wide system to generate\nalerts in response to financial news. The methodology used for JEL is directly\napplicable and usable by other enterprises who need entity linking solutions\nfor data that are unique to their respective situations.",
      "authors": [
        "Wanying Ding",
        "Vinay K. Chaudhri",
        "Naren Chittar",
        "Krishna Konakanchi"
      ],
      "categories": {},
      "citations": null,
      "comments": "8 pages, 4 figures, IAAI-21",
      "databases": [
        "arXiv"
      ],
      "doi": "10.1609/aaai.v35i17.17796",
      "keywords": [],
      "number_of_pages": null,
      "pages": null,
      "publication": null,
      "publication_date": "2024-11-05",
      "selected": true,
      "title": "JEL: Applying End-to-End Neural Entity Linking in JPMorgan Chase",
      "urls": [
        "http://arxiv.org/abs/2411.02695v1",
        "http://arxiv.org/pdf/2411.02695v1",
        "http://dx.doi.org/10.1609/aaai.v35i17.17796"
      ]
    }
  ],
  "processed_at": "2025-10-08 07:39:04",
  "publication_types": null,
  "query": "[Deep Learning] AND [Knowledge Graph] AND ([Quantitative Investment] OR [Algorithmic Trading] OR [Financial Analysis] OR [Risk Assessment] OR [Economic Cycle] OR [Business Cycle])",
  "since": null,
  "until": null
}

搜索完成后只搜到了1篇文献,所以需要放宽一下约束条件(不局限于深度学习,包括机器学习),并限定专业库(更贴合金融量化投资需求的库)

复制代码
findpapers search search_broad.json --query "([Machine Learning] OR [Deep Learning] OR [Knowledge Graph]) AND ([Quantitative Investment] OR [Algorithmic Trading] OR [Financial Analysis] OR [Risk Assessment] OR [Finance] OR [Investment])" --databases "arxiv,semanticscholar" --limit-db 40 --since 2020-01-01 --verbose

搜索完成,要执行如下预选精炼:

复制代码
findpapers refine search_broad.json

精炼过程每一篇均要选择是否保留。

结束之后,执行如下代码进行论文下载:

复制代码
findpapers download search_broad.json ./papers_broad --selected --verbose

执行命令后,论文逐步下载,虽然速度较慢(36篇文献的下载耗时约1小时)。

相关推荐
AIFQuant17 小时前
Java 对接全球股票实时报价:高可用架构与异常处理
java·开发语言·websocket·金融·架构·股票api
多年小白19 小时前
复盘】2026年5月21日(周四)
大数据·人工智能·ai·金融·区块链
kels889921 小时前
实时外汇api的节假日交易时间表,能自动判断休市吗?
开发语言·经验分享·笔记·python·金融·区块链
号码认证服务21 小时前
公司号码认证怎么申请?提交企业资质开通名片,建立高效外呼体系
游戏·金融·健康医疗·传媒·零售·教育电商·交通物流
AI医影跨模态组学1 天前
EBioMedicine美国佐治亚理工学院与埃默里大学:基于深度学习的放射组学与病理学多模态融合预测HPV相关口咽鳞状细胞癌预后
人工智能·深度学习·论文·医学·医学影像·影像组学
AI医影跨模态组学2 天前
NPJ Precis Oncol 青岛大学附属医院放射科王鹤翔:基于CT的可解释深度学习模型预测膀胱癌患者总生存期的多中心研究
人工智能·深度学习·论文·医学影像·影像组学
AI医影跨模态组学2 天前
eClinMed 遵义医科大学附属医院:肺癌术后肺部并发症可解释机器学习预测模型的开发与验证:一项机器学习研究
人工智能·深度学习·机器学习·论文·医学影像·影像组学
AI医影跨模态组学2 天前
Radiology(IF=15.2)北京大学肿瘤医院影像科孙应实教授团队:CT预测微卫星不稳定性高结肠癌区域淋巴结转移
人工智能·深度学习·论文·医学·医学影像·影像组学
AI医影跨模态组学2 天前
Int J Surg华中科技大学同济医学院附属协和医院:可解释机器学习模型预测胰腺癌早期复发:整合瘤内瘤周影像组学及身体成分分析
人工智能·机器学习·论文·医学·医学影像·影像组学
CryptoPP2 天前
快速集成:基于现代API的金融数据流解决方案
大数据·数据结构·笔记·金融·区块链