【金融】- findpapers:论文搜索与下载工具

金融 - findpapers:论文搜索与下载工具

findpapers:论文搜索与下载工具

复制代码
findpapers search search.json --query "[Deep Learning] AND [Knowledge Graph] AND ([Quantitative Investment] OR [Algorithmic Trading] OR [Financial Analysis] OR [Risk Assessment] OR [Economic Cycle] OR [Business Cycle])" --databases "arxiv,ssrn,repec,econbiz,semanticscholar" --limit-db 40 --verbose

这段代码是一个使用 findpapers工具,在五个专业库中(arxiv,ssrn,repec,econbiz,semanticscholar),进行一定逻辑条件的,学术论文搜索的命令。

其中

复制代码
findpapers search search_broad.json --query "[...]" --databases "arxiv,pubmed" --limit-db 40 --verbose

该命令通过 findpapers工具从"arxiv,ssrn,repec,econbiz,semanticscholar"​数据库中检索符合如下指定关键词组合

复制代码
"[Deep Learning] AND [Knowledge Graph] AND ([Quantitative Investment] OR [Algorithmic Trading] OR [Financial Analysis] OR [Risk Assessment] OR [Economic Cycle] OR [Business Cycle])"

的学术论文,并将结果保存到 search_broad.json文件中。

参数说明如下:

完成后,有类似如下整理好的搜索结果(以下是单篇备选文献的结果),

复制代码
{
  "databases": [
    "arxiv",
    "ssrn",
    "repec",
    "econbiz",
    "semanticscholar"
  ],
  "limit": null,
  "limit_per_database": 40,
  "number_of_papers": 1,
  "number_of_papers_by_database": {
    "arXiv": 1
  },
  "papers": [
    {
      "abstract": "Knowledge Graphs have emerged as a compelling abstraction for capturing key\nrelationship among the entities of interest to enterprises and for integrating\ndata from heterogeneous sources. JPMorgan Chase (JPMC) is leading this trend by\nleveraging knowledge graphs across the organization for multiple mission\ncritical applications such as risk assessment, fraud detection, investment\nadvice, etc. A core problem in leveraging a knowledge graph is to link mentions\n(e.g., company names) that are encountered in textual sources to entities in\nthe knowledge graph. Although several techniques exist for entity linking, they\nare tuned for entities that exist in Wikipedia, and fail to generalize for the\nentities that are of interest to an enterprise. In this paper, we propose a\nnovel end-to-end neural entity linking model (JEL) that uses minimal context\ninformation and a margin loss to generate entity embeddings, and a Wide & Deep\nLearning model to match character and semantic information respectively. We\nshow that JEL achieves the state-of-the-art performance to link mentions of\ncompany names in financial news with entities in our knowledge graph. We report\non our efforts to deploy this model in the company-wide system to generate\nalerts in response to financial news. The methodology used for JEL is directly\napplicable and usable by other enterprises who need entity linking solutions\nfor data that are unique to their respective situations.",
      "authors": [
        "Wanying Ding",
        "Vinay K. Chaudhri",
        "Naren Chittar",
        "Krishna Konakanchi"
      ],
      "categories": {},
      "citations": null,
      "comments": "8 pages, 4 figures, IAAI-21",
      "databases": [
        "arXiv"
      ],
      "doi": "10.1609/aaai.v35i17.17796",
      "keywords": [],
      "number_of_pages": null,
      "pages": null,
      "publication": null,
      "publication_date": "2024-11-05",
      "selected": true,
      "title": "JEL: Applying End-to-End Neural Entity Linking in JPMorgan Chase",
      "urls": [
        "http://arxiv.org/abs/2411.02695v1",
        "http://arxiv.org/pdf/2411.02695v1",
        "http://dx.doi.org/10.1609/aaai.v35i17.17796"
      ]
    }
  ],
  "processed_at": "2025-10-08 07:39:04",
  "publication_types": null,
  "query": "[Deep Learning] AND [Knowledge Graph] AND ([Quantitative Investment] OR [Algorithmic Trading] OR [Financial Analysis] OR [Risk Assessment] OR [Economic Cycle] OR [Business Cycle])",
  "since": null,
  "until": null
}

搜索完成后只搜到了1篇文献,所以需要放宽一下约束条件(不局限于深度学习,包括机器学习),并限定专业库(更贴合金融量化投资需求的库)

复制代码
findpapers search search_broad.json --query "([Machine Learning] OR [Deep Learning] OR [Knowledge Graph]) AND ([Quantitative Investment] OR [Algorithmic Trading] OR [Financial Analysis] OR [Risk Assessment] OR [Finance] OR [Investment])" --databases "arxiv,semanticscholar" --limit-db 40 --since 2020-01-01 --verbose

搜索完成,要执行如下预选精炼:

复制代码
findpapers refine search_broad.json

精炼过程每一篇均要选择是否保留。

结束之后,执行如下代码进行论文下载:

复制代码
findpapers download search_broad.json ./papers_broad --selected --verbose

执行命令后,论文逐步下载,虽然速度较慢(36篇文献的下载耗时约1小时)。

相关推荐
2601_955505254 小时前
行业研究|AI-Ready高质量数据集建设难点与元数据标准化解决方案(基于国家数据局25号文)
人工智能·金融·能源·健康医疗·制造·政务
HavenlonLabs6 小时前
三年内,AI 控制会走向安全的一线
人工智能·安全·金融·架构·安全架构
汇海老周7 小时前
FX110金融历史复盘:1869年黑色星期五事件解析
人工智能·金融
科研小刘带你玩学术7 小时前
【科研快讯】KAIST突破性研究:让机器人“读懂“人类意图——VOTP算法开启Physical AI新纪元
论文·强化学习·机器人视觉·physical ai·人类意图识别·reward function
2601_955505257 小时前
自然人身份确权可信基础设施赋能身份风险等级标签合规
人工智能·网络安全·金融·健康医疗·媒体·教育电商·政务
2601_961963388 小时前
供应链金融中,电子债权凭证(应收账款的数字化)的法律性质
网络·人工智能·安全·金融·区块链·sass·政务
2601_961795749 小时前
瑞德克斯服务建设平台规则贴心吗?
金融
CryptoPP9 小时前
多市场行情 API 接入实战:一套接口打通股票/外汇/期货/加密货币 + WebSocket 实时推送
大数据·网络·人工智能·websocket·网络协议·金融·区块链
zandy10119 小时前
金融制造零售三行业实战:衡石 BI 多场景落地经验分享
金融·制造·零售
机汇五金_21 小时前
深圳钣金外壳OEM定制
金融