neo4j+LLM构建成品油&医疗跨领域知识图谱(初)&neo4j查询某个节点

全词查询

MATCH(n) WHERE n.name = '成品油' RETURN n

关键词查询:

MATCH (n) WHERE n.id CONTAINS '油' RETURN n

计算节点数

MATCH (n) RETURN count(n) AS number_of_nodes;

计算关系数:

MATCH ()-[r]->() RETURN count(r) AS number_of_relationships;

具体代码如下(仅做备份自查):

python 复制代码
from langchain_core.runnables import (
    RunnableBranch,
    RunnableLambda,
    RunnableParallel,
    RunnablePassthrough,
)
from langchain_core.prompts import ChatPromptTemplate
from langchain_core.prompts.prompt import PromptTemplate
from langchain_core.pydantic_v1 import BaseModel, Field
from typing import Tuple, List, Optional
from langchain_core.messages import AIMessage, HumanMessage
from langchain_core.output_parsers import StrOutputParser
import os
from langchain_community.graphs import Neo4jGraph
from langchain.document_loaders import WikipediaLoader
from langchain.text_splitter import TokenTextSplitter
from langchain_openai import ChatOpenAI
from langchain_experimental.graph_transformers import LLMGraphTransformer
from neo4j import GraphDatabase
from yfiles_jupyter_graphs import GraphWidget
from langchain_community.vectorstores import Neo4jVector
from langchain_openai import OpenAIEmbeddings
from langchain_community.vectorstores.neo4j_vector import remove_lucene_chars
from langchain_core.runnables import ConfigurableField, RunnableParallel, RunnablePassthrough

try:
  import google.colab
  from google.colab import output
  output.enable_custom_widget_manager()
except:
  pass
python 复制代码
os.environ["OPENAI_API_KEY"] = "sk-**********"
os.environ["NEO4J_URI"] = "bolt://localhost:7687"
os.environ["NEO4J_USERNAME"] = "neo4j"
os.environ["NEO4J_PASSWORD"] = "123456cys"

graph = Neo4jGraph()
python 复制代码
# 假设你有一个本地文件路径
file_path = 'data/oil072801.txt'  # 替换为你的文件路径

# 读取本地文件内容
with open(file_path, 'r', encoding='utf-8') as file:
    raw_documents = file.read()

# 创建 Document 类,添加 metadata 属性
class Document:
    def __init__(self, content, metadata=None):
        self.page_content = content
        self.metadata = metadata  # 这里添加了 metadata 属性

# 创建文档对象,并设置 metadata 属性
documents = [Document(content, {'source': 'oil0728'}) for content in raw_documents.split('\n')]

# Define chunking strategy
text_splitter = TokenTextSplitter(chunk_size=512, chunk_overlap=24)
split_documents = text_splitter.split_documents(documents)

# Initialize the LLM
llm = ChatOpenAI(temperature=0, model_name="gpt-3.5-turbo-0125")  # gpt-4-0125-preview occasionally has issues
llm_transformer = LLMGraphTransformer(llm=llm)

# Transform documents to graph documents
graph_documents = llm_transformer.convert_to_graph_documents(split_documents)

# 存储到 neo4j
graph.add_graph_documents(
    graph_documents,
    baseEntityLabel=True,
    include_source=True
)
相关推荐
前端双越老师5 分钟前
让我每天沉浸于 AI 编程的大玩具
人工智能
却道天凉_好个秋10 分钟前
OpenCV(十八):绘制文本
人工智能·opencv·计算机视觉
rengang6631 分钟前
105-Spring AI Alibaba Module RAG 使用示例
java·人工智能·spring·rag·spring ai·ai应用编程
说私域33 分钟前
开源AI智能客服、AI智能名片与S2B2C商城小程序在营销运营中的应用与重要性研究
人工智能·小程序·开源
美团技术团队35 分钟前
LongCat-Flash-Omni正式发布并开源:开启全模态实时交互时代
人工智能
rengang6635 分钟前
09-神经网络的结构:描述神经网络的层次化组成和设计
人工智能·深度学习·神经网络
rengang6636 分钟前
07-神经元模型:介绍神经网络中神经元的结构和功能
人工智能·深度学习·神经网络
说私域41 分钟前
开源AI智能名片链动2+1模式S2B2C商城小程序商业化路径优化研究
人工智能·小程序·开源
倦王1 小时前
PyTorch图片数据载入方法
人工智能·pytorch·python