基于沙戈荒新能源送出特性的特高压工程知识图谱构建技术研究 |
点此下载全文 |
引用本文:胡杰1,许刚1,齐立忠2,郄鑫1.基于沙戈荒新能源送出特性的特高压工程知识图谱构建技术研究[J].电网与清洁能源,2023,39(11):1~8 |
摘要点击次数: 672 |
全文下载次数: 376 |
|
基金项目:国家电网有限公司科技项目(5100-202113396A-0-0-00) |
|
中文摘要:我国西部的沙漠、戈壁、荒漠地区拥有优质的太阳能和风能资源,由于外送距离远、输电容量大等特点,特高压工程将成为主要的电能输送手段。特高压工程数据具有数量大、关联度高、数据结构性差的特点,传统以专家经验为基础的工程数据收集分析手段已经无法满足日益增长的数据增长需求。知识图谱技术能有效结构化工程数据,传统基于子字符串的命名实体识别技术生成了具有大量负样本的子字符串,对模型的精度具有不利影响。提出一种改进的命名实体识别算法,首先在分析特高压工程典型文本的基础上构建知识图谱本体层,其次利用考虑实体边界的负采样技术削减子字符串样本数量,提高命名实体识别效率,最后利用关系抽取算法得到实体对及关系类别。实验表明:所提算法在精度上与参考算法差别不大,运行效率提高了9%,验证了模型的有效性。 |
中文关键词:特高压工程 知识图谱 自然语言处理 命名实体识别 关系识别 |
|
Research on UHV Engineering Knowledge Graph Construction Technology Based on New Energy Transmission Characteristics from Desert Areas |
|
|
Abstract:The deserts, Gobi, and desert areas in western China have high-quality solar and wind energy resources. Thanks to the characteristics of long transmission distances and large transmission capacity, UHV projects have become the main means of electricity transmission and already entered a large-scale construction stage. The UHV engineering data is characteristic of large quantity, high correlation, and poor data structure, thus traditional engineering data collection and analysis methods based on expert experience can no longer meet the growing demand for data growth. Knowledge graph technology can effectively structure engineering data and the named entity recognition technology based on substrings generates substrings with a large number of negative samples, which, however, has a negative impact on the accuracy of the model. To this end, this paper proposes an improved named entity recognition algorithm. First, the knowledge graph ontology layer is constructed based on the analysis of typical texts of UHV projects. Secondly, negative sampling technology that considers entity boundaries is used to reduce the number of substring samples and improve the efficiency of named entity recognition. Finally, a relationship extraction algorithm is used to obtain entity pairs and relationship categories. The experiments show that the accuracy of the algorithm proposed in this article is not much different from the reference algorithm, and the operating efficiency is increased by 9%, which verifies the effectiveness of the model. |
keywords:ultra-high voltage construction project knowledge graph natural language processing named entity recognition relationship recognition |
查看全文 查看/发表评论 下载PDF阅读器 |