Mineral question-answering system in Chinese based on multi-hop reasoning in knowledge graphs

Xiaohui JI, Yuhang DONG, Zhongji YANG, Mei YANG, Mingyue HE, Yuzhu WANG

PDF(2371 KB)
PDF(2371 KB)
Earth Science Frontiers ›› 2024, Vol. 31 ›› Issue (4) : 37-46. DOI: 10.13745/j.esf.sf.2024.5.11

Mineral question-answering system in Chinese based on multi-hop reasoning in knowledge graphs

Author information +
History +

Abstract

Mineral knowledge is important for geosciences research. Some mineral databases are used for storing and retrieving mineral knowledge, and common search engines can also answer mineral questions. But the mineral databases cannot answer mineral questions in natural language and the answers returned from the common search engines need to be filtered. To solve the above problems knowledge graphs have been used; however, the current mineral question-answering based on knowledge graphs can only answer simple questions involving one triplet, but not complex questions involving multiple triplets and multi-hop reasoning. This paper presents a mineral question-answering system based on multi-hop reasoning in knowledge graphs. The mineral entities, relations and questions are first transformed into vectors of complex domain to obtain their semantic and reasoning relations by using the ComplEx model, and Bert-LSTM-CRF is applied to obtain the head of the question. Candidate entities of the head are then obtained by calculating the edit distance and word segmentation, and a fully connected network is constructed to obtain the most related entity of the head of the question from the candidate entities and the entity is the start of the reasoning. Next, the entity and question vectors are concatenated into an input vector into the fully connected network to get their most related relation; afterward another entity most related to the starting entity/relation can be obtained from the mineral knowledge graph to start the reasoning of the next hop; the question of the next hop is updated by the concatenated vector of this hop to bring the reasoning information of this hop to the next hop. This process continues until the most related relation obtained is the stop sign predefined. The last entity obtained in this process is the answer to the question and the reasoning path is also remembered. This method is implemented using Python under Tensorflow and compared with related methods, which show the effectiveness of the method. Using this method, a question-answering system capable of answering complex mineral questions is developed under the front and back end separation architecture employing RESTful API, React, Ajax, echarts and Flask, which provides a platform for acquiring mineral knowledge and performing geosciences research.

Key words

mineral / question answering / knowledge graph / multi-hop reasoning

Cite this article

Download Citations
Xiaohui JI , Yuhang DONG , Zhongji YANG , et al . Mineral question-answering system in Chinese based on multi-hop reasoning in knowledge graphs. Earth Science Frontiers. 2024, 31(4): 37-46 https://doi.org/10.13745/j.esf.sf.2024.5.11

References

[1]
YU Y Q, WANG Y Z, MU J Q, et al. Chinese mineral named entity recognition based on BERT model[J]. Expert Systems with Applications, 2022, 206: 117727.
[2]
LIU C J, JI X H, DONG Y H, et al. Chinese mineral question and answering system based on knowledge graph[J]. Expert Systems with Applications, 2023, 231: 120841.
[3]
周永章, 张良均, 张奥多, 等. 地球科学大数据挖掘与机器学习[M]. 广州: 中山大学出版社, 2018.
[4]
周永章, 陈川, 张旗, 等. 地质大数据分析的若干工具与应用[J]. 大地构造与成矿学, 2020, 44(2): 173-182.
[5]
周永章, 张前龙, 黄永健, 等. 钦杭成矿带斑岩铜矿知识图谱构建及应用展望[J]. 地学前缘, 2021, 28(3): 67-75.
[6]
周永章, 左仁广, 刘刚, 等. 数学地球科学跨越发展的十年: 大数据、 人工智能算法正在改变地质学[J]. 矿物岩石地球化学通报, 2021, 40(3): 556-573, 777.
[7]
周成虎, 王华, 王成善, 等. 大数据时代的地学知识图谱研究[J]. 中国科学: 地球科学, 2021, 51(7): 1070-1079.
[8]
冉一早, 董少春, 王汝成, 等. 铌钽矿床知识图谱的构建及应用实践[J]. 高校地质学报, 2023, 29(3): 359-371.
[9]
张春菊, 刘文聪, 张雪英, 等. 基于本体的金矿知识图谱构建方法[J]. 地球信息科学学报, 2023, 25(7): 1269-1281.
[10]
张悦. 矿物领域知识图谱构建技术研究与实现[D]. 北京: 中国地质大学(北京), 2021.
[11]
燕群. 矿床知识图谱构建方法及找矿预测应用: 以甘肃寨上-马坞金矿集区为例[D]. 长春: 吉林大学, 2022.
[12]
陈忠良, 袁峰, 李晓晖, 等. 基于BERT-BiLSTM-CRF模型的中文岩石描述文本命名实体与关系联合提取[J]. 地质论评, 2022, 68(2): 742-750.
[13]
YIH W T, CHANG M W, HE X D, et al. Semantic parsing via staged query graph generation: question answering with knowledge base[C]//Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing (volume 1:long papers). Stroudsburg, USA: Association for Computational Linguistics, 2015: 1321-1331.
[14]
ZHENG W G, YU J X, ZOU L, et al. Question answering over knowledge graphs[J]. Proceedings of the VLDB Endowment, 2018, 11(11): 1373-1386.
[15]
SAXENA A, TRIPATHI A, TALUKDAR P. Improving multi-hop question answering over knowledge graphs using knowledge base embeddings[C]// Proceedings of the 58th annual meeting of the association for computational linguistics. Stroudsburg, USA: Association for Computational Linguistics, 2020: 4498-4507.
[16]
张天杭, 李婷婷, 张永刚. 基于知识图谱嵌入的多跳中文知识问答方法[J]. 吉林大学学报(理学版), 2022, 60(1): 119-126.
[17]
叶蕾, 张宇迪, 杨旭华. 利用知识图谱的多跳可解释问答[J/OL]. 小型微型计算机系统, 2023: 1-11[2024-04-24]. https://kns.cnki.net/kcms/detail/21.1106.TP.20230519.0923.002.html.
[18]
DOWNS B. IMA Mineral List[EB/OL]. (2018-03-04)[2024-04-24]. https://rruff.info/ima/.
[19]
BARTHELMY D. Mineralogy database[DB/OL]. (2009-03-05)[2024-04-24]. http://webmineral.com/.
[20]
Hudson Institute of Mineralogy. Mindat[EB/OL]. (2020-05-15)[2024-04-24]. https://www.mindat.org/.
[21]
Mineralogical Society of America. American mineralogist crystal structure database[DB/OL]. (2019-04-01)[2024-04-24]. http://rruff.geo.arizona.edu/AMS/amcsd.php.
[22]
中国地质大学北京. 国家岩矿化石标本资源共享平台[EB/OL]. (2003-01-01)[2024-04-24]. http://www.nimrf.net.cn/.
[23]
ABUJABAL A, YAHYA M, RIEDEWALD M, et al. Automated template generation for question answering over knowledge graphs[C]// Proceedings of the 26th international conference on World Wide Web. Perth, Australia: ACM, 2017: 1191-1200.
[24]
TROUILLON T, WELBL J, RIEDEL S, et al. Complex embeddings for simple link prediction[C]// Proceedings of the 33rd international conference on machine learning, New York: ACM, 2016: 2071-2080.
[25]
WANG Q, MAO Z D, WANG B, et al. Knowledge graph embedding: a survey of approaches and applications[J]. IEEE Transactions on Knowledge and Data Engineering, 2017, 29(12): 2724-2743.
[26]
DEVLIN J, CHANG M, LEE K, et al. Bert: Pre-training of deep bidirectional transformers for language understanding[C]//Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics:human language technologies. Minneapolis, Minnesota: Association for Computational Linguistics, 2019: 4171-4186.

Comments

PDF(2371 KB)

Accesses

Citation

Detail

Sections
Recommended

/