Research and Practice on Construction of Medical Data Knowledge Graph
PDF

Keywords

Medical Data Knowledge Graph
Knowledge Graph Construction
Data Fusion
Clinical Decision Support
Entity Recognition

How to Cite

Han, T., Li, X., Li, A., Zhou, X., & Yang, Z. (2025). Research and Practice on Construction of Medical Data Knowledge Graph. International Theory and Practice in Humanities and Social Sciences, 2(8), 11–19. https://doi.org/10.70693/itphss.v2i8.1225

Abstract

This paper conducts in-depth research and practice on the construction of medi- cal data knowledge graphs, exploring their core value in the medical field. With the explosive growth of medical data from electronic health records, genomic sequenc- ing, and medical literature, data silos and heterogeneity have restricted medical efficiency and intelligent development. Medical data knowledge graphs address this by integrating multi-source heterogeneous data into a structured semantic network, enabling effective organization and application of medical knowledge.

The study details the construction process, including data acquisition from clini- cal databases and literature, preprocessing (cleaning, annotation), core technologies (ontology design, entity recognition, relation extraction, knowledge fusion), and graph database-based storage and query methods. It compares technical routes (rule-based, machine learning, deep learning) and highlights innovations like com- bining BERT and graph neural networks to enhance entity extraction and relation prediction accuracy.

Practical applications in clinical decision support (e.g., accelerating differen- tial diagnosis via disease-symptom-drug relationships), rare disease diagnosis, and public health management (e.g., tracking infectious disease spread) are explored through case studies. The paper also summarizes challenges such as data privacy, dynamic knowledge updates, and terminology standardization, proposing solutions and future directions. This research provides theoretical and practical support for advancing medical informatization and intelligence, contributing to improved service quality, reduced costs, and precision medicine.

https://doi.org/10.70693/itphss.v2i8.1225
PDF

References

1. Abubakar, B., McCarron, H., Smirnov, S., et al. (2023). Healthcare knowledge graph construction: A systematic review of the state-of-the-art, open issues, and opportunities. *Journal of Big Data, 10*(1), 1-33.

2. Tao, X., Pham, T., Zhang, J., et al. (2023). Medical knowledge graph: Data sources, construction, reasoning, and applications. *Big Data Mining and Analytics, 6*(2), 201-219.

3. Shi, L., Li, S., Yang, X., et al. (2020). Real-world data medical knowledge graph: Construction and applications. *Artificial Intelligence in Medicine, 103*, Article 101812.

4. Rotmensch, M., Halpern, Y., Tlimat, A., et al. (2017). Learning a health knowledge graph from electronic medical records. *Scientific Reports, 7*, Article 5994.

5. Chandak, P., Huang, K., & Zitnik, M. (2023). Building a knowledge graph to enable precision medicine. *Scientific Data, 10*, Article 67.

6. Santos, A., Colaço, A. R., Nielsen, A. B., et al. (2020). Clinical knowledge graph integrates proteomics data into clinical decision-making. *Nature Biotechnology, 38*, 1089-1093.

7. Chen, X., Zhu, J., Lu, W., et al. (2020). A general approach for constructing a knowledge graph: A case study in the medical domain. *Journal of Biomedical Informatics, 103*, Article 103384.

8. Bonner, S., Barrett, I. P., Ye, C., et al. (2022). A review of biomedical datasets relating to drug discovery: A knowledge graph perspective. *Briefings in Bioinformatics, 23*(6), Article bbac404.

9. Callahan, T. J., Tripodi, I. J., Stefanski, A. L., et al. (2024). An open source knowledge graph ecosystem for the life sciences. *Scientific Data, 11*, Article 363.

10. Lee, J., Yoon, W., Kim, S., et al. (2020). BioBERT: A pre-trained biomedical language representation model for biomedical text mining. *Bioinformatics, 36*(4), 1234-1240.

11. Nelson, S. J., Zeng, K., Kilbourne, J., et al. (2011). Normalized names for clinical drugs: RxNorm at 6 years. *Journal of the American Medical Informatics Association, 18*(4), 441-448.

12. Tao, C., Chen, J., Fan, F., et al. (2020). Constructing knowledge graphs and their biomedical applications. *Computational and Structural Biotechnology Journal, 18*, 1414-1428.

13. Robinson, I., Webber, J., & Eifrem, E. (2013). *Graph databases*. O'Reilly Media.

14. Miller, J. J. (2013). Graph database applications and concepts with Neo4j. In *Proceedings of the Southern Association for Information Systems Conference* (pp. 141-147).

15. Suchanek, F. M., Abiteboul, S., & Senellart, P. (2011). PARIS: Probabilistic alignment of relations, instances, and schema. *Proceedings of the VLDB Endowment, 5*(3), 157-168.

16. Chen, X., Zhao, Z., Jia, W., et al. (2021). Enterprise knowledge graphs: A literature review. *IEEE Access, 9*, 9335-9355.

17. Ji, S., Pan, S., Cambria, E., et al. (2022). A survey on knowledge graphs: Representation, acquisition, and applications. *IEEE Transactions on Neural Networks and Learning Systems, 33*(2), 494-514.

18. Hogan, A., Blomqvist, E., Cochez, M., et al. (2021). Knowledge graphs. *ACM Computing Surveys, 54*(4), 1-37.

19. Zhu, Y., Pan, L., Liu, P., et al. (2022). Causal knowledge graph construction and evaluation for clinical decision support of diabetic nephropathy. *Journal of Biomedical Informatics, 134*, Article 104212.

20. Wang, Q., Mao, Z., Wang, B., et al. (2017). Knowledge graph embedding: A survey of approaches and applications. *IEEE Transactions on Knowledge and Data Engineering, 29*(12), 2724-2743.

21. Sun, W., Liu, Z., Yuan, C., Zhou, X., Pei, Y., & Wei, C. (2025). RCSAN residual enhanced channel spatial attention network for stock price forecasting. Scientific Reports, 15(1), 21800.

22. Wang, Y., Zhang, H., Yuan, C., Li, X., & Jiang, Z. (2025). An efficient scheduling method in supply chain logistics based on network flow. Processes, 13(4), 969.

23. Liu, Z., Yuan, C., Zhang, Z., Zhou, X., Li, X., Tian, Z., ... & Tian, Z. (2025). A hybrid YOLO-UNet3D framework for automated protein particle annotation in Cryo-ET images. Scientific Reports, 15(1), 25033.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright (c) 2025 Tengyue Han (Co-Authors); Xuanyu Li, Ao Li, Xinyue Zhou, Zhi Yang (Author)

Downloads

Download data is not yet available.