Agentic Knowledge Graph Traversal in Protein-Protein Relation Grounding

7 pages•Published: April 19, 2026

Gabriel Reder, Carl Collins, Larisa Soldatova and Ross King

Abstract

Automated semantic knowledge extraction from scientific literature promises to open vast quantities of scientific knowledge to formal analysis and computationally-driven discovery. In this work we investigate the promise of Large Language Model (LLM) agents in extracting structured knowledge from biomedical texts, specifically for grounding protein-protein interaction (PPI) relations to terms in the PSI-MI ontology of molecular interactions. While LLMs excel at summarization, they struggle to interface with structured knowledge representations. We equipped agents with various knowledge graph interaction strategies and measured their PPI grounding performance. Our central finding is that PageRank-guided traversal, a method rooted in graph topology, consistently outperforms embedding-based approaches such as retrieval augmented generation (RAG) and top-down traversal strategies including breadth-first search (BFS), depth-first search (DFS), and local greedy search in extracting knowledge previously missed by human curators. Our initial results indicate that the structure of a well-curated knowledge base is itself a powerful source of information, an underutilized principle in current agentic knowledge base interaction methods.

Keyphrases: agentic ai, ai for science, biology, knowledge representation, natural language processing, ontologies, relation extraction, research process automation, semantic web

In: Jernej Masnec, Hamid Reza Karimian, Parisa Kordjamshidi and Yan Li (editors). Proceedings of AI for Accelerated Research Symposium, vol 3, pages 50-56.

Links:	https://easychair.org/publications/paper/54ZjN
	https://doi.org/10.29007/8fsx

BibTeX entry

@inproceedings{AIAS2025:Agentic_Knowledge_Graph_Traversal,
  author    = {Gabriel Reder and Carl Collins and Larisa Soldatova and Ross King},
  title     = {Agentic Knowledge Graph Traversal in Protein-Protein Relation Grounding},
  booktitle = {Proceedings of AI for Accelerated Research Symposium},
  editor    = {Jernej Masnec and Hamid Reza Karimian and Parisa Kordjamshidi and Yan Li},
  series    = {EPiC Series in Technology},
  volume    = {3},
  publisher = {EasyChair},
  bibsource = {EasyChair, https://easychair.org},
  issn      = {2516-2322},
  url       = {/publications/paper/54ZjN},
  doi       = {10.29007/8fsx},
  pages     = {50-56},
  year      = {2026}}

Download PDF Open PDF in browser